WHY BOTHER WITH PROTEOMICS?
• Proteins are the machines that drive much
of biology
• Genes are merely the recipe
• The direct characterization of a sample’s
proteins en masse.
• What proteins are present?
• How much of each protein is present?
WHY NOT MICROARRAYS?
Is Proteomics the New Genomics? Jürgen Cox and Matthias Mann, Cell 130, August 10, 2007
AN ANALYTICAL CHALLENGE
Dynamic range
of protein
abundances is
a challenge for
separation
sciences
No equivalent
of PCR for
proteins-deal
with µ- to nmol
concentrations
Alternate
splice forms of
a gene can
make different
proteins
>200 Post
translational
modifications;
cannot be
deduced from
a gene or
mRNA
Edman sequencing cannot provide the solutions !!!
MASS SPECTROMETRY
Analytical method to measure the molecular or atomic
weight of samples
Slide adopted from: Dr.. Ahna Skop. Mass Spectrometry: Methods & Theory
SOFT IONIZATION METHODS
337 nm UV laser
MALDI
cyano-hydroxy
cinnamic acid
Gold tip needle
Fluid (no salt)
ESI
+
_
Slide adopted from: Nathan Edwards
Center for Bioinformatics and Computational Biology(UMIACS)
MS INSTRUMENTS
A Brief Summary of the Different Types of Mass Spectrometers Used in Proteomics
Methods in Molecular Biology, vol. 484: Functional Proteomics: Methods and Protocols
IDENTIFICATION STRATEGIES
Experimental
masses
Theoretical
Masses
(database)
1. Peptide mass fingerprinting(PMF)
2. MS/MS spectral matching
Experimental spectrum
Theoretical spectra
3.De novo sequencing*
72.0 129.0 97.0 101.0 113.1 174.1
A E P T I R H2O
*Adopted from: Brian C. Searle, Proteome Software
Inc. Portland, Oregon USA
4. Spectral library search
PEPTIDE MASS FINGERPRINT
The proteins from a sample are separated on 2D gels
Protein of interest is digested by trypsin (or any other site
specific cleavage)
Ionization of peptides in a MALDI mass spectrometer
m/z values detected and plotted as mass spectrum
PMF database search to identify the protein
22
MODIFICATIONS
Fixed modifications: will be present on any
occurrence of the affected amino acid.Eg.+57@C
Variable modifications: may be present on some
or all positions of the affected amino acid.
Eg.+16@M
Slide adopted from: Nathan Edwards
Center for Bioinformatics and Computational Biology(UMIACS)
26
FRAGMENTATION
PEPTIDE
MW ion ion MW
98 b1 P EPTIDE y6 703
227 b2 PE PTIDE y5 574
324 b3 PEP TIDE y4 477
425 b4 PEPT IDE y3 376
538 b5 PEPTI DE y2 263
653 b6 PEPTID E y2 148
SHOTGUN PROTEOMICS & DATABASE
SEARCH
The pros and cons of peptide-centric proteomics. Mark W. Duncan, Ruedi Aebersold, Richard M. Caprioli
Nature Biotechnology, Vol. 28, No. 7. (01 July 2010), pp. 659-664
30
DE NOVO INTERPRETATION
100
0
250 500 750 1000
m/z
%Intensity
Slide adopted from: Nathan Edwards
Center for Bioinformatics and Computational Biology(UMIACS)
31
DE NOVO INTERPRETATION
100
0
250 500 750 1000
m/z
%Intensity
E L
Slide adopted from: Nathan Edwards
Center for Bioinformatics and Computational Biology(UMIACS)
32
DE NOVO INTERPRETATION
100
0
250 500 750 1000
m/z
%Intensity
E L F
KL
SGF G
E D
E
L E
E D E L
Slide adopted from: Nathan Edwards
Center for Bioinformatics and Computational Biology(UMIACS)
33
SUMMARY
Proteomics is large-scale study (qualitative and
quantitative) study of proteins by mass spec.
Mass spectrometry + sequence databases
represent a huge leap for protein (bio-)chemistry.
ProteinSeparation - 2DGE and HPLC
Ionization - MALDI and ESI
Identification - PMF, MS/MS and de novo
sequencing
Sample prep, instruments and algorithms still
maturing, much work to be done.