Mascot is a software package from Matrix Science that interprets mass spectral data into protein identities.
In this presentation we will study about MASCOT and also on how to use it.
2. Basics
Simple MS – molecular weight of peptide mixture.
MS/MS (Tandem MS) – sequence structural information by
recording the fragment ion spectrum of peptide.
4. PURPOSE OF MS
Elemental composition.
Masses of particles of molecules.
Identify unknown compounds.
Isotopic Composition.
5. INTRODUCTION
Mascot is a software package from Matrix Science
(www.matrixscience.com) that interprets mass spectral data into
protein identities.
It uses mass spectrometry data to identify proteins from primary
sequence databases.
6. INTRODUCTION
The experimental mass values are then compared with calculated
peptide mass by applying cleavage rules to the entries in a
comprehensive primary sequence database.
If unknown protein is present, we will get precise entry otherwise pull
out those entries which exhibit the closest homology(related species).
8. Algorithm used..
• Program MASCOT is based on the MOWSE algorithm; this program also
evaluates a possibility of random matching of experimental and
theoretical peptide masses.
• The Algorithm MOWSE (Molecular Weight Search) is more selective and
sensitive.
9. Two Mascot Choices
Matrix Sciences offers two choice for users:
A free, open access web-based system for occasional (1-10) queries.
A locally installed version for heavy use or highthroughput MS (100’s
queries/day)
11. MASCOT SEARCH STRATEGIES
Mascot has three main search modes:
Peptide Mass Fingerprint(based on a list of peptide mass values).
Sequence Query (based on one or more peptide mass values
associated with information such as partial sequence information).
MS/MS Ion Search (based on raw MS/MS data from one or more
peptides).
12. PEPTIDE MASS FINGERPRINT
• It is possible to identify the protein from available mass spectrum of the peptide mixture
resulting from the digestion of a protein by specific enzyme.
• This method is useful for identifications of protein with already known sequences.
• Requires enzymes of great specificity.
• Mascot looks for the highest scoring set of peptide matches which are within a contiguous
stretch of sequence less than or equal to the specified protein molecular weight.
13. SEQUENCE QUERY
• The sequence query, in which one or more peptide molecular masses are combined with sequence,
composition and fragment ion data
• It is potentially the most powerful search.
• The source of the information is MS/MS spectrum.
• The sequence query mode of Mascot supports both standard and error tolerant sequence tags.
14. MS/MS IONS SEARCH
• The MS/MS ions search accepts data in the form of peak lists containing mass and intensity
pairs.
• The high level of specificity of an MS/MS ions search means that it is not essential to choose
an enzyme.
• Obtaining matches to a number of peptides from a single protein provides a very high level of
confidence that the result is correct.
17. Parameters used in database
searching
Database searched
Taxonomy
Enzyme
Missed cleavages
Fixed versus variable modifications (PTMs)
18. SCORING SCHEMES
PROBABILITY BASED SCORING
Mascot incorporates a probability based implementation of the Mowse
algorithm
The total score is the absolute probability that the observed match is
a random event.
Advantages :
Different types of matching (peptide masses and fragment ions) can
be combined in a single search.
Scores from different searches and on different databases can be
compared.
Search parameters can be optimised more readily by iteration.
19. For this reason,
We report scores as -10*LOG10(P), where P is the absolute probability.
Probability of 10-20 thus becomes a score of 200.
21. The best result is obtained for PML_HUMAN having a score of 194, we
can confirm it by referring the graph where there is one hit and 194
lies in the significant region having a threshold score of 70 so the
chances of randomness is reduced.
The hits below the threshold region are insignificant.
24. Parameters
Additional parameters within each query are optional, and can be used to
specify:
TITLE for spectrum identification
CHARGE state of the precursor peptide
TOL peptide tolerance
TOLU peptide tolerance units
SEQ for a sequence qualifier (multiple SEQ qualifiers are allowed)
COMP for a composition qualifier (only one COMP qualifier is allowed)
TAG for a sequence tag (multiple TAG qualifiers are allowed)
28. There is only one hit having a score of 180 that falls in the significant
region.
In MS/MS Ion search the best result is taken by number of queries
matched and the score should be highlighted in bold & red.
33. Here we obtain 3 hits for the score 76, which fall in the significant
region.
So, here the best match is selected by the numbers of the queries
matched.
LYSCO_PHACO is the best match for this result.