SlideShare une entreprise Scribd logo
1  sur  7
Télécharger pour lire hors ligne
Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23
www.ijera.com 17 | P a g e
A Review on Mass Spectrometry: Technique and Tools
Ms. Ashwini Yerlekar*, Dr. M.M. Kshirsagar **
*(Department of Computer Technology, Yeshwantrao Chavan college of Engineering, Nagpur, India)
** (Department of Computer Technology, Yeshwantrao Chavan college of Engineering, Nagpur, India)
ABSTRACT
Protein structure prediction has gain important in area of life sciences, because of its complex structure. The
protein-protein interaction is necessary to study the behavior of protein in a specific environment, and study
molecular relationship in living systems. Therefore, large scale proteomics technologies are required to measure
physical connection of proteins in living organisms. Mass Spectrometry uses the technique to measure mass-to-
charge ratio of ion. It‘s an evolving technique for characterization of proteins. A Mass Spectrometer can be
more sensitive and specific, also complement with other LC detectors. Liquid Chromatography, unlike gas
chromatography is a separation technique which helps to separate wide range of organic compounds from small
molecular metabolites to peptides and proteins. This paper addresses the study of data analysis using mass
Spectrometry. It also includes the study of various methods of Mass Spectrometry data analysis, the tools and
various applications of Mass Spectrometry.This review briefs on Mass Spectrometry technique, its application,
usage, and tools used by Mass Spectrometry.
Keywords – complex, LC (Liquid chromatography) detectors, Mass Spectrometry, mass-to-charge ratio,
Protein-protein interaction.
I. INTRODUCTION
Bioinformatics is a novel approach in recent
investigations on sequence analysis and structure
prediction of proteins. The central challenge for
Bioinformatics is not only to store and retrieve data
but also to develop tools for data analysis. A number
of tools are also available for protein
characterization, structure identification, and
visualization.
Mass Spectrometer is a quantitative tool used to
measure the mass-to-charge ratio of ions. An ion is
atom or group of atoms which have gained or lost
one or more electrons making them negatively or
positively charged. The two primary methods for
ionization of whole proteins are electrospray
ionization (ESI-which turns the sample proteins into
ions) and matrix-assisted laser desorption/ionization
(MALDI- It uses laser to ionize the sample proteins
and then push the proteins into the analyzer to
produce a Mass Spectrum.). As it is an essential tool
in proteomics, it is necessary to understand the results
and the principles of Mass Spectrometer. The various
steps in mass spectrometry can be performed on a
single sample by some mass spectrometer. They
prefer the modularity method i.e.to generate the
spectrum for a single ion, then fragmenting the ion
and getting the spectrum. So it can work with
complex molecules piece by piece until its structure
is determined.
A typical Mass Spectrometer consists of three parts: a
mass analyzer, a detector and an ion source. The ion
source produces ions from the sample. The Mass
Analyzer separates ions with different mass-to-charge
ratios, these different ions are detected by detector.
Finally, the mass spectrum is generated after all the
data have been collected. Fig.1 is a scheme graph of
the Mass Spectrometer [1].
FIG1: SCHEME GRAPH OF MASS SPECTROMETRY
II. Various tools available for Protein
Prediction using Mass Spectrometry
Peptide identification algorithms fall into
two broad classes: database search and de novo
search. The first search takes place against a
database containing all amino acid whereas the latter
deduce peptide sequences without knowledge of
genomic data. At present, database search is more
reliable and considered to produce higher quality
results for most uses. With advancement in
RESEARCH ARTICLE OPEN ACCESS
SAMPLE
ION
SOURCE
MASS
ANALYZER
DETECTOR
DATA ANALYSIS
Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23
www.ijera.com 18 | P a g e
instrument precision, the de novo search may
become increasingly captivating.
2.1 Database Search Algorithms
2.1.1. SEQUEST
SEQUEST [2] is a patented tandem mass
spectrometry data analysis program. It was developed
by John Yates and Jimmy Eng in 1994. The
algorithm used by this program is covered by several
US and European software copyrights. SEQUEST
identifies collections of tandem mass spectra to
peptide sequences that have been evolved from
databases of protein sequences. SEQUEST, like
many engines, learns each tandem mass spectrum one
by one. The software evaluates protein sequences
from a database to calculate the list of peptides. The
peptide's intact mass is known from the mass
spectrum, and SEQUEST uses this information to
find the group of candidate peptides sequences that
could be compared to the spectrum by including only
those which are equally near the mass of the observed
peptide ion.
2.1.2 Mascot
Mascot [3] is a proprietary identification program
readily available from Matrix Science. Instead of
using the cross correlation method, it carries out mass
spectrometry data analysis through statistical
comparisons of matches between projected and
observed peptide fragments. In addition to the
identification features, support for peptide
quantization methods is provided, as per version 2.2.
2.1.3. PEAKS DB
PEAKS DB [4] is a proprietary database
search engine, run in cohesion with de novo
sequencing to automatically formalize the search
results, allowing for a higher number of searched
sequences for the available false search rate. In
addition to providing an independent database search,
results can be integrated as part of the software‘s
multi-engine (Sequest, Mascot, X!Tandem, OMSSA,
PEAKS DB) consensus reporting tool, in Chorus. The
tool also provides a list of sequences identified
exclusively by de novo sequencing.
2.2 De novo Sequencing Algorithms
DeNovoX performs de novo sequencing [5] on
CID spectra obtained with ion trap mass
spectrometers. The software was launched by
Thermo Fisher Scientific in 2002, was the first
commercial software for low-resolution data.
DeNovoX presents complete and/or partial peptide
sequences. Each output sequence comes with the
probable symptoms of how likely it is for the
sequence to be obtained. Probabilistic results are used
while implementing the software.
2.2.1 DeNoS
DeNoS [6] is part of the software tool Protein
matching Analysis Software (PAS) which is part of
the software package; Medicwave Bioinformatics
Suite (MBS).DeNoS performs sequencing of peptides
by gathering the data from CAD and ECD spectra
with high reliability. Hierarchal algorithm is used by
DeNoS. In the initial step fragments that are
confirmed in both CAD and ECD (so called Golden
Complementary Pairs) along with fragments that are
only found in CAD (so called Complementary Pairs)
are used. After that, one-by-one fragments with less
assurance are used. Ultimately, if the peptide is still
not fully sequenced, the software uses a crucial
application from the graph theory to sequence the left
out peptide parts with "unreliable" fragments.
2.2.2 PEAKS
PEAKS[7] de novo automatically provides a
confidence scores on individual amino acid
assignments, a complete sequence for individual
peptide, greater knowledge for scientifically
sensitive, in-depth investigations simple reporting
and greater knowledge for scientifically sensitive, in-
depth investigations for high-throughput analysis,. A
de novo, manually assisted mode, is present for users
who wish to idealize their results further. Automated
de novo sequencing on full LC run processed data
faster than 1 spectrum per second.
2.2.3 Lutefisk
For the de novo interpretation of peptide
spectrum Lutefisk software is available [8].
2.3 MS/MS peptide quantification
2.3.1 OpenMS / TOPP
OpenMS [9] is a software C++ library for LC-
MS/MS data management and analysis; it offers an
infrastructure for the development of softwares
related to mass spectrometry. It is free software
available under the 2-clause BSD license (previously
under the LGPL).TOPP - The Opens Proteomics
Pipeline - is a set of small applications that can be
sequenced to create analysis pipelines tailored for a
particular problem. TOPP is developed using the
structures and algorithms provided by OpenMS.
TOPPView is viewer software that visualization of
mass spectrometric data on MS1 and MS2 level as
well as in 3D; additionally it also displays
chromatographic data from SRM experiments (in
version 1.10). OpenMS and TOPP is a collaboration
project of the Algorithmic Bioinformatics group at
the Free University of Berlin and the Applied
Bioinformatics group at Tubingen University.
2.3.2. MaxQuant
MaxQuant is a proprietary software for numerical
proteomics developed by Jorgen Cox and others at
the Max Planck Institute of Biochemistry in
Martinsried, Germany.[10] The software is released
Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23
www.ijera.com 19 | P a g e
as freeware under the "MaxQuant‖ Freeware
Software License Agreement" and written in C#. The
software provides its own search engine called
Andromeda, and also accepts the analysis of label
free and SILAC based proteomics experiments.
2.4 Other Softwares
2.4.1 AnalyzerPro
AnalyzerPro is proprietary software by
SpectralWorks Limited. It is a software application
for processing mass spectrometry data and is vendor
independent. Using proprietary algorithms,
AnalyzerPro can process data by both GC-MS and
LC-MS using both qualitative and quantitative
processing. On large scale, it is used for
metabolomics data processing using MatrixAnalyzer
for the comparison of multiple data sets.
2.4.2 Analyst
Analyst is licensed software by AB Scitex, a
division of The Danaher Corporation.
2.4.3 RemoteAnalyzer
RemoteAnalyzer is proprietary software by
Spectral Works Limited. It is a vendor independent
'Open Access' client/server based solution to provide
a walk-up and use LC-MS and GC-MS data system.
Data processing support and Instrument control for
multiple vendors' hardware is provided. massXpert
[11] is another tools used in Mass spectrometry.
2.4.4 ESIprot 1.0 / ESIprot Online
Electrospray ionization (ESI) mass spectrometry
(MS) devices with relatively low resolution are
widely used for metabolomics and proteomics. Ion
trap devices like the Agilent MSD/XCT ultra or the
Bruker HCT ultra are typical representatives. The
ESI-MS data of most of the naturally occurring
proteins can be computed, the availability of data
evaluation software for such ESI protein spectra with
low resolution is quite moderate. ESIprot 1.0 allows
the charge state determination and molecular weight
calculation for low resolution electrospray ionization
(ESI) mass spectrometry (MS) data of proteins.
Whereas ESIprot 1.0 is written in Python (GPL v3
License), the online freely available PHP web
application is ESIprot.
III. MASS SPECTROMETRY DATA ANALYSIS
Key to making this paradigm shift possible
has been the development of bioinformatics software
that allows one to correlate bimolecular MS data
directly with protein sequence databases [12]. Two
kinds of MS bioinformatics software exist:
(1) Software for identifying proteins from peptide
mass fingerprints and
(2) Software for identifying peptides or proteins
directly from uninterrupted tandem (MS/MS) mass
spectra.
Peptide mass fingerprinting was developed
in the early 1990s to identify proteins from
proteolysis fragments (Pappies al., 1993; Yates al.,
1993; MANNet al., 1993). Specifically, if a pure
protein is digested with a protease that cuts at
predictable locations (say trypsin), the result will be a
peptide mixture containing a unique collection of
between 10–50 different peptides, each with a
different or characteristic mass. Running this mixture
on a modern ESI or MALDI instrument will lead to
an MS spectrum with dozens of peaks corresponding
to the masses of each of these peptides. Because no
two proteins are likely to share the same set of
constituent peptides, this mixture is called a peptide
mass fingerprint. By comparing the observed masses
of the mixture with predicted peptide masses derived
from all known protein sequences it is theoretically
possible to identify the protein of interest (providing
the protein has been previously sequenced).
Specifically, in the course of performing a mass
fingerprint search, database sequences are
theoretically ―cleaved‖ using known protease cutting
rules, the resulting hypothetical peptide masses are
calculated and the whole protein is ranked according
to the number of exact (or near exact) cleavage
fragment matches made to the observed set of peptide
masses. The sequence with the highest number and
quality of matches is usually selected as the most
attractive candidate. Most of the mass finger printing
softwares are freely available while some are sold as
commercial products. When peptide masses are read
from the protein databases, it must be taking into
notice that mono-isotopic masses from500-3000mz-1
is accurately readable or restricting the size of
database is also a wise decision.
Because of the inclusion of noise the search
for peptide mass fingerprinting is not an easy task.
The common complications included in it are as
follows [4]:
(1) disappearance of key peaks due to nonspecific
ion suppression,
(2) appearance of extra peaks from protease
autolysis,
(3) appearance of peaks from post-translational or
artifactual chemical modification,
(4) appearance of peaks from non-specific
cleavage, or from contaminating proteases,
and
(5) appearance of peaks from contaminating
impurities, contaminating homologs, or splice
variants.
Due to such complications the reliability and
performance of search gets reduced. The secondary
searches are called ―orphan‖ masses are made readily
available. It also allows comparing the results with
other search algorithms as SEARCH. But the best
way is to apply various methods of mass
fingerprinting and combining the result, called
‗Single Averaging‘. The programs works typically by
scanning the databases and probability of a protein
Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23
www.ijera.com 20 | P a g e
matching increases with maximum matched
sequence.
3.1 Data representations
Mass spectrometry produces different types
of data. The most common visualization technique is
the mass spectrum i.e. Graphical Representation.
Various types of mass spectrometry data are best
represented as a mass chromatogram. Types of
chromatograms include total ion current (TIC),
selected ion monitoring (SIM), and selected reaction
monitoring (SRM), among many others. Other types
of mass spectrometry data are well represented as a
three-dimensional map. In this form, intensity the y-
axis, the mass-to-charge, m/z is on the x-axis, and an
additional parameter, is plotted on the z-axis.
3.2 Data analysis
Mass spectrometry data analysis particular
to the type of experiment producing the data.
Fundamental divisions of data are general in
understanding any data. Many mass spectrometers
work in either electron mode or positive ion mode.
Sometimes it is very necessary to know whether the
obtained ions are negatively or positively charged. It
is often essential in determining the neutral mass but
it also symbolizes something about the nature of the
molecules.
Various ion source results in arrays of fragments
obtained from the fundamental molecules. The
ionization source creates many fragments and mostly
single-charged (1- ) radicals (odd number of
electrons), whereas non-radical quasimolecular ions,
an electrospray source usually produces that are
frequently multiply charged. Tandem mass
spectrometry produces fragment ions post-source and
can change the kind of data obtained by an
experiment.
A strongly prepared biological sample will
probably contain a specific amount of salt,
adducts get formed with the analyte molecules in
various analyses Knowledge of the initial sample can
provide insight into the component molecules of the
sample and their peptides(protein in form of
fragments). A sample from this
synthesis/manufacturing process will probably
consist of impurities chemically related to the target
component.
Results can also depend heavily on sample
preparation and how it was introduced. Much of the
energetics of the desorption/ionization event is
controlled by the matrix rather than the laser power.
Mass spectrometry can measure sample purity,
molecular structure and molar mass. Each of these
queries requires a different experimental procedure;
so adequate definition of the experimental goal is an
early requirement for collecting the proper data and
evolving the data.
3.3 Interpretation of mass spectra
Since the specific structure or peptide sequence of a
molecule is decrypted through the set of fragment of
masses, the combined usage of various techniques
helps in the interpretation of mass spectra. The first
way of identifying an unknown compound is to
compare its experimental mass spectrum against
databases of mass spectrum. The software assisted
interpretation or manual interpretation [15] of mass
spectra must be performed if match in spectral
database in not found. Computer simulation
of ionization and fragmentation processes occurring
in mass spectrometer is the fundamental tool for
assigning structure or peptide sequence to a
molecule. Such simulation is often supported by a
fragmentation library [16] that contains published
patterns of known decomposition
reactions. Software developed on this idea has been
developed for both small molecules and proteins.
3.4 Analysis of mass spectra
It can also be spectra with accurate mass. A mass-to-
charge ratio value (m/z) with only integer precision
can represent an immense number of theoretically
possible ion structures; so specific mass figures
mainly reduce the number of candidate in molecular
formulas. All molecular formulas that theoretically fit
a given mass with specified tolerance can be
calculated by the computer algorithm called as
formula generator. The precursor ion fingerprinting,
technique used for structure prediction identifies
individual pieces of structural information by
carrying a search of the tandem spectra of the
molecule under investigation against a library of
the product-ion spectra of structurally characterized
precursor ions. Mass spectrometry can measure,
sample purity, molecular structure and molar mass.
These queries require a different experimental
procedure; so an adequate definition of the
experimental goal is an early requirement for
collecting the proper data and successfully
interpreting it.
IV. R PACKAGES FOR MASS
SPECTROMETRY DATA ANALYSIS
R language is generally used by the life
sciences people for evaluating the results obtained. It
is useful for analysing the data. R packages in
combination with Bioconductor packages also help to
analyse the proteomics data. The various R packages
available are as follows:
4.1 The mzR Package
The mzR package [19] provides a unified
interface to various mass spectrometry open formats.
This code chunk, taken mainly from the openMSfile
documentation illustrated how to open a connection
to the raw data file. mzR package is also applicable to
an mzXML, mzData or netCDF file. mzR is used by
other packages, like MSnbase [14], TargetSearch
Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23
www.ijera.com 21 | P a g e
[15] and xcms that provide a higher level abstraction
to the data.
4.2 MSnbase
MSnbase provides base functions and classes for
MS-based proteomics that allow facile data and meta-
data processing, manipulation and plotting [13] [14].
4.3 The MALDIquant package
MALDIquant gives a combined analysis pipeline
for MALDI-TOF and other mass spectrometry data.
Differentiating features include baseline subtraction
methods such as SNIP TopHat or TopHat, peak
alignment using warping functions used for handling
of replicated measurements and also allowing spectra
with different resolutions.
The MALDIquant package contains of two main R
packages:
 MALDIquant consist of the base
functionality for accessing mass
spectrometry data.
 MALDIquantForeign contains interface for
importing and exporting data.
4.4 The isobar package
The isobar package [15] provides methods for
the statistical analysis of isobarically tagged MS2
experiments.
4.5 The synapter package
The synapter package comes with a detailed
vignette that describes how to prepare the MSE data
and then process it in R. Several interfaces are
available provided the user with easy batch
processing capabilities, or a graphical user interface
and maximum control. The conversion into MSnSet
instances and filter and combination thereof as well
as statistical analysis are also described.
4.6 digeR16
This is available on CRAN but not listed in the
Chemometrics and Computational Physics Task
View, provides a GUI interface for analysing 2D
DIGE data. (http://cran.r-
project.org/web/packages/digeR/index.html) using R
and Bioconductor for Proteomics Data Analysis. It
allows performing score plot, feature selection,
correlation analysis, classification and power analysis
for 2D DIGE experiment data.
V. ADVANTAGES AND DISADVANTAGES OF
MASS SPECTROMETRY
5.1 Advantages
i) High Sensitivity– ability to detect very small
amounts)
ii) High Selectivity– Ability to tell molecules
apart in a mixture
iii) High Time Resolution
iv) Low Cost
v) small sample size
vi) fast
vii) differentiates isotopes
viii) can be combined with GC and LC to run
mixtures, or can also be run in tandem for
proteins or peptides etc.
5.2 Disadvantages
i) doesn't directly give structural information
(although we can often figure it out)
ii) Needs pure compounds.
iii) Difficult with non-volatile compounds.
VI. APPLICATIONS OF MASS SPECTROMETRY
Mass Spectrometry consists of determining
the isotopic composition of elements in a molecule,
identifying unknown compounds, and determining
the structure of a compound by understanding its
fragmentation. Other uses include evaluating the
amount of a compound in a sample or studying the
fundamentals of gas phase in chemistry MS is now in
very common use in analytical laboratories that study
chemical, physical, or biological properties of a great
variety of compounds.
6.1 Isotope ratio MS: isotope dating and tracking
Mass spectrometry is also used to determine
the composition of elements in a sample. Variations
in mass among isotopes of an element are very less,
and the less abundant isotopes of an element are
typically very rare, so a very sensitive instrument is
requirement of Mass Spectrometry. These
instruments, sometimes allude to as isotope ratio
mass spectrometers (IR-MS), it uses a single magnet
to bend a beam of ionized particles towards a series
of Faraday cups which convert particle impacts
to electric current. Isotope ratios are important
markers of a variety of processes. The age of
materials for example as in carbon dating is
determined by isotopes ratios. Marking with isotopes
which are stable are also used for protein
quantification.
6.2 Protein characterization
Mass spectrometry is an important method for the
characterization and sequence prediction of proteins.
Fundamentally, the two methods for ionization of
whole proteins are electrospray ionization (ESI) and
matrix-assisted laser desorption/
ionization (MALDI). For characterizing Proteins two
approaches are used. In the first, intact proteins are
ionized by either of the two techniques described, and
then introduced to a mass analyzer. This approach of
Protein analysis is referred to as "top-down" strategy.
In the other approach, proteins are enzymatically
digested into smaller peptides using proteases.
The mass analyzer gets introduced from the
collection of peptide. When the characteristic pattern
of peptides is used for the identification of the protein
the method is called peptide mass
fingerprinting (PMF), if the identification is
performed using the sequence data determined
Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23
www.ijera.com 22 | P a g e
in tandem MS analysis, called as de novo sequencing.
This is called as ―bottom-up approach.
6.3 Space exploration
Mass Spectrometry has also reached to other
planets and moons, as a standard method for analysis.
In early 2005 the Cassini–Huygens mission delivered
a specialized GC-MS instrument aboard. It is
the Huygens probe through the atmosphere of the
largest moon of the planet Saturn, Titan, This
instrument analyzed atmospheric samples, once the
probe had landed, along its descent ,trajectory and
was able to vaporize and analyze samples of Titan's
frozen, hydrocarbon covered surface. These
measurements compare the abundance of isotope(s)
of each particle comparatively to earth's natural
abundance.[17] Also on board the Cassini–
Huygens spacecraft is an ion and neutral mass
spectrometer which has been taking measurements of
Titan's atmospheric composition .A Thermal and
Evolved Gas Analyzer mass spectrometer was carried
by the Mars Phoenix Lander launched in 2007[18].
Mass spectrometers are also widely used in space
missions to measure the composition of plasmas.
6.4 Respired gas monitor
Mass spectrometers were used in hospitals for
respiratory gas analysis beginning around 1975
through the end of the century. Found mostly in
the operating room, a part of a complex system,
which consist of respired gas samples from patients
undergoing anaesthesia were drawn into the
instrument through a valve mechanism. The
instrument is designed to sequentially connect up to
32 rooms to the mass spectrometer. All operations of
the system are directed by the computers. The
anaesthesiologist uses the data collected from the
mass spectrometer was delivered to the individual
rooms.
VII. CONCLUSION
The data originated in the field of
Proteomics is very large .It‘s the need of the day that
these huge amount of data need to be managed and
analysed. Various tools and techniques are available
for the protein prediction. Mass Spectrometry is an
analytical method for data analysis which originated
in large amount. We have seen the various aspects of
Mass Spectrometry. The mass Spectrometry tools and
databases available. Also we have seen the
advantages and disadvantages of the method and
various application of it.
REFERENCES
[1] Cheng Lu, Introduction to mass
Spectrometry.
[2] Jimmy K. Eng, Ashley L. McCormack, and
John R. Yates, III (1994). "An Approach to
Correlate Tandem Mass Spectral Data of
Peptides with Amino Acid Sequences in a
Protein Database". J Am Soc Mass
Spectrom 5 (11): 976–989
[3] Perkins, David N.; Pappin, Darryl J. C.;
Creasy, David M.; Cottrell, John S. (1999).
"Probability-based protein identification by
searching sequence databases using mass
spectrometry data".Electrophoresis 20 (18):
3551–67
[4] Liang, C; Smith, JC; Hendrie, Christopher
(2003). A Comparative Study of Peptide
Sequencing Software Tools for MS/MS.
American Society for Mass Spectrometry.
[5] Bartels, Christian (31 May 1990). "Fast
algorithm for peptide sequencing by mass
spectroscopy". Biological Mass
Spectrometry 19 (6): 363–368.
[6] Savitski, Mikhail M.; Nielsen, Michael L.;
Kjeldsen, Frank; Zubarev, Roman A. (2005).
"Proteomics-Grade de Novo Sequencing
Approach". Journal of Proteome
Research 4 (6): 2348–54
[7] Ma, Bin; Zhang, Kaizhong; Hendrie,
Christopher; Liang, Chengzhi; Li, Ming;
Doherty-Kirby, Amanda; Lajoie, Gilles
(2003). "PEAKS: powerful software for
peptidede novo sequencing by tandem mass
spectrometry". Rapid Communications in
Mass Spectrometry 17 (20): 2337–42.
[8] "Lutefisk - de novo MS/MS Sequencing". J.
Alex Taylor. Retrieved 2009-11-16.
[9] Junker, J.; Bielow, C.; Bertsch, A.; Sturm,
M.; Reinert, K.; Kohlbacher, O. (2012).
"TOPPAS: A Graphical Workflow Editor for
the Analysis of High-Throughput Proteomics
Data". Journal of Proteome Research 11 (7):
3914–3920.
[10] Cox, J.; Mann, M. (Dec 2008). "MaxQuant
enables high peptide identification rates,
individualized p.p.b.-range mass accuracies
and proteome-wide protein
quantification". Nat Biotechnol 26 (12):
1367–72.
[11] Rusconi, F. (2009). "massXpert 2: a cross-
platform software environment for polymer
chemistry modelling and simulation/analysis
of mass spectrometric data". Bioinformatics
25 (20): 2741–2
[12] Catherine P. Riley*, Erik S. Gough, Jing He,
Shrinivas S. Jandhyala, Brad Kennedy, Seza
Orcun, Mourad Ouzzani, Charles Buck, Ali
M. Roumani and Xiang Zhang, ― The
Proteome Discovery Pipeline – A Data
Analysis Pipeline for Mass Spectrometry-
Based Differential Proteomics Discovery‖ :
The Open Proteomics Journal, 2010, vol 3,
pg[8-19]
[13] L. Gatto and K. S. Lilley. MSnbase _ an
R/Bioconductor package for isobaric tagged
mass spectrometry data visualization,
processing and quantitation. Bioinformatics,
28(2):288_9, Jan 2012.
Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23
www.ijera.com 23 | P a g e
[14] R. C. Gentleman, V. J. Carey, D. M. Bates,
B. Bolstad, M. Dettling, S. Dudoit, B. Ellis,
L. Gautier, Y. Ge, J. Gentry, K. Hornik, T.
Hothorn, W. Huber, S. Iacus, R. Irizarry, F.
Leisch, C. Li, M. Maechler, A. J. Rossini, G.
Sawitzki, C. Smith, G. Smyth, L. Tierney, J.
Y. H. Yang, and J. Zhang. Bioconductor:
open software development for computational
biology and bioinformatics. Genome Biol,
5(10):_ 80, 2004.
[22] Tureček, František; McLafferty, Fred W.
(1993). Interpretation of mass spectra.
Sausalito: University Science Books.
[16] Mistrik, R. (2004). A New Concept for the
Interpretation of Mass Spectra Based on a
Combination of a Fragmentation Mechanism
Database and a Computer Expert System. in
Ashcroft, A.E., Brenton, G., Monaghan, J.J.
(Eds.), Advances in Mass Spectrometry,
Elsevier, Amsterdam, vol. 16, pp. 821
[17] Petrie, S. and Bohme, D. K. (2007). "Ions in
space". Mass Spectrometry Reviews 26 (2):
258–
280. doi:10.1002/mas.20114. PMID 1711134
6.
[18] Hoffman, J; Chaney, R; Hammack, H (2008).
"Phoenix Mars Mission—the Thermal
Evolved Gas Analyzer". Journal of the
American Society for Mass
Spectrometry 19 (10): 1377–83.

Contenu connexe

Tendances

Improving the performance of Intrusion detection systems
Improving the performance of Intrusion detection systemsImproving the performance of Intrusion detection systems
Improving the performance of Intrusion detection systemsyasmen essam
 
Cheminformatics in drug design
Cheminformatics in drug designCheminformatics in drug design
Cheminformatics in drug designSurmil Shah
 
Genome structure prediction a review over soft computing techniques
Genome structure prediction a review over soft computing techniquesGenome structure prediction a review over soft computing techniques
Genome structure prediction a review over soft computing techniqueseSAT Journals
 
Dependable fire detection_system_with_multifunctio
Dependable fire detection_system_with_multifunctioDependable fire detection_system_with_multifunctio
Dependable fire detection_system_with_multifunctioBharath Kumar
 
Analysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOTAnalysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOTIJERA Editor
 
Minkowski Distance based Feature Selection Algorithm for Effective Intrusion ...
Minkowski Distance based Feature Selection Algorithm for Effective Intrusion ...Minkowski Distance based Feature Selection Algorithm for Effective Intrusion ...
Minkowski Distance based Feature Selection Algorithm for Effective Intrusion ...IJMER
 
Elevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringElevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringeSAT Publishing House
 
Elevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringElevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringeSAT Journals
 
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGijbbjournal
 
NeuroPG - Open source software for optical pattern generation and data acquis...
NeuroPG - Open source software for optical pattern generation and data acquis...NeuroPG - Open source software for optical pattern generation and data acquis...
NeuroPG - Open source software for optical pattern generation and data acquis...Daniel Murphy
 
USING ARTIFICIAL NEURAL NETWORK IN DIAGNOSIS OF THYROID DISEASE: A CASE STUDY
USING ARTIFICIAL NEURAL NETWORK IN DIAGNOSIS OF THYROID DISEASE: A CASE STUDYUSING ARTIFICIAL NEURAL NETWORK IN DIAGNOSIS OF THYROID DISEASE: A CASE STUDY
USING ARTIFICIAL NEURAL NETWORK IN DIAGNOSIS OF THYROID DISEASE: A CASE STUDYijcsa
 
EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795Michael Williams
 
NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonAlexander Pico
 
USE OF NETWORK FORENSIC MECHANISMS TO FORMULATE NETWORK SECURITY
USE OF NETWORK FORENSIC MECHANISMS TO FORMULATE NETWORK SECURITYUSE OF NETWORK FORENSIC MECHANISMS TO FORMULATE NETWORK SECURITY
USE OF NETWORK FORENSIC MECHANISMS TO FORMULATE NETWORK SECURITYIJMIT JOURNAL
 
Srge most important publications 2020
Srge most important  publications 2020Srge most important  publications 2020
Srge most important publications 2020Aboul Ella Hassanien
 
Software Defect Prediction Using Radial Basis and Probabilistic Neural Networks
Software Defect Prediction Using Radial Basis and Probabilistic Neural NetworksSoftware Defect Prediction Using Radial Basis and Probabilistic Neural Networks
Software Defect Prediction Using Radial Basis and Probabilistic Neural NetworksEditor IJCATR
 
Chemo informatics scope and applications
Chemo informatics scope and applicationsChemo informatics scope and applications
Chemo informatics scope and applicationsshyam I
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designingW Roseybala Devi
 
Network embedding in biomedical data science
Network embedding in biomedical data scienceNetwork embedding in biomedical data science
Network embedding in biomedical data scienceArindam Ghosh
 

Tendances (20)

Improving the performance of Intrusion detection systems
Improving the performance of Intrusion detection systemsImproving the performance of Intrusion detection systems
Improving the performance of Intrusion detection systems
 
Cheminformatics in drug design
Cheminformatics in drug designCheminformatics in drug design
Cheminformatics in drug design
 
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORKCLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
 
Genome structure prediction a review over soft computing techniques
Genome structure prediction a review over soft computing techniquesGenome structure prediction a review over soft computing techniques
Genome structure prediction a review over soft computing techniques
 
Dependable fire detection_system_with_multifunctio
Dependable fire detection_system_with_multifunctioDependable fire detection_system_with_multifunctio
Dependable fire detection_system_with_multifunctio
 
Analysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOTAnalysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOT
 
Minkowski Distance based Feature Selection Algorithm for Effective Intrusion ...
Minkowski Distance based Feature Selection Algorithm for Effective Intrusion ...Minkowski Distance based Feature Selection Algorithm for Effective Intrusion ...
Minkowski Distance based Feature Selection Algorithm for Effective Intrusion ...
 
Elevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringElevating forensic investigation system for file clustering
Elevating forensic investigation system for file clustering
 
Elevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringElevating forensic investigation system for file clustering
Elevating forensic investigation system for file clustering
 
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
 
NeuroPG - Open source software for optical pattern generation and data acquis...
NeuroPG - Open source software for optical pattern generation and data acquis...NeuroPG - Open source software for optical pattern generation and data acquis...
NeuroPG - Open source software for optical pattern generation and data acquis...
 
USING ARTIFICIAL NEURAL NETWORK IN DIAGNOSIS OF THYROID DISEASE: A CASE STUDY
USING ARTIFICIAL NEURAL NETWORK IN DIAGNOSIS OF THYROID DISEASE: A CASE STUDYUSING ARTIFICIAL NEURAL NETWORK IN DIAGNOSIS OF THYROID DISEASE: A CASE STUDY
USING ARTIFICIAL NEURAL NETWORK IN DIAGNOSIS OF THYROID DISEASE: A CASE STUDY
 
EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795
 
NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina Kutmon
 
USE OF NETWORK FORENSIC MECHANISMS TO FORMULATE NETWORK SECURITY
USE OF NETWORK FORENSIC MECHANISMS TO FORMULATE NETWORK SECURITYUSE OF NETWORK FORENSIC MECHANISMS TO FORMULATE NETWORK SECURITY
USE OF NETWORK FORENSIC MECHANISMS TO FORMULATE NETWORK SECURITY
 
Srge most important publications 2020
Srge most important  publications 2020Srge most important  publications 2020
Srge most important publications 2020
 
Software Defect Prediction Using Radial Basis and Probabilistic Neural Networks
Software Defect Prediction Using Radial Basis and Probabilistic Neural NetworksSoftware Defect Prediction Using Radial Basis and Probabilistic Neural Networks
Software Defect Prediction Using Radial Basis and Probabilistic Neural Networks
 
Chemo informatics scope and applications
Chemo informatics scope and applicationsChemo informatics scope and applications
Chemo informatics scope and applications
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
 
Network embedding in biomedical data science
Network embedding in biomedical data scienceNetwork embedding in biomedical data science
Network embedding in biomedical data science
 

En vedette

Study on the Influence of Ice and Snow on Pavement Evaluation
Study on the Influence of Ice and Snow on Pavement EvaluationStudy on the Influence of Ice and Snow on Pavement Evaluation
Study on the Influence of Ice and Snow on Pavement EvaluationIJERA Editor
 
Investigation on Growth and Optical Properties of LVCC Single Crystals
Investigation on Growth and Optical Properties of LVCC Single CrystalsInvestigation on Growth and Optical Properties of LVCC Single Crystals
Investigation on Growth and Optical Properties of LVCC Single CrystalsIJERA Editor
 
The Mechanical Behavior Of A Nylon Seat Belt Exposed To Cyclical Loads: A Num...
The Mechanical Behavior Of A Nylon Seat Belt Exposed To Cyclical Loads: A Num...The Mechanical Behavior Of A Nylon Seat Belt Exposed To Cyclical Loads: A Num...
The Mechanical Behavior Of A Nylon Seat Belt Exposed To Cyclical Loads: A Num...IJERA Editor
 
A survey on RBF Neural Network for Intrusion Detection System
A survey on RBF Neural Network for Intrusion Detection SystemA survey on RBF Neural Network for Intrusion Detection System
A survey on RBF Neural Network for Intrusion Detection SystemIJERA Editor
 
Game Design & Business Models
Game Design & Business ModelsGame Design & Business Models
Game Design & Business ModelsHeroEngine
 

En vedette (20)

Cl4201593597
Cl4201593597Cl4201593597
Cl4201593597
 
Be4301314318
Be4301314318Be4301314318
Be4301314318
 
Dg4201725728
Dg4201725728Dg4201725728
Dg4201725728
 
Study on the Influence of Ice and Snow on Pavement Evaluation
Study on the Influence of Ice and Snow on Pavement EvaluationStudy on the Influence of Ice and Snow on Pavement Evaluation
Study on the Influence of Ice and Snow on Pavement Evaluation
 
Cu4201638643
Cu4201638643Cu4201638643
Cu4201638643
 
Investigation on Growth and Optical Properties of LVCC Single Crystals
Investigation on Growth and Optical Properties of LVCC Single CrystalsInvestigation on Growth and Optical Properties of LVCC Single Crystals
Investigation on Growth and Optical Properties of LVCC Single Crystals
 
A42040104
A42040104A42040104
A42040104
 
The Mechanical Behavior Of A Nylon Seat Belt Exposed To Cyclical Loads: A Num...
The Mechanical Behavior Of A Nylon Seat Belt Exposed To Cyclical Loads: A Num...The Mechanical Behavior Of A Nylon Seat Belt Exposed To Cyclical Loads: A Num...
The Mechanical Behavior Of A Nylon Seat Belt Exposed To Cyclical Loads: A Num...
 
A045070104
A045070104A045070104
A045070104
 
K43015659
K43015659K43015659
K43015659
 
A42020106
A42020106A42020106
A42020106
 
J42046469
J42046469J42046469
J42046469
 
A survey on RBF Neural Network for Intrusion Detection System
A survey on RBF Neural Network for Intrusion Detection SystemA survey on RBF Neural Network for Intrusion Detection System
A survey on RBF Neural Network for Intrusion Detection System
 
De4201715719
De4201715719De4201715719
De4201715719
 
N43017277
N43017277N43017277
N43017277
 
E43012628
E43012628E43012628
E43012628
 
Q43019295
Q43019295Q43019295
Q43019295
 
Game Design & Business Models
Game Design & Business ModelsGame Design & Business Models
Game Design & Business Models
 
M44086974
M44086974M44086974
M44086974
 
Aj4301193196
Aj4301193196Aj4301193196
Aj4301193196
 

Similaire à C044041723

Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformaticscontactsoorya
 
Principle Component Analysis for Classification of the Quality of Aromatic Rice
Principle Component Analysis for Classification of the Quality of Aromatic RicePrinciple Component Analysis for Classification of the Quality of Aromatic Rice
Principle Component Analysis for Classification of the Quality of Aromatic RiceIJCSIS Research Publications
 
An evaluation of machine learning algorithms coupled to an electronic olfact...
An evaluation of machine learning algorithms coupled to an  electronic olfact...An evaluation of machine learning algorithms coupled to an  electronic olfact...
An evaluation of machine learning algorithms coupled to an electronic olfact...IJECEIAES
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...ijdms
 
Comparative study of artificial neural network based classification for liver...
Comparative study of artificial neural network based classification for liver...Comparative study of artificial neural network based classification for liver...
Comparative study of artificial neural network based classification for liver...Alexander Decker
 
Quantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserQuantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserNeil Swainston
 
GASCAN: A Novel Database for Gastric Cancer Genes and Primers
GASCAN: A Novel Database for Gastric Cancer Genes and PrimersGASCAN: A Novel Database for Gastric Cancer Genes and Primers
GASCAN: A Novel Database for Gastric Cancer Genes and Primersijdmtaiir
 
IRJET - A Framework for Predicting Drug Effectiveness in Human Body
IRJET - A Framework for Predicting Drug Effectiveness in Human BodyIRJET - A Framework for Predicting Drug Effectiveness in Human Body
IRJET - A Framework for Predicting Drug Effectiveness in Human BodyIRJET Journal
 
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...gerogepatton
 
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...ijaia
 
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...gerogepatton
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsAyeshaYousaf20
 
Protein structure prediction by means
Protein structure prediction by meansProtein structure prediction by means
Protein structure prediction by meansijaia
 

Similaire à C044041723 (20)

Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
1207.2600
1207.26001207.2600
1207.2600
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformatics
 
Bioinformatics ppt
Bioinformatics pptBioinformatics ppt
Bioinformatics ppt
 
Principle Component Analysis for Classification of the Quality of Aromatic Rice
Principle Component Analysis for Classification of the Quality of Aromatic RicePrinciple Component Analysis for Classification of the Quality of Aromatic Rice
Principle Component Analysis for Classification of the Quality of Aromatic Rice
 
woot2
woot2woot2
woot2
 
An evaluation of machine learning algorithms coupled to an electronic olfact...
An evaluation of machine learning algorithms coupled to an  electronic olfact...An evaluation of machine learning algorithms coupled to an  electronic olfact...
An evaluation of machine learning algorithms coupled to an electronic olfact...
 
B.3.5
B.3.5B.3.5
B.3.5
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...
 
Comparative study of artificial neural network based classification for liver...
Comparative study of artificial neural network based classification for liver...Comparative study of artificial neural network based classification for liver...
Comparative study of artificial neural network based classification for liver...
 
Quantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserQuantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To Browser
 
GASCAN: A Novel Database for Gastric Cancer Genes and Primers
GASCAN: A Novel Database for Gastric Cancer Genes and PrimersGASCAN: A Novel Database for Gastric Cancer Genes and Primers
GASCAN: A Novel Database for Gastric Cancer Genes and Primers
 
Disease Prediction Using Machine Learning
Disease Prediction Using Machine LearningDisease Prediction Using Machine Learning
Disease Prediction Using Machine Learning
 
D1803012022
D1803012022D1803012022
D1803012022
 
IRJET - A Framework for Predicting Drug Effectiveness in Human Body
IRJET - A Framework for Predicting Drug Effectiveness in Human BodyIRJET - A Framework for Predicting Drug Effectiveness in Human Body
IRJET - A Framework for Predicting Drug Effectiveness in Human Body
 
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
 
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
 
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
 
Protein structure prediction by means
Protein structure prediction by meansProtein structure prediction by means
Protein structure prediction by means
 

Dernier

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Dernier (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

C044041723

  • 1. Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23 www.ijera.com 17 | P a g e A Review on Mass Spectrometry: Technique and Tools Ms. Ashwini Yerlekar*, Dr. M.M. Kshirsagar ** *(Department of Computer Technology, Yeshwantrao Chavan college of Engineering, Nagpur, India) ** (Department of Computer Technology, Yeshwantrao Chavan college of Engineering, Nagpur, India) ABSTRACT Protein structure prediction has gain important in area of life sciences, because of its complex structure. The protein-protein interaction is necessary to study the behavior of protein in a specific environment, and study molecular relationship in living systems. Therefore, large scale proteomics technologies are required to measure physical connection of proteins in living organisms. Mass Spectrometry uses the technique to measure mass-to- charge ratio of ion. It‘s an evolving technique for characterization of proteins. A Mass Spectrometer can be more sensitive and specific, also complement with other LC detectors. Liquid Chromatography, unlike gas chromatography is a separation technique which helps to separate wide range of organic compounds from small molecular metabolites to peptides and proteins. This paper addresses the study of data analysis using mass Spectrometry. It also includes the study of various methods of Mass Spectrometry data analysis, the tools and various applications of Mass Spectrometry.This review briefs on Mass Spectrometry technique, its application, usage, and tools used by Mass Spectrometry. Keywords – complex, LC (Liquid chromatography) detectors, Mass Spectrometry, mass-to-charge ratio, Protein-protein interaction. I. INTRODUCTION Bioinformatics is a novel approach in recent investigations on sequence analysis and structure prediction of proteins. The central challenge for Bioinformatics is not only to store and retrieve data but also to develop tools for data analysis. A number of tools are also available for protein characterization, structure identification, and visualization. Mass Spectrometer is a quantitative tool used to measure the mass-to-charge ratio of ions. An ion is atom or group of atoms which have gained or lost one or more electrons making them negatively or positively charged. The two primary methods for ionization of whole proteins are electrospray ionization (ESI-which turns the sample proteins into ions) and matrix-assisted laser desorption/ionization (MALDI- It uses laser to ionize the sample proteins and then push the proteins into the analyzer to produce a Mass Spectrum.). As it is an essential tool in proteomics, it is necessary to understand the results and the principles of Mass Spectrometer. The various steps in mass spectrometry can be performed on a single sample by some mass spectrometer. They prefer the modularity method i.e.to generate the spectrum for a single ion, then fragmenting the ion and getting the spectrum. So it can work with complex molecules piece by piece until its structure is determined. A typical Mass Spectrometer consists of three parts: a mass analyzer, a detector and an ion source. The ion source produces ions from the sample. The Mass Analyzer separates ions with different mass-to-charge ratios, these different ions are detected by detector. Finally, the mass spectrum is generated after all the data have been collected. Fig.1 is a scheme graph of the Mass Spectrometer [1]. FIG1: SCHEME GRAPH OF MASS SPECTROMETRY II. Various tools available for Protein Prediction using Mass Spectrometry Peptide identification algorithms fall into two broad classes: database search and de novo search. The first search takes place against a database containing all amino acid whereas the latter deduce peptide sequences without knowledge of genomic data. At present, database search is more reliable and considered to produce higher quality results for most uses. With advancement in RESEARCH ARTICLE OPEN ACCESS SAMPLE ION SOURCE MASS ANALYZER DETECTOR DATA ANALYSIS
  • 2. Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23 www.ijera.com 18 | P a g e instrument precision, the de novo search may become increasingly captivating. 2.1 Database Search Algorithms 2.1.1. SEQUEST SEQUEST [2] is a patented tandem mass spectrometry data analysis program. It was developed by John Yates and Jimmy Eng in 1994. The algorithm used by this program is covered by several US and European software copyrights. SEQUEST identifies collections of tandem mass spectra to peptide sequences that have been evolved from databases of protein sequences. SEQUEST, like many engines, learns each tandem mass spectrum one by one. The software evaluates protein sequences from a database to calculate the list of peptides. The peptide's intact mass is known from the mass spectrum, and SEQUEST uses this information to find the group of candidate peptides sequences that could be compared to the spectrum by including only those which are equally near the mass of the observed peptide ion. 2.1.2 Mascot Mascot [3] is a proprietary identification program readily available from Matrix Science. Instead of using the cross correlation method, it carries out mass spectrometry data analysis through statistical comparisons of matches between projected and observed peptide fragments. In addition to the identification features, support for peptide quantization methods is provided, as per version 2.2. 2.1.3. PEAKS DB PEAKS DB [4] is a proprietary database search engine, run in cohesion with de novo sequencing to automatically formalize the search results, allowing for a higher number of searched sequences for the available false search rate. In addition to providing an independent database search, results can be integrated as part of the software‘s multi-engine (Sequest, Mascot, X!Tandem, OMSSA, PEAKS DB) consensus reporting tool, in Chorus. The tool also provides a list of sequences identified exclusively by de novo sequencing. 2.2 De novo Sequencing Algorithms DeNovoX performs de novo sequencing [5] on CID spectra obtained with ion trap mass spectrometers. The software was launched by Thermo Fisher Scientific in 2002, was the first commercial software for low-resolution data. DeNovoX presents complete and/or partial peptide sequences. Each output sequence comes with the probable symptoms of how likely it is for the sequence to be obtained. Probabilistic results are used while implementing the software. 2.2.1 DeNoS DeNoS [6] is part of the software tool Protein matching Analysis Software (PAS) which is part of the software package; Medicwave Bioinformatics Suite (MBS).DeNoS performs sequencing of peptides by gathering the data from CAD and ECD spectra with high reliability. Hierarchal algorithm is used by DeNoS. In the initial step fragments that are confirmed in both CAD and ECD (so called Golden Complementary Pairs) along with fragments that are only found in CAD (so called Complementary Pairs) are used. After that, one-by-one fragments with less assurance are used. Ultimately, if the peptide is still not fully sequenced, the software uses a crucial application from the graph theory to sequence the left out peptide parts with "unreliable" fragments. 2.2.2 PEAKS PEAKS[7] de novo automatically provides a confidence scores on individual amino acid assignments, a complete sequence for individual peptide, greater knowledge for scientifically sensitive, in-depth investigations simple reporting and greater knowledge for scientifically sensitive, in- depth investigations for high-throughput analysis,. A de novo, manually assisted mode, is present for users who wish to idealize their results further. Automated de novo sequencing on full LC run processed data faster than 1 spectrum per second. 2.2.3 Lutefisk For the de novo interpretation of peptide spectrum Lutefisk software is available [8]. 2.3 MS/MS peptide quantification 2.3.1 OpenMS / TOPP OpenMS [9] is a software C++ library for LC- MS/MS data management and analysis; it offers an infrastructure for the development of softwares related to mass spectrometry. It is free software available under the 2-clause BSD license (previously under the LGPL).TOPP - The Opens Proteomics Pipeline - is a set of small applications that can be sequenced to create analysis pipelines tailored for a particular problem. TOPP is developed using the structures and algorithms provided by OpenMS. TOPPView is viewer software that visualization of mass spectrometric data on MS1 and MS2 level as well as in 3D; additionally it also displays chromatographic data from SRM experiments (in version 1.10). OpenMS and TOPP is a collaboration project of the Algorithmic Bioinformatics group at the Free University of Berlin and the Applied Bioinformatics group at Tubingen University. 2.3.2. MaxQuant MaxQuant is a proprietary software for numerical proteomics developed by Jorgen Cox and others at the Max Planck Institute of Biochemistry in Martinsried, Germany.[10] The software is released
  • 3. Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23 www.ijera.com 19 | P a g e as freeware under the "MaxQuant‖ Freeware Software License Agreement" and written in C#. The software provides its own search engine called Andromeda, and also accepts the analysis of label free and SILAC based proteomics experiments. 2.4 Other Softwares 2.4.1 AnalyzerPro AnalyzerPro is proprietary software by SpectralWorks Limited. It is a software application for processing mass spectrometry data and is vendor independent. Using proprietary algorithms, AnalyzerPro can process data by both GC-MS and LC-MS using both qualitative and quantitative processing. On large scale, it is used for metabolomics data processing using MatrixAnalyzer for the comparison of multiple data sets. 2.4.2 Analyst Analyst is licensed software by AB Scitex, a division of The Danaher Corporation. 2.4.3 RemoteAnalyzer RemoteAnalyzer is proprietary software by Spectral Works Limited. It is a vendor independent 'Open Access' client/server based solution to provide a walk-up and use LC-MS and GC-MS data system. Data processing support and Instrument control for multiple vendors' hardware is provided. massXpert [11] is another tools used in Mass spectrometry. 2.4.4 ESIprot 1.0 / ESIprot Online Electrospray ionization (ESI) mass spectrometry (MS) devices with relatively low resolution are widely used for metabolomics and proteomics. Ion trap devices like the Agilent MSD/XCT ultra or the Bruker HCT ultra are typical representatives. The ESI-MS data of most of the naturally occurring proteins can be computed, the availability of data evaluation software for such ESI protein spectra with low resolution is quite moderate. ESIprot 1.0 allows the charge state determination and molecular weight calculation for low resolution electrospray ionization (ESI) mass spectrometry (MS) data of proteins. Whereas ESIprot 1.0 is written in Python (GPL v3 License), the online freely available PHP web application is ESIprot. III. MASS SPECTROMETRY DATA ANALYSIS Key to making this paradigm shift possible has been the development of bioinformatics software that allows one to correlate bimolecular MS data directly with protein sequence databases [12]. Two kinds of MS bioinformatics software exist: (1) Software for identifying proteins from peptide mass fingerprints and (2) Software for identifying peptides or proteins directly from uninterrupted tandem (MS/MS) mass spectra. Peptide mass fingerprinting was developed in the early 1990s to identify proteins from proteolysis fragments (Pappies al., 1993; Yates al., 1993; MANNet al., 1993). Specifically, if a pure protein is digested with a protease that cuts at predictable locations (say trypsin), the result will be a peptide mixture containing a unique collection of between 10–50 different peptides, each with a different or characteristic mass. Running this mixture on a modern ESI or MALDI instrument will lead to an MS spectrum with dozens of peaks corresponding to the masses of each of these peptides. Because no two proteins are likely to share the same set of constituent peptides, this mixture is called a peptide mass fingerprint. By comparing the observed masses of the mixture with predicted peptide masses derived from all known protein sequences it is theoretically possible to identify the protein of interest (providing the protein has been previously sequenced). Specifically, in the course of performing a mass fingerprint search, database sequences are theoretically ―cleaved‖ using known protease cutting rules, the resulting hypothetical peptide masses are calculated and the whole protein is ranked according to the number of exact (or near exact) cleavage fragment matches made to the observed set of peptide masses. The sequence with the highest number and quality of matches is usually selected as the most attractive candidate. Most of the mass finger printing softwares are freely available while some are sold as commercial products. When peptide masses are read from the protein databases, it must be taking into notice that mono-isotopic masses from500-3000mz-1 is accurately readable or restricting the size of database is also a wise decision. Because of the inclusion of noise the search for peptide mass fingerprinting is not an easy task. The common complications included in it are as follows [4]: (1) disappearance of key peaks due to nonspecific ion suppression, (2) appearance of extra peaks from protease autolysis, (3) appearance of peaks from post-translational or artifactual chemical modification, (4) appearance of peaks from non-specific cleavage, or from contaminating proteases, and (5) appearance of peaks from contaminating impurities, contaminating homologs, or splice variants. Due to such complications the reliability and performance of search gets reduced. The secondary searches are called ―orphan‖ masses are made readily available. It also allows comparing the results with other search algorithms as SEARCH. But the best way is to apply various methods of mass fingerprinting and combining the result, called ‗Single Averaging‘. The programs works typically by scanning the databases and probability of a protein
  • 4. Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23 www.ijera.com 20 | P a g e matching increases with maximum matched sequence. 3.1 Data representations Mass spectrometry produces different types of data. The most common visualization technique is the mass spectrum i.e. Graphical Representation. Various types of mass spectrometry data are best represented as a mass chromatogram. Types of chromatograms include total ion current (TIC), selected ion monitoring (SIM), and selected reaction monitoring (SRM), among many others. Other types of mass spectrometry data are well represented as a three-dimensional map. In this form, intensity the y- axis, the mass-to-charge, m/z is on the x-axis, and an additional parameter, is plotted on the z-axis. 3.2 Data analysis Mass spectrometry data analysis particular to the type of experiment producing the data. Fundamental divisions of data are general in understanding any data. Many mass spectrometers work in either electron mode or positive ion mode. Sometimes it is very necessary to know whether the obtained ions are negatively or positively charged. It is often essential in determining the neutral mass but it also symbolizes something about the nature of the molecules. Various ion source results in arrays of fragments obtained from the fundamental molecules. The ionization source creates many fragments and mostly single-charged (1- ) radicals (odd number of electrons), whereas non-radical quasimolecular ions, an electrospray source usually produces that are frequently multiply charged. Tandem mass spectrometry produces fragment ions post-source and can change the kind of data obtained by an experiment. A strongly prepared biological sample will probably contain a specific amount of salt, adducts get formed with the analyte molecules in various analyses Knowledge of the initial sample can provide insight into the component molecules of the sample and their peptides(protein in form of fragments). A sample from this synthesis/manufacturing process will probably consist of impurities chemically related to the target component. Results can also depend heavily on sample preparation and how it was introduced. Much of the energetics of the desorption/ionization event is controlled by the matrix rather than the laser power. Mass spectrometry can measure sample purity, molecular structure and molar mass. Each of these queries requires a different experimental procedure; so adequate definition of the experimental goal is an early requirement for collecting the proper data and evolving the data. 3.3 Interpretation of mass spectra Since the specific structure or peptide sequence of a molecule is decrypted through the set of fragment of masses, the combined usage of various techniques helps in the interpretation of mass spectra. The first way of identifying an unknown compound is to compare its experimental mass spectrum against databases of mass spectrum. The software assisted interpretation or manual interpretation [15] of mass spectra must be performed if match in spectral database in not found. Computer simulation of ionization and fragmentation processes occurring in mass spectrometer is the fundamental tool for assigning structure or peptide sequence to a molecule. Such simulation is often supported by a fragmentation library [16] that contains published patterns of known decomposition reactions. Software developed on this idea has been developed for both small molecules and proteins. 3.4 Analysis of mass spectra It can also be spectra with accurate mass. A mass-to- charge ratio value (m/z) with only integer precision can represent an immense number of theoretically possible ion structures; so specific mass figures mainly reduce the number of candidate in molecular formulas. All molecular formulas that theoretically fit a given mass with specified tolerance can be calculated by the computer algorithm called as formula generator. The precursor ion fingerprinting, technique used for structure prediction identifies individual pieces of structural information by carrying a search of the tandem spectra of the molecule under investigation against a library of the product-ion spectra of structurally characterized precursor ions. Mass spectrometry can measure, sample purity, molecular structure and molar mass. These queries require a different experimental procedure; so an adequate definition of the experimental goal is an early requirement for collecting the proper data and successfully interpreting it. IV. R PACKAGES FOR MASS SPECTROMETRY DATA ANALYSIS R language is generally used by the life sciences people for evaluating the results obtained. It is useful for analysing the data. R packages in combination with Bioconductor packages also help to analyse the proteomics data. The various R packages available are as follows: 4.1 The mzR Package The mzR package [19] provides a unified interface to various mass spectrometry open formats. This code chunk, taken mainly from the openMSfile documentation illustrated how to open a connection to the raw data file. mzR package is also applicable to an mzXML, mzData or netCDF file. mzR is used by other packages, like MSnbase [14], TargetSearch
  • 5. Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23 www.ijera.com 21 | P a g e [15] and xcms that provide a higher level abstraction to the data. 4.2 MSnbase MSnbase provides base functions and classes for MS-based proteomics that allow facile data and meta- data processing, manipulation and plotting [13] [14]. 4.3 The MALDIquant package MALDIquant gives a combined analysis pipeline for MALDI-TOF and other mass spectrometry data. Differentiating features include baseline subtraction methods such as SNIP TopHat or TopHat, peak alignment using warping functions used for handling of replicated measurements and also allowing spectra with different resolutions. The MALDIquant package contains of two main R packages:  MALDIquant consist of the base functionality for accessing mass spectrometry data.  MALDIquantForeign contains interface for importing and exporting data. 4.4 The isobar package The isobar package [15] provides methods for the statistical analysis of isobarically tagged MS2 experiments. 4.5 The synapter package The synapter package comes with a detailed vignette that describes how to prepare the MSE data and then process it in R. Several interfaces are available provided the user with easy batch processing capabilities, or a graphical user interface and maximum control. The conversion into MSnSet instances and filter and combination thereof as well as statistical analysis are also described. 4.6 digeR16 This is available on CRAN but not listed in the Chemometrics and Computational Physics Task View, provides a GUI interface for analysing 2D DIGE data. (http://cran.r- project.org/web/packages/digeR/index.html) using R and Bioconductor for Proteomics Data Analysis. It allows performing score plot, feature selection, correlation analysis, classification and power analysis for 2D DIGE experiment data. V. ADVANTAGES AND DISADVANTAGES OF MASS SPECTROMETRY 5.1 Advantages i) High Sensitivity– ability to detect very small amounts) ii) High Selectivity– Ability to tell molecules apart in a mixture iii) High Time Resolution iv) Low Cost v) small sample size vi) fast vii) differentiates isotopes viii) can be combined with GC and LC to run mixtures, or can also be run in tandem for proteins or peptides etc. 5.2 Disadvantages i) doesn't directly give structural information (although we can often figure it out) ii) Needs pure compounds. iii) Difficult with non-volatile compounds. VI. APPLICATIONS OF MASS SPECTROMETRY Mass Spectrometry consists of determining the isotopic composition of elements in a molecule, identifying unknown compounds, and determining the structure of a compound by understanding its fragmentation. Other uses include evaluating the amount of a compound in a sample or studying the fundamentals of gas phase in chemistry MS is now in very common use in analytical laboratories that study chemical, physical, or biological properties of a great variety of compounds. 6.1 Isotope ratio MS: isotope dating and tracking Mass spectrometry is also used to determine the composition of elements in a sample. Variations in mass among isotopes of an element are very less, and the less abundant isotopes of an element are typically very rare, so a very sensitive instrument is requirement of Mass Spectrometry. These instruments, sometimes allude to as isotope ratio mass spectrometers (IR-MS), it uses a single magnet to bend a beam of ionized particles towards a series of Faraday cups which convert particle impacts to electric current. Isotope ratios are important markers of a variety of processes. The age of materials for example as in carbon dating is determined by isotopes ratios. Marking with isotopes which are stable are also used for protein quantification. 6.2 Protein characterization Mass spectrometry is an important method for the characterization and sequence prediction of proteins. Fundamentally, the two methods for ionization of whole proteins are electrospray ionization (ESI) and matrix-assisted laser desorption/ ionization (MALDI). For characterizing Proteins two approaches are used. In the first, intact proteins are ionized by either of the two techniques described, and then introduced to a mass analyzer. This approach of Protein analysis is referred to as "top-down" strategy. In the other approach, proteins are enzymatically digested into smaller peptides using proteases. The mass analyzer gets introduced from the collection of peptide. When the characteristic pattern of peptides is used for the identification of the protein the method is called peptide mass fingerprinting (PMF), if the identification is performed using the sequence data determined
  • 6. Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23 www.ijera.com 22 | P a g e in tandem MS analysis, called as de novo sequencing. This is called as ―bottom-up approach. 6.3 Space exploration Mass Spectrometry has also reached to other planets and moons, as a standard method for analysis. In early 2005 the Cassini–Huygens mission delivered a specialized GC-MS instrument aboard. It is the Huygens probe through the atmosphere of the largest moon of the planet Saturn, Titan, This instrument analyzed atmospheric samples, once the probe had landed, along its descent ,trajectory and was able to vaporize and analyze samples of Titan's frozen, hydrocarbon covered surface. These measurements compare the abundance of isotope(s) of each particle comparatively to earth's natural abundance.[17] Also on board the Cassini– Huygens spacecraft is an ion and neutral mass spectrometer which has been taking measurements of Titan's atmospheric composition .A Thermal and Evolved Gas Analyzer mass spectrometer was carried by the Mars Phoenix Lander launched in 2007[18]. Mass spectrometers are also widely used in space missions to measure the composition of plasmas. 6.4 Respired gas monitor Mass spectrometers were used in hospitals for respiratory gas analysis beginning around 1975 through the end of the century. Found mostly in the operating room, a part of a complex system, which consist of respired gas samples from patients undergoing anaesthesia were drawn into the instrument through a valve mechanism. The instrument is designed to sequentially connect up to 32 rooms to the mass spectrometer. All operations of the system are directed by the computers. The anaesthesiologist uses the data collected from the mass spectrometer was delivered to the individual rooms. VII. CONCLUSION The data originated in the field of Proteomics is very large .It‘s the need of the day that these huge amount of data need to be managed and analysed. Various tools and techniques are available for the protein prediction. Mass Spectrometry is an analytical method for data analysis which originated in large amount. We have seen the various aspects of Mass Spectrometry. The mass Spectrometry tools and databases available. Also we have seen the advantages and disadvantages of the method and various application of it. REFERENCES [1] Cheng Lu, Introduction to mass Spectrometry. [2] Jimmy K. Eng, Ashley L. McCormack, and John R. Yates, III (1994). "An Approach to Correlate Tandem Mass Spectral Data of Peptides with Amino Acid Sequences in a Protein Database". J Am Soc Mass Spectrom 5 (11): 976–989 [3] Perkins, David N.; Pappin, Darryl J. C.; Creasy, David M.; Cottrell, John S. (1999). "Probability-based protein identification by searching sequence databases using mass spectrometry data".Electrophoresis 20 (18): 3551–67 [4] Liang, C; Smith, JC; Hendrie, Christopher (2003). A Comparative Study of Peptide Sequencing Software Tools for MS/MS. American Society for Mass Spectrometry. [5] Bartels, Christian (31 May 1990). "Fast algorithm for peptide sequencing by mass spectroscopy". Biological Mass Spectrometry 19 (6): 363–368. [6] Savitski, Mikhail M.; Nielsen, Michael L.; Kjeldsen, Frank; Zubarev, Roman A. (2005). "Proteomics-Grade de Novo Sequencing Approach". Journal of Proteome Research 4 (6): 2348–54 [7] Ma, Bin; Zhang, Kaizhong; Hendrie, Christopher; Liang, Chengzhi; Li, Ming; Doherty-Kirby, Amanda; Lajoie, Gilles (2003). "PEAKS: powerful software for peptidede novo sequencing by tandem mass spectrometry". Rapid Communications in Mass Spectrometry 17 (20): 2337–42. [8] "Lutefisk - de novo MS/MS Sequencing". J. Alex Taylor. Retrieved 2009-11-16. [9] Junker, J.; Bielow, C.; Bertsch, A.; Sturm, M.; Reinert, K.; Kohlbacher, O. (2012). "TOPPAS: A Graphical Workflow Editor for the Analysis of High-Throughput Proteomics Data". Journal of Proteome Research 11 (7): 3914–3920. [10] Cox, J.; Mann, M. (Dec 2008). "MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification". Nat Biotechnol 26 (12): 1367–72. [11] Rusconi, F. (2009). "massXpert 2: a cross- platform software environment for polymer chemistry modelling and simulation/analysis of mass spectrometric data". Bioinformatics 25 (20): 2741–2 [12] Catherine P. Riley*, Erik S. Gough, Jing He, Shrinivas S. Jandhyala, Brad Kennedy, Seza Orcun, Mourad Ouzzani, Charles Buck, Ali M. Roumani and Xiang Zhang, ― The Proteome Discovery Pipeline – A Data Analysis Pipeline for Mass Spectrometry- Based Differential Proteomics Discovery‖ : The Open Proteomics Journal, 2010, vol 3, pg[8-19] [13] L. Gatto and K. S. Lilley. MSnbase _ an R/Bioconductor package for isobaric tagged mass spectrometry data visualization, processing and quantitation. Bioinformatics, 28(2):288_9, Jan 2012.
  • 7. Ms. Ashwini Yerlekar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 4), April 2014, pp.17-23 www.ijera.com 23 | P a g e [14] R. C. Gentleman, V. J. Carey, D. M. Bates, B. Bolstad, M. Dettling, S. Dudoit, B. Ellis, L. Gautier, Y. Ge, J. Gentry, K. Hornik, T. Hothorn, W. Huber, S. Iacus, R. Irizarry, F. Leisch, C. Li, M. Maechler, A. J. Rossini, G. Sawitzki, C. Smith, G. Smyth, L. Tierney, J. Y. H. Yang, and J. Zhang. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol, 5(10):_ 80, 2004. [22] Tureček, František; McLafferty, Fred W. (1993). Interpretation of mass spectra. Sausalito: University Science Books. [16] Mistrik, R. (2004). A New Concept for the Interpretation of Mass Spectra Based on a Combination of a Fragmentation Mechanism Database and a Computer Expert System. in Ashcroft, A.E., Brenton, G., Monaghan, J.J. (Eds.), Advances in Mass Spectrometry, Elsevier, Amsterdam, vol. 16, pp. 821 [17] Petrie, S. and Bohme, D. K. (2007). "Ions in space". Mass Spectrometry Reviews 26 (2): 258– 280. doi:10.1002/mas.20114. PMID 1711134 6. [18] Hoffman, J; Chaney, R; Hammack, H (2008). "Phoenix Mars Mission—the Thermal Evolved Gas Analyzer". Journal of the American Society for Mass Spectrometry 19 (10): 1377–83.