SlideShare une entreprise Scribd logo
1  sur  18
Protein Structure Prediction
Protein
Sequence +
Structure
By:-
KARAMVEER
M.Sc. LIFE SCIENCES SPECIALIZATION
WITH BIOINFORMATICS
WEL-COME
1
Different Levels of Protein Structure
2
METHODS FOR PROTEIN STRUCTURE PREDICTION
1. EXPERIMANTAL METHODS
2. COMPUTIONAL METHODS
3
Experimental Protein StructureExperimental Protein Structure
DeterminationDetermination
• X-ray crystallographyX-ray crystallography
– most accuratemost accurate
– in vitroin vitro
– needs crystalsneeds crystals
– ~$100-200K per structure~$100-200K per structure
– time consuming and expansive.time consuming and expansive.
• NMRNMR
– fairly accuratefairly accurate
– in vivoin vivo
– no need for crystalsno need for crystals
– limited to very small proteinslimited to very small proteins
– time consuming and hardly .time consuming and hardly .
• Electron-microscopyElectron-microscopy
– imaging technologyimaging technology
– low resolutionlow resolution
– not more observable.not more observable.
4
Computational method
• Major Techniques
– Template Modeling
• Homology Modeling
• Threading
• Both are use known protein structure
– Template-Free Modeling
• ab initio Methods
– Physics-Based
– Knowledge-Based
– without use of known protein structure
5
Homology Modelling
• also called comparitive modeling.
• predict protein structures based on sequence
homology with known structure.
• Principle:-
• if two proteins share a high enough sequence
similarity,they are likely to have very similar
three dimensional structure.
• modeling server:-modbase,swiss-model etc.
• Fail in absence of homology
6
Homology Modelling
 six steps:-
1.template selection (BLAST and FASTA)
2.sequence alignment (T-coffee and PRALINE)
3.model building (CODA)
(a)backbone model building
(b)loop modeling
4.side chain refinement (SCWRL)
5.model refinement using energy function (GROMOS)
6.model evalution (PROCHECK & WHAT IF)
• Has been observed that even proteins with 30% sequence identity
fold into similar structures
• Does not work for remote homologs (< 30% pairwise identity)
7
Homology Modeling: How it works
o Find template
o Align target sequence
with template
o Generate model:
- add loops
- add sidechains
o Refine model
8
Threading
• Given:
– sequence of protein 'P 'with unknown structure
– Database of known folds
• Find:
– Most plausible fold for 'P'
– Evaluate quality of such arrangement
• Places the residues of unknown 'P' along the
backbone of a known structure and determines
stability of side chains in that arrangement
9
Threading and fold recognition
• predicts the structural fold of unknown protein
sequences by fitting the sequence into a
structural database and selecting the best
fitting fold.
• we can identify structurally similar proteins
even without detectablesequence similarity.
• two algorithms:-
1.pairwise energy based method
2.profile based method
10
1.Pairwise energy based method (threading)
• Searched for a structural fold database to find the best
matching structural fold using energy based criteria.
• Using dynamic programming and heuristic approaches.
• Calculate energy for raw model.
• Lowest energy fold that correspond to the structurally a
group of most compatible fold.
2.profile based method (fold recognition)
• A profile is constructed for related protein structures.
• Generated by superimposition of the structures to
expose corresponding residues.
• Secondary structure type,polarity,hydrophobicity.
• The protein fold to be predicted does not exist in the
fold library,method will fail. 3D-PSSM
http://www.sbg.bio.ic.ac.uk/~3dpssm/
11
12
13
Ab InitioAb Initio Protein Structure PredictionProtein Structure Prediction
• Ab initio protein structure prediction methods build protein 3D
structures from sequence based on physical principles.
• Importance
– The ab initio methods are important even though they are
computationally demanding
– Ab initio methods predict protein structure based on physical models,
they are indispensable complementary methods to Knowledge-based
approach.
Knowledge-based approach would fail in following conditions:
• Structure homologues are not available
• Possible undiscovered new fold exists.
Anfinsen’s theory: Protein native structure corresponds to the
state with the lowest free energy of the protein-solvent system.
Rosetta
• web server for protein 3d structure prediction.
• mini threading method.
 breaks down the quary sequence into many short segments (3
to 9).
 predicts the secondary structure of small segments using
HMMSTR.
 Segments with assigned secondary structure are subsequently
assembled into a 3D configuration.
 random combination of fragments, a large number of models
are built and their over all energy potential calculated.
 conformation with lowest free energy is choosen as the best
model.
14
15o CASP
Protein Sequence
Database Searching
Multiple Sequence
Alignment
Homologue
in PDB
Homology
Modelling
Secondary
Structure
Prediction
No
Yes
3-D Protein Model
Fold
Recognition
Predicted
Fold
Sequence-Structure
Alignment
Ab-initio
Structure
Prediction
No
Yes
Overall Approach
16
Reference
• http://www.golujain/biology-protein-
structure-in-cloud-computing
• http://infoman.teikav.edu.gr/~stpapad/Ess
entialBioinformatics.pdf
• https://en.wikipedia.org/wiki/Ab_Initio_Soft
ware
• Dictionary of bioinformatics/ by k.
manikandakumar.
17
18.

Contenu connexe

Tendances

Tendances (20)

Protein micro array
Protein micro arrayProtein micro array
Protein micro array
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
shotgun sequncing
 shotgun sequncing shotgun sequncing
shotgun sequncing
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methods
 
Fasta
FastaFasta
Fasta
 
2d Page
2d Page2d Page
2d Page
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interaction
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
 
Protein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modelingProtein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modeling
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
Rasmol
RasmolRasmol
Rasmol
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 

En vedette (9)

In silico structure prediction
In silico structure predictionIn silico structure prediction
In silico structure prediction
 
The Lac operon
The Lac operonThe Lac operon
The Lac operon
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databases
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Biological databases
Biological databasesBiological databases
Biological databases
 

Similaire à methods for protein structure prediction

Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
ashharnomani
 
protein structure prediction presentation .pptx
protein structure prediction presentation .pptxprotein structure prediction presentation .pptx
protein structure prediction presentation .pptx
abdulahad563527
 
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
Deependra Ban
 

Similaire à methods for protein structure prediction (20)

Protien Structure Prediction
Protien Structure PredictionProtien Structure Prediction
Protien Structure Prediction
 
protein Modeling Abi.pptx
protein Modeling Abi.pptxprotein Modeling Abi.pptx
protein Modeling Abi.pptx
 
Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modeling
 
L1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptxL1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptx
 
protein structure prediction.pptx
protein structure prediction.pptxprotein structure prediction.pptx
protein structure prediction.pptx
 
De novo str_prediction
De novo str_predictionDe novo str_prediction
De novo str_prediction
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
 
HOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAYHOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAY
 
Homology modeling: Modeller
Homology modeling: ModellerHomology modeling: Modeller
Homology modeling: Modeller
 
Class powerpoint.ppt
Class powerpoint.pptClass powerpoint.ppt
Class powerpoint.ppt
 
Drug discovery presentation
Drug discovery presentationDrug discovery presentation
Drug discovery presentation
 
protein structure prediction presentation .pptx
protein structure prediction presentation .pptxprotein structure prediction presentation .pptx
protein structure prediction presentation .pptx
 
Modelling Proteins By Computational Structural Biology
Modelling Proteins By Computational Structural BiologyModelling Proteins By Computational Structural Biology
Modelling Proteins By Computational Structural Biology
 
protein design, principles and examples.pptx
protein design, principles and examples.pptxprotein design, principles and examples.pptx
protein design, principles and examples.pptx
 
Presentation1
Presentation1Presentation1
Presentation1
 
Protein structure 2
Protein structure 2Protein structure 2
Protein structure 2
 
Computational Prediction of Protein Structure.pptx
Computational Prediction of Protein Structure.pptxComputational Prediction of Protein Structure.pptx
Computational Prediction of Protein Structure.pptx
 
Molecular modelling (1)
Molecular modelling (1)Molecular modelling (1)
Molecular modelling (1)
 
Protein structure analysis
Protein structure analysis Protein structure analysis
Protein structure analysis
 
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
 

Dernier

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 

Dernier (20)

GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 

methods for protein structure prediction

  • 1. Protein Structure Prediction Protein Sequence + Structure By:- KARAMVEER M.Sc. LIFE SCIENCES SPECIALIZATION WITH BIOINFORMATICS WEL-COME 1
  • 2. Different Levels of Protein Structure 2
  • 3. METHODS FOR PROTEIN STRUCTURE PREDICTION 1. EXPERIMANTAL METHODS 2. COMPUTIONAL METHODS 3
  • 4. Experimental Protein StructureExperimental Protein Structure DeterminationDetermination • X-ray crystallographyX-ray crystallography – most accuratemost accurate – in vitroin vitro – needs crystalsneeds crystals – ~$100-200K per structure~$100-200K per structure – time consuming and expansive.time consuming and expansive. • NMRNMR – fairly accuratefairly accurate – in vivoin vivo – no need for crystalsno need for crystals – limited to very small proteinslimited to very small proteins – time consuming and hardly .time consuming and hardly . • Electron-microscopyElectron-microscopy – imaging technologyimaging technology – low resolutionlow resolution – not more observable.not more observable. 4
  • 5. Computational method • Major Techniques – Template Modeling • Homology Modeling • Threading • Both are use known protein structure – Template-Free Modeling • ab initio Methods – Physics-Based – Knowledge-Based – without use of known protein structure 5
  • 6. Homology Modelling • also called comparitive modeling. • predict protein structures based on sequence homology with known structure. • Principle:- • if two proteins share a high enough sequence similarity,they are likely to have very similar three dimensional structure. • modeling server:-modbase,swiss-model etc. • Fail in absence of homology 6
  • 7. Homology Modelling  six steps:- 1.template selection (BLAST and FASTA) 2.sequence alignment (T-coffee and PRALINE) 3.model building (CODA) (a)backbone model building (b)loop modeling 4.side chain refinement (SCWRL) 5.model refinement using energy function (GROMOS) 6.model evalution (PROCHECK & WHAT IF) • Has been observed that even proteins with 30% sequence identity fold into similar structures • Does not work for remote homologs (< 30% pairwise identity) 7
  • 8. Homology Modeling: How it works o Find template o Align target sequence with template o Generate model: - add loops - add sidechains o Refine model 8
  • 9. Threading • Given: – sequence of protein 'P 'with unknown structure – Database of known folds • Find: – Most plausible fold for 'P' – Evaluate quality of such arrangement • Places the residues of unknown 'P' along the backbone of a known structure and determines stability of side chains in that arrangement 9
  • 10. Threading and fold recognition • predicts the structural fold of unknown protein sequences by fitting the sequence into a structural database and selecting the best fitting fold. • we can identify structurally similar proteins even without detectablesequence similarity. • two algorithms:- 1.pairwise energy based method 2.profile based method 10
  • 11. 1.Pairwise energy based method (threading) • Searched for a structural fold database to find the best matching structural fold using energy based criteria. • Using dynamic programming and heuristic approaches. • Calculate energy for raw model. • Lowest energy fold that correspond to the structurally a group of most compatible fold. 2.profile based method (fold recognition) • A profile is constructed for related protein structures. • Generated by superimposition of the structures to expose corresponding residues. • Secondary structure type,polarity,hydrophobicity. • The protein fold to be predicted does not exist in the fold library,method will fail. 3D-PSSM http://www.sbg.bio.ic.ac.uk/~3dpssm/ 11
  • 12. 12
  • 13. 13 Ab InitioAb Initio Protein Structure PredictionProtein Structure Prediction • Ab initio protein structure prediction methods build protein 3D structures from sequence based on physical principles. • Importance – The ab initio methods are important even though they are computationally demanding – Ab initio methods predict protein structure based on physical models, they are indispensable complementary methods to Knowledge-based approach. Knowledge-based approach would fail in following conditions: • Structure homologues are not available • Possible undiscovered new fold exists. Anfinsen’s theory: Protein native structure corresponds to the state with the lowest free energy of the protein-solvent system.
  • 14. Rosetta • web server for protein 3d structure prediction. • mini threading method.  breaks down the quary sequence into many short segments (3 to 9).  predicts the secondary structure of small segments using HMMSTR.  Segments with assigned secondary structure are subsequently assembled into a 3D configuration.  random combination of fragments, a large number of models are built and their over all energy potential calculated.  conformation with lowest free energy is choosen as the best model. 14
  • 16. Protein Sequence Database Searching Multiple Sequence Alignment Homologue in PDB Homology Modelling Secondary Structure Prediction No Yes 3-D Protein Model Fold Recognition Predicted Fold Sequence-Structure Alignment Ab-initio Structure Prediction No Yes Overall Approach 16
  • 17. Reference • http://www.golujain/biology-protein- structure-in-cloud-computing • http://infoman.teikav.edu.gr/~stpapad/Ess entialBioinformatics.pdf • https://en.wikipedia.org/wiki/Ab_Initio_Soft ware • Dictionary of bioinformatics/ by k. manikandakumar. 17
  • 18. 18.