SlideShare a Scribd company logo
1 of 66
WWW.RCSB.ORG/PDB
• Introduction
• Supported and funded by
• History
• PDB Holdings list
• Member organizations
• Task forces
• PDB ID
• PDB File format
• Browse to WWW.RCSB.ORG/PDB/
 “The repository reservoir data bank to store the authenticated structures
  of Protein and Nucleic acid”

 Single worldwide database and hundreds of secondary databases categorize the
  data differently.

 Key resource in the area of structural biology, stores 3D structural data of
  large biological molecules such as Proteins and Nucleic acids.

 Data is   submitted by Biologists and Biochemists from all around the world to
  be freely accessible on internet via its member organizations’ websites and is
  updated weekly.

 The   mission is to maintain a single Protein Data Bank Archive of
  Macromolecular Structural data.
 The Protein Data Bank (PDB) is operated by:


 Rutgers, The State University of New Jersey.

 The San Diego Supercomputer Center at the University of California, San
  Diego.

 RCSB-the Research Collaborator for Structural Bioinformatics


 The PDB is supported by funds from the National Science Foundation, the
  Department of Energy, and the National Institutes of Health.
Two forces to initiate PDB:
   Growing collection of sets of protein structural data by X-Ray diffraction.
   NMR-nuclear Magnetic Resonance method to visualize protein structures
  in 3D, emerged in 1968.


 In 1969,   Dr Edger Meyer began to write software to store atomic
  coordinates files in a common format to make them available for geometric
  and graphical evaluation.


 In 1971, one of Dr   Meyer’s programs- SEARCH- enabled networking i.e
  enabled the researchers to access information from database to study protein
  structures offline.
 In 1973, upon Hamilton’s death, Dr Tom Koetzle took over direction of PDB
  for 20 years.
 mmCIF project completed and Structural genomics began in 1970s.


 In 1980s, IUCr guidelines established, number of structures deposited increases
  and independent biological databases established – e.g., the NDB.

 In Oct, 1998; PDB was transferred to Research Collaboratory for Structural
  Bioinformatics (RCSB), complete transfer since 1999. Dr Helen M Berman of
  Rutgers University was the new director.

 In 2003, with the formation of wwPDB, the PDB became an international
  organization having three member organizations.

 In 2006, the BMRB joined PDB.
Experimental                                             Protein/Nucleic Acid
                                Proteins    Nucleic Acids                                     Other       Total
            Method                                                       complexes


X-ray diffraction                   62750              1323                         3050              2   67125


NMR                                  7962               960                             179           7     9108


Electron microscopy                   262                   22                           96           0      380


Hybrid                                 41                    3                            1           1       46


Other                                 133                    4                            5       13         155


                       Total:       71148              2312                         3331          23      76814
 Act as Data deposition, Data processing and Distribution centers for PDB data.


 Three are founding member organizations:


 PDBe…Protein Data Bank in Europe.
 PDBj…Protein Data Bank in Japan.
 RCSB…Research Collaboratory for Structural Bioinformatics.


 The Biological Magnetic Resonance Data Bank (BMRB) joined later in 2006.


 Another organization Worldwide Protein Data Bank (wwPDB) oversees PDB.
  wwPDB reviews and annotates each submitted entry and then it is automatically
  checked for plausibility( the source code) for validation software is available.
 X-Ray diffraction :-Spacing of atoms determined by location intensities
  spot on photographic plate by X-Ray e.g lyzozyme.

 Limited to just crystal structures only


 NMR (about 15% e.g., hemoglobin)…estimations of distances between
  pairs of atoms of proteins. Final conformation is obtained after solving
  distance geometry problem.

 Illuminate dynamic side,conformatonal changes, protein folding as well
 Each structure published in PDB receives a four character alphanumeric
  identifier or accession number. Like, 1ANG or 4hhb.

 However, this cant be used as an identifier for biomolecules. Because
  several structures for the same molecule in different environments or
  conformations-are contained in PDB with different PDB IDs.


                          HAEMOGLOBIN
                             (2DN2)
 Standard data representation…encoded in data
     dictionary. The metadata model supporting this
     representation is used by all PDB data processing and
     database software tools.

1. PDB file format was restricted to 80 characters per line
   initially.
2. In 1996, macromolecular Crystallographic Information
   File (mmCIF) format started.
3. In 2005, XML version called as PDBML, was
   described.
 The Protein Data Bank (pdb) file format is a textual
  file format
 describing the three dimensional structures of
  molecules held in the Protein Data Bank.
 provides description and annotation structure
   atomic coordinates,
   side chains,
   secondary structure,   as well as
   atomic connectivity
 Water , ions, nucleic acids, ligands…
 mmCIF is the acronym for the macromolecular Crystallographic
  Information File.
 mmCIF is based on a subset of the syntax rules for the Self
  Defining Text Archive (STAR) file.
 A Dictionary Description Language (DDL) defines the structure of
  mmCIF dictionaries. Dictionaries provide the metadata which define
  the content of mmCIF data files.
 mmCIF data files, dictionaries and DDLs are all expressed in a
  common syntax.
 basic information, more detailed


 description of PDB, PDBML and mmCIF file formats
 can be found at Protein Data Bank web sites.
 highly recommended to get familiar with all rules of
 PDB format (such as gaps between columns)

 BEACAUSE…
put either a search term (for example, a protein name) or a PDB
                             number
2DN2
HAEMOGLOBIN
 If the contents of the PDB are thought of as primary
  data,
THEN
 hundreds of derived (i.e., secondary) databases
  categorize the data differently.
For example
 SCOP & CATH :
   categorize structures according to type of structure and
    assumed evolutionary relations;
 GO categorize structures based on genes.
The Structural Classification of
Proteins (SCOP) database is a
largely manual classification of
protein structural domains based
on similarities of
their structures and amino
acid sequences
Class:the overall secondary-structure content of the
domain
Architecture:high structural similarity but no evidence
of homology.
Topology:a large-scale grouping of topologies which
share particular structural features
Homologous superfamily:indicative of a demonstrable
evolutionary relationship.
Pfam is a database
of protein
families that
includes their
annotations
and multiple
sequence
alignment
generated
using hidden
Markov models
CLICK
Select your desire
method
CLICK
CLICK
CLICK
CLICK
& FINALLY
Show the gradual updating
Structural View of Biology   released entries
You can also
select different
display view
can also download in
different view
 Text file can be viewed or modified in editor.
 Structure files may be viewed using various free and commercial
    visualizations programs and Web browsers plug-ins like
   OPEN SOURCE PDB SOFTWERES
   Jmol
   Molekel
   MeshLab (able to import PDB data set and buildup surfaces from them)
   QuteMol
 Avogadro
 OPEN BUT NOT FREE
 PYMOL , RASMOL, VIST PROT 3DS & STAR BIOCHEM
The RCSB PDB website contains
an extensive list of both free and
commercial molecule visualization
programs and web browser plug-in.
 central archive of experimentally solved bimolecular structures.
  But
 only allows data retrieval
 does not provide collaboration or user feedback.


 In contrast, PDBWiki allows for sharing expert knowledge
  about structures deposited in the PDB.
 provides tools for discussing and annotating proteins in a
  collaborative way.
 The Protein Data Bank (PDB) is the central archive of
  experimentally solved bimolecular structures. However, the
  PDB only allows data retrieval and does not provide
  functionality for collaboration or user feedback.
 In contrast, PDBWiki allows for sharing expert knowledge
  about structures deposited in the PDB. It provides tools for
  discussing and annotating proteins in a collaborative way. The
  goal is to create a central and freely-accessible repository of
  user-contributed information that will be useful for anyone
  working with PDB structures. As such PDBWiki can be
  considered a part of a wider effort in community-based
  biological databases curation.
Lo0k there is something more for
you…..
Protein Data Bank

More Related Content

What's hot (20)

Introduction to pdb
Introduction to pdbIntroduction to pdb
Introduction to pdb
 
Dot matrix
Dot matrixDot matrix
Dot matrix
 
Cath
CathCath
Cath
 
Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbj
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
EMBL
EMBLEMBL
EMBL
 
BLAST
BLASTBLAST
BLAST
 
Protein database
Protein  databaseProtein  database
Protein database
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Protein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modelingProtein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modeling
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure prediction
 
Swiss pdb viewer
Swiss pdb viewerSwiss pdb viewer
Swiss pdb viewer
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Gen bank
Gen bankGen bank
Gen bank
 
Protein sequence databases
Protein sequence databasesProtein sequence databases
Protein sequence databases
 
Protein structure classification/domain prediction: SCOP and CATH (Bioinforma...
Protein structure classification/domain prediction: SCOP and CATH (Bioinforma...Protein structure classification/domain prediction: SCOP and CATH (Bioinforma...
Protein structure classification/domain prediction: SCOP and CATH (Bioinforma...
 
Structural databases
Structural databases Structural databases
Structural databases
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 

Viewers also liked

Structural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeStructural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeNixon Mendez
 
Protein database ..... of NCBI
Protein database ..... of NCBI Protein database ..... of NCBI
Protein database ..... of NCBI Alagppa University
 
Errors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation SequencingErrors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation SequencingNixon Mendez
 
Addressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambiaAddressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambiaNixon Mendez
 
PERL- Bioperl modules
PERL- Bioperl modulesPERL- Bioperl modules
PERL- Bioperl modulesNixon Mendez
 
Clustering and Visualisation using R programming
Clustering and Visualisation using R programmingClustering and Visualisation using R programming
Clustering and Visualisation using R programmingNixon Mendez
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)N Poorin
 
Cytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeCytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeNixon Mendez
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources innocent87
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactionsPrianca12
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 

Viewers also liked (19)

Structural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeStructural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its Scope
 
Protein database ..... of NCBI
Protein database ..... of NCBI Protein database ..... of NCBI
Protein database ..... of NCBI
 
Lyme disease
Lyme diseaseLyme disease
Lyme disease
 
PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
Errors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation SequencingErrors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation Sequencing
 
PowerMV
PowerMV PowerMV
PowerMV
 
Addressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambiaAddressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambia
 
Sequence database
Sequence databaseSequence database
Sequence database
 
PERL- Bioperl modules
PERL- Bioperl modulesPERL- Bioperl modules
PERL- Bioperl modules
 
Clustering and Visualisation using R programming
Clustering and Visualisation using R programmingClustering and Visualisation using R programming
Clustering and Visualisation using R programming
 
MASCOT
MASCOTMASCOT
MASCOT
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)
 
Genome Database Systems
Genome Database Systems Genome Database Systems
Genome Database Systems
 
2D-PAGE & DIGE
2D-PAGE & DIGE2D-PAGE & DIGE
2D-PAGE & DIGE
 
Cytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeCytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScape
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources
 
Protein database
Protein databaseProtein database
Protein database
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Protein databases
Protein databasesProtein databases
Protein databases
 

Similar to Protein Data Bank

Molecular Structures 2009
Molecular Structures 2009Molecular Structures 2009
Molecular Structures 2009lyonja
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiiiMuhammad Younis
 
Protein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - BioinformaticsProtein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - Bioinformaticskarmandeepkaur7
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS
 
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)R.P MAURYA
 
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINEPROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINEijsc
 
Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine  Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine ijsc
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein databasechinmayeec
 
2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練 2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練 Abner Huang
 
MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
 MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo... MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...Peter Rose
 
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGijbbjournal
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptxSilpa87
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkLisa Muthukumar
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptxwadhava gurumeet
 
Structural bioinformatics.
Structural bioinformatics.Structural bioinformatics.
Structural bioinformatics.SALIHAMUGHAL
 
Molecular modelling and dcoking.pptx
Molecular modelling and dcoking.pptxMolecular modelling and dcoking.pptx
Molecular modelling and dcoking.pptx12nikitaborade1
 
MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxChris Mungall
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbgetSurendraKumar338
 

Similar to Protein Data Bank (20)

Molecular Structures 2009
Molecular Structures 2009Molecular Structures 2009
Molecular Structures 2009
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiii
 
Protein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - BioinformaticsProtein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - Bioinformatics
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
 
Ppi
PpiPpi
Ppi
 
Bind database
Bind databaseBind database
Bind database
 
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINEPROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
 
Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine  Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein database
 
2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練 2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練
 
MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
 MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo... MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
 
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration Framework
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptx
 
Structural bioinformatics.
Structural bioinformatics.Structural bioinformatics.
Structural bioinformatics.
 
Molecular modelling and dcoking.pptx
Molecular modelling and dcoking.pptxMolecular modelling and dcoking.pptx
Molecular modelling and dcoking.pptx
 
MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptx
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
 

Recently uploaded

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Recently uploaded (20)

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Protein Data Bank

  • 2. • Introduction • Supported and funded by • History • PDB Holdings list • Member organizations • Task forces • PDB ID • PDB File format • Browse to WWW.RCSB.ORG/PDB/
  • 3.  “The repository reservoir data bank to store the authenticated structures of Protein and Nucleic acid”  Single worldwide database and hundreds of secondary databases categorize the data differently.  Key resource in the area of structural biology, stores 3D structural data of large biological molecules such as Proteins and Nucleic acids.  Data is submitted by Biologists and Biochemists from all around the world to be freely accessible on internet via its member organizations’ websites and is updated weekly.  The mission is to maintain a single Protein Data Bank Archive of Macromolecular Structural data.
  • 4.  The Protein Data Bank (PDB) is operated by:  Rutgers, The State University of New Jersey.  The San Diego Supercomputer Center at the University of California, San Diego.  RCSB-the Research Collaborator for Structural Bioinformatics  The PDB is supported by funds from the National Science Foundation, the Department of Energy, and the National Institutes of Health.
  • 5. Two forces to initiate PDB: Growing collection of sets of protein structural data by X-Ray diffraction. NMR-nuclear Magnetic Resonance method to visualize protein structures in 3D, emerged in 1968.  In 1969, Dr Edger Meyer began to write software to store atomic coordinates files in a common format to make them available for geometric and graphical evaluation.  In 1971, one of Dr Meyer’s programs- SEARCH- enabled networking i.e enabled the researchers to access information from database to study protein structures offline.
  • 6.  In 1973, upon Hamilton’s death, Dr Tom Koetzle took over direction of PDB for 20 years.  mmCIF project completed and Structural genomics began in 1970s.  In 1980s, IUCr guidelines established, number of structures deposited increases and independent biological databases established – e.g., the NDB.  In Oct, 1998; PDB was transferred to Research Collaboratory for Structural Bioinformatics (RCSB), complete transfer since 1999. Dr Helen M Berman of Rutgers University was the new director.  In 2003, with the formation of wwPDB, the PDB became an international organization having three member organizations.  In 2006, the BMRB joined PDB.
  • 7. Experimental Protein/Nucleic Acid Proteins Nucleic Acids Other Total Method complexes X-ray diffraction 62750 1323 3050 2 67125 NMR 7962 960 179 7 9108 Electron microscopy 262 22 96 0 380 Hybrid 41 3 1 1 46 Other 133 4 5 13 155 Total: 71148 2312 3331 23 76814
  • 8.  Act as Data deposition, Data processing and Distribution centers for PDB data.  Three are founding member organizations:  PDBe…Protein Data Bank in Europe.  PDBj…Protein Data Bank in Japan.  RCSB…Research Collaboratory for Structural Bioinformatics.  The Biological Magnetic Resonance Data Bank (BMRB) joined later in 2006.  Another organization Worldwide Protein Data Bank (wwPDB) oversees PDB. wwPDB reviews and annotates each submitted entry and then it is automatically checked for plausibility( the source code) for validation software is available.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.  X-Ray diffraction :-Spacing of atoms determined by location intensities spot on photographic plate by X-Ray e.g lyzozyme.  Limited to just crystal structures only  NMR (about 15% e.g., hemoglobin)…estimations of distances between pairs of atoms of proteins. Final conformation is obtained after solving distance geometry problem.  Illuminate dynamic side,conformatonal changes, protein folding as well
  • 15.
  • 16.
  • 17.  Each structure published in PDB receives a four character alphanumeric identifier or accession number. Like, 1ANG or 4hhb.  However, this cant be used as an identifier for biomolecules. Because several structures for the same molecule in different environments or conformations-are contained in PDB with different PDB IDs. HAEMOGLOBIN (2DN2)
  • 18.  Standard data representation…encoded in data dictionary. The metadata model supporting this representation is used by all PDB data processing and database software tools. 1. PDB file format was restricted to 80 characters per line initially. 2. In 1996, macromolecular Crystallographic Information File (mmCIF) format started. 3. In 2005, XML version called as PDBML, was described.
  • 19.  The Protein Data Bank (pdb) file format is a textual file format  describing the three dimensional structures of molecules held in the Protein Data Bank.  provides description and annotation structure  atomic coordinates,  side chains,  secondary structure, as well as  atomic connectivity  Water , ions, nucleic acids, ligands…
  • 20.
  • 21.  mmCIF is the acronym for the macromolecular Crystallographic Information File.  mmCIF is based on a subset of the syntax rules for the Self Defining Text Archive (STAR) file.  A Dictionary Description Language (DDL) defines the structure of mmCIF dictionaries. Dictionaries provide the metadata which define the content of mmCIF data files.  mmCIF data files, dictionaries and DDLs are all expressed in a common syntax.
  • 22.
  • 23.  basic information, more detailed  description of PDB, PDBML and mmCIF file formats  can be found at Protein Data Bank web sites.  highly recommended to get familiar with all rules of PDB format (such as gaps between columns)  BEACAUSE…
  • 24.
  • 25. put either a search term (for example, a protein name) or a PDB number
  • 26.
  • 27. 2DN2
  • 28.
  • 29.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.  If the contents of the PDB are thought of as primary data, THEN  hundreds of derived (i.e., secondary) databases categorize the data differently. For example  SCOP & CATH :  categorize structures according to type of structure and assumed evolutionary relations;  GO categorize structures based on genes.
  • 39.
  • 40. The Structural Classification of Proteins (SCOP) database is a largely manual classification of protein structural domains based on similarities of their structures and amino acid sequences
  • 41. Class:the overall secondary-structure content of the domain Architecture:high structural similarity but no evidence of homology. Topology:a large-scale grouping of topologies which share particular structural features Homologous superfamily:indicative of a demonstrable evolutionary relationship.
  • 42. Pfam is a database of protein families that includes their annotations and multiple sequence alignment generated using hidden Markov models
  • 43.
  • 44. CLICK
  • 46. CLICK
  • 47. CLICK
  • 48. CLICK
  • 49. CLICK
  • 51. Show the gradual updating Structural View of Biology released entries
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57. You can also select different display view
  • 58. can also download in different view
  • 59.  Text file can be viewed or modified in editor.  Structure files may be viewed using various free and commercial visualizations programs and Web browsers plug-ins like  OPEN SOURCE PDB SOFTWERES  Jmol  Molekel  MeshLab (able to import PDB data set and buildup surfaces from them)  QuteMol  Avogadro  OPEN BUT NOT FREE  PYMOL , RASMOL, VIST PROT 3DS & STAR BIOCHEM
  • 60. The RCSB PDB website contains an extensive list of both free and commercial molecule visualization programs and web browser plug-in.
  • 61.
  • 62.
  • 63.  central archive of experimentally solved bimolecular structures. But  only allows data retrieval  does not provide collaboration or user feedback.  In contrast, PDBWiki allows for sharing expert knowledge about structures deposited in the PDB.  provides tools for discussing and annotating proteins in a collaborative way.
  • 64.  The Protein Data Bank (PDB) is the central archive of experimentally solved bimolecular structures. However, the PDB only allows data retrieval and does not provide functionality for collaboration or user feedback.  In contrast, PDBWiki allows for sharing expert knowledge about structures deposited in the PDB. It provides tools for discussing and annotating proteins in a collaborative way. The goal is to create a central and freely-accessible repository of user-contributed information that will be useful for anyone working with PDB structures. As such PDBWiki can be considered a part of a wider effort in community-based biological databases curation.
  • 65. Lo0k there is something more for you…..