SlideShare une entreprise Scribd logo
1  sur  54
NCBI
National Centre For Biotechnology
Information
Site: www.ncbi.nlm.nih.gov
By Richa Sharma
M.Sc. Biomedical Sciences
Dr. BR Ambedkar Center for Biomedical
aresearch (ACBR)
INTRODUCTION
NCBI was established in the year 1988, as a part of the
National Library of Medicine at the National Institutes of
Health, Maryland, USA
NCBI HOME PAGE
DIFFERENCES BETWEEN
DATABASE AND TOOL
DATABASE
 It is a collection of data
that is structured,
searchable, updated
periodically and cross-
referenced.
 Different databases are:
 Genome Database
 Sequence Database
 Protein Database
 Literature Database
 Disease Database
TOOL
 A program that is used to
extract or retrieve the
desired information from
the database.
 Different types of tools are:
 Database Retrieval Tool i.e.
Entrez
 BLAST
 ORF Finder
 ePCR
 Spidey
DATABASES AND TOOLS OF NCBI
TOOLS OF NCBI
DATABASE RETRIEVAL TOOL-
ENTREZ
Entrez is an integrated database search and retrieval
system that extracts information from DNA and protein
sequence data, population sets, whole genome,
macromolecular structures, and the biomedical literature
via PubMed.
Entrez provides extensive links within and between
database records.
http://www.ncbi.nlm.nih.gov/gquery/
ARCHITECTURE OF THE ENTREZ SYSTEM
BLAST-BASIC LOCAL ALIGNMENT
SEARCH TOOL
The BLAST programs perform sequence-similarity searches
against a variety of sequence databases, returning a set of
gapped alignments with links to full database records, to
UniGene, Gene, the MMDB, or GEO.
The BLAST tools available at NCBI are classified into
different categories.
Two important ones are:
 Standard BLAST
 MegaBLAST
STANDARD BLAST
Standard BLAST includes:
 blastn : Comparing the nucleotide sequence query
against a nucleotide sequence database.
 blastp : Comparing the amino acid query against a
protein sequence database.
 blastx : Comparing the nucleotide query sequence
translated in all reading frames against a protein
database.
• tblastn : Comparing the protein query
sequence against a nucleotide database
translated in all reading frames.
tblastx : Comparing the six –reading
frame translations of the nucleotide
query against six frame translations of
the nucleotide sequence database.
MegaBLAST
MegaBLAST is a program optimized for aligning long
sequences.
It can only work with DNA sequences, hence the only
program it supports is “blastn”.
It is faster than blastn but less sensitive,
SEQUENCE SUBMISSION TO NCBI
The databases are constantly updated through newer
submissions of sequences, and this is done using the
following sequence submission tools :
1. BankIt
2. Sequin
BankIt
BankIT is a web based GenBank sequence submission tool.
It is a tool of choice for simple submissions, especially
when only one or small number of records are to be
submitted. It can also be used by submitters to update
their existing GenBank records. Sequence analysis tools are
not required for submission through this process.
SEQUIN
Sequin is a stand-alone software tool developed by NCBI
which aids in submission and updating entries to the
sequence databases. It helps in handling multiple
sequence submissions, provides increased capacity for
complex submissions containing long sequences, multiple
annotations, segmented sets of DNA or phylogenetic and
population studies.
It also provides graphical viewing and editing options.
NCBI HOME PAGE
SPECIALISED TOOLS
Some of the specialized tools for the sequence analysis are
:
1. ORF Finder
2. e-PCR
3. Spidey
Open Reading Frame (ORF)
Finder
ORF Finder is an essential graphical analysis tool, which
finds all open reading frames of a selectable minimum size
in a user’s sequence or in a sequence already in the
database.
It uses the standard or alternative genetic codes to identify
all open reading frames.
This is helpful in preparing complete and accurate
sequence submissions. It is also packaged with the Sequin
sequence submission software.
e-PCR (Electronic Polymerase
Chain Reaction)
e-PCR is a computational procedure that is used to identify
sequence-tagged sites (STSs) within DNA sequeces. While
looking for potential STSs in DNA sequences e-PCR searches
for sub-sequences that closely match the PCR primers and
have the correct order, orientation, and spacing that could
represent the PCR primers used to generate known
STSs.The new version of e-PCr provides a search mode
using a query sequence against a sequence database.
SPIDEY
This is an m-RNA to genomic alignment program ,which
uses the local alignment tools like BLAST to find its
alignment. Spidey takes as an input a single genomic
sequence and a set of mRNA-FASTA sequences. At first,
Spidey defines windows on the genomic sequence and then
perform the mRNA-to-genomic alignment separately within
each window to avoid including exons from paralogs and
pseudogenes. It has no maximum intron size and does not
favour shorter or longer introns.
Databases
 Structured collection of information.
 Consists of basic units called record or enteries.
 The prefect database-
 Comprehensive but easy to search
 Cross referenced
 Minimum redundancy
NCBI Databases
 Nucleotide database
 Literature database
 Protein database
 Gene expression database
 Structural database
 Chemical database
 Other databases
Kinds of databases
Primary database
 Original submissions by
experimentalists.
 Database staff organise
but don’t add additional
information.
 Example - Genbank
Derivative databases
 Derived from primary
data
 Content controlled by
third party.
 Examples – Refseq,
SWISS-PROT, unigene
Nucleotide database
 GENBANK
 NCBI’s primary sequence data
 It is a comprehensive public database of nucleotide
sequences.
 Genbank along with EMBL and DDBJ comprises the INSD.
 It is a collaborative approach for exchanging data daily
to ensure a uniform and comprehensive collection of
sequence information.
Accession numbers are labels for
sequences
 DNA sequences and other molecular data are tagged with
accession numbers that are used to identify a sequence or
other record relevant to molecular data.
 It is string of letters and/or numbers that corresponds to a
molecular sequence.
 It is shared among the 3 collaborating databases and
remains constant over the lifetime of record.
 The DNA sequence within a Genbank record is also assigned
a unique NCBI identifier called a ‘gi’ that apperas on the
version line of flat file records following the accession
number.
Retrieval of nucleotide sequence of
beta-globin gene from Xenopus laevis
NCBI’s Derivative Sequence
Database
 RefSeq
 It is a collection of non redundant set of nucleotide and
protein sequences.
 It is derived from the primary submissions available in the
GenBank.
 RefSeq records can be distinguished from GenBank records
by the format of the accession series
 RefSeq accession numbers are formatted as two alphabetic
characters followed by an underscore ‘-’
 The GenBank accession never include an underscore.
Literature database
 PMC – PubMed Central
 It is a digital archive of peer-reviewed journals in the
life sciences providing access to full-text articles.
 All PMC free articles are identified in PubMed search
results and PMC itself can be searched using Entrez.
Retrieval of complete entry of role of
remorin protein in the pubmed
database
Protein database
 Entrez protein is the protein sequence database of NCBI.
 The protein sequences in this database come from several
different sources such as Swiss-Prot,PDB.
 There are GenPept translations for each of the coding
sequences within the GenBank nucleotide database.
 The Entrez protein database is cross linked to the Entrez
taxonomy database.
 It is also linled to CDD.
 After clicking on the individual search results of Entrez
protein,the protein sequence is displayed in a particular
format which is known as GenPept.
Expression database
 GEO-Gene Expression Omnibus
 Distribution and regulation of the transcriptional
products of normal and abnormal cell types.
 SAGE map- serial analysis of gene expression map.
Structural database
 MMDB-Molecular modelling database.
 3D macromolecular structures.
 XRD and NMR are being used for the experimental structure
determination.
 These provide a wealth of information regarding the biological
function,mechanism linked to the function,the evolutionary history of the
function and relationship between the macromolecules.
Chemical database
 PubChem is a database of chemical molecules
maintained by NCBI.
 It focuses on the chemical,structural and biological
properties of small molecules
 Molecular mass below 2000u.
Other databases
 OMIM-Online Mendelian Inheritance in Man.
 It is a comprehensive,authoritative and timely
knowledge base of human genes and genetic disorders.
 OMIA-Online Mendelian Inheritance in Animals.
 It is a database of genes,inhertited disorders and traits
in animal species other than human and mouse.
THANK
YOU… !!! 

Contenu connexe

Tendances (20)

Swiss PROT
Swiss PROT Swiss PROT
Swiss PROT
 
Kegg
KeggKegg
Kegg
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
UniProt
UniProtUniProt
UniProt
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Ddbj
DdbjDdbj
Ddbj
 
Protein database
Protein databaseProtein database
Protein database
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 

Similaire à Ncbi

Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acidsvibhakumari12
 
Sequencedatabases
SequencedatabasesSequencedatabases
SequencedatabasesAbhik Seal
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsAyeshaYousaf20
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES nadeem akhter
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdfSrimathideviJ
 
Introduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxIntroduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxRAJESHKUMAR428748
 
JEVBase: An Interactive Resource for Protein Annotationof JE Virus
JEVBase: An Interactive Resource for Protein Annotationof JE VirusJEVBase: An Interactive Resource for Protein Annotationof JE Virus
JEVBase: An Interactive Resource for Protein Annotationof JE VirusCSCJournals
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsRaj Varun
 
02. Biological sequence databases.pptx
02. Biological sequence databases.pptx02. Biological sequence databases.pptx
02. Biological sequence databases.pptxHussainTaqi1
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformaticsVinaKhan1
 

Similaire à Ncbi (20)

Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
NCBI
NCBINCBI
NCBI
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
Article
ArticleArticle
Article
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acids
 
Understanding Genome
Understanding Genome Understanding Genome
Understanding Genome
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Databases_L2.pptx
Databases_L2.pptxDatabases_L2.pptx
Databases_L2.pptx
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdf
 
Introduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxIntroduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptx
 
JEVBase: An Interactive Resource for Protein Annotationof JE Virus
JEVBase: An Interactive Resource for Protein Annotationof JE VirusJEVBase: An Interactive Resource for Protein Annotationof JE Virus
JEVBase: An Interactive Resource for Protein Annotationof JE Virus
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
02. Biological sequence databases.pptx
02. Biological sequence databases.pptx02. Biological sequence databases.pptx
02. Biological sequence databases.pptx
 
Intro to databases
Intro to databasesIntro to databases
Intro to databases
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 

Dernier

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONrouseeyyy
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 

Dernier (20)

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 

Ncbi

  • 1. NCBI National Centre For Biotechnology Information Site: www.ncbi.nlm.nih.gov By Richa Sharma M.Sc. Biomedical Sciences Dr. BR Ambedkar Center for Biomedical aresearch (ACBR)
  • 2. INTRODUCTION NCBI was established in the year 1988, as a part of the National Library of Medicine at the National Institutes of Health, Maryland, USA
  • 4. DIFFERENCES BETWEEN DATABASE AND TOOL DATABASE  It is a collection of data that is structured, searchable, updated periodically and cross- referenced.  Different databases are:  Genome Database  Sequence Database  Protein Database  Literature Database  Disease Database TOOL  A program that is used to extract or retrieve the desired information from the database.  Different types of tools are:  Database Retrieval Tool i.e. Entrez  BLAST  ORF Finder  ePCR  Spidey
  • 7. DATABASE RETRIEVAL TOOL- ENTREZ Entrez is an integrated database search and retrieval system that extracts information from DNA and protein sequence data, population sets, whole genome, macromolecular structures, and the biomedical literature via PubMed. Entrez provides extensive links within and between database records. http://www.ncbi.nlm.nih.gov/gquery/
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13. ARCHITECTURE OF THE ENTREZ SYSTEM
  • 14. BLAST-BASIC LOCAL ALIGNMENT SEARCH TOOL The BLAST programs perform sequence-similarity searches against a variety of sequence databases, returning a set of gapped alignments with links to full database records, to UniGene, Gene, the MMDB, or GEO. The BLAST tools available at NCBI are classified into different categories. Two important ones are:  Standard BLAST  MegaBLAST
  • 15. STANDARD BLAST Standard BLAST includes:  blastn : Comparing the nucleotide sequence query against a nucleotide sequence database.  blastp : Comparing the amino acid query against a protein sequence database.  blastx : Comparing the nucleotide query sequence translated in all reading frames against a protein database.
  • 16. • tblastn : Comparing the protein query sequence against a nucleotide database translated in all reading frames. tblastx : Comparing the six –reading frame translations of the nucleotide query against six frame translations of the nucleotide sequence database.
  • 17. MegaBLAST MegaBLAST is a program optimized for aligning long sequences. It can only work with DNA sequences, hence the only program it supports is “blastn”. It is faster than blastn but less sensitive,
  • 18. SEQUENCE SUBMISSION TO NCBI The databases are constantly updated through newer submissions of sequences, and this is done using the following sequence submission tools : 1. BankIt 2. Sequin
  • 19. BankIt BankIT is a web based GenBank sequence submission tool. It is a tool of choice for simple submissions, especially when only one or small number of records are to be submitted. It can also be used by submitters to update their existing GenBank records. Sequence analysis tools are not required for submission through this process.
  • 20. SEQUIN Sequin is a stand-alone software tool developed by NCBI which aids in submission and updating entries to the sequence databases. It helps in handling multiple sequence submissions, provides increased capacity for complex submissions containing long sequences, multiple annotations, segmented sets of DNA or phylogenetic and population studies. It also provides graphical viewing and editing options.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26. SPECIALISED TOOLS Some of the specialized tools for the sequence analysis are : 1. ORF Finder 2. e-PCR 3. Spidey
  • 27. Open Reading Frame (ORF) Finder ORF Finder is an essential graphical analysis tool, which finds all open reading frames of a selectable minimum size in a user’s sequence or in a sequence already in the database. It uses the standard or alternative genetic codes to identify all open reading frames. This is helpful in preparing complete and accurate sequence submissions. It is also packaged with the Sequin sequence submission software.
  • 28. e-PCR (Electronic Polymerase Chain Reaction) e-PCR is a computational procedure that is used to identify sequence-tagged sites (STSs) within DNA sequeces. While looking for potential STSs in DNA sequences e-PCR searches for sub-sequences that closely match the PCR primers and have the correct order, orientation, and spacing that could represent the PCR primers used to generate known STSs.The new version of e-PCr provides a search mode using a query sequence against a sequence database.
  • 29. SPIDEY This is an m-RNA to genomic alignment program ,which uses the local alignment tools like BLAST to find its alignment. Spidey takes as an input a single genomic sequence and a set of mRNA-FASTA sequences. At first, Spidey defines windows on the genomic sequence and then perform the mRNA-to-genomic alignment separately within each window to avoid including exons from paralogs and pseudogenes. It has no maximum intron size and does not favour shorter or longer introns.
  • 30. Databases  Structured collection of information.  Consists of basic units called record or enteries.  The prefect database-  Comprehensive but easy to search  Cross referenced  Minimum redundancy
  • 31. NCBI Databases  Nucleotide database  Literature database  Protein database  Gene expression database  Structural database  Chemical database  Other databases
  • 32.
  • 33. Kinds of databases Primary database  Original submissions by experimentalists.  Database staff organise but don’t add additional information.  Example - Genbank Derivative databases  Derived from primary data  Content controlled by third party.  Examples – Refseq, SWISS-PROT, unigene
  • 34. Nucleotide database  GENBANK  NCBI’s primary sequence data  It is a comprehensive public database of nucleotide sequences.  Genbank along with EMBL and DDBJ comprises the INSD.  It is a collaborative approach for exchanging data daily to ensure a uniform and comprehensive collection of sequence information.
  • 35.
  • 36. Accession numbers are labels for sequences  DNA sequences and other molecular data are tagged with accession numbers that are used to identify a sequence or other record relevant to molecular data.  It is string of letters and/or numbers that corresponds to a molecular sequence.  It is shared among the 3 collaborating databases and remains constant over the lifetime of record.  The DNA sequence within a Genbank record is also assigned a unique NCBI identifier called a ‘gi’ that apperas on the version line of flat file records following the accession number.
  • 37. Retrieval of nucleotide sequence of beta-globin gene from Xenopus laevis
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43. NCBI’s Derivative Sequence Database  RefSeq  It is a collection of non redundant set of nucleotide and protein sequences.  It is derived from the primary submissions available in the GenBank.  RefSeq records can be distinguished from GenBank records by the format of the accession series  RefSeq accession numbers are formatted as two alphabetic characters followed by an underscore ‘-’  The GenBank accession never include an underscore.
  • 44. Literature database  PMC – PubMed Central  It is a digital archive of peer-reviewed journals in the life sciences providing access to full-text articles.  All PMC free articles are identified in PubMed search results and PMC itself can be searched using Entrez.
  • 45. Retrieval of complete entry of role of remorin protein in the pubmed database
  • 46.
  • 47.
  • 48.
  • 49. Protein database  Entrez protein is the protein sequence database of NCBI.  The protein sequences in this database come from several different sources such as Swiss-Prot,PDB.  There are GenPept translations for each of the coding sequences within the GenBank nucleotide database.  The Entrez protein database is cross linked to the Entrez taxonomy database.  It is also linled to CDD.  After clicking on the individual search results of Entrez protein,the protein sequence is displayed in a particular format which is known as GenPept.
  • 50. Expression database  GEO-Gene Expression Omnibus  Distribution and regulation of the transcriptional products of normal and abnormal cell types.  SAGE map- serial analysis of gene expression map.
  • 51. Structural database  MMDB-Molecular modelling database.  3D macromolecular structures.  XRD and NMR are being used for the experimental structure determination.  These provide a wealth of information regarding the biological function,mechanism linked to the function,the evolutionary history of the function and relationship between the macromolecules.
  • 52. Chemical database  PubChem is a database of chemical molecules maintained by NCBI.  It focuses on the chemical,structural and biological properties of small molecules  Molecular mass below 2000u.
  • 53. Other databases  OMIM-Online Mendelian Inheritance in Man.  It is a comprehensive,authoritative and timely knowledge base of human genes and genetic disorders.  OMIA-Online Mendelian Inheritance in Animals.  It is a database of genes,inhertited disorders and traits in animal species other than human and mouse.