SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
Integrated database biology
    with well-curated and
 well-circulated knowledge

        Dr. Hidemasa Bono
    Database Center for Life Science(DBCLS)
Research Organization of Information and Systems


                  © ライフサイエンス統合データベースセンター/大学共同利用機関法人 情報・システム研究機構
English?




           2
日本語?




       3
DBCLS: Database Center for Life Science
• Since 2007               バイオDB7-8
• Located at Hongo campus(Asano area)
  of the University of Tokyo(UT)
• Not affiliated to UT


            http://dbcls.rois.ac.jp/


                                        4
NBDC
• National Bioscience Database Center
• Since 2011                                                     バイオDB1-3
• Affiliated to Japan Science and
  Technology Agency (JST)




        http://biosciencedbc.jp/nbdc.cgi?lng=en&gg=projects_and_activities   5
http://biosciencedbc.jp/
                  2P-0220




                            6
DB Catalog




http://biosciencedbc.jp/   7
http://biosciencedbc.jp/dbcatalog/   8
DB Cross Search




http://biosciencedbc.jp/   9
2P-0240
http://biosciencedbc.jp/dbsearch/en/
                                   10
DB Archive




http://biosciencedbc.jp/   11
12
What is DBCLS doing now?




    http://biosciencedbc.jp/nbdc.cgi?lng=en&gg=projects_and_activities   13
Technology development
 of database integration
1.Database integration with RDF
2.Development and maintenance of research
 environment for accessing databases 2P-0978
3.Technology development of the integrated
 database search
4.Maintenance and standardization of ontologies,
 dictionaries, and corpus
5.Technology development of huge amount of
 public biological data
6.Development and distribution of the system for
 manual curation                        2P-0269
7.Development and maintenance of contents
 concerning the integrated database     1P-0881  14
統合TV (TogoTV)
•Curated tutorial movies for DB&tools
 ‒Freely available from YouTube & iTunes Store
 ‒Lectures from various classes
                       Kawano S, Ono H, Takagi T, Bono H
 ‒ over 500   contents Brief Bioinform. Jul 29 (2011)




                                                      15
BodyParts3D/Anatomography
                                            d /
                                         p 3
                                       /b
                                   . jp
                                d b
                             c e
                         ie n
                   s c
               if e
            / l
     p :/
  t t
 h                                                16
Wikimedia commons for circulation




                                    17
RefEx: curated expression dataset
for circulation of knowledge obtained
  http://refex.dbcls.jp/
 • RefEx: Reference Expression dataset
   ‒GGRNA 2P-0131                 2P-0113
  ‒Bodyparts3D




                                  18
How to deal with big data
                      2P-0132
        from NGS
• Before publication
                             SRR001356.1 2023DAAXX:5:1:123:563 length=33
                             TGTCGGTCCAGCTCGGCCTTGGGCTCCGTTTTC




                                FASTQ
                             +SRR001356.1 2023DAAXX:5:1:123:563 length=33
                             -IIIIIIII8IIIIIIIIIII6IIIIIIIII9I
                             @SRR001356.2 2023DAAXX:5:1:123:476 length=33
                             TCTGAACCCGACTCCCTTTCGATCGGCCGCGGG
                             +SRR001356.2 2023DAAXX:5:1:123:476 length=33


 ‒TogoTV: How to...
                             IIIIIIIIIIIIIIIIIIIIIGIIIIIII-III
                             @SRR001356.3 2023DAAXX:5:1:121:746 length=33
                             GTGGCAGCGTTTTTGGGCCCGCCGCTTGCCGTT
                             +SRR001356.3 2023DAAXX:5:1:121:746 length=33
                             IIIII&IIIIIIIIIIIIIIIIHI1IIIIIIII



   •Make use of available tools
   •Handle huge amount of data
   •Submit to DDBJ Sequence Read Archive(DRA)

• After publication
 ‒Promote recycle of archived data
   •Metadata(Experimental condition etc)                                                      19
                                                             © 2011 DBCLS Licensed under CC 表示 2.1 日本
Digest of archived NGS data
• http://sra.dbcls.jp/
• Search by publications




• Search by diseases     2P-0133
                                   20
Conclusion
★   Curation and circulation of data required.
    ✴ So many raw data, so little curated data and
      circulated knowledge.
    ✴ 「さらさらで、知の巡りのよい分子生物学」


★   Sharing information needed.
    ✴   Join our forum tomorrow! 3F5
    ✴   「もし分子生物学者がGoogle+の招待を受けたら」
    ✴   Join tutorial online and offline(統合DB講習会)
★   For information in Japanese.
    ✴                    バイオDB7-8
        Visit our booth ʻバイオDB7-8ʻ                   21

Contenu connexe

Tendances

Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Dag Endresen
 
Persistent identifiers for digitized specimens (2013)
Persistent identifiers for digitized specimens (2013)Persistent identifiers for digitized specimens (2013)
Persistent identifiers for digitized specimens (2013)Dag Endresen
 
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...Dag Endresen
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Dag Endresen
 
Darwin Core extension for germplasm (11th December 2013)
Darwin Core extension for germplasm (11th December 2013)Darwin Core extension for germplasm (11th December 2013)
Darwin Core extension for germplasm (11th December 2013)Dag Endresen
 
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014Dag Endresen
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31Dag Endresen
 
EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)
EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)
EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)Dag Endresen
 
SemanticCampLondon, 16th February 2008
SemanticCampLondon, 16th February 2008SemanticCampLondon, 16th February 2008
SemanticCampLondon, 16th February 2008Andrew Walkingshaw
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenHeinz Pampel
 
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Dag Endresen
 
Managing research data at Bristol
Managing research data at BristolManaging research data at Bristol
Managing research data at BristolSimon Price
 
What is DataCite?
What is DataCite?What is DataCite?
What is DataCite?datacite
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)Dag Endresen
 
Architecture of ContentMine Components contentmine.org
Architecture of ContentMine Components contentmine.orgArchitecture of ContentMine Components contentmine.org
Architecture of ContentMine Components contentmine.orgpetermurrayrust
 

Tendances (15)

Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)
 
Persistent identifiers for digitized specimens (2013)
Persistent identifiers for digitized specimens (2013)Persistent identifiers for digitized specimens (2013)
Persistent identifiers for digitized specimens (2013)
 
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)
 
Darwin Core extension for germplasm (11th December 2013)
Darwin Core extension for germplasm (11th December 2013)Darwin Core extension for germplasm (11th December 2013)
Darwin Core extension for germplasm (11th December 2013)
 
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31
 
EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)
EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)
EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)
 
SemanticCampLondon, 16th February 2008
SemanticCampLondon, 16th February 2008SemanticCampLondon, 16th February 2008
SemanticCampLondon, 16th February 2008
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
 
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
 
Managing research data at Bristol
Managing research data at BristolManaging research data at Bristol
Managing research data at Bristol
 
What is DataCite?
What is DataCite?What is DataCite?
What is DataCite?
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
 
Architecture of ContentMine Components contentmine.org
Architecture of ContentMine Components contentmine.orgArchitecture of ContentMine Components contentmine.org
Architecture of ContentMine Components contentmine.org
 

Similaire à Integrated database biology with well-curated and circulated knowledge

Database Integration toward Semantic Web: Development of Ontologies and RDF ...
Database Integration toward Semantic Web: Development of  Ontologies and RDF ...Database Integration toward Semantic Web: Development of  Ontologies and RDF ...
Database Integration toward Semantic Web: Development of Ontologies and RDF ...Database Center for Life Science
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18Dag Endresen
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary databaseKAUSHAL SAHU
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databasesfarwa fayaz
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceGigaScience, BGI Hong Kong
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptxvijayapraba1
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyArchiver
 
Haystack Live tallison_202010_v2
Haystack Live tallison_202010_v2Haystack Live tallison_202010_v2
Haystack Live tallison_202010_v2Tim Allison
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourcePhilippa Griffin
 
Data management plans archeology class 10 18 2012
Data management plans archeology class 10 18 2012Data management plans archeology class 10 18 2012
Data management plans archeology class 10 18 2012Elizabeth Brown
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterNeuro, McGill University
 
IDB-Cloud Providing Bioinformatics Services on Cloud
IDB-Cloud Providing Bioinformatics Services on CloudIDB-Cloud Providing Bioinformatics Services on Cloud
IDB-Cloud Providing Bioinformatics Services on Cloudstratuslab
 

Similaire à Integrated database biology with well-curated and circulated knowledge (20)

Database Integration toward Semantic Web: Development of Ontologies and RDF ...
Database Integration toward Semantic Web: Development of  Ontologies and RDF ...Database Integration toward Semantic Web: Development of  Ontologies and RDF ...
Database Integration toward Semantic Web: Development of Ontologies and RDF ...
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Ddbj
DdbjDdbj
Ddbj
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databases
 
Bioinformatica 06-10-2011-t2-databases
Bioinformatica 06-10-2011-t2-databasesBioinformatica 06-10-2011-t2-databases
Bioinformatica 06-10-2011-t2-databases
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Haystack Live tallison_202010_v2
Haystack Live tallison_202010_v2Haystack Live tallison_202010_v2
Haystack Live tallison_202010_v2
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 
Data management plans archeology class 10 18 2012
Data management plans archeology class 10 18 2012Data management plans archeology class 10 18 2012
Data management plans archeology class 10 18 2012
 
NCBO Technology
NCBO TechnologyNCBO Technology
NCBO Technology
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
 
IDB-Cloud Providing Bioinformatics Services on Cloud
IDB-Cloud Providing Bioinformatics Services on CloudIDB-Cloud Providing Bioinformatics Services on Cloud
IDB-Cloud Providing Bioinformatics Services on Cloud
 

Plus de Hidemasa Bono

DDBJing on 20140612 by Hidemasa Bono
DDBJing on 20140612 by Hidemasa BonoDDBJing on 20140612 by Hidemasa Bono
DDBJing on 20140612 by Hidemasa BonoHidemasa Bono
 
新規医療開発に関わる統計学 (バイオインフォマティクス)
新規医療開発に関わる統計学 (バイオインフォマティクス)新規医療開発に関わる統計学 (バイオインフォマティクス)
新規医療開発に関わる統計学 (バイオインフォマティクス)Hidemasa Bono
 
What was togofarm on earth?
What was togofarm on earth?What was togofarm on earth?
What was togofarm on earth?Hidemasa Bono
 
“これから”のライフサイエンス研究とバイオインフォマティクス (Next Generation Life Science & Bioinformatics)
“これから”のライフサイエンス研究とバイオインフォマティクス (Next Generation Life Science & Bioinformatics)“これから”のライフサイエンス研究とバイオインフォマティクス (Next Generation Life Science & Bioinformatics)
“これから”のライフサイエンス研究とバイオインフォマティクス (Next Generation Life Science & Bioinformatics)Hidemasa Bono
 
データベース活用による 知のめぐりのよい細胞生物学
データベース活用による 知のめぐりのよい細胞生物学データベース活用による 知のめぐりのよい細胞生物学
データベース活用による 知のめぐりのよい細胞生物学Hidemasa Bono
 
バイオインフォマティクス(2013年度以降用改訂版)
バイオインフォマティクス(2013年度以降用改訂版)バイオインフォマティクス(2013年度以降用改訂版)
バイオインフォマティクス(2013年度以降用改訂版)Hidemasa Bono
 
データベースから始まる分子生物学~トランスクリプトーム解析研究の新しいスタイル~
データベースから始まる分子生物学~トランスクリプトーム解析研究の新しいスタイル~データベースから始まる分子生物学~トランスクリプトーム解析研究の新しいスタイル~
データベースから始まる分子生物学~トランスクリプトーム解析研究の新しいスタイル~Hidemasa Bono
 
第57回日本人類遺伝学会大会 教育講演「バイオインフォマティクス:データベース統合化によるアプローチ」
第57回日本人類遺伝学会大会 教育講演「バイオインフォマティクス:データベース統合化によるアプローチ」第57回日本人類遺伝学会大会 教育講演「バイオインフォマティクス:データベース統合化によるアプローチ」
第57回日本人類遺伝学会大会 教育講演「バイオインフォマティクス:データベース統合化によるアプローチ」Hidemasa Bono
 
bonohu's presentation in Osaka.R#6
bonohu's presentation in Osaka.R#6bonohu's presentation in Osaka.R#6
bonohu's presentation in Osaka.R#6Hidemasa Bono
 

Plus de Hidemasa Bono (10)

DDBJing on 20140612 by Hidemasa Bono
DDBJing on 20140612 by Hidemasa BonoDDBJing on 20140612 by Hidemasa Bono
DDBJing on 20140612 by Hidemasa Bono
 
新規医療開発に関わる統計学 (バイオインフォマティクス)
新規医療開発に関わる統計学 (バイオインフォマティクス)新規医療開発に関わる統計学 (バイオインフォマティクス)
新規医療開発に関わる統計学 (バイオインフォマティクス)
 
What was togofarm on earth?
What was togofarm on earth?What was togofarm on earth?
What was togofarm on earth?
 
“これから”のライフサイエンス研究とバイオインフォマティクス (Next Generation Life Science & Bioinformatics)
“これから”のライフサイエンス研究とバイオインフォマティクス (Next Generation Life Science & Bioinformatics)“これから”のライフサイエンス研究とバイオインフォマティクス (Next Generation Life Science & Bioinformatics)
“これから”のライフサイエンス研究とバイオインフォマティクス (Next Generation Life Science & Bioinformatics)
 
データベース活用による 知のめぐりのよい細胞生物学
データベース活用による 知のめぐりのよい細胞生物学データベース活用による 知のめぐりのよい細胞生物学
データベース活用による 知のめぐりのよい細胞生物学
 
バイオインフォマティクス(2013年度以降用改訂版)
バイオインフォマティクス(2013年度以降用改訂版)バイオインフォマティクス(2013年度以降用改訂版)
バイオインフォマティクス(2013年度以降用改訂版)
 
データベースから始まる分子生物学~トランスクリプトーム解析研究の新しいスタイル~
データベースから始まる分子生物学~トランスクリプトーム解析研究の新しいスタイル~データベースから始まる分子生物学~トランスクリプトーム解析研究の新しいスタイル~
データベースから始まる分子生物学~トランスクリプトーム解析研究の新しいスタイル~
 
第57回日本人類遺伝学会大会 教育講演「バイオインフォマティクス:データベース統合化によるアプローチ」
第57回日本人類遺伝学会大会 教育講演「バイオインフォマティクス:データベース統合化によるアプローチ」第57回日本人類遺伝学会大会 教育講演「バイオインフォマティクス:データベース統合化によるアプローチ」
第57回日本人類遺伝学会大会 教育講演「バイオインフォマティクス:データベース統合化によるアプローチ」
 
TogoRecipes 120907
TogoRecipes 120907TogoRecipes 120907
TogoRecipes 120907
 
bonohu's presentation in Osaka.R#6
bonohu's presentation in Osaka.R#6bonohu's presentation in Osaka.R#6
bonohu's presentation in Osaka.R#6
 

Dernier

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 

Dernier (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 

Integrated database biology with well-curated and circulated knowledge

  • 1. Integrated database biology with well-curated and well-circulated knowledge Dr. Hidemasa Bono Database Center for Life Science(DBCLS) Research Organization of Information and Systems © ライフサイエンス統合データベースセンター/大学共同利用機関法人 情報・システム研究機構
  • 4. DBCLS: Database Center for Life Science • Since 2007 バイオDB7-8 • Located at Hongo campus(Asano area) of the University of Tokyo(UT) • Not affiliated to UT http://dbcls.rois.ac.jp/ 4
  • 5. NBDC • National Bioscience Database Center • Since 2011 バイオDB1-3 • Affiliated to Japan Science and Technology Agency (JST) http://biosciencedbc.jp/nbdc.cgi?lng=en&gg=projects_and_activities 5
  • 12. 12
  • 13. What is DBCLS doing now? http://biosciencedbc.jp/nbdc.cgi?lng=en&gg=projects_and_activities 13
  • 14. Technology development of database integration 1.Database integration with RDF 2.Development and maintenance of research environment for accessing databases 2P-0978 3.Technology development of the integrated database search 4.Maintenance and standardization of ontologies, dictionaries, and corpus 5.Technology development of huge amount of public biological data 6.Development and distribution of the system for manual curation 2P-0269 7.Development and maintenance of contents concerning the integrated database 1P-0881 14
  • 15. 統合TV (TogoTV) •Curated tutorial movies for DB&tools ‒Freely available from YouTube & iTunes Store ‒Lectures from various classes Kawano S, Ono H, Takagi T, Bono H ‒ over 500 contents Brief Bioinform. Jul 29 (2011) 15
  • 16. BodyParts3D/Anatomography d / p 3 /b . jp d b c e ie n s c if e / l p :/ t t h 16
  • 17. Wikimedia commons for circulation 17
  • 18. RefEx: curated expression dataset for circulation of knowledge obtained http://refex.dbcls.jp/ • RefEx: Reference Expression dataset ‒GGRNA 2P-0131 2P-0113 ‒Bodyparts3D 18
  • 19. How to deal with big data 2P-0132 from NGS • Before publication SRR001356.1 2023DAAXX:5:1:123:563 length=33 TGTCGGTCCAGCTCGGCCTTGGGCTCCGTTTTC FASTQ +SRR001356.1 2023DAAXX:5:1:123:563 length=33 -IIIIIIII8IIIIIIIIIII6IIIIIIIII9I @SRR001356.2 2023DAAXX:5:1:123:476 length=33 TCTGAACCCGACTCCCTTTCGATCGGCCGCGGG +SRR001356.2 2023DAAXX:5:1:123:476 length=33 ‒TogoTV: How to... IIIIIIIIIIIIIIIIIIIIIGIIIIIII-III @SRR001356.3 2023DAAXX:5:1:121:746 length=33 GTGGCAGCGTTTTTGGGCCCGCCGCTTGCCGTT +SRR001356.3 2023DAAXX:5:1:121:746 length=33 IIIII&IIIIIIIIIIIIIIIIHI1IIIIIIII •Make use of available tools •Handle huge amount of data •Submit to DDBJ Sequence Read Archive(DRA) • After publication ‒Promote recycle of archived data •Metadata(Experimental condition etc) 19 © 2011 DBCLS Licensed under CC 表示 2.1 日本
  • 20. Digest of archived NGS data • http://sra.dbcls.jp/ • Search by publications • Search by diseases 2P-0133 20
  • 21. Conclusion ★ Curation and circulation of data required. ✴ So many raw data, so little curated data and circulated knowledge. ✴ 「さらさらで、知の巡りのよい分子生物学」 ★ Sharing information needed. ✴ Join our forum tomorrow! 3F5 ✴ 「もし分子生物学者がGoogle+の招待を受けたら」 ✴ Join tutorial online and offline(統合DB講習会) ★ For information in Japanese. ✴ バイオDB7-8 Visit our booth ʻバイオDB7-8ʻ 21