SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
BioRuby




BioRuby
― Bioinformatics Library
― 生物情報科学用ライブラリ

Naohisa Goto / 後藤直久
Genome Information Research Center, Research
Institute for Microbial Diseases, Osaka Univ.
大阪大学微生物病研究所附属遺伝情報実験センター
Email: ngoto@gen-info.osaka-u.ac.jp
twitter: @ngotogenome
BioRuby


   Who am I? / 自己紹介
    Name: Naohisa Goto
    名前: 後藤 直久
    Affiliation: Genome Information Research Center,
     Research Institute for Microbial Diseases, Osaka
     University
    所属: 大阪大学微生物病研究所附属遺伝情報実験センター

    Twitter: @ngotogenome
    Email: ngoto@gen-info.osaka-u.ac.jp

    First Ruby experience: 1.2.6 (compiled in 22/Jun/1999)
BioRuby


   BioRuby
    Bioinformatics software library and tools written
     in the Ruby Language
    Rubyで書かれた生物情報科学(バイオインフォマティ
     クス)用ライブラリとツール集

    Free software (Ruby License)
    http://bioruby.org/
    https://github.com/bioruby/bioruby
    % gem install bio
BioRuby


   DNA
                           DNA is a chain made of the
                           four molecules.
                           DNAは4種類の分子からなる鎖
                            A (Adenine)
                            C (Cytosine)
                            G (Guanine)
                            T (Thymine)
                           DNA can be treated as String.
                           DNAは文字列として扱える。
                          (ところで、Encoding どうしよう…)

                    Human: Total 3GB (49-247MB/chromosome)
                    ヒト: 合計3GB (染色体1本あたり49~247MB)
図: WikiPedia: 染色体
BioRuby


   An example DNA data (with metadata / 付加情報含む)
   >gi|60459557|gb|AY948115.1| Homo sapiens alcohol dehydrogenase 1A (class
   I), alpha polypeptide (ADH1A) gene, complete cds
   GAGGGCGACAAAAGGGAACAGACCCAAAACCACAGGAGAGATGCTAGCATGACAGGGATGCAGAGACATA
   AAGCACAACAGTGAGATGGAGTTAATATACCTCCACGAGGGTGACCTTGTCCTGCATCTCAAATTTTGGG
   TAGGATTTGAATGGGCCAGAGGGACAGAAAAGAAGAGAAAGAGCATGATGAGCAAGGGCTTGAATGTTAA
   ATAGATTCCTCTTTGGGGGACCAGGGAGATACAAGCTTCTAAAGCACATACGCCCTGTATTGGAGAATGG
   GGAGGAGTAGATAGATGAGAAGGTTGAAGCCATATTACGAAGCCTTGAATGCTGAACATCAGATCTGGGG
   CTATATTCTTACCTTGATACATTTCAGAAGCAACTGAAATCGTAGGACCTTCCTTGCTTCTCTATTGGGT
   GAATGTTTCTCAGTCTTGGTGTGAGTCTCAGTGCCTACGTAGTTAAAGCTTACTGAAATGTTCCCTTTAC
   AATTCTAGAGAGATATGTCCTTTATGTTGACATGTTCATGCTGACAGACTGCATCTGATTAAACAGCTGC
   CTGTGCAATGCCTCCAAGTGTGGATAAAAGAAAAATTAAACTCATAATCTTGGACAGCCATGTGTAGACT
   AGTTACATTGATCAAAGGGCAATAGAAATGATCCAGTGAGGATTTGTCTGAATTTCCCACAATTATTTAA
   AATCTACCTCAAATACCTGTTCATCTATAATGCCTCCCCTGAGGCCTTCATTCTGAATAGTACCTCTGTC
   TCTGTCCCCAAAGCACTAACTGATCCCTGTGATAGCGCACTTCCCAGCCAGGCTGATATGTAGACTTGGC
   TGCCTGTGTATCTTTTCCCCATAGACTGTGAGCTTCCTTTTATGAATAATAATTGTAGCTAGCATTTAGT
   AGGGTGCTCCTACCTGTTAAACTCTATGATGAGTGCTTTACATAGATTATATCATTTATTCACTAAACAG
   TCCTTTAAAATGGTGCTATATTCACTAAACAGTCCTTTAAAATGGTGCTATATTCACTAAACAGTCATTT
   AAAATGGTATTATTCTTCTTCATCTTACAGGTAAACAAACTAAGGCAAAAAAAAAAGTGAAATAATAAGT
   GCCAGTACACAGAGCTAGTAAGGAATAGGGTCTGCCAGGTCCCAAAAAGCATGCCATCACCTTTGCCCCA
   TACTGCCTCTGGTACAGATAGAGGTAATGTCTTATTTATCACTGCCATCCACTGGACCCAGCTTAGTGCC
   TGACACACAGAGGGGCTCAGTCAATGCTGATTGGTTTGAGGTGGAGCAAAAATGCTTAGCAGGGTGAGCA
   CCTTTGCTGTGATTGAGTATCTGATTCTCTATGAAGAGAAGGGGAGTCCTGAGCCAAACACATTCCTCTG
   GCTCCTGGCTGTCATCTTTATTTGCCCGGCTTCTTTGCTCTTCCTCCTTCCTAACTGCACCGTTTGGATT
   (snip / 以下略)
             http://togows.dbcls.jp/entry/genbank/AY948115.1.fasta
BioRuby


   What BioRuby can do? / できること
    Biological data analysis / 生物データの解析
           DNA, RNA
           Protein / タンパク質
           Relation of genes / 遺伝子間の関係性
           Phylogenetic tree / 系統樹
           Bibliography / 文献情報
          …
    I/O with other software / 他のソフトの入出力
    Utilize web services / ウェブサービス利用
BioRuby


   Code example
      DNA → Protein translation / DNA→タンパク質の翻訳

          require "rubygems"
          require "bio"

          f = Bio::FlatFile.open(ARGF)
          f.each do |entry|
            dna = entry.naseq
            aa = dna.translate
            seq = Bio::Sequence.new(seq)
            print seq.output_fasta(e.definition)
          end
BioRuby


   Status
   Latest version: BioRuby 1.4.1 (22/Oct/2010)
    Supported Ruby version: 1.8.x
           Will soon be migrated to1.9 / 速やかに1.9に移行予定
    Files
       Library: 230 files / 35,000 lines (w/o comments, void lines)
       Tests: 120 files / 22,000 lines
       Sample codes: 70 files
    Functionality
       580 classes/modules
       2,800 methods
    Plugin system introduced (using gem)
BioRuby


   BioRuby developer’s community
    Core developers (6 persons)
             Toshiaki Katayama (leader) (Univ. of Tokyo, Japan)
             Naohisa Goto (release manager) (Osaka Univ., Japan)
             Mitsuteru Nakao (Japan)
             Pjotr Prins (Wageningen University, Netherlands)
             Raoul Bonnal (INGM, Italy)
             Jan Aerts (Belgium)
    Total >30 contributors in 10 years
     10年間で延べ30人以上の貢献者
    Active developers / users in the world
     世界中にアクティブな開発者/利用者
BioRuby
BioRuby


   Brief history
   11/2000     BioRuby project started
   06/2001     The first version (BioRuby 0.1)

   2005-2006   IPA Exploratory (未踏) Software Project
   02/2006     BioRuby 1.0.0 released

   09/2008     moved from CVS to Git

   08/2010     Published BioRuby research paper / 学術論文

   10/2010     BioRuby 1.4.1 released
BioRuby


   Preceding projects / 先行プロジェクト
   Demanded by genome projects in late 90’s
   1990年代後半のゲノムプロジェクトに伴う

   BioPerl – since 1996 (Perl 1987)
   Biopython – since 1999 (Python 1991)
   BioJava – since 1999 (Java 1995)
   BioRuby – since 2000 (Ruby 1995)

   Together with Open Bioinformatics Foundation
   http://open-bio.org/
   Google Summer of Code 2009, 2010, 2011
BioRuby


   BioHackathon
    Open Bio* Hackathon (2002, 2003)
    Phyloinformatics Hackathon (2006)
    DBCLS BioHackathon (2008-2010)
BioRuby


   Academic Community / 学会
   •      Bioinformatics Open Source Conference
   •      GIW / JSBi (日本バイオインフォマティクス学会)
   •      MBSJ (日本分子生物学会)
   •      Open Bio Japan
          (オープンバイオ研究会)
BioRuby


   Open Source Community
   • Ruby Kansai (関西Ruby勉強会) (2005-)
   • IPA Exploratory (未踏) Software Project   (2005-2006)

   • RubyKaigi (2006-)

   • Google Summer of Code (2009-2011)
     (Open Bioinformatics Foundation)
BioRuby


   Recent topics
    Release of new version (BioRuby 1.4.2)
    Ruby 1.9.3 migration

    Revolution of DNA sequencing technique
     DNA塩基配列決定技術の飛躍的向上
BioRuby


   Next-Generation Sequencer (NGS)
   Example: Illumina HiSeq 2000
   >600GB DNA sequences in 10-days
   10日間で600GB超のDNA塩基配列を決定する装置
                   Lack of Resources / 足りないもの
                   • HDD
                   • CPU
                   • Memory
                   • Software
                   • Human
                   • Money
                   • ...
BioRuby


   Join us
   BioRuby
     Web      http://bioruby.org
     ML       bioruby@lists.open-bio.org
     GitHub   https://github.org/bioruby/bioruby

                   BioRubyユーザーが書いた本
                   多田雅人著「Rubyではじめる
                   バイオインフォマティクス」
                   発売中!!

Contenu connexe

Tendances

Correlagen next gen presentation 042711
Correlagen next gen presentation 042711Correlagen next gen presentation 042711
Correlagen next gen presentation 042711
algunduz28
 
Next-generation genomics: an integrative approach
Next-generation genomics: an integrative approachNext-generation genomics: an integrative approach
Next-generation genomics: an integrative approach
Hong ChangBum
 

Tendances (12)

NGS overview
NGS overviewNGS overview
NGS overview
 
Correlagen next gen presentation 042711
Correlagen next gen presentation 042711Correlagen next gen presentation 042711
Correlagen next gen presentation 042711
 
Neurotech seminar ish wish 2014 maduna
Neurotech seminar ish wish 2014 madunaNeurotech seminar ish wish 2014 maduna
Neurotech seminar ish wish 2014 maduna
 
ECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing TutorialECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing Tutorial
 
New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...
New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...
New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...
 
How to write bioinformatics software people will use and cite - t.seemann - ...
How to write bioinformatics software people will use and cite -  t.seemann - ...How to write bioinformatics software people will use and cite -  t.seemann - ...
How to write bioinformatics software people will use and cite - t.seemann - ...
 
Single-molecule real-time (SMRT) Nanopore sequencing for Plant Pathology appl...
Single-molecule real-time (SMRT) Nanopore sequencing for Plant Pathology appl...Single-molecule real-time (SMRT) Nanopore sequencing for Plant Pathology appl...
Single-molecule real-time (SMRT) Nanopore sequencing for Plant Pathology appl...
 
Next-generation genomics: an integrative approach
Next-generation genomics: an integrative approachNext-generation genomics: an integrative approach
Next-generation genomics: an integrative approach
 
Sweden_eemis_big_data
Sweden_eemis_big_dataSweden_eemis_big_data
Sweden_eemis_big_data
 
How to Standardise and Assemble Raw Data into Sequences: What Does it Mean fo...
How to Standardise and Assemble Raw Data into Sequences: What Does it Mean fo...How to Standardise and Assemble Raw Data into Sequences: What Does it Mean fo...
How to Standardise and Assemble Raw Data into Sequences: What Does it Mean fo...
 
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)
 
Genome assembly: the art of trying to make one big thing from millions of ver...
Genome assembly: the art of trying to make one big thing from millions of ver...Genome assembly: the art of trying to make one big thing from millions of ver...
Genome assembly: the art of trying to make one big thing from millions of ver...
 

En vedette (9)

D03-NextGen-Bio-NGS
D03-NextGen-Bio-NGSD03-NextGen-Bio-NGS
D03-NextGen-Bio-NGS
 
Prins Bio Lib Bosc2008
Prins Bio Lib Bosc2008Prins Bio Lib Bosc2008
Prins Bio Lib Bosc2008
 
Amistad
AmistadAmistad
Amistad
 
Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009
 
Experiences with logic programming in bioinformatics
Experiences with logic programming in bioinformaticsExperiences with logic programming in bioinformatics
Experiences with logic programming in bioinformatics
 
Sharing Data: An Introductory Workshop from OpenAIRE and Foster
Sharing Data: An Introductory Workshop from OpenAIRE and FosterSharing Data: An Introductory Workshop from OpenAIRE and Foster
Sharing Data: An Introductory Workshop from OpenAIRE and Foster
 
Open Bioinformatics Foundation: 2014 Update & Some Introspection
Open Bioinformatics Foundation: 2014 Update & Some IntrospectionOpen Bioinformatics Foundation: 2014 Update & Some Introspection
Open Bioinformatics Foundation: 2014 Update & Some Introspection
 
yw jakartarb20101031
yw jakartarb20101031yw jakartarb20101031
yw jakartarb20101031
 
Ch5andch6
Ch5andch6Ch5andch6
Ch5andch6
 

Similaire à BioRuby -- Bioinformatics Library

Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
David Ruau
 
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesGenome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
EBI
 
Java Introductie
Java IntroductieJava Introductie
Java Introductie
mbruggen
 
SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...
SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...
SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...
Stéphane Ducasse
 
Antao Biopython Bosc2008
Antao Biopython Bosc2008Antao Biopython Bosc2008
Antao Biopython Bosc2008
bosc_2008
 

Similaire à BioRuby -- Bioinformatics Library (20)

ICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick ProvartICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick Provart
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesGenome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of Life
 
Talk6 biopython bosc2011
Talk6 biopython bosc2011Talk6 biopython bosc2011
Talk6 biopython bosc2011
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128
 
Thesis biobix
Thesis biobixThesis biobix
Thesis biobix
 
Java Introductie
Java IntroductieJava Introductie
Java Introductie
 
University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012
 
Biopython Project Update 2013
Biopython Project Update 2013Biopython Project Update 2013
Biopython Project Update 2013
 
The quality of the python ecosystem - and how we can protect it!
The quality of the python ecosystem - and how we can protect it!The quality of the python ecosystem - and how we can protect it!
The quality of the python ecosystem - and how we can protect it!
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
 
SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...
SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...
SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...
 
Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and Annotations
 
Antao Biopython Bosc2008
Antao Biopython Bosc2008Antao Biopython Bosc2008
Antao Biopython Bosc2008
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
 
BOSC 2008 Biopython
BOSC 2008 BiopythonBOSC 2008 Biopython
BOSC 2008 Biopython
 
Biopython
BiopythonBiopython
Biopython
 

Dernier

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Dernier (20)

Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

BioRuby -- Bioinformatics Library

  • 1. BioRuby BioRuby ― Bioinformatics Library ― 生物情報科学用ライブラリ Naohisa Goto / 後藤直久 Genome Information Research Center, Research Institute for Microbial Diseases, Osaka Univ. 大阪大学微生物病研究所附属遺伝情報実験センター Email: ngoto@gen-info.osaka-u.ac.jp twitter: @ngotogenome
  • 2. BioRuby Who am I? / 自己紹介  Name: Naohisa Goto  名前: 後藤 直久  Affiliation: Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University  所属: 大阪大学微生物病研究所附属遺伝情報実験センター  Twitter: @ngotogenome  Email: ngoto@gen-info.osaka-u.ac.jp  First Ruby experience: 1.2.6 (compiled in 22/Jun/1999)
  • 3. BioRuby BioRuby  Bioinformatics software library and tools written in the Ruby Language  Rubyで書かれた生物情報科学(バイオインフォマティ クス)用ライブラリとツール集  Free software (Ruby License)  http://bioruby.org/  https://github.com/bioruby/bioruby  % gem install bio
  • 4. BioRuby DNA DNA is a chain made of the four molecules. DNAは4種類の分子からなる鎖  A (Adenine)  C (Cytosine)  G (Guanine)  T (Thymine) DNA can be treated as String. DNAは文字列として扱える。 (ところで、Encoding どうしよう…) Human: Total 3GB (49-247MB/chromosome) ヒト: 合計3GB (染色体1本あたり49~247MB) 図: WikiPedia: 染色体
  • 5. BioRuby An example DNA data (with metadata / 付加情報含む) >gi|60459557|gb|AY948115.1| Homo sapiens alcohol dehydrogenase 1A (class I), alpha polypeptide (ADH1A) gene, complete cds GAGGGCGACAAAAGGGAACAGACCCAAAACCACAGGAGAGATGCTAGCATGACAGGGATGCAGAGACATA AAGCACAACAGTGAGATGGAGTTAATATACCTCCACGAGGGTGACCTTGTCCTGCATCTCAAATTTTGGG TAGGATTTGAATGGGCCAGAGGGACAGAAAAGAAGAGAAAGAGCATGATGAGCAAGGGCTTGAATGTTAA ATAGATTCCTCTTTGGGGGACCAGGGAGATACAAGCTTCTAAAGCACATACGCCCTGTATTGGAGAATGG GGAGGAGTAGATAGATGAGAAGGTTGAAGCCATATTACGAAGCCTTGAATGCTGAACATCAGATCTGGGG CTATATTCTTACCTTGATACATTTCAGAAGCAACTGAAATCGTAGGACCTTCCTTGCTTCTCTATTGGGT GAATGTTTCTCAGTCTTGGTGTGAGTCTCAGTGCCTACGTAGTTAAAGCTTACTGAAATGTTCCCTTTAC AATTCTAGAGAGATATGTCCTTTATGTTGACATGTTCATGCTGACAGACTGCATCTGATTAAACAGCTGC CTGTGCAATGCCTCCAAGTGTGGATAAAAGAAAAATTAAACTCATAATCTTGGACAGCCATGTGTAGACT AGTTACATTGATCAAAGGGCAATAGAAATGATCCAGTGAGGATTTGTCTGAATTTCCCACAATTATTTAA AATCTACCTCAAATACCTGTTCATCTATAATGCCTCCCCTGAGGCCTTCATTCTGAATAGTACCTCTGTC TCTGTCCCCAAAGCACTAACTGATCCCTGTGATAGCGCACTTCCCAGCCAGGCTGATATGTAGACTTGGC TGCCTGTGTATCTTTTCCCCATAGACTGTGAGCTTCCTTTTATGAATAATAATTGTAGCTAGCATTTAGT AGGGTGCTCCTACCTGTTAAACTCTATGATGAGTGCTTTACATAGATTATATCATTTATTCACTAAACAG TCCTTTAAAATGGTGCTATATTCACTAAACAGTCCTTTAAAATGGTGCTATATTCACTAAACAGTCATTT AAAATGGTATTATTCTTCTTCATCTTACAGGTAAACAAACTAAGGCAAAAAAAAAAGTGAAATAATAAGT GCCAGTACACAGAGCTAGTAAGGAATAGGGTCTGCCAGGTCCCAAAAAGCATGCCATCACCTTTGCCCCA TACTGCCTCTGGTACAGATAGAGGTAATGTCTTATTTATCACTGCCATCCACTGGACCCAGCTTAGTGCC TGACACACAGAGGGGCTCAGTCAATGCTGATTGGTTTGAGGTGGAGCAAAAATGCTTAGCAGGGTGAGCA CCTTTGCTGTGATTGAGTATCTGATTCTCTATGAAGAGAAGGGGAGTCCTGAGCCAAACACATTCCTCTG GCTCCTGGCTGTCATCTTTATTTGCCCGGCTTCTTTGCTCTTCCTCCTTCCTAACTGCACCGTTTGGATT (snip / 以下略) http://togows.dbcls.jp/entry/genbank/AY948115.1.fasta
  • 6. BioRuby What BioRuby can do? / できること  Biological data analysis / 生物データの解析  DNA, RNA  Protein / タンパク質  Relation of genes / 遺伝子間の関係性  Phylogenetic tree / 系統樹  Bibliography / 文献情報 …  I/O with other software / 他のソフトの入出力  Utilize web services / ウェブサービス利用
  • 7. BioRuby Code example DNA → Protein translation / DNA→タンパク質の翻訳 require "rubygems" require "bio" f = Bio::FlatFile.open(ARGF) f.each do |entry| dna = entry.naseq aa = dna.translate seq = Bio::Sequence.new(seq) print seq.output_fasta(e.definition) end
  • 8. BioRuby Status Latest version: BioRuby 1.4.1 (22/Oct/2010)  Supported Ruby version: 1.8.x  Will soon be migrated to1.9 / 速やかに1.9に移行予定  Files  Library: 230 files / 35,000 lines (w/o comments, void lines)  Tests: 120 files / 22,000 lines  Sample codes: 70 files  Functionality  580 classes/modules  2,800 methods  Plugin system introduced (using gem)
  • 9. BioRuby BioRuby developer’s community  Core developers (6 persons)  Toshiaki Katayama (leader) (Univ. of Tokyo, Japan)  Naohisa Goto (release manager) (Osaka Univ., Japan)  Mitsuteru Nakao (Japan)  Pjotr Prins (Wageningen University, Netherlands)  Raoul Bonnal (INGM, Italy)  Jan Aerts (Belgium)  Total >30 contributors in 10 years 10年間で延べ30人以上の貢献者  Active developers / users in the world 世界中にアクティブな開発者/利用者
  • 11. BioRuby Brief history 11/2000 BioRuby project started 06/2001 The first version (BioRuby 0.1) 2005-2006 IPA Exploratory (未踏) Software Project 02/2006 BioRuby 1.0.0 released 09/2008 moved from CVS to Git 08/2010 Published BioRuby research paper / 学術論文 10/2010 BioRuby 1.4.1 released
  • 12. BioRuby Preceding projects / 先行プロジェクト Demanded by genome projects in late 90’s 1990年代後半のゲノムプロジェクトに伴う BioPerl – since 1996 (Perl 1987) Biopython – since 1999 (Python 1991) BioJava – since 1999 (Java 1995) BioRuby – since 2000 (Ruby 1995) Together with Open Bioinformatics Foundation http://open-bio.org/ Google Summer of Code 2009, 2010, 2011
  • 13. BioRuby BioHackathon  Open Bio* Hackathon (2002, 2003)  Phyloinformatics Hackathon (2006)  DBCLS BioHackathon (2008-2010)
  • 14. BioRuby Academic Community / 学会 • Bioinformatics Open Source Conference • GIW / JSBi (日本バイオインフォマティクス学会) • MBSJ (日本分子生物学会) • Open Bio Japan (オープンバイオ研究会)
  • 15. BioRuby Open Source Community • Ruby Kansai (関西Ruby勉強会) (2005-) • IPA Exploratory (未踏) Software Project (2005-2006) • RubyKaigi (2006-) • Google Summer of Code (2009-2011) (Open Bioinformatics Foundation)
  • 16. BioRuby Recent topics  Release of new version (BioRuby 1.4.2)  Ruby 1.9.3 migration  Revolution of DNA sequencing technique DNA塩基配列決定技術の飛躍的向上
  • 17. BioRuby Next-Generation Sequencer (NGS) Example: Illumina HiSeq 2000 >600GB DNA sequences in 10-days 10日間で600GB超のDNA塩基配列を決定する装置 Lack of Resources / 足りないもの • HDD • CPU • Memory • Software • Human • Money • ...
  • 18. BioRuby Join us BioRuby Web http://bioruby.org ML bioruby@lists.open-bio.org GitHub https://github.org/bioruby/bioruby BioRubyユーザーが書いた本 多田雅人著「Rubyではじめる バイオインフォマティクス」 発売中!!