SlideShare a Scribd company logo
1 of 34
Download to read offline
RNA-Seq


                       (yag_ays)
http://yag-ays.jp/pdf/20110602labseminar_pub.pdf
r e d
             n s o
          c e
usagi



usamimi
NGS
  (Next Generation Sequencing)

         RNA-Seq
    (Transcriptome Analysis)

        de novo
Transcriptome Assembly
Central Dogma
           A T G C

 DNA




 mRNA




 Protein
Central Dogma
           A T G C

 DNA




 mRNA




 Protein
Central Dogma
           A T G C

 DNA
           Transcriptome


 mRNA




 Protein
NGS       RNA-Seq
A T G C
                NGS
                      •   illumina / Solexa GA
                      •   ABI / SOLiD
                      •   Roche / 454
                      •   PacBio
                      •   Helicos / Heliscope
                      •   ion torrent    etc...




mRNA




               TTAGCCTTAGCTTCC
               GTCGCAACTTCCTTA
               TTCACGAGCTTGATG
               TTGCGGATCACTTTG
NGS               RNA-Seq
A T G C
          NGS           NGS
                              •   illumina / Solexa GA
          •                   •   ABI / SOLiD
                              •   Roche / 454
              •               •   PacBio
                              •   Helicos / Heliscope
                              •   ion torrent    etc...

          •
mRNA




                       TTAGCCTTAGCTTCC
                       GTCGCAACTTCCTTA
                       TTCACGAGCTTGATG
                       TTGCGGATCACTTTG
RNA-Seq




 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




                          •


                          •


                          •

 ʻalign-then-assembleʼ   ʻassemble-then-aliignʼ
        approach               approach
RNA-Seq




 • 454


 •
 •


 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




              •


               • cDNA
 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
Sujai Kumar and Mark L Blaxter : Comparing de novo
assemblers for 454 transcriptome data (2010)
Newbler 2.5
Sujai Kumar and Mark L Blaxter : Comparing de novo
assemblers for 454 transcriptome data (2010)
Newbler 2.5


                                      ...
Sujai Kumar and Mark L Blaxter : Comparing de novo
assemblers for 454 transcriptome data (2010)
Newbler 2.5


                                      ...


                                            Trinity...!!
1. Newbler 2.5
                 • Roche 454
                 • 454
                 •

     2. Trinity
                 • Broad Institute


                 • 454                                                                                                                (        )


                 • Nat Biotechnol. 2011 May                                                               *
* Grabherr MG, Haas BJ,Yassour M et al. : Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011 May 15
1. Newbler 2.5
    • Overlap-Layout-Consensus (OLC)


2. Trinity
    I. Inchworm : k-mer graph
    II. Chrysalis : Contig pool
    III.Butterfly : De Bruijn Graph


                       2
Roche 454 pyrosequencing
           usamimi                                     0.3M reads
                                     (sff or fastq format)




Newbler 2.5                              Trinity


   (fasta format)                          (fasta format)

                    GMAP
                    with usagi CDS




    (gff format)                             (gff format)
S. Kumar et al.(2010)




  •
                        (   )

  •
Newbler 2.5 Trinity

              Newbler 2.5    Trinity
Number of
                19,753       20,758
 contigs

Total Bases    9,651,390    10,275,166

Max contig
                 2,878        2,151
  length
Mean contig
                 488.6         495
  length

   N50            581          616
Newbler 2.5                             N = 19,753




     Trinity                            N = 20,758




http://edwards.sdsu.edu/prinseq_beta/
usagi CDS


                                                       all
  usagi CDS       30,000                               ≧ 80% alignment
                                                       ≧ 90% alignment
                                                       ≧ 95% alignment
        Newbler 2.5   Trinity                          100% alignment
                                16000
 all     15,498       15,524
                                12000
≧ 80%    14,583       14,697
                                 8000
≧ 90%     8,466       8,665
≧ 95%     1,059       1,191      4000


100%        66          30          0
                                        Newbler 2.5   Trinity
usagi


     Newbler 2.5           Trinity

12,417                               10,433
 genes                                genes


          2,990    9,427   1,006
...



            S. Kumar et al.(2010)
 Poly(A/T)


Poly(A/T)


                       Poly(A/T)
Poly(A/T)                                        Trinity > Newbler 2.5

                          Newbler 2.5                            Trinity


                                          257                            3,773
Poly T                                  (1.30%)                        (18.18%)

                      20 bp                              20 bp




                                          539                            2,349
Poly A                                  (2.73%)                        (11.32%)

                              20 bp                      20 bp

http://edwards.sdsu.edu/prinseq_beta/               ()
Poly(A/T)                                           Trinity > Newbler 2.5

                           Newbler 2.5                              Trinity


                                             257                            3,773
Poly T                                     (1.30%)                        (18.18%)
Poly(A/T)                                Quality Value
                       20 bp                                20 bp

    →Newbler Quality                                 trimming                 ...?
                                             539                            2,349
Poly A                                     (2.73%)                        (11.32%)

                               20 bp                        20 bp

 http://edwards.sdsu.edu/prinseq_beta/                 ()
Trinity Newbler 2.5

1
      usagi CDS




2     Poly(A/T)       Trinity
      Newbler 2.5
Trinity Newbler 2.5

1
      usagi CDS
                      Trinity
                      454
2     Poly(A/T)         Trinity
      Newbler 2.5
Trinity Newbler 2.5

1
      usagi CDS
                      Trinity
                      454
2     Poly(A/T)         Trinity
      Newbler 2.5
Method : Parameters
• Newbler 2.5   • Trinity (20110519 ver.)
 • -notrim       • --seqType=fq
 • -urt          • --single
                 • --min_contig_length 50
                 • --run_butterfly
                 • --CPU 4
                 • --bfly_opts "--
                   compatible_path_extensi
                   on --stderr "

More Related Content

Viewers also liked

Rを用いた解析の実例 Project *ぶろったん*
Rを用いた解析の実例 Project *ぶろったん*Rを用いた解析の実例 Project *ぶろったん*
Rを用いた解析の実例 Project *ぶろったん*裕樹 奥田
 
Variational AutoEncoder
Variational AutoEncoderVariational AutoEncoder
Variational AutoEncoderKazuki Nitta
 
Humor Recognition and Humor Anchor Extraction
Humor Recognition and Humor Anchor ExtractionHumor Recognition and Humor Anchor Extraction
Humor Recognition and Humor Anchor Extraction裕樹 奥田
 
変分ベイズ法の説明
変分ベイズ法の説明変分ベイズ法の説明
変分ベイズ法の説明Haruka Ozaki
 
パターン認識 04 混合正規分布
パターン認識 04 混合正規分布パターン認識 04 混合正規分布
パターン認識 04 混合正規分布sleipnir002
 
数式を使わずイメージで理解するEMアルゴリズム
数式を使わずイメージで理解するEMアルゴリズム数式を使わずイメージで理解するEMアルゴリズム
数式を使わずイメージで理解するEMアルゴリズム裕樹 奥田
 
現在のDNNにおける未解決問題
現在のDNNにおける未解決問題現在のDNNにおける未解決問題
現在のDNNにおける未解決問題Daisuke Okanohara
 
IIBMP2016 深層生成モデルによる表現学習
IIBMP2016 深層生成モデルによる表現学習IIBMP2016 深層生成モデルによる表現学習
IIBMP2016 深層生成モデルによる表現学習Preferred Networks
 
猫でも分かるVariational AutoEncoder
猫でも分かるVariational AutoEncoder猫でも分かるVariational AutoEncoder
猫でも分かるVariational AutoEncoderSho Tatsuno
 

Viewers also liked (12)

Re revenge chap03-1
Re revenge chap03-1Re revenge chap03-1
Re revenge chap03-1
 
NICOMI
NICOMINICOMI
NICOMI
 
Rを用いた解析の実例 Project *ぶろったん*
Rを用いた解析の実例 Project *ぶろったん*Rを用いた解析の実例 Project *ぶろったん*
Rを用いた解析の実例 Project *ぶろったん*
 
Chapter9 2
Chapter9 2Chapter9 2
Chapter9 2
 
Variational AutoEncoder
Variational AutoEncoderVariational AutoEncoder
Variational AutoEncoder
 
Humor Recognition and Humor Anchor Extraction
Humor Recognition and Humor Anchor ExtractionHumor Recognition and Humor Anchor Extraction
Humor Recognition and Humor Anchor Extraction
 
変分ベイズ法の説明
変分ベイズ法の説明変分ベイズ法の説明
変分ベイズ法の説明
 
パターン認識 04 混合正規分布
パターン認識 04 混合正規分布パターン認識 04 混合正規分布
パターン認識 04 混合正規分布
 
数式を使わずイメージで理解するEMアルゴリズム
数式を使わずイメージで理解するEMアルゴリズム数式を使わずイメージで理解するEMアルゴリズム
数式を使わずイメージで理解するEMアルゴリズム
 
現在のDNNにおける未解決問題
現在のDNNにおける未解決問題現在のDNNにおける未解決問題
現在のDNNにおける未解決問題
 
IIBMP2016 深層生成モデルによる表現学習
IIBMP2016 深層生成モデルによる表現学習IIBMP2016 深層生成モデルによる表現学習
IIBMP2016 深層生成モデルによる表現学習
 
猫でも分かるVariational AutoEncoder
猫でも分かるVariational AutoEncoder猫でも分かるVariational AutoEncoder
猫でも分かるVariational AutoEncoder
 

Similar to 20110602labseminar pub

Craig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
Craig Hawker of UCSB: Commercial Applications of Polymer as NanomaterialsCraig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
Craig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterialsucsb.ira
 
Jag Trasgo Helsinki091002
Jag Trasgo Helsinki091002Jag Trasgo Helsinki091002
Jag Trasgo Helsinki091002Miguel Morales
 
Glycomics2004-CrKa
Glycomics2004-CrKaGlycomics2004-CrKa
Glycomics2004-CrKaCrKa
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pubsesejun
 
monsanto monmouth1
monsanto monmouth1monsanto monmouth1
monsanto monmouth1finance28
 
Preparation and characterization of pla pbat organoclay composites
Preparation and characterization of pla pbat organoclay compositesPreparation and characterization of pla pbat organoclay composites
Preparation and characterization of pla pbat organoclay compositesJunaedy Keputet
 
PYReco present at Cologne Expo Cologne 14 16th Feb 2012 = Final
PYReco present at Cologne Expo    Cologne  14 16th Feb 2012 = FinalPYReco present at Cologne Expo    Cologne  14 16th Feb 2012 = Final
PYReco present at Cologne Expo Cologne 14 16th Feb 2012 = FinalPYReco
 
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...ainia centro tecnológico
 
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise..."Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...Jonathan Eisen
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_coursehansjansen9999
 
GROTESQUE - ANS 2012
GROTESQUE - ANS 2012GROTESQUE - ANS 2012
GROTESQUE - ANS 2012jdbess
 
生命を理解する道具としての計算機  SCSN@UCLA
生命を理解する道具としての計算機  SCSN@UCLA生命を理解する道具としての計算機  SCSN@UCLA
生命を理解する道具としての計算機  SCSN@UCLAKeiichiro Ono
 
Linked Data for integrating life-science databases
Linked Data for integrating life-science databasesLinked Data for integrating life-science databases
Linked Data for integrating life-science databasesShuichi Kawashima
 
Edward Muge Presentation
Edward Muge  PresentationEdward Muge  Presentation
Edward Muge Presentationobutuz
 
identification of genes and gene-near regions related to active compounds in ...
identification of genes and gene-near regions related to active compounds in ...identification of genes and gene-near regions related to active compounds in ...
identification of genes and gene-near regions related to active compounds in ...World Agroforestry (ICRAF)
 
Edward Muge Presentation
Edward Muge  PresentationEdward Muge  Presentation
Edward Muge Presentationguestd2d93b8
 

Similar to 20110602labseminar pub (20)

Craig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
Craig Hawker of UCSB: Commercial Applications of Polymer as NanomaterialsCraig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
Craig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
 
Jag Trasgo Helsinki091002
Jag Trasgo Helsinki091002Jag Trasgo Helsinki091002
Jag Trasgo Helsinki091002
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Glycomics2004-CrKa
Glycomics2004-CrKaGlycomics2004-CrKa
Glycomics2004-CrKa
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pub
 
monsanto monmouth1
monsanto monmouth1monsanto monmouth1
monsanto monmouth1
 
Preparation and characterization of pla pbat organoclay composites
Preparation and characterization of pla pbat organoclay compositesPreparation and characterization of pla pbat organoclay composites
Preparation and characterization of pla pbat organoclay composites
 
PCR Primer desining
PCR Primer desiningPCR Primer desining
PCR Primer desining
 
PYReco present at Cologne Expo Cologne 14 16th Feb 2012 = Final
PYReco present at Cologne Expo    Cologne  14 16th Feb 2012 = FinalPYReco present at Cologne Expo    Cologne  14 16th Feb 2012 = Final
PYReco present at Cologne Expo Cologne 14 16th Feb 2012 = Final
 
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
 
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise..."Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
 
Ashg grc workshop2015_tg
Ashg grc workshop2015_tgAshg grc workshop2015_tg
Ashg grc workshop2015_tg
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_course
 
GROTESQUE - ANS 2012
GROTESQUE - ANS 2012GROTESQUE - ANS 2012
GROTESQUE - ANS 2012
 
生命を理解する道具としての計算機  SCSN@UCLA
生命を理解する道具としての計算機  SCSN@UCLA生命を理解する道具としての計算機  SCSN@UCLA
生命を理解する道具としての計算機  SCSN@UCLA
 
Linked Data for integrating life-science databases
Linked Data for integrating life-science databasesLinked Data for integrating life-science databases
Linked Data for integrating life-science databases
 
Edward Muge Presentation
Edward Muge  PresentationEdward Muge  Presentation
Edward Muge Presentation
 
identification of genes and gene-near regions related to active compounds in ...
identification of genes and gene-near regions related to active compounds in ...identification of genes and gene-near regions related to active compounds in ...
identification of genes and gene-near regions related to active compounds in ...
 
Edward Muge Presentation
Edward Muge  PresentationEdward Muge  Presentation
Edward Muge Presentation
 
Edward muge presentation
Edward muge  presentationEdward muge  presentation
Edward muge presentation
 

Recently uploaded

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Recently uploaded (20)

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

20110602labseminar pub

  • 1. RNA-Seq (yag_ays) http://yag-ays.jp/pdf/20110602labseminar_pub.pdf
  • 2. r e d n s o c e usagi usamimi
  • 3. NGS (Next Generation Sequencing) RNA-Seq (Transcriptome Analysis) de novo Transcriptome Assembly
  • 4. Central Dogma A T G C DNA mRNA Protein
  • 5. Central Dogma A T G C DNA mRNA Protein
  • 6. Central Dogma A T G C DNA Transcriptome mRNA Protein
  • 7. NGS RNA-Seq A T G C NGS • illumina / Solexa GA • ABI / SOLiD • Roche / 454 • PacBio • Helicos / Heliscope • ion torrent etc... mRNA TTAGCCTTAGCTTCC GTCGCAACTTCCTTA TTCACGAGCTTGATG TTGCGGATCACTTTG
  • 8. NGS RNA-Seq A T G C NGS NGS • illumina / Solexa GA • • ABI / SOLiD • Roche / 454 • • PacBio • Helicos / Heliscope • ion torrent etc... • mRNA TTAGCCTTAGCTTCC GTCGCAACTTCCTTA TTCACGAGCTTGATG TTGCGGATCACTTTG
  • 9. RNA-Seq ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 10. RNA-Seq • • • ʻalign-then-assembleʼ ʻassemble-then-aliignʼ approach approach
  • 11. RNA-Seq • 454 • • ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 12. RNA-Seq ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 13. RNA-Seq ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 14. RNA-Seq ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 15. RNA-Seq • • cDNA ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 16. Sujai Kumar and Mark L Blaxter : Comparing de novo assemblers for 454 transcriptome data (2010) Newbler 2.5
  • 17. Sujai Kumar and Mark L Blaxter : Comparing de novo assemblers for 454 transcriptome data (2010) Newbler 2.5 ...
  • 18. Sujai Kumar and Mark L Blaxter : Comparing de novo assemblers for 454 transcriptome data (2010) Newbler 2.5 ... Trinity...!!
  • 19. 1. Newbler 2.5 • Roche 454 • 454 • 2. Trinity • Broad Institute • 454 ( ) • Nat Biotechnol. 2011 May * * Grabherr MG, Haas BJ,Yassour M et al. : Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011 May 15
  • 20. 1. Newbler 2.5 • Overlap-Layout-Consensus (OLC) 2. Trinity I. Inchworm : k-mer graph II. Chrysalis : Contig pool III.Butterfly : De Bruijn Graph 2
  • 21. Roche 454 pyrosequencing usamimi 0.3M reads (sff or fastq format) Newbler 2.5 Trinity (fasta format) (fasta format) GMAP with usagi CDS (gff format) (gff format)
  • 22. S. Kumar et al.(2010) • ( ) •
  • 23. Newbler 2.5 Trinity Newbler 2.5 Trinity Number of 19,753 20,758 contigs Total Bases 9,651,390 10,275,166 Max contig 2,878 2,151 length Mean contig 488.6 495 length N50 581 616
  • 24. Newbler 2.5 N = 19,753 Trinity N = 20,758 http://edwards.sdsu.edu/prinseq_beta/
  • 25. usagi CDS all usagi CDS 30,000 ≧ 80% alignment ≧ 90% alignment ≧ 95% alignment Newbler 2.5 Trinity 100% alignment 16000 all 15,498 15,524 12000 ≧ 80% 14,583 14,697 8000 ≧ 90% 8,466 8,665 ≧ 95% 1,059 1,191 4000 100% 66 30 0 Newbler 2.5 Trinity
  • 26. usagi Newbler 2.5 Trinity 12,417 10,433 genes genes 2,990 9,427 1,006
  • 27. ... S. Kumar et al.(2010) Poly(A/T) Poly(A/T) Poly(A/T)
  • 28. Poly(A/T) Trinity > Newbler 2.5 Newbler 2.5 Trinity 257 3,773 Poly T (1.30%) (18.18%) 20 bp 20 bp 539 2,349 Poly A (2.73%) (11.32%) 20 bp 20 bp http://edwards.sdsu.edu/prinseq_beta/ ()
  • 29. Poly(A/T) Trinity > Newbler 2.5 Newbler 2.5 Trinity 257 3,773 Poly T (1.30%) (18.18%) Poly(A/T) Quality Value 20 bp 20 bp →Newbler Quality trimming ...? 539 2,349 Poly A (2.73%) (11.32%) 20 bp 20 bp http://edwards.sdsu.edu/prinseq_beta/ ()
  • 30. Trinity Newbler 2.5 1 usagi CDS 2 Poly(A/T) Trinity Newbler 2.5
  • 31. Trinity Newbler 2.5 1 usagi CDS Trinity 454 2 Poly(A/T) Trinity Newbler 2.5
  • 32. Trinity Newbler 2.5 1 usagi CDS Trinity 454 2 Poly(A/T) Trinity Newbler 2.5
  • 33.
  • 34. Method : Parameters • Newbler 2.5 • Trinity (20110519 ver.) • -notrim • --seqType=fq • -urt • --single • --min_contig_length 50 • --run_butterfly • --CPU 4 • --bfly_opts "-- compatible_path_extensi on --stderr "