SlideShare une entreprise Scribd logo
1  sur  25
Yaoyu E. Wang, Ph.D
Center for Cancer Computational Biology, DFCI
SPECSII webinar
June 05, 2013
- Transcriptome profiling represents a static gene expression
state of a biological sample across the genome
- Allows for direct genomic comparisons with multiple samples
to determine genes that exhibit differential expression in
different state (i.e. normal vs. tumor)
- Allows for hypothesis generation on molecular abnormalities
and mechanisms that may contribute to the tumor phenotype
- Provides information on molecular subtypes, the development
of prognostic and predictive molecular signatures
- Two main technologies:
a. Microarray
b. RNA-Sequencing (RNASeq) using next generation
sequencing
Affymetrix
GeneChip
scanner
Blencowe B J et al. Genes Dev. 2009;23:1379-1386
Illumina HiSeq
.bcl files
CASAVA processing
•Demultiplexing
•Fastq file generation
•Sequencing filtering
Raw files containing base calls
and quality scores
Illumina defined
quality filters
Split into Project and Sample Folders
Jones_Lab
ChIP_A ChIP-B
Marcus_Lab
RNA-SeqA RNA-SeqB RNA-SeqC
Williams_Lab
Exome1 Exome2
Fastq Files Fastq Files Fastq Files
Haas & Zody. Nature Biotechnology 28, 421–423 (2010)
Using known
annotations
And compare to
known annotations
•Differential Expression
•Differential Isoform Abundance
•RNA editing
•SNP, indel detection
Technology RNASeq Microarray
High run-to run reproducibility Yes Yes
Dynamic Range Comparable to
actual transcript abundance
>8000-fold
Hundred
fold
Able to detect alternative splice site
and novel isoforms
Yes No
De novo analysis of samples without
reference genome
Yes No
Multiplexing Samples in one run Yes No
Required amount of total RNA >100 ng ~1 ug
Re-analyzable data Yes No
Technology RNASeq Microarray
Heterogeneity of read coverage
across an expressed region
Yes No
Well understood sources of
experimental bias
No Yes
Data portable on a flush drive (~4G) No Yes
Data is analyzable by any PC No Yes
Cheaper cost per sample No(?) Yes(?)
RNA-Seq Experiment
GEO Database
White paper, Illumina
White paper, Illumina
Comparing Expression Profiles from Microarrays to RNASeq
n=7532 n=4537
Mooney M, PloSOne (2013)
10 Lymphoma (3T-cell, 7 B-cell)
4 Normal lymph node
Total RNA
PE100 run
50-100 million
mapped reads
Compare 15,092 annotated genes on chip
Mooney M, PloSOne (2013)
T
NB
r=0.6; p<10-15
c. elegans
Biological Replicates for
L2 andYA stages
AffyTilingArrays* Illumina RNASeq
Agarwal, BMC Genomics (2010)
* Covers whole c.elegans genome
Differential Expression genes between the L2 andYA stage
Agarwal, BMC Genomics (2010)
RNA-Seq and tiling arrays
Tiling Array
Microarray
Maximum
Sensitivity
RNASeq 11-plex
RNASeq 6-plex
Agarwal, BMC Genomics (2010)
Per Sample Microarray Illumina HiSeq
1 per Chip/Lane $670 $4,010.00
2 plex NA $2,097.50
4-plex NA $1,141.25
6-plex NA $822.50
8-plex NA $663.13
6-plex
11-plex
Per Sample Microarray Illumina HiSeq
1 per Chip/Lane $670 $4,010.00
2 plex NA $2,097.50
4-plex NA $1,141.25
6-plex NA $822.50
8-plex NA $663.13
Data Per Sample
Time to
download 1
Sample
Time to download
100 samples
Cost to Store on the
Cloud per Month
RNASeq 30-65GB 1 Hr 6 days $270
Microarray 30MB 5 second 8 minutes $0.30
http://www.ncbi.nlm.nih.gov/genbank/statistics
-Application withUser Interface RNA-Seq analysis (i.e. Galaxy) can only
handle very few samples
-Knowledge of Linux server, scripting language, programming language is
absolutely REQUIRED
-Lack of detailed understanding in NGS technology and data leads to
diverse bioinformatics tools with different characteristics
LawWC ,Voom!, Bionconductor (2013)
The answer isYes
- Transcriptome profiles generated by microarray and RNASeq
are in strongly concordance
- Microarray data generated in the last decades is durable
- RNASeq is it offers more a lot more biological information
than microarray that is re-analyzable
- NGS is getting cheaper
However, the devil is in the data
- NGS data is a lot more expensive to store and analyze
- Specialized computing infrastructure and personnel are
required to take advantage of the information from NGS data

Contenu connexe

Tendances (20)

Gemome annotation
Gemome annotationGemome annotation
Gemome annotation
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
Genome Assembly
Genome AssemblyGenome Assembly
Genome Assembly
 
EMBL
EMBLEMBL
EMBL
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Genome assembly
Genome assemblyGenome assembly
Genome assembly
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Prosite
PrositeProsite
Prosite
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resources
 
String.pptx
String.pptxString.pptx
String.pptx
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 
Pymol
PymolPymol
Pymol
 
Presentation1
Presentation1Presentation1
Presentation1
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure prediction
 

Similaire à Comparison between RNASeq and Microarray for Gene Expression Analysis

FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochureAffymetrix
 
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Reid Robison
 
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...eventi-ITBbari
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisdrelamuruganvet
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;applicationFyzah Bashir
 
New Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewNew Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewPaolo Dametto
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsThermo Fisher Scientific
 
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesProcessing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesMartin Hartmann
 
Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Thermo Fisher Scientific
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysisDr. Olusoji Adewumi
 
GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016ExternalEvents
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNAUlises Urzua
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput SequencingMark Pallen
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposalGenomeInABottle
 
Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Anne Deslattes Mays
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...VHIR Vall d’Hebron Institut de Recerca
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSLubna MRL
 

Similaire à Comparison between RNASeq and Microarray for Gene Expression Analysis (20)

FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochure
 
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
 
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysis
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
Gene expression profiling
Gene expression profilingGene expression profiling
Gene expression profiling
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;application
 
New Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewNew Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overview
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
 
Dna microarray mehran- u of toronto
Dna microarray  mehran- u of torontoDna microarray  mehran- u of toronto
Dna microarray mehran- u of toronto
 
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesProcessing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
 
Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysis
 
GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput Sequencing
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal
 
Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
 

Plus de Yaoyu Wang

Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisYaoyu Wang
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisYaoyu Wang
 
Request CCCB Services
Request CCCB ServicesRequest CCCB Services
Request CCCB ServicesYaoyu Wang
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformYaoyu Wang
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsYaoyu Wang
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment DesignYaoyu Wang
 

Plus de Yaoyu Wang (6)

Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysis
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysis
 
Request CCCB Services
Request CCCB ServicesRequest CCCB Services
Request CCCB Services
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud Platform
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 

Dernier

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Dernier (20)

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Comparison between RNASeq and Microarray for Gene Expression Analysis

  • 1. Yaoyu E. Wang, Ph.D Center for Cancer Computational Biology, DFCI SPECSII webinar June 05, 2013
  • 2. - Transcriptome profiling represents a static gene expression state of a biological sample across the genome - Allows for direct genomic comparisons with multiple samples to determine genes that exhibit differential expression in different state (i.e. normal vs. tumor) - Allows for hypothesis generation on molecular abnormalities and mechanisms that may contribute to the tumor phenotype - Provides information on molecular subtypes, the development of prognostic and predictive molecular signatures - Two main technologies: a. Microarray b. RNA-Sequencing (RNASeq) using next generation sequencing
  • 4. Blencowe B J et al. Genes Dev. 2009;23:1379-1386 Illumina HiSeq
  • 5. .bcl files CASAVA processing •Demultiplexing •Fastq file generation •Sequencing filtering Raw files containing base calls and quality scores Illumina defined quality filters Split into Project and Sample Folders Jones_Lab ChIP_A ChIP-B Marcus_Lab RNA-SeqA RNA-SeqB RNA-SeqC Williams_Lab Exome1 Exome2 Fastq Files Fastq Files Fastq Files
  • 6. Haas & Zody. Nature Biotechnology 28, 421–423 (2010) Using known annotations And compare to known annotations •Differential Expression •Differential Isoform Abundance •RNA editing •SNP, indel detection
  • 7. Technology RNASeq Microarray High run-to run reproducibility Yes Yes Dynamic Range Comparable to actual transcript abundance >8000-fold Hundred fold Able to detect alternative splice site and novel isoforms Yes No De novo analysis of samples without reference genome Yes No Multiplexing Samples in one run Yes No Required amount of total RNA >100 ng ~1 ug Re-analyzable data Yes No
  • 8. Technology RNASeq Microarray Heterogeneity of read coverage across an expressed region Yes No Well understood sources of experimental bias No Yes Data portable on a flush drive (~4G) No Yes Data is analyzable by any PC No Yes Cheaper cost per sample No(?) Yes(?)
  • 12. Comparing Expression Profiles from Microarrays to RNASeq n=7532 n=4537
  • 13. Mooney M, PloSOne (2013) 10 Lymphoma (3T-cell, 7 B-cell) 4 Normal lymph node Total RNA PE100 run 50-100 million mapped reads Compare 15,092 annotated genes on chip
  • 14. Mooney M, PloSOne (2013) T NB r=0.6; p<10-15
  • 15. c. elegans Biological Replicates for L2 andYA stages AffyTilingArrays* Illumina RNASeq Agarwal, BMC Genomics (2010) * Covers whole c.elegans genome
  • 16. Differential Expression genes between the L2 andYA stage Agarwal, BMC Genomics (2010)
  • 17. RNA-Seq and tiling arrays Tiling Array Microarray Maximum Sensitivity RNASeq 11-plex RNASeq 6-plex Agarwal, BMC Genomics (2010)
  • 18.
  • 19. Per Sample Microarray Illumina HiSeq 1 per Chip/Lane $670 $4,010.00 2 plex NA $2,097.50 4-plex NA $1,141.25 6-plex NA $822.50 8-plex NA $663.13 6-plex 11-plex
  • 20. Per Sample Microarray Illumina HiSeq 1 per Chip/Lane $670 $4,010.00 2 plex NA $2,097.50 4-plex NA $1,141.25 6-plex NA $822.50 8-plex NA $663.13
  • 21.
  • 22. Data Per Sample Time to download 1 Sample Time to download 100 samples Cost to Store on the Cloud per Month RNASeq 30-65GB 1 Hr 6 days $270 Microarray 30MB 5 second 8 minutes $0.30 http://www.ncbi.nlm.nih.gov/genbank/statistics
  • 23. -Application withUser Interface RNA-Seq analysis (i.e. Galaxy) can only handle very few samples -Knowledge of Linux server, scripting language, programming language is absolutely REQUIRED -Lack of detailed understanding in NGS technology and data leads to diverse bioinformatics tools with different characteristics LawWC ,Voom!, Bionconductor (2013)
  • 24.
  • 25. The answer isYes - Transcriptome profiles generated by microarray and RNASeq are in strongly concordance - Microarray data generated in the last decades is durable - RNASeq is it offers more a lot more biological information than microarray that is re-analyzable - NGS is getting cheaper However, the devil is in the data - NGS data is a lot more expensive to store and analyze - Specialized computing infrastructure and personnel are required to take advantage of the information from NGS data

Notes de l'éditeur

  1. The basic concept behind the use of GeneChip arrays for gene expression is simple: labeled cDNA or cRNA targets derived from the mRNA of an experimental sample are hybridized to nucleic acid probes attached to the solid support. By monitoring the amount of dye label associated with each DNA location, it is possible to infer the abundance of each mRNA species represented. For transcriptome profiling, the input is usually about 1ug total RNA that are poly-A selected to ensure only mature mRNA is being assayed.
  2. Poly(A)+ mRNA is purified, fragmented, and then converted to a cDNA library with 5′ and 3′ adapter sequences. Short sequence reads are generated from the cDNA library. Normally, reads are mapped to previously annotate known transcripts and a pile un-mapped reads are kept. Reads that map to novel expressed sequences, including alternative exons and corresponding splice junction sequences
  3. Two RNA sample types MAQC brain and universal human Reference RNA were processed using 5 technical replicates on both microarray and RNA-Seq. Once teh data is generated, the microarray data was processed using MAQC. For RNA-Seq, the sample cDNA libraries were prepared with Illumina protocol and sequenced to a depth of ~30 million mapped reads.
  4. This is the scatter plot of technical replicates of the samples analyzed by RNA-Seq and microarray. The false positive rates are comparable between the two methods, and both methods have extremely high correlation between replicates (R&gt;0.99). The plots demonstrate that RNA-Seq identifies more genes and spans a wider dynamic range compared to the microarray.
  5. Scatterplot of fold change per gene as measured by RNASeq and microarray. Genes identified as differentially expressed by both platform are plotted in red, genes identified by RNASeq in blue, microarray in yellow and neither ins green. While the correlation between the two platforms in identifying differentially expressed genes is really high, this figure clearly indicates that a discrepancy between the platforms in the ability to identify genes as differentially expressed. The gene subset segmentation reveal that RNA-Seq counts identified significantly more differentially expressed genes. However, microarray does detect gene expression differences. Further valudation from a subset of 1000 genes for which PCR data is available, RNASeq data shows higher concordance with PCR results than microarray.
  6. A study by Mooney et al, use a paired RNA sequencing (RNA-Seq)/microarray analysis of a set of 4 normal canine lymph nodes and 10 canine lymphoma fine needle aspirates to identify technical biases and variation between the technologies. We use a paired RNA sequencing (RNA-Seq)/microarray analysis of a set of four normal canine lymph nodes and ten canine lymphoma fine needle aspirates to identify technical biases and variation between the technologies and compare the 15,092 annotated genes on chip.
  7. Both RNA-Seq and microarray observations provide present detection calls for 15,092 genes in each of the 14 samples. Thepercent present detection calls provided by the two technologies agreed with high frequency (73%) and were statistically associated(Table 3; p,10215, odds ratio .40). Among genes probed by both methods, percent present detection frequencies of 69% and 44%were obtained by RNA-Seq and microarray, respectively. Among genes called present using microarray over 97% were detectedusing RNA-Seq.Variation among expression profiles obtained using RNASeq is similar to that obtained using microarray after removing contributions of the first surrogate variable [42]. Each letter denotes a sample from a dog having a normal (N), B-cell (B), or T-cell (T) diagnosis as in the legend, with subscript ‘m’ run on the microarray platform and subscript ‘r’ run onthe NGS platform. a) Principal component scores b) Hierarchical clustering
  8. Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from thesecond larval stage of C. elegansto Young adult (YA)for all genes Each point represents a gene from the composite model. RNA-Seq expression levels per gene were measured using RPKM, and tiling array levels were measured using the mean intensity of probes falling within composite exons. The Spearman&apos;s coefficient is 0.90, indicating that the platforms correlate well on identical samples. The disproportionate number of genes in the upper left likely represents cross-hybridization.
  9. Differential expression of genes between the L2 and YA stages. (a) Correlation of log2(YA/L2) ratios between RNA-Seq and tiling arrays.. Black: not significantly differentially expressed between samples.Blue: significantly differentially expressed (q ≤ 0.01). The ratio of expression levels is well-correlated, but RNA-Seq has a larger dynamic range. (b) Venn diagram of genes called differentially expressed by each platform. There is significant overlap (8,976) between the two platforms, but more genes were called differentially expressed by RNA-Seq (14,201) than by tiling arrays (10,283), likely reflecting its greater dynamic range. A total of 4,326 genes were not called differentially expressed by either technology. 
  10. ROC curve analysis. Black: tiling array. Red: RNA-Seq with all 32 million reads. It is evident that the RNA-Seq substantially outperforms the tiling array with consistently higher sensitivity at lower FPR. Remaining curves are for RNA-Seq with only a subset of reads utilized. At an FPR = 0.05, just 4 million reads (blue) are required to attain the same sensitivity as two tiling array replicates.