SlideShare une entreprise Scribd logo
1  sur  33
Télécharger pour lire hors ligne
Single-Cell RNA-seq analysis
Section 1:Overview of Single-Cell RNA-seq
Presented By Alireza Dosutmohammadi
M.Sc. in Bioinformatics, Tarbiat Modares University
Oct 2021
RNA-seq Vs Microarrays
RNA-seq allows profiling the transcripts in a sample in an efficient and cost-
effective way.
RNA-seq allows for an unbiased sampling of all transcripts in a sample, rather
than being limited to a pre-determined set of transcripts
2
with bulk RNA-seq:
we can only estimate the average expression level for each gene across a
population of cells, without regard for the heterogeneity in gene expression
across individual cells of that sample. Therefore, it is insufficient for studying
heterogeneous systems, e.g. early development studies or complex tissues
such as the brain.
Unlike with the bulk approach, with scRNA-seq we can estimate a distribution
of expression levels for each gene across a population of cells.
Bulk RNA-seq Vs Single-Cell RNA-seq
3
Bulk RNA-seq Vs Single-Cell RNA-seq
4
5
Bulk RNA-seq Vs Single-Cell RNA-seq
Single-Cell RNA-seq
6
• Human Cell Atlas (H. sapiens)
• Tabula Muris (M. musculus)
• Fly Cell Atlas (D. melanogaster)
• Cell Atlas of Worm (C. elegans)
• Arabidopsis Root Atlas (A. thaliana)
Single-cell Atlases
7
Sample Preparation Protocols
8
• Tissue dissection and cell dissociating to obtain a suspension of cells.
• Optionally cells may be selected (e.g. based on membrane markers,
fluorescent transgenes or staining dyes).
• Capture single cells into individual reaction containers (e.g. wells or oil
droplets).
• Extracting the RNA from each cell.
• Reverse-transcribing the RNA to more stable cDNA.
• Amplifying the cDNA (either by in vitro transcription or by PCR).
• Preparing the sequencing library with adequate molecular adapters.
• Sequencing, usually with paired-end Illumina protocols.
• Processing the raw data to obtain a count matrix of genes-by-cells
• Carrying several downstream analysis.
Sample Preparation Protocols
9
Sample Preparation Protocols
10
Sample Preparation Protocols
two most important aspects are: cell capture or isolation and transcript
quantification.
11
Sample Preparation Protocols
In tissues where cell dissociation is difficult or in frozen tissue samples, instead
of isolating whole single cells it is possible to instead isolate single nuclei. Apart
from the isolation step, the protocol to prepare single-nuclei sequencing
libraries is similar to that of single-cell protocols. However, nuclear RNA usually
contains a higher proportion of unprocessed RNA, with more of the sequenced
transcripts containing introns.
12
Sample Preparation Protocols
Cell Capturing methods:
• Microtitre-plate-based
• Microfluidic-array-based
• Microfluidic-droplet-based
The strategy determines the throughput of the experiment.
13
Sample Preparation Protocols
Cell Capturing methods:
14
Sample Preparation Protocols
Cell Capturing methods:
Microtitre-plate methods (well-based methods ):
• isolating cells into individual wells of the plate using, for example, pipetting,
microdissection or fluorescent activated cell sorting (FACS).
• Advantage: take pictures of the cells before library preparation.
• Can identify and discard damaged cells or find wells containing doublets
associate information such as cell size and the intensity of any used labels
with the well coordinates.
• The main drawback: they are often low-throughput.
15
Sample Preparation Protocols
Cell Capturing methods:
Microfluidic-array platforms:
• integrated system for capturing cells and for carrying out the reactions
necessary for the library preparations.
• they provide a higher throughput than microtitre-plate-based methods.
• only around 10% of cells are captured in a microfluidic platform
• they are not appropriate if one is dealing with rare cell-types or very small
amounts of input.
• has to be taken with the cell sizes captured by the arrays, as the nanowells
are customised for particular sizes. this may therefore affect the unbiased
sampling of cells in complex tissues.
• the chip is relatively expensive.
16
Sample Preparation Protocols
Cell Capturing methods:
Microfluidic-droplet methods:
• offer the highest throughput.
• They work by encapsulating individual cells inside a nanoliter-sized oil droplet,
together with a bead.
• The bead is loaded with enzymes and other components required to construct
the library.
• Each bead contains a unique barcode which is attached to all of the
sequencing reads originating from that cell.
• all of the droplets can be pooled, sequenced together and the reads can
subsequently be assigned to the cell of origin based on those barcodes.
• Droplet platforms have relatively cheap library preparation costs on the order of
0.05 USD/cell.
17
Sample Preparation Protocols
Transcript Quantification:
• full-length: uniform read coverage across the whole transcript
• tag-based: only capture either the 5’ or 3’ ends
18
Sample Preparation Protocols
Transcript Quantification:
• full-length Protocol:
• identical to what is done in bulk RNA-seq
• Although in theory full-length protocols should provide an even coverage of
transcripts, there can sometimes be biases in the coverage across the gene
body.
• Full-length protocols also allow the detection of splice variants.
19
Sample Preparation Protocols
Transcript Quantification:
• SMART-seq2 is a popular low-throughput method, providing full-length transcript
quantification. It is ideally suited for studying a smaller group of cells in greater
detail.
• 10x Chromium is a popular high-throughput method, using UMIs for transcript
quantification (from either 3’ or 5’ ends). It is ideally suited to study highly
heterogeneous tissues and sample large populations of cells at scale.
20
Sample Preparation Protocols
Transcript Quantification:
• tag-based protocols:
• only one of the ends (3’ or 5’) of the transcript is sequenced.
• 3’ protocols are more commonly used, many protocols now allow sequencing from
either end (e.g. 10x Chromium supports both).
• Advantage of 5’-end sequencing: obtain information about the transcription start site
(TSS), which allows to explore whether there is differential TSS usage across cells.
• Advantage: they can be combined with unique molecular identifiers (UMIs), which
can help improve the accuracy of transcript quantification.
• Unique molecular identifiers (UMIs) are a type of molecular barcoding that provides
error correction and increased accuracy during sequencing. These molecular
barcodes are short sequences used to uniquely tag each molecule in a sample
library.
• Disadvantage: being restricted to one end of the transcript only, it reduces our ability
to unambiguously align reads to a transcript, as well as making it difficult to distinguish
different isoforms.
21
Importance of Single-Cell RNA-seq
22
Comparing different protocols
23
24
Comparing different protocols
25
Comparing different protocols
• Sensitivity: how many genes are detected per cell
• accuracy (e.g. compared to bulk RNA-seq)
• recover all cell types present in a sample.
• low-throughput methods have higher sensitivity compared to high-
throughput methods, such as 10x Chromium.
• low-throughput methods did not capture some of the rarer cell types in their
samples, leading to an incomplete characterisation of the cell population.
26
Comparing different protocols
• if one is interested in characterizing the composition of a heterogeneous
tissue, then a droplet-based method is more appropriate, as it allows a very
large number of cells to be captured in a mostly unbiased manner.
• Full-length transcript quantification will be more appropriate if one is
interested in studying different isoforms, since tagged protocols are much
more limited in this regard. By contrast, UMIs can only be used with tagged
protocols and they can improve gene-level quantification.
• If one is interested in rare cell types (for which known markers are not
available), then more cells need to be sequenced, which will increase the
cost of the experiment.
27
What Protocol Should I Choose?
• take into account when performing scRNA-seq experiments Factors such as:
the cost per cell, how many cells one needs, or how much to sequence
each cell.
• Care has to be taken to avoid biases due to batches being processed at
different times.
Important !
28
How many cells do we need to sample so that we see at least n cells of each
type?
• This depends on the number of cell type present and the diversity, i.e. the
entropy.
• Assume that there are 10 rare cell types, each one present at a fraction of
2% of the total population. If we want to be 95% confident that our sample
contains at least 5 cells from each of those cell types, we need to sample
at least 619 cells in total.
29
Important !
• The main difference between bulk and single cell RNA-seq is that each
sequencing library represents a single cell, instead of a population of cells.
• Another important aspect to take into account are batch effects. These
can be observed even when sequencing the same material using different
technologies, and if not properly normalised, can lead to incorrect
conclusions.
Data challenges
30
31
Data challenges
• if planning an experiment to compare healthy and diseased tissues
from 10 patients each, if only 10 samples can be processed per day, it
is best to do 5 healthy + 5 diseased together each day, rather than
prepare all healthy samples one day and all diseased samples in
another.
• Another consideration is to ensure that there is replication of tissue
samples. For example, when collecting tissue from an organ, it may be
a good idea to take multiple samples from different parts of the organ.
Or consider the time of day when samples/replicates are collected
(due to possible circadian changes in gene expression).
• all the common best practices in experimental design should be taken
into account.
32
Data challenges
33
Data challenges

Contenu connexe

Tendances

Tendances (20)

Rna seq
Rna seqRna seq
Rna seq
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
Next generation sequencing technologies for crop improvement
Next generation sequencing technologies for crop improvementNext generation sequencing technologies for crop improvement
Next generation sequencing technologies for crop improvement
 
Pyrosequencing 454
Pyrosequencing 454Pyrosequencing 454
Pyrosequencing 454
 
Ngs intro_v6_public
 Ngs intro_v6_public Ngs intro_v6_public
Ngs intro_v6_public
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Roche Pyrosequencing 454 ; Next generation DNA Sequencing
Roche Pyrosequencing 454 ; Next generation DNA SequencingRoche Pyrosequencing 454 ; Next generation DNA Sequencing
Roche Pyrosequencing 454 ; Next generation DNA Sequencing
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
 
RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1
 
Rna seq pipeline
Rna seq pipelineRna seq pipeline
Rna seq pipeline
 
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
 
Data analysis pipelines for NGS applications
Data analysis pipelines for NGS applicationsData analysis pipelines for NGS applications
Data analysis pipelines for NGS applications
 
THIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptxTHIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptx
 
Exome sequence analysis
Exome sequence analysisExome sequence analysis
Exome sequence analysis
 
Conventional and next generation sequencing ppt
Conventional and next generation sequencing pptConventional and next generation sequencing ppt
Conventional and next generation sequencing ppt
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
scRNA-Seq Workshop Presentation - Stem Cell Network 2018
scRNA-Seq Workshop Presentation - Stem Cell Network 2018scRNA-Seq Workshop Presentation - Stem Cell Network 2018
scRNA-Seq Workshop Presentation - Stem Cell Network 2018
 
Ion Torrent Sequencing
Ion Torrent SequencingIon Torrent Sequencing
Ion Torrent Sequencing
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1
 

Similaire à Overview of Single-Cell RNA-seq

MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptxMOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
AmosiRichard
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
Bia Khan
 
212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis
SHAPE Society
 

Similaire à Overview of Single-Cell RNA-seq (20)

Shotgun (2) metagenomics
Shotgun (2) metagenomicsShotgun (2) metagenomics
Shotgun (2) metagenomics
 
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptxMOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
 
Sequencing genes and genomes
Sequencing genes and genomesSequencing genes and genomes
Sequencing genes and genomes
 
Gene Libarries
Gene LibarriesGene Libarries
Gene Libarries
 
Barker immemxi final March 2016
Barker immemxi final March 2016Barker immemxi final March 2016
Barker immemxi final March 2016
 
Flow cytometry for cell componenet analysis
Flow cytometry for cell componenet analysisFlow cytometry for cell componenet analysis
Flow cytometry for cell componenet analysis
 
genetic_engineering__unit_3__lecture_1.ppt
genetic_engineering__unit_3__lecture_1.pptgenetic_engineering__unit_3__lecture_1.ppt
genetic_engineering__unit_3__lecture_1.ppt
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
 
Transcriptomics
TranscriptomicsTranscriptomics
Transcriptomics
 
Transcriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdfTranscriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdf
 
Systems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasetsSystems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasets
 
212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis
 
Molecular genetics
Molecular geneticsMolecular genetics
Molecular genetics
 
212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis
 
Basic molecular genetic studies in atherosclerosis
Basic molecular genetic studies in atherosclerosisBasic molecular genetic studies in atherosclerosis
Basic molecular genetic studies in atherosclerosis
 
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminar
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Genomic library
Genomic libraryGenomic library
Genomic library
 

Plus de Alireza Doustmohammadi

Plus de Alireza Doustmohammadi (10)

Processing Raw scRNA-Seq Sequencing Data
Processing Raw scRNA-Seq Sequencing DataProcessing Raw scRNA-Seq Sequencing Data
Processing Raw scRNA-Seq Sequencing Data
 
Introduction to Applied Machine Learning
Introduction to Applied Machine LearningIntroduction to Applied Machine Learning
Introduction to Applied Machine Learning
 
OSPREY 3.0: Open-Source Protein Redesign for You
OSPREY 3.0: Open-Source Protein Redesign for YouOSPREY 3.0: Open-Source Protein Redesign for You
OSPREY 3.0: Open-Source Protein Redesign for You
 
WGCNA: an R package for weighted correlation network analysis
WGCNA: an R package for weighted  correlation network analysisWGCNA: an R package for weighted  correlation network analysis
WGCNA: an R package for weighted correlation network analysis
 
Introduction to Kaa IoT platform
Introduction to Kaa IoT platformIntroduction to Kaa IoT platform
Introduction to Kaa IoT platform
 
Speech processing and the induction of spoken language
Speech processing and the induction of spoken languageSpeech processing and the induction of spoken language
Speech processing and the induction of spoken language
 
Digital data storage technologies
Digital data storage technologiesDigital data storage technologies
Digital data storage technologies
 
DevOps
DevOpsDevOps
DevOps
 
differential expression genes (DEG)
differential expression genes (DEG)differential expression genes (DEG)
differential expression genes (DEG)
 
Lowest common ancestor (LCA) algorithm
Lowest common ancestor (LCA) algorithmLowest common ancestor (LCA) algorithm
Lowest common ancestor (LCA) algorithm
 

Dernier

Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
Health
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
HyderabadDolls
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
SayantanBiswas37
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 

Dernier (20)

20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 

Overview of Single-Cell RNA-seq

  • 1. Single-Cell RNA-seq analysis Section 1:Overview of Single-Cell RNA-seq Presented By Alireza Dosutmohammadi M.Sc. in Bioinformatics, Tarbiat Modares University Oct 2021
  • 2. RNA-seq Vs Microarrays RNA-seq allows profiling the transcripts in a sample in an efficient and cost- effective way. RNA-seq allows for an unbiased sampling of all transcripts in a sample, rather than being limited to a pre-determined set of transcripts 2
  • 3. with bulk RNA-seq: we can only estimate the average expression level for each gene across a population of cells, without regard for the heterogeneity in gene expression across individual cells of that sample. Therefore, it is insufficient for studying heterogeneous systems, e.g. early development studies or complex tissues such as the brain. Unlike with the bulk approach, with scRNA-seq we can estimate a distribution of expression levels for each gene across a population of cells. Bulk RNA-seq Vs Single-Cell RNA-seq 3
  • 4. Bulk RNA-seq Vs Single-Cell RNA-seq 4
  • 5. 5 Bulk RNA-seq Vs Single-Cell RNA-seq
  • 7. • Human Cell Atlas (H. sapiens) • Tabula Muris (M. musculus) • Fly Cell Atlas (D. melanogaster) • Cell Atlas of Worm (C. elegans) • Arabidopsis Root Atlas (A. thaliana) Single-cell Atlases 7
  • 9. • Tissue dissection and cell dissociating to obtain a suspension of cells. • Optionally cells may be selected (e.g. based on membrane markers, fluorescent transgenes or staining dyes). • Capture single cells into individual reaction containers (e.g. wells or oil droplets). • Extracting the RNA from each cell. • Reverse-transcribing the RNA to more stable cDNA. • Amplifying the cDNA (either by in vitro transcription or by PCR). • Preparing the sequencing library with adequate molecular adapters. • Sequencing, usually with paired-end Illumina protocols. • Processing the raw data to obtain a count matrix of genes-by-cells • Carrying several downstream analysis. Sample Preparation Protocols 9
  • 11. Sample Preparation Protocols two most important aspects are: cell capture or isolation and transcript quantification. 11
  • 12. Sample Preparation Protocols In tissues where cell dissociation is difficult or in frozen tissue samples, instead of isolating whole single cells it is possible to instead isolate single nuclei. Apart from the isolation step, the protocol to prepare single-nuclei sequencing libraries is similar to that of single-cell protocols. However, nuclear RNA usually contains a higher proportion of unprocessed RNA, with more of the sequenced transcripts containing introns. 12
  • 13. Sample Preparation Protocols Cell Capturing methods: • Microtitre-plate-based • Microfluidic-array-based • Microfluidic-droplet-based The strategy determines the throughput of the experiment. 13
  • 14. Sample Preparation Protocols Cell Capturing methods: 14
  • 15. Sample Preparation Protocols Cell Capturing methods: Microtitre-plate methods (well-based methods ): • isolating cells into individual wells of the plate using, for example, pipetting, microdissection or fluorescent activated cell sorting (FACS). • Advantage: take pictures of the cells before library preparation. • Can identify and discard damaged cells or find wells containing doublets associate information such as cell size and the intensity of any used labels with the well coordinates. • The main drawback: they are often low-throughput. 15
  • 16. Sample Preparation Protocols Cell Capturing methods: Microfluidic-array platforms: • integrated system for capturing cells and for carrying out the reactions necessary for the library preparations. • they provide a higher throughput than microtitre-plate-based methods. • only around 10% of cells are captured in a microfluidic platform • they are not appropriate if one is dealing with rare cell-types or very small amounts of input. • has to be taken with the cell sizes captured by the arrays, as the nanowells are customised for particular sizes. this may therefore affect the unbiased sampling of cells in complex tissues. • the chip is relatively expensive. 16
  • 17. Sample Preparation Protocols Cell Capturing methods: Microfluidic-droplet methods: • offer the highest throughput. • They work by encapsulating individual cells inside a nanoliter-sized oil droplet, together with a bead. • The bead is loaded with enzymes and other components required to construct the library. • Each bead contains a unique barcode which is attached to all of the sequencing reads originating from that cell. • all of the droplets can be pooled, sequenced together and the reads can subsequently be assigned to the cell of origin based on those barcodes. • Droplet platforms have relatively cheap library preparation costs on the order of 0.05 USD/cell. 17
  • 18. Sample Preparation Protocols Transcript Quantification: • full-length: uniform read coverage across the whole transcript • tag-based: only capture either the 5’ or 3’ ends 18
  • 19. Sample Preparation Protocols Transcript Quantification: • full-length Protocol: • identical to what is done in bulk RNA-seq • Although in theory full-length protocols should provide an even coverage of transcripts, there can sometimes be biases in the coverage across the gene body. • Full-length protocols also allow the detection of splice variants. 19
  • 20. Sample Preparation Protocols Transcript Quantification: • SMART-seq2 is a popular low-throughput method, providing full-length transcript quantification. It is ideally suited for studying a smaller group of cells in greater detail. • 10x Chromium is a popular high-throughput method, using UMIs for transcript quantification (from either 3’ or 5’ ends). It is ideally suited to study highly heterogeneous tissues and sample large populations of cells at scale. 20
  • 21. Sample Preparation Protocols Transcript Quantification: • tag-based protocols: • only one of the ends (3’ or 5’) of the transcript is sequenced. • 3’ protocols are more commonly used, many protocols now allow sequencing from either end (e.g. 10x Chromium supports both). • Advantage of 5’-end sequencing: obtain information about the transcription start site (TSS), which allows to explore whether there is differential TSS usage across cells. • Advantage: they can be combined with unique molecular identifiers (UMIs), which can help improve the accuracy of transcript quantification. • Unique molecular identifiers (UMIs) are a type of molecular barcoding that provides error correction and increased accuracy during sequencing. These molecular barcodes are short sequences used to uniquely tag each molecule in a sample library. • Disadvantage: being restricted to one end of the transcript only, it reduces our ability to unambiguously align reads to a transcript, as well as making it difficult to distinguish different isoforms. 21
  • 26. • Sensitivity: how many genes are detected per cell • accuracy (e.g. compared to bulk RNA-seq) • recover all cell types present in a sample. • low-throughput methods have higher sensitivity compared to high- throughput methods, such as 10x Chromium. • low-throughput methods did not capture some of the rarer cell types in their samples, leading to an incomplete characterisation of the cell population. 26 Comparing different protocols
  • 27. • if one is interested in characterizing the composition of a heterogeneous tissue, then a droplet-based method is more appropriate, as it allows a very large number of cells to be captured in a mostly unbiased manner. • Full-length transcript quantification will be more appropriate if one is interested in studying different isoforms, since tagged protocols are much more limited in this regard. By contrast, UMIs can only be used with tagged protocols and they can improve gene-level quantification. • If one is interested in rare cell types (for which known markers are not available), then more cells need to be sequenced, which will increase the cost of the experiment. 27 What Protocol Should I Choose?
  • 28. • take into account when performing scRNA-seq experiments Factors such as: the cost per cell, how many cells one needs, or how much to sequence each cell. • Care has to be taken to avoid biases due to batches being processed at different times. Important ! 28
  • 29. How many cells do we need to sample so that we see at least n cells of each type? • This depends on the number of cell type present and the diversity, i.e. the entropy. • Assume that there are 10 rare cell types, each one present at a fraction of 2% of the total population. If we want to be 95% confident that our sample contains at least 5 cells from each of those cell types, we need to sample at least 619 cells in total. 29 Important !
  • 30. • The main difference between bulk and single cell RNA-seq is that each sequencing library represents a single cell, instead of a population of cells. • Another important aspect to take into account are batch effects. These can be observed even when sequencing the same material using different technologies, and if not properly normalised, can lead to incorrect conclusions. Data challenges 30
  • 32. • if planning an experiment to compare healthy and diseased tissues from 10 patients each, if only 10 samples can be processed per day, it is best to do 5 healthy + 5 diseased together each day, rather than prepare all healthy samples one day and all diseased samples in another. • Another consideration is to ensure that there is replication of tissue samples. For example, when collecting tissue from an organ, it may be a good idea to take multiple samples from different parts of the organ. Or consider the time of day when samples/replicates are collected (due to possible circadian changes in gene expression). • all the common best practices in experimental design should be taken into account. 32 Data challenges