SlideShare une entreprise Scribd logo
1  sur  5
Télécharger pour lire hors ligne
African Journal of Science and Research,2014,(3)5:08-12
ISSN: 2306-5877
Available Online: http://ajsr.rstpublishers.com/
PROBES SPECIFICITY IN ARRAY DESIGN INFLUENCES THE AGREEMENT
BETWEEN MICROARRAY AND RNA-Seq IN GENE EXPRESSION ANALYSIS.
Dago Dougba Noel 1,3*, Alberto Ferrarini 3, Diarassouba Nafan 1, Fofana Inza Jésus 1, Silué Souleymane 1,
Giovanni Malerba 2 andMassimo Delledonne 3
1)UPGC University of Korhogo Cote d’Ivoire Unité Formation Recherche Sciences Biologiques, Côte d'Ivoire.
2 )Department of Life and Reproduction sciences, Section of Biology and Genetics, University of Verona, Italy.
3) Department of Biotechnology, University of Verona, Italy Strada le Grazie 15, Cà vignal 1,Italy.
Email:dgnoel7@gmail.com
Received:04, Aug,2014 Accepted: 19 ,Sep,2014
Abstract
More than a decade, oligonucleotide microarrays have been the method of choice for transcriptional profiling studies, used to characterize
biological systems. The power of microarray platforms depends on the number, identity and specificity of the oligonucleotide probes for their
target gene models. In recent time, however, researchers have been increasingly focusing high-throughput sequencing RNA-Seq which offers
advantages when examining transcriptome fine structure; for example, in the detection of allele-specific expression and splice junctions. We are
investigating how the specificity of oligonucleotide probes in an array design influences the agreement between RNA-Seq and microarray
platforms in gene expression, differential analysis. Hence, we are essaying the agreement between a custom microarray platform, based on
multiple long oligonucleotide probes (60 mer) per gene model transcript (Grape custom microarray platform), and RNA-Seq, by removing
microarray, design less specific probes discriminating differential expressed genes (DEGs), analyzing two Vitis vinifera berry developmental
stages. We were able to demonstrate that the agreement between both RNA-Seq and microarray platforms calling DEGs, depend on the high
rate of probes set with specific oligonucleotide probes. Furthermore, this investigation confirmed the superiority of RNA-Seq next generation
sequencing (NGS) technology, as opposed to microarray in gene expression differential analysis.
Keywords: Microarray, Probe set, oligonucleotide probe, RNA-Seq.
INTRODUCTION
Microarray technology has revolutionized molecular biology. It
uses hundreds of millions of highly organized probes on a limited solid
surface to simultaneously interrogate the multiple RNA or DNA
molecules, defined as target within an individual sample [1]. The
microarray technologies are universal tools that can be applied
throughout the life sciences [2-4]. mRNA-expression profiling is the
most frequent application. The principle behind of a microarray
experiment, is that mRNAs from a given cell line or tissue is labeled
sample, and hybridized in parallel to a large number of DNA
sequences, it is then immobilized on a solid surface in an ordered
array. The number of immobilized probes can vary from hundreds
designed to monitor the expression of few specific genes to hundreds
of thousands representing complete transcriptomes. Microarray
expression technology is strongly influenced by the number, identity
and quality (specificity) of the oligonucleotide probes on the array.
Tracing and maintaining the identity of the thousands of probes
requires an informatic system throughout the fabrication process.
Gene-specific oligonucleotide probes are currently used in microarrays
to avoid cross-hybridization of highly similar sequences. If the probes
are not optimized for sequence specificity, microarrays can generate
false-positive data due to non-specific crosshybridization to highly
similar sequences, gene families [5], or alternatively spliced variants
[6]. Long oligonucleotide probes are prone to cross-hybridization and
thus often exhibit poor discrimination and hybridize to similar
sequences. Studies have suggested that the percentage sequence
homology is a reasonable predictor of cross-hybridization [7] and to
overcome this crosshybridization problem, several laboratories have
adopted the practice of designing oligonucleotide probes that target
regions of low sequence similarity [8]. The literature data [9] indicate
that longer oligonucleotides provide significantly better detection
sensitivity than shorter probes. Single or multiple probes per genes can
be designed. Cheng-Chung Chou et al. [10] demonstrated that a single
longer oligonucleotide probe for a gene model could be sufficient for
accurate expression measurement if the probe is validated
experimentally. However, oligonucleotide probes binding to different
regions of a gene yield different signal intensities [1], and it is difficult to
predict whether an oligonucleotide probe will bind efficiently to its
target sequence and yield a good hybridization signal on the basis of
sequence information alone [11].
As a result, multiple probes per gene model transcript have been
used in oligonucleotide array designs to obtain reliable quantitative
information of gene expression. Furthermore it has been shown that
the measurement bias decreased with an increase in the number of
probes per gene. Fewer probes per gene were required for the longer
probes to achieve the same bias reduction as shorter probes. As
reported by Dago N., 2012 work [12], Grape custom microarray
designs based on multiple longer probes (60 mer) per gene model
transcript exhibited a higher sensitivity than their corresponding
microarray designs based on shorter multiple probes (35-40 mer) per
gene model transcript in detecting DEGs. The same investigation and
unpublished data form the Functional Genomic Center of the university
of Verona-Italy showed that as regards this microarray design, several
probe sets from 6538 genes out of 29582 demonstrated a different
behavior among themselves; a considerable number of these genes
(4146 in total) claimed to be DEGs. Thus, in order to determine the
specificity of the analyzed oligonucleotide probes, a blast analysis with
no mismatch by setting Tm ≥ 85±15°C threshold to predict
oligonucleotide probes secondary structure was performed against all
represented gene model transcripts of Vitis vinifera grape 12x
assembly transcriptome [13].
Next, we selected the specific probes (oligonucleotide probes
and/or probe set that recognized only their gene model transcript at a
African Journal of Science and Research , 2014,(3)5:08-12
Tm ≥ 85±15°C with no mismatch) from each analyzed probe set and
compared the gene expression data of their corresponding genes with
those of RNA-Seq. In recent times, RNA-Seq has emerged as a
powerful new technology for transcriptome analysis [14-15]. Although
RNA-Seq is still a technology under active development, it offers
several key advantages over microarray technology. First, unlike
hybridization-based approaches, RNA-Seq is not limited to detecting
transcripts that correspond to existing genomic sequence. This makes
RNA-Seq particularly attractive for non model organisms with genomic
sequences that are yet to be determined. RNA-Seq can reveal the
precise location of transcription boundaries, to a single-base
resolution. Furthermore, several studies showed that RNA-Seq can
accurately reveal gene expression difference [16]. These factors make
RNA-Seq useful for studying complex transcriptomes [17]. Hence, we
explored the number of specific and less specific oligonucleotide
probes (i) when both RNA-Seq and microarray platforms agree in
discriminating DEGs; (ii) when DEGs have been discriminated
exclusively by RNA-Seq, and/or (iii) by microarray.
MATERIALS AND METHODS
Samples from two development stages of Vitis vinifera grape
(veraison and ripening) have been profiled for global gene
expression using Grape custom microarray platform based on
multiple (4) longer oligonucleotide probes (60 mer) for the genes
essayed [12]. The same samples were previously profiled using NGS
technology (RNA-Seq) as reported in Zenoni et al., 2010 [18].
Microarrays expression data have been pre-processed by RMA
module [19] and analyzed by limma moderated statistical t-test [12].
Agreement assessment between microarray and RNA-Seq platforms
in gene expression differential analysis have been measured in term
of oligonucleotide probes specificity. To that effect, we employed a
thermodynamic approach based on OligoArray 2.0 software [20] to
select probes that exhibited a melting temperature (Tm) ≥85±15°C
as threshold to predict oligonucleotide probes secondary structure.
Next, the selected probes were processed for local alignment
analysis (megaBLAST http://www.ncbi.nlm.nih.gov/blast/Blast)
against Vitis vinifera grape 12x assembly transcriptome [13]. Then
oligonucleotide probes that recognized exclusively their gene model
transcript with no mismatch at Tm ≥ 85±15 have been classified as
specific.
RNA preparation
Sample of Vitis vinifera (grape) berry at the development stages
of veraison and ripening were collected [18] and total RNA has been
extracted as described in Anita Zamboni et al., 2010 work [21]. The
RNA amount and integrity were essayed using a Nanodrop 2000
instrument (Thermo Scientific) and a Bio-analyzer Chip RNA 6000
(Agilent), respectively.
Hybridization of Grape custom array based on long multiple
probes per gene
Grape custom array hybridization experiment based on One
Color-DNA labeling system with Cy3 fluorescent performed by
processing the same total RNA amount (10μg) of the three analyzed
technical replicates of veraison and ripening development stage [12].
RNA processing, labeling, hybridization and slide wash step were
accomplished following the NimbleGen Arrays User’s Guide Gene
Expression Analysis version 3.1 protocol manufacturer’s instructions.
Hybridization image scanning have been performed with the aid of
the axon scanner Instruments GenePix 4200A at 535 wave length.
For Grape custom array design based on multiple long
9
oligonucleotide probes (60 mer) per gene model transcript, the
summarization of the normalized intensities of the different probes
per transcript was performed according to the Robust Multichip
Average (RMA) algorithm from NimbleScan software [19].
RESULTS
Rate of specific probes for DEGs detected by both microarray
and RNA-Seq platforms
In total 117999 probes from 29582 gene model transcripts of
Grape custom microarray design were analyzed. 85.66% of them
have been selected as specific for their respective gene model
transcript (oligonucleotide probes that recognized exclusively their
gene model transcript at a Tm ≥85±15°C with no mismatch).
However, the survey of specific oligonucleotide probes of DEGs
detected by (i) both microarray and RNA-Seq platforms, (ii)
microarray platform exclusively and (iii) RNA-Seq platform
exclusively have been performed on 17850 genes (expressed genes
called by both Grape custom microarray and RNA-Seq) [12].
Considering microarray data for which, internal replicate probes have
been summarized by RMA algorithm as described by Irizarry et al.
2003 work [19], 6542 genes have been detected as DEGs by both
Grape custom array design and RNA-Seq platforms [12]. Focusing
on these DEGs, 57.56% of the analyzed probe set exhibited 3 or 4
specific oligonucleotide probes against 42.44% with 2, 1 or 0 specific
oligonucleotide probes (see Figure 1). This result suggests that the
agreement between both microarray and RNA-Seq tools in gene
expression differential analysis is carried by the high number of
probe set with specific oligonucleotide probes.
Figure 1 Percentage of probe set with their corresponding specific
oligonucleotide probes for DEGs discriminated by RNA-Seq
microarray and/or by both microarray and RNA-Seq platforms in
gene expression differential analysis.
Rate of specific probes for DEGs detected exclusively by
microarray and/or RNA-Seq
We next analyzed the percentage of specific oligonucleotide
probes in Grape custom microarray design focusing on DEGs called
exclusively by (i) microarray (3518 genes) and (ii) RNA-Seq (3053
gene) [12]. The results of this analysis have been summarized in
Figure 1 (see above). In fact, for DEGs discriminated exclusively by
RNA-Seq platform, 62.28% of the analyzed probe set exhibited 4 or 3
specific oligonucleotide probes against 37.72% with 2, 1 or 0 specific
oligonucleotide probes as showed in Figure 1 (see above). These
results suppose a large involvement of specific oligonucleotide
probes in the discrimination of DEGs previously recognized as such
by RNA-Seq tool exclusively. Further, for DEGs wholly discriminated
by Grape custom microarray design, 53% of the analyzed probe set
display 2, 1 or 0 specific oligonucleotide probes against 47% with 4
or 3 specific oligonucleotide probes. Hence, for DEGs called
exclusively by Grape custom microarray design, the number of
10
specific oligonucleotide probes was comparable with those less
specific oligonucleotide probes, supporting an involvement of a
substantial number of less specific oligonucleotide probes in the
discrimination of DEGs previously recognized as such by Grape
custom microarray design exclusively. In view of the foregoing these
analysis suggest that disagreement between RNA-Seq and
microarray technologies in gene expression differential analysis
could be due to the considerable number of less specific
oligonucleotide probes in the Grape custom microarray design.
Moreover, these results suggest the high specificity and sensitivity of
RNA-Seq in detecting DEGs in gene expression differential analysis
[22].
Correlation between Grape custom microarray and RNA-Seq
Level of gene expression in microarray depend on both spot
intensity (transcript abundance) and oligonucleotide probes
specificity (cross-hybridization) in signal detection. Consequently,
oligonucleotide probes intensity in microarray experiment cannot
represent the absolute gene expression level and is not suitable for
analysis across different platforms. We then introduced a fold
change (FC) parameter. There, FC parameter has been calculated
as the ratio of the measured genes expression values of ripening
sample to the genes expression values of véraison sample. Pearson
correlation of the 17850 expressed genes between Grape custom
microarray design and RNA-Seq platforms in FC measurement have
been previously estimated at 0.72 (R2=0.72) [12]. The deletion of
less specific oligonucleotide probes out the set of analyzed probes of
Grape custom microarray design allows an increase in correlation
between both microarray and RNA-Seq platforms in FC
measurement analysis (from R2=0.72 to R2=0.90); this shows that
microarray designed oligonucleotide probes could be best selected
by the integration of both oligonucleotide probe design strategy
based on score and thermodynamic parameters. This analysis also
proves that a good selection of specific oligonucleotide probes in
Grape custom microarray design improves strongly its agreement
with RNA-Seq in gene expression differential analysis.
Assessment of microarray accuracy and specificity by using
specific probes in array design
To further explore the agreement between microarray and RNA-
Seq in gene expression differential analysis by selecting the Grape
custom microarray specific oligonucleotide probes, a Receiver
Operating Characteristics (ROC) curve was constructed for the
analyzed microarray design (Grape custom microarray design)
assuming RNA-Seq gene expression data set as reference. Each
point on the ROC curve of the Grape custom array represents the
sensitivity on Y-axis (True Positive Rate) and the specificity on X-
axis (False Positive Rate). Deletion of less specific oligonucleotide
probes out the Grape custom microarray design enlarged the area
under the curve (AUC from 0.70 to 90.4%) as showed in Figure 2.
These results demonstrated that less specific oligonucleotide
probes in a microarray design give false signal detection in gene
expression level measurement. By contrast, a high proportion of
specific oligonucleotide probes in microarray design increased it
specificity and accuracy in the discrimination of DEGs when RNA-
Seq gene expression data was assumed as reference. Taking
together, these analysis showed that Grape custom microarray
platform enriched by gene target specific oligonucleotide probes
exhibits a quite similar performance (specificity and accuracy) to
RNA-Seq discriminating DEGs in gene expression differential
analysis (see Figure 2).
Dago Dougba Noel et.al
Figure 2 ROC curve of 17850 genes of Grape custom array platform
in detecting DEGs with all oligonucleotide probes (left) and with only
specific oligonucleotide probes (right). RNA-Seq gene expression
data have been used as reference.
DISCUSSION AND CONCLUSION
The recent development of transcriptomic approaches based on
Next Generation Sequencing (NGS) is gaining popularity as they
provide a genome-wide, precise, quantitative measure of gene
expression. Although RNA-Seq is still a technology under active
development, it offers several key advantages over microarray
technology. Furthermore, several studies showed that RNA-Seq can
accurately reveal gene expression difference [16]. However, because
of the extensive legacy of the data and installed instrument base,
microarrays may still be widely used in the foreseeable future.
Although microarrays have been extensively used as discovery tools
for biological and biomedical studies, the challenge remains whether
this technology can be applied reliably in clinical practice and
regulatory decision making, where high precision and accuracy in
performance are required. Moreover, the power of this technology
depends on the number, identity and quality of the oligonucleotide
probes [23]. In the present paper we investigated how
oligonucleotide probes specificity in Grape custom microarray design
based on longer multiple oligonucleotide probes per gene model
transcript could influence its reliability in gene expression analysis.
Hence, we established a relationship between DEGs indentified by
(i) Grape custom microarray design, (ii) RNA-Seq or (iii) both
platforms and their respective specific oligonucleotide probes
number. However, oligonucleotide probes that recognized
exclusively their gene model transcript at a Tm ≥85±15°C with no
mismatch have been classified as specific.
Our findings showed an involvement of a substantial number of
specific oligonucleotide probes in the discrimination of DEGs
previously recognized as such by both microarray and RNA-Seq
platforms suggesting that accord between the two analyzed gene
expression platforms (Grape custom microarray design and RNA-
Seq) can be improved by increasing the ratio of specific
oligonucleotide probes in microarray design. Next, we showed that
more specific oligonucleotide probes were involved in the
discrimination of DEGs previously recognized as such by RNA-Seq
platform. By contrast a similar percentage of specific and less
specific oligonucleotide probes claimed to be associated with DEGs
exclusively detected by the Grape custom microarray design.
Considered as a whole, these results support a higher specificity of
RNA-Seq technology in the discrimination of DEGs in gene
expression differential analysis [22]. Furthermore, the deletion of less
specific oligonucleotide probes from the set of analyzed
oligonucleotide probes of Grape custom microarray design, improved
the correlation between both microarray and RNA-Seq platforms in
gene expression differential analysis FC measurement showing that
African Journal of Science and Research , 2014,(3)5:08-12
for oligonucleotide probes that are not optimized for sequence
specificity, microarrays can generate false positive data due to non
specific cross-hybridization to highly similar sequences gene families
[5-6].
The ROC curve is a two-dimensional depiction of classifier
performance. To compare classifiers we may want to reduce ROC
performance to a single scalar value representing expected
performance. A common method is to calculate the area under the
ROC curve, abbreviated AUC [24-25]. Indeed, ROC curve analysis
performed in this investigation suggested that an accurate selection
of oligonucleotide probes by integrating a thermodynamic parameter,
(selection of strongly specific oligonucleotide probe) allowed the
Grape custom microarray design to achieve a quite similar
performance to that of the RNA-Seq in calling DEGs in gene
expression differential analysis. Therefore, we demonstrated that
microarray designed oligonucleotide probes could be best selected
by the integration of thermodynamic parameters [20]. Moreover, our
investigation showed that discrepancy between RNA-Seq and
microarray platforms in gene expression differential analysis could
be due to the presence of less specific oligonucleotide probes in
microarray design. As expected, we also showed that a high rate of
specific oligonucleotide probes in a microarray design increase it
specificity and accuracy in discriminating DEGs in gene expression
differential analysis. Then, stringent selection of gene model target
specific oligonucleotide probes based on score and thermodynamic
parameters, improved strongly the performance of microarray
technology in gene expression measurement analysis.
In conclusion this investigation holds that microarray design enriched
by gene target specific oligonucleotide probes exhibits a quite similar
performances to that of RNA-Seq in gene expression differential
analysis, suggesting that (i) the accurate detection of DEGs by
microarray platform and that (ii) the agreement between both
microarray and RNA-Seq gene expression platforms is strongly
correlated with the best selection of gene target specific
oligonucleotide probes based on thermodynamic parameters.
References
1. Lockhart DJ, Dong H, Byrne MC, Follettie MT, Gallo MV, Chee MS,
Mittmann M, Wang C, Kobayashi M, Horton H, Brown EL. (1996
Dec). Expression monitoring by hybridization to high-density
oligonucleotide arrays. Nat Biotechnol.14 (13):1675-1680.
2. Brown PO, Botstein D. Exploring the new world of the genome
with DNA microarrays (1999 Jan 21). Nat Genet. (1 Suppl):33-37.
3. Lockhart DJ, Winzeler EA (2000 Jun15) Genomics, gene
expression and DNA arrays. Nature.405 (6788):827-836.
4. Young T, Peppard P. (2000 Jun15). Sleep-disordered breathing
and cardiovascular disease: epidemiologic evidence for a
relationship. Sleep.;23 Suppl 4:S122-1226.
5. Xu W, Chen H, Du K, Asahara H, Tini M, Emerson BM, Montminy
M, Evans RM. (2001) A transcriptional switch mediated by cofactor
methylation .Science. 294 (5551):2507-11. Epub.
6. Modrek, B. et al. (2001). Genome-wide detection of alternative
splicing in expressed sequences of human genes. Nucleic Acids Res.
29, 2850–2859.
7. Evertsz EM, Au-Young J, Ruvolo MV, Lim AC, Reynolds MA.
Hybridization cross-reactivity within homologous gene families on
glass cDNA microarrays (2001 Nov). Biotechniques. 31 (5): 1182-
1184, 1186 passim.
8. Kane MD, Jatkoe TA, Stumpf CR, Lu J, Thomas JD, Madore SJ
(2000 Nov). Assessment of the sensitivity and specificity of
11
oligonucleotide (50mer) microarrays. Nucleic Acids Res.15; 28
(22):4552-4557.
9. Hughes TR, Mao M, Jones AR, Burchard J, Marton MJ, Shannon
KW, Lefkowitz SM, Ziman M, Schelter JM, Meyer MR, Kobayashi S,
Davis C, Dai H, He YD, Stephaniants SB, Cavet G, Walker WL, West
A, Coffey E, Shoemaker DD, Stoughton R, Blanchard AP, Friend SH,
Linsley PS. Expression profiling using microarrays fabricated by an
ink-jet oligonucleotide synthesizer (2001 Apr). Nat 19 (4):342-347.
10. Cheng-Chung Chou, Chun-Houh Chen, Te-Tsui Lee and Konan
Peck. Optimization of probe length and the number of probes per
gene for optimal microarray analysis of gene expression (2004).
Nucleic Acids Research, Vol. 32, No. 12 e99 doi:10.1093/nar/gnh099.
11. Li,F. and Stormo,G.D. (2001). Selection of optimal DNA oligos for
gene expression arrays. Bioinformatics, 17, 1067–1076.
12. Dago Dougba Noel (2012). Performance assessment of different
microarray designs using RNA-Seq as reference Id prodotto: 67051;
Id Ugov: 404537.
13. Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A,
Choisne N, Aubourg S, Vitulo N, Jubin C, Vezzi A, Legeai F,
Hugueney P, Dasilva C, Horner D, Mica E, Jublot D, Poulain J,
Bruyère C, Billault A, Segurens B, Gouyvenoux M, Ugarte E,
Cattonaro F, Anthouard V, Vico V, Del Fabbro C, Alaux M, Di
Gaspero G, Dumas V, Felice N, Paillard S, Juman I, Moroldo M,
Scalabrin S, Canaguier A, Le Clainche I, Malacrida G, Durand E,
Pesole G, Laucou V, Chatelet P, Merdinoglu D, Delledonne M,
Pezzotti M, Lecharny A, Scarpelli C, Artiguenave F, Pè ME, Valle G,
Morgante M, Caboche M, Adam-Blondon AF, Weissenbach J, Quétier
F, Wincker P (2007 Sep 27); French-Italian Public Consortium for
Grapevine Genome Characterization: The grapevine genome
sequence suggests ancestral hexaploidization in major angiosperm
phyla; 449 (7161): 463-467. Epub 2007 Aug 26. PMID: 17721507
PubMed – indexed for MEDLINE. Nature.
14. Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C,
Kingsmore SF, Schroth GP, Burge CB (2008). Alternative isoform
regulation in human tissue transcriptomes. Nature; 456:470–47.
[PubMed].
15. Cloonan N, Forrest AR, Kolle G, Gardiner BB, Faulkner GJ,
Brown MK, Taylor DF, Steptoe AL, Wani S, Bethel G, et al. Stem cell
transcriptome profiling via massive-scale mRNA sequencing (2008).
Nat. Methods; 5:613–619.
16. Song Liu, Lan Lin, Peng Jiang, Dan Wang and Yi Xing (2011
Jan). A comparison of RNA-Seq and high-density exon array for
detecting differential gene expression between closely related
species Nucleic Acids Res; 39(2): 578–588. doi: 10.1093/nar/gkq817
PMCID: PMC3025565.
17. Zhong Wang, Mark Gerstein, and Michael Snyder (2009 Jan).
RNA-Seq: a revolutionary tool for transcriptomics Nat Rev Genet; 10
(1):57–63. doi: 10.1038/nrg2484 PMCID: PMC2949280 NIHMSID:
NIHMS229948.
18. Sara Zenoni, Alberto Ferrarini, Enrico Giacomelli, Luciano
Xumerle, Marianna Fasoli, Giovanni Malerba, Diana Bellin, Mario
Pezzotti, and Massimo Delledonne (2010 April) Characterization of
Transcriptional Complexity duringBerry Development in Vitis vinifera
Using RNA-Seq Plant Physiology vol. 152 n° 4 1787-1795.
19. Irizarry, R. A. Bolstad BM, Collin F, Cope LM, Hobbs B, Speed
TP. Summaries of Affymetrix Gene-Chip probe level data (2003).
Nucleic Acids Res. 31, e15 (4).
20. Jean-Marie Rouillard, Michael Zuker and Erdogan Gulari
OligoArray 2.0: design of oligonucleotide probes for DNA
microarrays using a thermodynamic approach (2003 Jun). Nucleic
12
Acids Res. 15;31(12):3057-3062.
21. Anita Zamboni, Mariasole Di Carli, Flavia Guzzo, Matteo
Stocchero, Sara Zenoni, Alberto Ferrarini, Paola Tononi, Ketti Tofalli,
Angiola Desiderio, Kathryn S. Lilley, M. Enrico Pè, Eugenio
Benvenuto, Massimo Delledonne and Mario Pezzotti (2010
November). Identification of putative stage-specific grapevine berry
biomarkers and omics data integration into networks. Plant
Physiology. vol. 154 N°3 1439-1459.
22. John C. Marioni, Christopher E. Mason, Shrikant M. Mane, et al.
(2008 Jun 11). RNA-Seq: an assessment of technical reproducibility
and comparison with gene expression arrays. Genome Res.
18:1509-1517 doi:10.1101/gr.079558108.
Dago Dougba Noel et.al
23. Joseph D. Clarke and Ton Zhu (2006). Microarray analysis of the
transcriptomes as a stepping stone towards understanding biology
system: pratical consideration and perspectives. The plant biology
journal 45, 630-650
24. Hanley JA, McNeil BJ The meaning and use of the area under a
receiver operating characteristic (ROC) curve (1982 Apr). Radiology.
143(1):29-36.
25. Bradley KA, Boyd-Wickizer J, Powell SH, Burman ML (1998 Jul
8). Alcohol screening questionnaires in women: a critical review
JAMA. 280 (2):166-71.

Contenu connexe

Tendances

Applications of Single Cell Analysis
Applications of Single  Cell AnalysisApplications of Single  Cell Analysis
Applications of Single Cell AnalysisQIAGEN
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;applicationFyzah Bashir
 
Aug2014 abrf interlaboratory study plans
Aug2014 abrf interlaboratory study plansAug2014 abrf interlaboratory study plans
Aug2014 abrf interlaboratory study plansGenomeInABottle
 
Introduction to Microarray in Gene Expression studies
Introduction to Microarray in Gene Expression studiesIntroduction to Microarray in Gene Expression studies
Introduction to Microarray in Gene Expression studiesSarbesh D. Dangol
 
Molecular QC: Using Reference Standards in NGS Pipelines
Molecular QC: Using Reference Standards in NGS PipelinesMolecular QC: Using Reference Standards in NGS Pipelines
Molecular QC: Using Reference Standards in NGS PipelinesCandy Smellie
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarraysana shakeel
 
Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarrayprateek kumar
 
In Vitro Analog of the Primitive Streak (ANIMATED)
In Vitro Analog of the Primitive Streak (ANIMATED)In Vitro Analog of the Primitive Streak (ANIMATED)
In Vitro Analog of the Primitive Streak (ANIMATED)Nikolay Turovets
 
Sequencing the Human TCRβ Repertoire on the Ion S5TM
Sequencing the Human TCRβ Repertoire on the Ion S5TMSequencing the Human TCRβ Repertoire on the Ion S5TM
Sequencing the Human TCRβ Repertoire on the Ion S5TMThermo Fisher Scientific
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput SequencingMark Pallen
 
Next-generation sequencing course, part 1: technologies
Next-generation sequencing course, part 1: technologiesNext-generation sequencing course, part 1: technologies
Next-generation sequencing course, part 1: technologiesJan Aerts
 

Tendances (18)

Applications of Single Cell Analysis
Applications of Single  Cell AnalysisApplications of Single  Cell Analysis
Applications of Single Cell Analysis
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;application
 
Aug2014 abrf interlaboratory study plans
Aug2014 abrf interlaboratory study plansAug2014 abrf interlaboratory study plans
Aug2014 abrf interlaboratory study plans
 
Microarray Analysis
Microarray AnalysisMicroarray Analysis
Microarray Analysis
 
Introduction to Microarray in Gene Expression studies
Introduction to Microarray in Gene Expression studiesIntroduction to Microarray in Gene Expression studies
Introduction to Microarray in Gene Expression studies
 
Molecular QC: Using Reference Standards in NGS Pipelines
Molecular QC: Using Reference Standards in NGS PipelinesMolecular QC: Using Reference Standards in NGS Pipelines
Molecular QC: Using Reference Standards in NGS Pipelines
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
DNA MICROARRAY
DNA MICROARRAYDNA MICROARRAY
DNA MICROARRAY
 
Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarray
 
Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarray
 
Microarray
MicroarrayMicroarray
Microarray
 
In Vitro Analog of the Primitive Streak (ANIMATED)
In Vitro Analog of the Primitive Streak (ANIMATED)In Vitro Analog of the Primitive Streak (ANIMATED)
In Vitro Analog of the Primitive Streak (ANIMATED)
 
Sequencing the Human TCRβ Repertoire on the Ion S5TM
Sequencing the Human TCRβ Repertoire on the Ion S5TMSequencing the Human TCRβ Repertoire on the Ion S5TM
Sequencing the Human TCRβ Repertoire on the Ion S5TM
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
Microarray
MicroarrayMicroarray
Microarray
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput Sequencing
 
Microarray full detail
Microarray full detailMicroarray full detail
Microarray full detail
 
Next-generation sequencing course, part 1: technologies
Next-generation sequencing course, part 1: technologiesNext-generation sequencing course, part 1: technologies
Next-generation sequencing course, part 1: technologies
 

En vedette

COBOL DB2 BATCH EXAMPLE-RPR6520
COBOL DB2 BATCH EXAMPLE-RPR6520COBOL DB2 BATCH EXAMPLE-RPR6520
COBOL DB2 BATCH EXAMPLE-RPR6520Jon Fortman
 
Профессиональное управление распределенными проектами
Профессиональное управление распределенными проектамиПрофессиональное управление распределенными проектами
Профессиональное управление распределенными проектамиEvgeny Savitsky
 
Jacobijn van etten lessen luchtgroen
Jacobijn van etten lessen luchtgroenJacobijn van etten lessen luchtgroen
Jacobijn van etten lessen luchtgroenkrachtgroen
 
Principles of Social Media Marketing - Andrew Cavallaro
Principles of Social Media Marketing - Andrew CavallaroPrinciples of Social Media Marketing - Andrew Cavallaro
Principles of Social Media Marketing - Andrew CavallaroMarketing Success
 
Особенности продажи IT товаров и услуг
Особенности продажи IT товаров и услугОсобенности продажи IT товаров и услуг
Особенности продажи IT товаров и услугHanna Kuleshova
 
How to Standardize Organization-Wide Data Collection
How to Standardize Organization-Wide Data CollectionHow to Standardize Organization-Wide Data Collection
How to Standardize Organization-Wide Data CollectionQualtrics
 
PresentationPPT
PresentationPPTPresentationPPT
PresentationPPTPoonam_N
 

En vedette (12)

COBOL DB2 BATCH EXAMPLE-RPR6520
COBOL DB2 BATCH EXAMPLE-RPR6520COBOL DB2 BATCH EXAMPLE-RPR6520
COBOL DB2 BATCH EXAMPLE-RPR6520
 
Gr lrw102015
Gr lrw102015Gr lrw102015
Gr lrw102015
 
IFC_BLF
IFC_BLFIFC_BLF
IFC_BLF
 
Infantil c
Infantil cInfantil c
Infantil c
 
Updated Resume
Updated ResumeUpdated Resume
Updated Resume
 
Nfl classificação
Nfl classificaçãoNfl classificação
Nfl classificação
 
Профессиональное управление распределенными проектами
Профессиональное управление распределенными проектамиПрофессиональное управление распределенными проектами
Профессиональное управление распределенными проектами
 
Jacobijn van etten lessen luchtgroen
Jacobijn van etten lessen luchtgroenJacobijn van etten lessen luchtgroen
Jacobijn van etten lessen luchtgroen
 
Principles of Social Media Marketing - Andrew Cavallaro
Principles of Social Media Marketing - Andrew CavallaroPrinciples of Social Media Marketing - Andrew Cavallaro
Principles of Social Media Marketing - Andrew Cavallaro
 
Особенности продажи IT товаров и услуг
Особенности продажи IT товаров и услугОсобенности продажи IT товаров и услуг
Особенности продажи IT товаров и услуг
 
How to Standardize Organization-Wide Data Collection
How to Standardize Organization-Wide Data CollectionHow to Standardize Organization-Wide Data Collection
How to Standardize Organization-Wide Data Collection
 
PresentationPPT
PresentationPPTPresentationPPT
PresentationPPT
 

Similaire à Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)

Corrected 2e-5
Corrected 2e-5Corrected 2e-5
Corrected 2e-5Dago Noel
 
Corrected 2e-5
Corrected 2e-5Corrected 2e-5
Corrected 2e-5Dago Noel
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationIJAEMSJORNAL
 
Systems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasetsSystems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasetsimprovemed
 
DNA micro array by kk sahu
DNA micro array by kk sahuDNA micro array by kk sahu
DNA micro array by kk sahuKAUSHAL SAHU
 
Nucleic acid microarray.pptx
Nucleic acid microarray.pptxNucleic acid microarray.pptx
Nucleic acid microarray.pptxashharnomani
 
DNA microarray ppt
DNA microarray pptDNA microarray ppt
DNA microarray pptMohit Kohli
 
DNA microarray
DNA microarrayDNA microarray
DNA microarrayS Rasouli
 
Biotechnological approaches in aquatic animal health management
Biotechnological approaches in aquatic animal health managementBiotechnological approaches in aquatic animal health management
Biotechnological approaches in aquatic animal health managementThavasimuthu citarasu
 
Restriction mapping
Restriction mappingRestriction mapping
Restriction mappingArdraArdra1
 
RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing ResearchTanmay Ghai
 
Molecular marker and its application in breed improvement and conservation.docx
Molecular marker and its application in breed improvement and conservation.docxMolecular marker and its application in breed improvement and conservation.docx
Molecular marker and its application in breed improvement and conservation.docxTrilokMandal2
 

Similaire à Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression) (20)

Corrected 2e-5
Corrected 2e-5Corrected 2e-5
Corrected 2e-5
 
Corrected 2e-5
Corrected 2e-5Corrected 2e-5
Corrected 2e-5
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferation
 
Seminar
SeminarSeminar
Seminar
 
Systems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasetsSystems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasets
 
DNA micro array by kk sahu
DNA micro array by kk sahuDNA micro array by kk sahu
DNA micro array by kk sahu
 
Nucleic acid microarray.pptx
Nucleic acid microarray.pptxNucleic acid microarray.pptx
Nucleic acid microarray.pptx
 
DNA microarray ppt
DNA microarray pptDNA microarray ppt
DNA microarray ppt
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
Microarray
Microarray Microarray
Microarray
 
project
projectproject
project
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
 
Biotechnological approaches in aquatic animal health management
Biotechnological approaches in aquatic animal health managementBiotechnological approaches in aquatic animal health management
Biotechnological approaches in aquatic animal health management
 
Restriction mapping
Restriction mappingRestriction mapping
Restriction mapping
 
RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing Research
 
Molecular marker and its application in breed improvement and conservation.docx
Molecular marker and its application in breed improvement and conservation.docxMolecular marker and its application in breed improvement and conservation.docx
Molecular marker and its application in breed improvement and conservation.docx
 
MICROARRAY
MICROARRAYMICROARRAY
MICROARRAY
 
MGG2003-cDNA-AFLP
MGG2003-cDNA-AFLPMGG2003-cDNA-AFLP
MGG2003-cDNA-AFLP
 
RapportHicham
RapportHichamRapportHicham
RapportHicham
 

Plus de Dago Noel

s12864-015-1541-1
s12864-015-1541-1s12864-015-1541-1
s12864-015-1541-1Dago Noel
 
translational_medecine_56
translational_medecine_56translational_medecine_56
translational_medecine_56Dago Noel
 
Original_published_Paper_Dago_et_al_2015_IJRSR_3395
Original_published_Paper_Dago_et_al_2015_IJRSR_3395Original_published_Paper_Dago_et_al_2015_IJRSR_3395
Original_published_Paper_Dago_et_al_2015_IJRSR_3395Dago Noel
 
15_paper_ijcas
15_paper_ijcas15_paper_ijcas
15_paper_ijcasDago Noel
 
translational_medecine_56
translational_medecine_56translational_medecine_56
translational_medecine_56Dago Noel
 
12864_2015_1541_OnlinePDF
12864_2015_1541_OnlinePDF12864_2015_1541_OnlinePDF
12864_2015_1541_OnlinePDFDago Noel
 
Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)
Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)
Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)Dago Noel
 
2944_IJDR_final_version
2944_IJDR_final_version2944_IJDR_final_version
2944_IJDR_final_versionDago Noel
 
2944_IJDR_final_version
2944_IJDR_final_version2944_IJDR_final_version
2944_IJDR_final_versionDago Noel
 
12864_2015_1541_OnlinePDF
12864_2015_1541_OnlinePDF12864_2015_1541_OnlinePDF
12864_2015_1541_OnlinePDFDago Noel
 

Plus de Dago Noel (10)

s12864-015-1541-1
s12864-015-1541-1s12864-015-1541-1
s12864-015-1541-1
 
translational_medecine_56
translational_medecine_56translational_medecine_56
translational_medecine_56
 
Original_published_Paper_Dago_et_al_2015_IJRSR_3395
Original_published_Paper_Dago_et_al_2015_IJRSR_3395Original_published_Paper_Dago_et_al_2015_IJRSR_3395
Original_published_Paper_Dago_et_al_2015_IJRSR_3395
 
15_paper_ijcas
15_paper_ijcas15_paper_ijcas
15_paper_ijcas
 
translational_medecine_56
translational_medecine_56translational_medecine_56
translational_medecine_56
 
12864_2015_1541_OnlinePDF
12864_2015_1541_OnlinePDF12864_2015_1541_OnlinePDF
12864_2015_1541_OnlinePDF
 
Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)
Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)
Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)
 
2944_IJDR_final_version
2944_IJDR_final_version2944_IJDR_final_version
2944_IJDR_final_version
 
2944_IJDR_final_version
2944_IJDR_final_version2944_IJDR_final_version
2944_IJDR_final_version
 
12864_2015_1541_OnlinePDF
12864_2015_1541_OnlinePDF12864_2015_1541_OnlinePDF
12864_2015_1541_OnlinePDF
 

Dago et al. 2014 (Probes Specificity, Microarray, RNA-Seq, Gene Expression)

  • 1. African Journal of Science and Research,2014,(3)5:08-12 ISSN: 2306-5877 Available Online: http://ajsr.rstpublishers.com/ PROBES SPECIFICITY IN ARRAY DESIGN INFLUENCES THE AGREEMENT BETWEEN MICROARRAY AND RNA-Seq IN GENE EXPRESSION ANALYSIS. Dago Dougba Noel 1,3*, Alberto Ferrarini 3, Diarassouba Nafan 1, Fofana Inza Jésus 1, Silué Souleymane 1, Giovanni Malerba 2 andMassimo Delledonne 3 1)UPGC University of Korhogo Cote d’Ivoire Unité Formation Recherche Sciences Biologiques, Côte d'Ivoire. 2 )Department of Life and Reproduction sciences, Section of Biology and Genetics, University of Verona, Italy. 3) Department of Biotechnology, University of Verona, Italy Strada le Grazie 15, Cà vignal 1,Italy. Email:dgnoel7@gmail.com Received:04, Aug,2014 Accepted: 19 ,Sep,2014 Abstract More than a decade, oligonucleotide microarrays have been the method of choice for transcriptional profiling studies, used to characterize biological systems. The power of microarray platforms depends on the number, identity and specificity of the oligonucleotide probes for their target gene models. In recent time, however, researchers have been increasingly focusing high-throughput sequencing RNA-Seq which offers advantages when examining transcriptome fine structure; for example, in the detection of allele-specific expression and splice junctions. We are investigating how the specificity of oligonucleotide probes in an array design influences the agreement between RNA-Seq and microarray platforms in gene expression, differential analysis. Hence, we are essaying the agreement between a custom microarray platform, based on multiple long oligonucleotide probes (60 mer) per gene model transcript (Grape custom microarray platform), and RNA-Seq, by removing microarray, design less specific probes discriminating differential expressed genes (DEGs), analyzing two Vitis vinifera berry developmental stages. We were able to demonstrate that the agreement between both RNA-Seq and microarray platforms calling DEGs, depend on the high rate of probes set with specific oligonucleotide probes. Furthermore, this investigation confirmed the superiority of RNA-Seq next generation sequencing (NGS) technology, as opposed to microarray in gene expression differential analysis. Keywords: Microarray, Probe set, oligonucleotide probe, RNA-Seq. INTRODUCTION Microarray technology has revolutionized molecular biology. It uses hundreds of millions of highly organized probes on a limited solid surface to simultaneously interrogate the multiple RNA or DNA molecules, defined as target within an individual sample [1]. The microarray technologies are universal tools that can be applied throughout the life sciences [2-4]. mRNA-expression profiling is the most frequent application. The principle behind of a microarray experiment, is that mRNAs from a given cell line or tissue is labeled sample, and hybridized in parallel to a large number of DNA sequences, it is then immobilized on a solid surface in an ordered array. The number of immobilized probes can vary from hundreds designed to monitor the expression of few specific genes to hundreds of thousands representing complete transcriptomes. Microarray expression technology is strongly influenced by the number, identity and quality (specificity) of the oligonucleotide probes on the array. Tracing and maintaining the identity of the thousands of probes requires an informatic system throughout the fabrication process. Gene-specific oligonucleotide probes are currently used in microarrays to avoid cross-hybridization of highly similar sequences. If the probes are not optimized for sequence specificity, microarrays can generate false-positive data due to non-specific crosshybridization to highly similar sequences, gene families [5], or alternatively spliced variants [6]. Long oligonucleotide probes are prone to cross-hybridization and thus often exhibit poor discrimination and hybridize to similar sequences. Studies have suggested that the percentage sequence homology is a reasonable predictor of cross-hybridization [7] and to overcome this crosshybridization problem, several laboratories have adopted the practice of designing oligonucleotide probes that target regions of low sequence similarity [8]. The literature data [9] indicate that longer oligonucleotides provide significantly better detection sensitivity than shorter probes. Single or multiple probes per genes can be designed. Cheng-Chung Chou et al. [10] demonstrated that a single longer oligonucleotide probe for a gene model could be sufficient for accurate expression measurement if the probe is validated experimentally. However, oligonucleotide probes binding to different regions of a gene yield different signal intensities [1], and it is difficult to predict whether an oligonucleotide probe will bind efficiently to its target sequence and yield a good hybridization signal on the basis of sequence information alone [11]. As a result, multiple probes per gene model transcript have been used in oligonucleotide array designs to obtain reliable quantitative information of gene expression. Furthermore it has been shown that the measurement bias decreased with an increase in the number of probes per gene. Fewer probes per gene were required for the longer probes to achieve the same bias reduction as shorter probes. As reported by Dago N., 2012 work [12], Grape custom microarray designs based on multiple longer probes (60 mer) per gene model transcript exhibited a higher sensitivity than their corresponding microarray designs based on shorter multiple probes (35-40 mer) per gene model transcript in detecting DEGs. The same investigation and unpublished data form the Functional Genomic Center of the university of Verona-Italy showed that as regards this microarray design, several probe sets from 6538 genes out of 29582 demonstrated a different behavior among themselves; a considerable number of these genes (4146 in total) claimed to be DEGs. Thus, in order to determine the specificity of the analyzed oligonucleotide probes, a blast analysis with no mismatch by setting Tm ≥ 85±15°C threshold to predict oligonucleotide probes secondary structure was performed against all represented gene model transcripts of Vitis vinifera grape 12x assembly transcriptome [13]. Next, we selected the specific probes (oligonucleotide probes and/or probe set that recognized only their gene model transcript at a
  • 2. African Journal of Science and Research , 2014,(3)5:08-12 Tm ≥ 85±15°C with no mismatch) from each analyzed probe set and compared the gene expression data of their corresponding genes with those of RNA-Seq. In recent times, RNA-Seq has emerged as a powerful new technology for transcriptome analysis [14-15]. Although RNA-Seq is still a technology under active development, it offers several key advantages over microarray technology. First, unlike hybridization-based approaches, RNA-Seq is not limited to detecting transcripts that correspond to existing genomic sequence. This makes RNA-Seq particularly attractive for non model organisms with genomic sequences that are yet to be determined. RNA-Seq can reveal the precise location of transcription boundaries, to a single-base resolution. Furthermore, several studies showed that RNA-Seq can accurately reveal gene expression difference [16]. These factors make RNA-Seq useful for studying complex transcriptomes [17]. Hence, we explored the number of specific and less specific oligonucleotide probes (i) when both RNA-Seq and microarray platforms agree in discriminating DEGs; (ii) when DEGs have been discriminated exclusively by RNA-Seq, and/or (iii) by microarray. MATERIALS AND METHODS Samples from two development stages of Vitis vinifera grape (veraison and ripening) have been profiled for global gene expression using Grape custom microarray platform based on multiple (4) longer oligonucleotide probes (60 mer) for the genes essayed [12]. The same samples were previously profiled using NGS technology (RNA-Seq) as reported in Zenoni et al., 2010 [18]. Microarrays expression data have been pre-processed by RMA module [19] and analyzed by limma moderated statistical t-test [12]. Agreement assessment between microarray and RNA-Seq platforms in gene expression differential analysis have been measured in term of oligonucleotide probes specificity. To that effect, we employed a thermodynamic approach based on OligoArray 2.0 software [20] to select probes that exhibited a melting temperature (Tm) ≥85±15°C as threshold to predict oligonucleotide probes secondary structure. Next, the selected probes were processed for local alignment analysis (megaBLAST http://www.ncbi.nlm.nih.gov/blast/Blast) against Vitis vinifera grape 12x assembly transcriptome [13]. Then oligonucleotide probes that recognized exclusively their gene model transcript with no mismatch at Tm ≥ 85±15 have been classified as specific. RNA preparation Sample of Vitis vinifera (grape) berry at the development stages of veraison and ripening were collected [18] and total RNA has been extracted as described in Anita Zamboni et al., 2010 work [21]. The RNA amount and integrity were essayed using a Nanodrop 2000 instrument (Thermo Scientific) and a Bio-analyzer Chip RNA 6000 (Agilent), respectively. Hybridization of Grape custom array based on long multiple probes per gene Grape custom array hybridization experiment based on One Color-DNA labeling system with Cy3 fluorescent performed by processing the same total RNA amount (10μg) of the three analyzed technical replicates of veraison and ripening development stage [12]. RNA processing, labeling, hybridization and slide wash step were accomplished following the NimbleGen Arrays User’s Guide Gene Expression Analysis version 3.1 protocol manufacturer’s instructions. Hybridization image scanning have been performed with the aid of the axon scanner Instruments GenePix 4200A at 535 wave length. For Grape custom array design based on multiple long 9 oligonucleotide probes (60 mer) per gene model transcript, the summarization of the normalized intensities of the different probes per transcript was performed according to the Robust Multichip Average (RMA) algorithm from NimbleScan software [19]. RESULTS Rate of specific probes for DEGs detected by both microarray and RNA-Seq platforms In total 117999 probes from 29582 gene model transcripts of Grape custom microarray design were analyzed. 85.66% of them have been selected as specific for their respective gene model transcript (oligonucleotide probes that recognized exclusively their gene model transcript at a Tm ≥85±15°C with no mismatch). However, the survey of specific oligonucleotide probes of DEGs detected by (i) both microarray and RNA-Seq platforms, (ii) microarray platform exclusively and (iii) RNA-Seq platform exclusively have been performed on 17850 genes (expressed genes called by both Grape custom microarray and RNA-Seq) [12]. Considering microarray data for which, internal replicate probes have been summarized by RMA algorithm as described by Irizarry et al. 2003 work [19], 6542 genes have been detected as DEGs by both Grape custom array design and RNA-Seq platforms [12]. Focusing on these DEGs, 57.56% of the analyzed probe set exhibited 3 or 4 specific oligonucleotide probes against 42.44% with 2, 1 or 0 specific oligonucleotide probes (see Figure 1). This result suggests that the agreement between both microarray and RNA-Seq tools in gene expression differential analysis is carried by the high number of probe set with specific oligonucleotide probes. Figure 1 Percentage of probe set with their corresponding specific oligonucleotide probes for DEGs discriminated by RNA-Seq microarray and/or by both microarray and RNA-Seq platforms in gene expression differential analysis. Rate of specific probes for DEGs detected exclusively by microarray and/or RNA-Seq We next analyzed the percentage of specific oligonucleotide probes in Grape custom microarray design focusing on DEGs called exclusively by (i) microarray (3518 genes) and (ii) RNA-Seq (3053 gene) [12]. The results of this analysis have been summarized in Figure 1 (see above). In fact, for DEGs discriminated exclusively by RNA-Seq platform, 62.28% of the analyzed probe set exhibited 4 or 3 specific oligonucleotide probes against 37.72% with 2, 1 or 0 specific oligonucleotide probes as showed in Figure 1 (see above). These results suppose a large involvement of specific oligonucleotide probes in the discrimination of DEGs previously recognized as such by RNA-Seq tool exclusively. Further, for DEGs wholly discriminated by Grape custom microarray design, 53% of the analyzed probe set display 2, 1 or 0 specific oligonucleotide probes against 47% with 4 or 3 specific oligonucleotide probes. Hence, for DEGs called exclusively by Grape custom microarray design, the number of
  • 3. 10 specific oligonucleotide probes was comparable with those less specific oligonucleotide probes, supporting an involvement of a substantial number of less specific oligonucleotide probes in the discrimination of DEGs previously recognized as such by Grape custom microarray design exclusively. In view of the foregoing these analysis suggest that disagreement between RNA-Seq and microarray technologies in gene expression differential analysis could be due to the considerable number of less specific oligonucleotide probes in the Grape custom microarray design. Moreover, these results suggest the high specificity and sensitivity of RNA-Seq in detecting DEGs in gene expression differential analysis [22]. Correlation between Grape custom microarray and RNA-Seq Level of gene expression in microarray depend on both spot intensity (transcript abundance) and oligonucleotide probes specificity (cross-hybridization) in signal detection. Consequently, oligonucleotide probes intensity in microarray experiment cannot represent the absolute gene expression level and is not suitable for analysis across different platforms. We then introduced a fold change (FC) parameter. There, FC parameter has been calculated as the ratio of the measured genes expression values of ripening sample to the genes expression values of véraison sample. Pearson correlation of the 17850 expressed genes between Grape custom microarray design and RNA-Seq platforms in FC measurement have been previously estimated at 0.72 (R2=0.72) [12]. The deletion of less specific oligonucleotide probes out the set of analyzed probes of Grape custom microarray design allows an increase in correlation between both microarray and RNA-Seq platforms in FC measurement analysis (from R2=0.72 to R2=0.90); this shows that microarray designed oligonucleotide probes could be best selected by the integration of both oligonucleotide probe design strategy based on score and thermodynamic parameters. This analysis also proves that a good selection of specific oligonucleotide probes in Grape custom microarray design improves strongly its agreement with RNA-Seq in gene expression differential analysis. Assessment of microarray accuracy and specificity by using specific probes in array design To further explore the agreement between microarray and RNA- Seq in gene expression differential analysis by selecting the Grape custom microarray specific oligonucleotide probes, a Receiver Operating Characteristics (ROC) curve was constructed for the analyzed microarray design (Grape custom microarray design) assuming RNA-Seq gene expression data set as reference. Each point on the ROC curve of the Grape custom array represents the sensitivity on Y-axis (True Positive Rate) and the specificity on X- axis (False Positive Rate). Deletion of less specific oligonucleotide probes out the Grape custom microarray design enlarged the area under the curve (AUC from 0.70 to 90.4%) as showed in Figure 2. These results demonstrated that less specific oligonucleotide probes in a microarray design give false signal detection in gene expression level measurement. By contrast, a high proportion of specific oligonucleotide probes in microarray design increased it specificity and accuracy in the discrimination of DEGs when RNA- Seq gene expression data was assumed as reference. Taking together, these analysis showed that Grape custom microarray platform enriched by gene target specific oligonucleotide probes exhibits a quite similar performance (specificity and accuracy) to RNA-Seq discriminating DEGs in gene expression differential analysis (see Figure 2). Dago Dougba Noel et.al Figure 2 ROC curve of 17850 genes of Grape custom array platform in detecting DEGs with all oligonucleotide probes (left) and with only specific oligonucleotide probes (right). RNA-Seq gene expression data have been used as reference. DISCUSSION AND CONCLUSION The recent development of transcriptomic approaches based on Next Generation Sequencing (NGS) is gaining popularity as they provide a genome-wide, precise, quantitative measure of gene expression. Although RNA-Seq is still a technology under active development, it offers several key advantages over microarray technology. Furthermore, several studies showed that RNA-Seq can accurately reveal gene expression difference [16]. However, because of the extensive legacy of the data and installed instrument base, microarrays may still be widely used in the foreseeable future. Although microarrays have been extensively used as discovery tools for biological and biomedical studies, the challenge remains whether this technology can be applied reliably in clinical practice and regulatory decision making, where high precision and accuracy in performance are required. Moreover, the power of this technology depends on the number, identity and quality of the oligonucleotide probes [23]. In the present paper we investigated how oligonucleotide probes specificity in Grape custom microarray design based on longer multiple oligonucleotide probes per gene model transcript could influence its reliability in gene expression analysis. Hence, we established a relationship between DEGs indentified by (i) Grape custom microarray design, (ii) RNA-Seq or (iii) both platforms and their respective specific oligonucleotide probes number. However, oligonucleotide probes that recognized exclusively their gene model transcript at a Tm ≥85±15°C with no mismatch have been classified as specific. Our findings showed an involvement of a substantial number of specific oligonucleotide probes in the discrimination of DEGs previously recognized as such by both microarray and RNA-Seq platforms suggesting that accord between the two analyzed gene expression platforms (Grape custom microarray design and RNA- Seq) can be improved by increasing the ratio of specific oligonucleotide probes in microarray design. Next, we showed that more specific oligonucleotide probes were involved in the discrimination of DEGs previously recognized as such by RNA-Seq platform. By contrast a similar percentage of specific and less specific oligonucleotide probes claimed to be associated with DEGs exclusively detected by the Grape custom microarray design. Considered as a whole, these results support a higher specificity of RNA-Seq technology in the discrimination of DEGs in gene expression differential analysis [22]. Furthermore, the deletion of less specific oligonucleotide probes from the set of analyzed oligonucleotide probes of Grape custom microarray design, improved the correlation between both microarray and RNA-Seq platforms in gene expression differential analysis FC measurement showing that
  • 4. African Journal of Science and Research , 2014,(3)5:08-12 for oligonucleotide probes that are not optimized for sequence specificity, microarrays can generate false positive data due to non specific cross-hybridization to highly similar sequences gene families [5-6]. The ROC curve is a two-dimensional depiction of classifier performance. To compare classifiers we may want to reduce ROC performance to a single scalar value representing expected performance. A common method is to calculate the area under the ROC curve, abbreviated AUC [24-25]. Indeed, ROC curve analysis performed in this investigation suggested that an accurate selection of oligonucleotide probes by integrating a thermodynamic parameter, (selection of strongly specific oligonucleotide probe) allowed the Grape custom microarray design to achieve a quite similar performance to that of the RNA-Seq in calling DEGs in gene expression differential analysis. Therefore, we demonstrated that microarray designed oligonucleotide probes could be best selected by the integration of thermodynamic parameters [20]. Moreover, our investigation showed that discrepancy between RNA-Seq and microarray platforms in gene expression differential analysis could be due to the presence of less specific oligonucleotide probes in microarray design. As expected, we also showed that a high rate of specific oligonucleotide probes in a microarray design increase it specificity and accuracy in discriminating DEGs in gene expression differential analysis. Then, stringent selection of gene model target specific oligonucleotide probes based on score and thermodynamic parameters, improved strongly the performance of microarray technology in gene expression measurement analysis. In conclusion this investigation holds that microarray design enriched by gene target specific oligonucleotide probes exhibits a quite similar performances to that of RNA-Seq in gene expression differential analysis, suggesting that (i) the accurate detection of DEGs by microarray platform and that (ii) the agreement between both microarray and RNA-Seq gene expression platforms is strongly correlated with the best selection of gene target specific oligonucleotide probes based on thermodynamic parameters. References 1. Lockhart DJ, Dong H, Byrne MC, Follettie MT, Gallo MV, Chee MS, Mittmann M, Wang C, Kobayashi M, Horton H, Brown EL. (1996 Dec). Expression monitoring by hybridization to high-density oligonucleotide arrays. Nat Biotechnol.14 (13):1675-1680. 2. Brown PO, Botstein D. Exploring the new world of the genome with DNA microarrays (1999 Jan 21). Nat Genet. (1 Suppl):33-37. 3. Lockhart DJ, Winzeler EA (2000 Jun15) Genomics, gene expression and DNA arrays. Nature.405 (6788):827-836. 4. Young T, Peppard P. (2000 Jun15). Sleep-disordered breathing and cardiovascular disease: epidemiologic evidence for a relationship. Sleep.;23 Suppl 4:S122-1226. 5. Xu W, Chen H, Du K, Asahara H, Tini M, Emerson BM, Montminy M, Evans RM. (2001) A transcriptional switch mediated by cofactor methylation .Science. 294 (5551):2507-11. Epub. 6. Modrek, B. et al. (2001). Genome-wide detection of alternative splicing in expressed sequences of human genes. Nucleic Acids Res. 29, 2850–2859. 7. Evertsz EM, Au-Young J, Ruvolo MV, Lim AC, Reynolds MA. Hybridization cross-reactivity within homologous gene families on glass cDNA microarrays (2001 Nov). Biotechniques. 31 (5): 1182- 1184, 1186 passim. 8. Kane MD, Jatkoe TA, Stumpf CR, Lu J, Thomas JD, Madore SJ (2000 Nov). Assessment of the sensitivity and specificity of 11 oligonucleotide (50mer) microarrays. Nucleic Acids Res.15; 28 (22):4552-4557. 9. Hughes TR, Mao M, Jones AR, Burchard J, Marton MJ, Shannon KW, Lefkowitz SM, Ziman M, Schelter JM, Meyer MR, Kobayashi S, Davis C, Dai H, He YD, Stephaniants SB, Cavet G, Walker WL, West A, Coffey E, Shoemaker DD, Stoughton R, Blanchard AP, Friend SH, Linsley PS. Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer (2001 Apr). Nat 19 (4):342-347. 10. Cheng-Chung Chou, Chun-Houh Chen, Te-Tsui Lee and Konan Peck. Optimization of probe length and the number of probes per gene for optimal microarray analysis of gene expression (2004). Nucleic Acids Research, Vol. 32, No. 12 e99 doi:10.1093/nar/gnh099. 11. Li,F. and Stormo,G.D. (2001). Selection of optimal DNA oligos for gene expression arrays. Bioinformatics, 17, 1067–1076. 12. Dago Dougba Noel (2012). Performance assessment of different microarray designs using RNA-Seq as reference Id prodotto: 67051; Id Ugov: 404537. 13. Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, Vezzi A, Legeai F, Hugueney P, Dasilva C, Horner D, Mica E, Jublot D, Poulain J, Bruyère C, Billault A, Segurens B, Gouyvenoux M, Ugarte E, Cattonaro F, Anthouard V, Vico V, Del Fabbro C, Alaux M, Di Gaspero G, Dumas V, Felice N, Paillard S, Juman I, Moroldo M, Scalabrin S, Canaguier A, Le Clainche I, Malacrida G, Durand E, Pesole G, Laucou V, Chatelet P, Merdinoglu D, Delledonne M, Pezzotti M, Lecharny A, Scarpelli C, Artiguenave F, Pè ME, Valle G, Morgante M, Caboche M, Adam-Blondon AF, Weissenbach J, Quétier F, Wincker P (2007 Sep 27); French-Italian Public Consortium for Grapevine Genome Characterization: The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla; 449 (7161): 463-467. Epub 2007 Aug 26. PMID: 17721507 PubMed – indexed for MEDLINE. Nature. 14. Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, Kingsmore SF, Schroth GP, Burge CB (2008). Alternative isoform regulation in human tissue transcriptomes. Nature; 456:470–47. [PubMed]. 15. Cloonan N, Forrest AR, Kolle G, Gardiner BB, Faulkner GJ, Brown MK, Taylor DF, Steptoe AL, Wani S, Bethel G, et al. Stem cell transcriptome profiling via massive-scale mRNA sequencing (2008). Nat. Methods; 5:613–619. 16. Song Liu, Lan Lin, Peng Jiang, Dan Wang and Yi Xing (2011 Jan). A comparison of RNA-Seq and high-density exon array for detecting differential gene expression between closely related species Nucleic Acids Res; 39(2): 578–588. doi: 10.1093/nar/gkq817 PMCID: PMC3025565. 17. Zhong Wang, Mark Gerstein, and Michael Snyder (2009 Jan). RNA-Seq: a revolutionary tool for transcriptomics Nat Rev Genet; 10 (1):57–63. doi: 10.1038/nrg2484 PMCID: PMC2949280 NIHMSID: NIHMS229948. 18. Sara Zenoni, Alberto Ferrarini, Enrico Giacomelli, Luciano Xumerle, Marianna Fasoli, Giovanni Malerba, Diana Bellin, Mario Pezzotti, and Massimo Delledonne (2010 April) Characterization of Transcriptional Complexity duringBerry Development in Vitis vinifera Using RNA-Seq Plant Physiology vol. 152 n° 4 1787-1795. 19. Irizarry, R. A. Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP. Summaries of Affymetrix Gene-Chip probe level data (2003). Nucleic Acids Res. 31, e15 (4). 20. Jean-Marie Rouillard, Michael Zuker and Erdogan Gulari OligoArray 2.0: design of oligonucleotide probes for DNA microarrays using a thermodynamic approach (2003 Jun). Nucleic
  • 5. 12 Acids Res. 15;31(12):3057-3062. 21. Anita Zamboni, Mariasole Di Carli, Flavia Guzzo, Matteo Stocchero, Sara Zenoni, Alberto Ferrarini, Paola Tononi, Ketti Tofalli, Angiola Desiderio, Kathryn S. Lilley, M. Enrico Pè, Eugenio Benvenuto, Massimo Delledonne and Mario Pezzotti (2010 November). Identification of putative stage-specific grapevine berry biomarkers and omics data integration into networks. Plant Physiology. vol. 154 N°3 1439-1459. 22. John C. Marioni, Christopher E. Mason, Shrikant M. Mane, et al. (2008 Jun 11). RNA-Seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 18:1509-1517 doi:10.1101/gr.079558108. Dago Dougba Noel et.al 23. Joseph D. Clarke and Ton Zhu (2006). Microarray analysis of the transcriptomes as a stepping stone towards understanding biology system: pratical consideration and perspectives. The plant biology journal 45, 630-650 24. Hanley JA, McNeil BJ The meaning and use of the area under a receiver operating characteristic (ROC) curve (1982 Apr). Radiology. 143(1):29-36. 25. Bradley KA, Boyd-Wickizer J, Powell SH, Burman ML (1998 Jul 8). Alcohol screening questionnaires in women: a critical review JAMA. 280 (2):166-71.