SlideShare une entreprise Scribd logo
1  sur  14
Télécharger pour lire hors ligne
A call for the creation of a



   Human long ncRNA Clone Set

   Matthias Harbers - January 2012




Matthias Harbers                     1
Long non-coding mRNAs
  The dogma that an “mRNA has to encode a Protein” directed the creation
   of protein coding clone collections

  The Mammalian Gene Collection (MGC) and ORFeome Collaboration
   (OC) focused on the cloning of protein coding transcripts

  However, by now it is generally accepted many mRNAs are non-coding
   transcripts that exercise their functions by other, mostly unknown,
   mechanisms

  All clone collections are incomplete because of the use of oligo-dT priming

  However, may be up to 40% or more of the mRNAs could lack
   polyadenylation and hence are not covered by classical cDNA libraries

  Many non-polyadenylated mRNAs could be non-coding mRNAs
Matthias Harbers                                                                 2
Research is driven by sequencing
   High-speed sequencing changed the way genomic research is done

   High-speed sequencing has an unmatched power in “discovery”

   However, expressed sequences need confirmation by other means

   New transcripts have to be analyzed for their functions

   Loss-of-function studies became easier with RNAi knock-down
    experiments

   However, gain-of-function experiments are essential to understand
    mechanisms and functions

   We should have the resources needed to study the functions of
    ncRNAs!
Matthias Harbers                                                        3
“Real” versus “predicted”
   Public databases like NCBI and others use “reference sequences” for
    transcripts

   This implies that there is only one transcript per gene!

   Reference sequences ignore splicing!

   Actual cDNA clones in the public domain often do not match reference
    sequences

   High-speed sequencing data provide more “predicted sequences” based on
    the assembly of short reads into contigs or alike

   Predicted sequences are not experimentally verified!

   Cloned cDNAs have at least an “experimental origin”!

Matthias Harbers                                                             4
The “Knowledge Cycle”
                                           Creation of genomic resources and data:
                                           Public databases



Functional Studies:
Gene annotation




                                                                Find the clones you need:
                                                            Clone Distribution Services


     Physical resources are needed to bring “life” to in silico data!

Matthias Harbers                                                                      5
What is needed?
    Define what are long non-coding mRNAs (ongoing in the community)

    Description of human non-coding mRNA set

    Consent on the features of human non-coding mRNA set

    Starting materials available in the community?

    New starting materials required?

    Build consortium to build human non-coding mRNA clone set

    Consider non-coding mRNA collections from other organisms

    Small non-coding RNA are not considered here because they are in part
     already covered by some public collections (e.g. Netherlands Cancer
     Institute)
Matthias Harbers                                                             6
Features of non-coding mRNA set
   ORFeome Collaboration committed to Invitrogen Gateway system

   Broad Institute also uses Invitrogen Gateway system

   Suggestion to stay with Invitrogen Gateway system for ncRNA set?

   However, many clone customers do not want Invitrogen Gateway clones!

   Addition of restrictions sites could enable sub-cloning without use of the
    Invitrogen Gateway system

   For example Promega offers Flexi® Vectors using SgfI and PmeI

   Should the parental clones from cDNA libraries made available?

   Should the collection include splice variants?

   Are there special requirements we do not know of?
Matthias Harbers                                                                 7
Available starting materials
   Want to use high-quality full-length cDNA clones and libraries where
    possible!

   Human cDNA collections:
     • RIKEN ~311,000 human end-sequenced human cDNA clones in NCBI?
     • Other human cDNA clones: e.g. ORIGENE?

   Human full-length cDNA libraries in the public domain?

   I do not see gene synthesis based on predicted sequences as “general
    option”

   I prefer starting from cDNA libraries using “real transcripts”

   Classical cDNA libraries have not been sequenced deep enough to cover
    rare genes! There is an option to find more unique clones in old libraries

   Need new cDNA libraries/pools to cover important biological samples?
Matthias Harbers                                                                 8
New technologies will help
   In the past sequencing cost limiting factor for building clone collections

   Many clones in the public domain are not full-length sequenced

   Lack of sequence information limits clone annotation

   New high-speed sequencing methods can overcome limitation on
    sequencing cost

   Use high-speed sequencing instead of end-sequencing of individual clones
    to screen cDNA libraries more deeply

   Use high-speed sequencing to obtain full-length sequences of all clones
    within ncRNA collection

   Use high-speed sequencing to assure high quality standards of entire
    collection!
Matthias Harbers                                                                 9
RNA                       RNA


            cDNA Library           cDNA Library/cDNA Pool


            Clone Picking            Shotgun sequencing


          End Sequencing              Library Screening


         Annotated Clone              Clones for Targets
           Collection

    Limited by sequencing cost        Much higher coverage
  Redundancy in clone collection      Focus on new targets

Matthias Harbers                                             10
New starting materials required
   Many mRNAs lack polyadenylation and required new cloning method
                               Total RNA from cell


                             Removal of polyA mRNA


                          Ligation of 3’ adapter to mRNA


                         cDNA synthesis using 3’ adapter


                         Cap selection using Cap Trapper


                            Cloning into cDNA library

   Size does matter: Classical cDNA projects had a size cutoff
Matthias Harbers                                                      11
Build internet presence
   Any clone collection requires an internet presence with a database!

   Clone related information can only be provided by a database

   Annotation of the clones by reference to other resources is important

   Application notes and references could be a great capture for users

   Good documentation of the project needed

   Provide all clones to community without limitations on the rights to use
    (follow example of “Good Faith Agreement” of ORFeome Collaboration)

   ncRNAs may require “more” for a better understanding on how to
    study new mechanisms and functions!

   Become the “home” for research on ncRNAs!
Matthias Harbers                                                               12
Conclusion
   MGC and OC set standards for human clone resources!

   We should to build on the great record of MGC and OC to move from
    coding mRNAs to long non-coding mRNAs

   After most coding genes have been covered by at least one cDNA clone, we
    need to work on the non-coding transcripts to move forward

   Non-coding genes are essential players in life and we want to provide
    comprehensive resources for their study and analysis

   Starting with human ncRNAs will greatly benefit medical research

   Including ncRNAs from other organisms could be an option where those
    are key model systems to study ncRNA functions (RIKEN FANTOM clone
    set from mouse includes many long ncRNAs)

Matthias Harbers                                                               13
Matthias Harbers   14

Contenu connexe

Tendances

Tendances (20)

Ngs part i 2013
Ngs part i 2013Ngs part i 2013
Ngs part i 2013
 
NGS: bioinformatic challenges
NGS: bioinformatic challengesNGS: bioinformatic challenges
NGS: bioinformatic challenges
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential Expression
 
Ngs intro_v6_public
 Ngs intro_v6_public Ngs intro_v6_public
Ngs intro_v6_public
 
ECCMID 2015 Meet-The-Expert: Bioinformatics Tools
ECCMID 2015 Meet-The-Expert: Bioinformatics ToolsECCMID 2015 Meet-The-Expert: Bioinformatics Tools
ECCMID 2015 Meet-The-Expert: Bioinformatics Tools
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 
Computational infrastructure for NGS data analysis
Computational infrastructure for NGS data analysisComputational infrastructure for NGS data analysis
Computational infrastructure for NGS data analysis
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
 
Exploring new frontiers with next-generation sequencing
Exploring new frontiers with next-generation sequencingExploring new frontiers with next-generation sequencing
Exploring new frontiers with next-generation sequencing
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 
Annotating nc-RNAs with Rfam
Annotating nc-RNAs with RfamAnnotating nc-RNAs with Rfam
Annotating nc-RNAs with Rfam
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_course
 
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
 Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ... Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
 
White Paper: Next-Generation Genome Sequencing Using EMC Isilon Scale-Out NAS...
White Paper: Next-Generation Genome Sequencing Using EMC Isilon Scale-Out NAS...White Paper: Next-Generation Genome Sequencing Using EMC Isilon Scale-Out NAS...
White Paper: Next-Generation Genome Sequencing Using EMC Isilon Scale-Out NAS...
 
Rna seq pipeline
Rna seq pipelineRna seq pipeline
Rna seq pipeline
 
Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2
 
Innovative NGS Library Construction Technology
Innovative NGS Library Construction TechnologyInnovative NGS Library Construction Technology
Innovative NGS Library Construction Technology
 
Knowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsKnowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and Variants
 
QIAseq Targeted DNA, RNA and Fusion Gene Panels
QIAseq Targeted DNA, RNA and Fusion Gene PanelsQIAseq Targeted DNA, RNA and Fusion Gene Panels
QIAseq Targeted DNA, RNA and Fusion Gene Panels
 
Analyzing Fusion Genes Using Next-Generation Sequencing
Analyzing Fusion Genes Using Next-Generation SequencingAnalyzing Fusion Genes Using Next-Generation Sequencing
Analyzing Fusion Genes Using Next-Generation Sequencing
 

Similaire à Call for non-coding mRNA resource

2012 10-24 - ngs webinar
2012 10-24 - ngs webinar2012 10-24 - ngs webinar
2012 10-24 - ngs webinar
Elsa von Licy
 
Dna library lecture-Gene libraries and screening
Dna library lecture-Gene libraries and screening  Dna library lecture-Gene libraries and screening
Dna library lecture-Gene libraries and screening
Abdullah Abobakr
 

Similaire à Call for non-coding mRNA resource (20)

Gene Libarries
Gene LibarriesGene Libarries
Gene Libarries
 
Approaches to cDNA Cloning and Analysis
Approaches to cDNA Cloning and AnalysisApproaches to cDNA Cloning and Analysis
Approaches to cDNA Cloning and Analysis
 
3 dna libraries
3 dna libraries3 dna libraries
3 dna libraries
 
genetic_engineering__unit_3__lecture_1.ppt
genetic_engineering__unit_3__lecture_1.pptgenetic_engineering__unit_3__lecture_1.ppt
genetic_engineering__unit_3__lecture_1.ppt
 
Gene library
Gene libraryGene library
Gene library
 
Genomic library
Genomic libraryGenomic library
Genomic library
 
Genomic library construction
Genomic library constructionGenomic library construction
Genomic library construction
 
Genomic and c dna library by Kailash Sontakke
Genomic and c dna library by Kailash SontakkeGenomic and c dna library by Kailash Sontakke
Genomic and c dna library by Kailash Sontakke
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
DNA Libraries PPT
DNA Libraries PPTDNA Libraries PPT
DNA Libraries PPT
 
Genomic and c dna library
Genomic and c dna libraryGenomic and c dna library
Genomic and c dna library
 
DNA library
DNA libraryDNA library
DNA library
 
Screening methods for cloned libraries
Screening methods for cloned librariesScreening methods for cloned libraries
Screening methods for cloned libraries
 
DNA library ppt.pptx
DNA library ppt.pptxDNA library ppt.pptx
DNA library ppt.pptx
 
Gene Libraries
Gene LibrariesGene Libraries
Gene Libraries
 
2012 10-24 - ngs webinar
2012 10-24 - ngs webinar2012 10-24 - ngs webinar
2012 10-24 - ngs webinar
 
Rna seq and chip seq
Rna seq and chip seqRna seq and chip seq
Rna seq and chip seq
 
construction of genomicc dna libraries
construction of genomicc dna librariesconstruction of genomicc dna libraries
construction of genomicc dna libraries
 
Dna library lecture-Gene libraries and screening
Dna library lecture-Gene libraries and screening  Dna library lecture-Gene libraries and screening
Dna library lecture-Gene libraries and screening
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
 

Dernier

Dernier (20)

VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Bareilly Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 8250077686 Top Class Call Girl Service Available
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
 
O963O942363 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
O963O942363 Call Girls In Ahmedabad Escort Service Available 24×7 In AhmedabadO963O942363 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
O963O942363 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
 
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
 

Call for non-coding mRNA resource

  • 1. A call for the creation of a Human long ncRNA Clone Set Matthias Harbers - January 2012 Matthias Harbers 1
  • 2. Long non-coding mRNAs  The dogma that an “mRNA has to encode a Protein” directed the creation of protein coding clone collections  The Mammalian Gene Collection (MGC) and ORFeome Collaboration (OC) focused on the cloning of protein coding transcripts  However, by now it is generally accepted many mRNAs are non-coding transcripts that exercise their functions by other, mostly unknown, mechanisms  All clone collections are incomplete because of the use of oligo-dT priming  However, may be up to 40% or more of the mRNAs could lack polyadenylation and hence are not covered by classical cDNA libraries  Many non-polyadenylated mRNAs could be non-coding mRNAs Matthias Harbers 2
  • 3. Research is driven by sequencing  High-speed sequencing changed the way genomic research is done  High-speed sequencing has an unmatched power in “discovery”  However, expressed sequences need confirmation by other means  New transcripts have to be analyzed for their functions  Loss-of-function studies became easier with RNAi knock-down experiments  However, gain-of-function experiments are essential to understand mechanisms and functions  We should have the resources needed to study the functions of ncRNAs! Matthias Harbers 3
  • 4. “Real” versus “predicted”  Public databases like NCBI and others use “reference sequences” for transcripts  This implies that there is only one transcript per gene!  Reference sequences ignore splicing!  Actual cDNA clones in the public domain often do not match reference sequences  High-speed sequencing data provide more “predicted sequences” based on the assembly of short reads into contigs or alike  Predicted sequences are not experimentally verified!  Cloned cDNAs have at least an “experimental origin”! Matthias Harbers 4
  • 5. The “Knowledge Cycle” Creation of genomic resources and data: Public databases Functional Studies: Gene annotation Find the clones you need: Clone Distribution Services  Physical resources are needed to bring “life” to in silico data! Matthias Harbers 5
  • 6. What is needed?  Define what are long non-coding mRNAs (ongoing in the community)  Description of human non-coding mRNA set  Consent on the features of human non-coding mRNA set  Starting materials available in the community?  New starting materials required?  Build consortium to build human non-coding mRNA clone set  Consider non-coding mRNA collections from other organisms  Small non-coding RNA are not considered here because they are in part already covered by some public collections (e.g. Netherlands Cancer Institute) Matthias Harbers 6
  • 7. Features of non-coding mRNA set  ORFeome Collaboration committed to Invitrogen Gateway system  Broad Institute also uses Invitrogen Gateway system  Suggestion to stay with Invitrogen Gateway system for ncRNA set?  However, many clone customers do not want Invitrogen Gateway clones!  Addition of restrictions sites could enable sub-cloning without use of the Invitrogen Gateway system  For example Promega offers Flexi® Vectors using SgfI and PmeI  Should the parental clones from cDNA libraries made available?  Should the collection include splice variants?  Are there special requirements we do not know of? Matthias Harbers 7
  • 8. Available starting materials  Want to use high-quality full-length cDNA clones and libraries where possible!  Human cDNA collections: • RIKEN ~311,000 human end-sequenced human cDNA clones in NCBI? • Other human cDNA clones: e.g. ORIGENE?  Human full-length cDNA libraries in the public domain?  I do not see gene synthesis based on predicted sequences as “general option”  I prefer starting from cDNA libraries using “real transcripts”  Classical cDNA libraries have not been sequenced deep enough to cover rare genes! There is an option to find more unique clones in old libraries  Need new cDNA libraries/pools to cover important biological samples? Matthias Harbers 8
  • 9. New technologies will help  In the past sequencing cost limiting factor for building clone collections  Many clones in the public domain are not full-length sequenced  Lack of sequence information limits clone annotation  New high-speed sequencing methods can overcome limitation on sequencing cost  Use high-speed sequencing instead of end-sequencing of individual clones to screen cDNA libraries more deeply  Use high-speed sequencing to obtain full-length sequences of all clones within ncRNA collection  Use high-speed sequencing to assure high quality standards of entire collection! Matthias Harbers 9
  • 10. RNA RNA cDNA Library cDNA Library/cDNA Pool Clone Picking Shotgun sequencing End Sequencing Library Screening Annotated Clone Clones for Targets Collection Limited by sequencing cost Much higher coverage Redundancy in clone collection Focus on new targets Matthias Harbers 10
  • 11. New starting materials required  Many mRNAs lack polyadenylation and required new cloning method Total RNA from cell Removal of polyA mRNA Ligation of 3’ adapter to mRNA cDNA synthesis using 3’ adapter Cap selection using Cap Trapper Cloning into cDNA library  Size does matter: Classical cDNA projects had a size cutoff Matthias Harbers 11
  • 12. Build internet presence  Any clone collection requires an internet presence with a database!  Clone related information can only be provided by a database  Annotation of the clones by reference to other resources is important  Application notes and references could be a great capture for users  Good documentation of the project needed  Provide all clones to community without limitations on the rights to use (follow example of “Good Faith Agreement” of ORFeome Collaboration)  ncRNAs may require “more” for a better understanding on how to study new mechanisms and functions!  Become the “home” for research on ncRNAs! Matthias Harbers 12
  • 13. Conclusion  MGC and OC set standards for human clone resources!  We should to build on the great record of MGC and OC to move from coding mRNAs to long non-coding mRNAs  After most coding genes have been covered by at least one cDNA clone, we need to work on the non-coding transcripts to move forward  Non-coding genes are essential players in life and we want to provide comprehensive resources for their study and analysis  Starting with human ncRNAs will greatly benefit medical research  Including ncRNAs from other organisms could be an option where those are key model systems to study ncRNA functions (RIKEN FANTOM clone set from mouse includes many long ncRNAs) Matthias Harbers 13