SlideShare une entreprise Scribd logo
1  sur  59
ENCODE
             Encyclopedia of DNA Elements



                          Outline

                 What and who is ENCODE

Key ENCODE topics and most important papers for our research

       ENCODE data – make use of the encyclopedia…




                      Maté Ongenaert
What and who is ENCODE
        Main aims, funding and the institutions/labs behind the 200 M $

                           Who?
                 International consortium
Funded by NHGRI – National Human Genome Research Institute
                     200 million dollar

           Main collaborators (for human data)
                 Broad Institute (ChIP-seq)
    HudsonAlpha Institute for Biotechnology (methylation)
                Sanger Institute (RNA-seq)
                 Duke University (DNAse)
                   Yale University (Pol II)
                     EBI (data analysis)

                         Main aims
“Build a comprehensive parts list of functional elements in the
human genome, including elements that act at the protein and
  RNA levels, and regulatory elements that control cells and
           circumstances in which a gene is active”
What and who is ENCODE
         Main aims, funding and the institutions/labs behind the 200 M $

         What’s so hot… It has been running for years?

                 Started in 2007 – pilot project
                       1% of the genome

                         2007-2012
        Since then, introduction of new technologies
                     Higher throughput
                       Genome-wide
 Much more samples and different tissues (different ‘tiers’ – see
                              later)
            Better data analysis and integration
What and who is ENCODE
Main aims, funding and the institutions/labs behind the 200 M $

What’s so hot… It has been running for years?

         World wide press attention
What and who is ENCODE
          Main aims, funding and the institutions/labs behind the 200 M $

What’s so hot… It has been running for
               years?

     World wide press attention…
            and criticisms

“Popular” media focus on the “junk DNA
               aspect”

  The authors also claim in their press-
   release that > 80% of the genome is
‘biologically active’ (<> may be involved
 in regulation in one way or another <>
                junk DNA)

ENCODE reveals for the fist time a lot of
 factors of the very complex switching
    board controlling expression / …
What and who is ENCODE
          Main aims, funding and the institutions/labs behind the 200 M $

          What’s so hot… It has been running for years?

30 (!) research papers published in three journals at the same time
ENCODE
             Encyclopedia of DNA Elements



                          Outline

                 What and who is ENCODE

Key ENCODE topics and most important papers for our research

       ENCODE data – make use of the encyclopedia…
Key ENCODE topics
Main ENCODE topics and selection of most important papers

                          Key topics

               Transcription factor binding motifs
    Chromatin patterns at transcription factor binding sites
   Characterization of intergenic regions and gene definitions
  RNA and chromatin modification patterns around promoters
            Epigenetic regulation of RNA processing
                Non-coding RNA characterisation
                        DNA-methylation
            Enhancer discovery and characterization
              3D connections across the genome
             Characterisation of network topology
          Machine learning approaches to genomics
  Impact of functional information on understanding variation
     Impact of evolutionary selection on functional regions
Key ENCODE topics
            Main ENCODE topics and selection of most important papers

                                        Main paper

        95% of the genome lies within 8 kilobases (kb) of a DNA–protein interaction

Classifying the genome into seven chromatin states indicates an initial set of 399,124 regions
          with enhancer-like features and 70,292 regions with promoter-like features

It is possible to correlate quantitatively RNA sequence production and processing with both
  chromatin marks and transcription factor binding at promoters, indicating that promoter
                functionality can explain most of the variation in RNA expression

  Single nucleotide polymorphisms (SNPs) associated with disease by GWAS are enriched
within non-coding functional elements, with a majority residing in or near ENCODE-defined
 regions that are outside of protein-coding genes. In many cases, the disease phenotypes
              can be associated with a specific cell type or transcription factor
Key ENCODE topics
Main ENCODE topics and selection of most important papers

                          Main paper

                     Techniques used:
                          RNA-seq
                          ChIP-seq
                        DNAse-seq
           DNA-methylation arrays and bisulfite seq
                         FAIRE-seq

      Tier 1: three cell lines (K652 – GM12878 – H1 hESC)
      Tier 2: cell line panel (HeLa-S3 – HepG2 – HUVECs)
                     Tier 3 (all other cell types)

         Total: 1640 datasets / 147 different cell types
Key ENCODE topics
Main ENCODE topics and selection of most important papers

                       Main paper
Key ENCODE topics
            Main ENCODE topics and selection of most important papers

                                         Main paper

        95% of the genome lies within 8 kilobases (kb) of a DNA–protein interaction

Classifying the genome into seven chromatin states indicates an initial set of 399,124 regions
          with enhancer-like features and 70,292 regions with promoter-like features

It is possible to correlate quantitatively RNA sequence production and processing with both
  chromatin marks and transcription factor binding at promoters, indicating that promoter
                functionality can explain most of the variation in RNA expression

  Single nucleotide polymorphisms (SNPs) associated with disease by GWAS are enriched
within non-coding functional elements, with a majority residing in or near ENCODE-defined
 regions that are outside of protein-coding genes. In many cases, the disease phenotypes
              can be associated with a specific cell type or transcription factor
Key ENCODE topics
     Main ENCODE topics and selection of most important papers

Expression – chromatin state               Expression – transcription factors
Key ENCODE topics
Main ENCODE topics and selection of most important papers




            Expression – transcription factors
Key ENCODE topics
Main ENCODE topics and selection of most important papers



                                          Chromatin state patterns at
                                          transcription-factor binding
                                                      sites
Key ENCODE topics
Main ENCODE topics and selection of most important papers




         Co-association between transcription factors (K562)
Key ENCODE topics
     Main ENCODE topics and selection of most important papers




Insight in genomic variation – allele specific variation
Key ENCODE topics
            Main ENCODE topics and selection of most important papers

                                        Main paper

        95% of the genome lies within 8 kilobases (kb) of a DNA–protein interaction

Classifying the genome into seven chromatin states indicates an initial set of 399,124 regions
          with enhancer-like features and 70,292 regions with promoter-like features

It is possible to correlate quantitatively RNA sequence production and processing with both
  chromatin marks and transcription factor binding at promoters, indicating that promoter
                functionality can explain most of the variation in RNA expression

 Single nucleotide polymorphisms (SNPs) associated with disease by GWAS are enriched
within non-coding functional elements, with a majority residing in or near ENCODE-defined
 regions that are outside of protein-coding genes. In many cases, the disease phenotypes
             can be associated with a specific cell type or transcription factor
Key ENCODE topics
 Main ENCODE topics and selection of most important papers



 Overlap SNPs with
regulatory elements
Key ENCODE topics
Main ENCODE topics and selection of most important papers




  Overlap SNPs with regulatory elements and ‘open’ chromatin
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Accessible chromatin landscape

                    DNAseI treatment
          Combined analysis with TFs and H3K4me3

       Identification of “accessible” chromatin regions
Key ENCODE topics
Main ENCODE topics and selection of most important papers

 Accessible chromatin landscape – location of accessible regions
Key ENCODE topics
Main ENCODE topics and selection of most important papers

Accessible chromatin landscape – association with ChIP-seq and TFs
Key ENCODE topics
Main ENCODE topics and selection of most important papers

      Accessible chromatin landscape – novel transcripts
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
   Main ENCODE topics and selection of most important papers

                       Landscape of transcription

                                RNA-seq

 Get a grip on what is transcribed, including novel transcripts and RNAs
Key ENCODE topics
Main ENCODE topics and selection of most important papers

  Landscape of transcription – nucleolar fraction vs. whole cell
Key ENCODE topics
Main ENCODE topics and selection of most important papers

                 Landscape of transcription
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

             Long-range interaction of promoters

    5C mapping (chromatin interaction mapping technology)

         Long-range interactions of promoter regions
Key ENCODE topics
Main ENCODE topics and selection of most important papers

            Long-range interaction of promoters
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

                  Transcriptional regulation

              ChIP-seq <> expression detection

              Predict transcriptional regulation
Key ENCODE topics
Main ENCODE topics and selection of most important papers

       Transcriptional regulation – predict transcription
Key ENCODE topics
Main ENCODE topics and selection of most important papers

       Transcriptional regulation – expression prediction
Key ENCODE topics
   Main ENCODE topics and selection of most important papers

Transcriptional regulation – TFs predict location of histone modifications
Key ENCODE topics
Main ENCODE topics and selection of most important papers

             Transcriptional regulation – model
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

 Cell-type specific gene expression from open chromatin regions
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

                Cell-type specific TF binding
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

                 SNPs in regulatory regions
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

                  TF binding - interactions
Key ENCODE topics
Main ENCODE topics and selection of most important papers

              TF binding – cell-type specificity
Key ENCODE topics
Main ENCODE topics and selection of most important papers

               Other important papers to us
Key ENCODE topics
Main ENCODE topics and selection of most important papers

              Classification of genomic regions
Key ENCODE topics
Main ENCODE topics and selection of most important papers

              Classification of genomic regions
Key ENCODE topics
Main ENCODE topics and selection of most important papers

              Classification of genomic regions
ENCODE
             Encyclopedia of DNA Elements



                          Outline

                 What and who is ENCODE

Key ENCODE topics and most important papers for our research

       ENCODE data – make use of the encyclopedia…
ENCODE data
                    Data availability

                     Data availability

All data is available, from raw data to final processed data

                   For end-level users:

- Tracks in the UCSC browser with desired level of detail
     Visualize tracks and explore genomic context

           For end-level users and bio-IT:
  - In UCSC “Table browser” and other UCSC tools
Export genomic information, including processed data

          For high end-level users and Bio-IT:
 - Raw data and semi-processed data in GEO and others
ENCODE data
                  Data availability

Tracks in the UCSC browser with desired level of detail
ENCODE data
       Data availability

Tracks in the UCSC table browser
ENCODE data
Data availability

    Raw data
ENCODE data
Data availability

    Raw data
Blok
de   Van…
       ETER

Contenu connexe

Tendances

Functional genomics
Functional genomicsFunctional genomics
Functional genomicsPawan Kumar
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesMelanie Courtot
 
Single Nucleotide Polymorphism (SNP)
Single Nucleotide Polymorphism (SNP)Single Nucleotide Polymorphism (SNP)
Single Nucleotide Polymorphism (SNP)amna munir
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsajay301
 
Studying gene expression and function
Studying gene expression and functionStudying gene expression and function
Studying gene expression and functionMd Murad Khan
 
An Introduction to Genomics
An Introduction to GenomicsAn Introduction to Genomics
An Introduction to GenomicsDr NEETHU ASOKAN
 
second generation of DNA Sequencing
second generation of DNA Sequencingsecond generation of DNA Sequencing
second generation of DNA SequencingSidra Shaffique
 
Microarray Data Analysis
Microarray Data AnalysisMicroarray Data Analysis
Microarray Data Analysisyuvraj404
 
Zinc Finger Nuclease.
Zinc Finger Nuclease.Zinc Finger Nuclease.
Zinc Finger Nuclease.Zeeshan Awan
 
Introduction to Next Generation Sequencing
Introduction to Next Generation SequencingIntroduction to Next Generation Sequencing
Introduction to Next Generation SequencingFarid MUSA
 
Study of Transcriptome
Study of TranscriptomeStudy of Transcriptome
Study of TranscriptomeBOTANYWith
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSHEETHUMOLKS
 
ZINC FINGER NUCLEASE TECHNOLOGY
ZINC FINGER NUCLEASE TECHNOLOGYZINC FINGER NUCLEASE TECHNOLOGY
ZINC FINGER NUCLEASE TECHNOLOGYPriyesh Waghmare
 
ویرایش ژنوم Genome editing tools
ویرایش ژنوم Genome editing toolsویرایش ژنوم Genome editing tools
ویرایش ژنوم Genome editing toolsshahnam azizi
 
NGS data formats and analyses
NGS data formats and analysesNGS data formats and analyses
NGS data formats and analysesrjorton
 

Tendances (20)

functional genomics.ppt
functional genomics.pptfunctional genomics.ppt
functional genomics.ppt
 
Genomics
GenomicsGenomics
Genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Protein-protein interaction networks
Protein-protein interaction networksProtein-protein interaction networks
Protein-protein interaction networks
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resources
 
Single Nucleotide Polymorphism (SNP)
Single Nucleotide Polymorphism (SNP)Single Nucleotide Polymorphism (SNP)
Single Nucleotide Polymorphism (SNP)
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assembly
 
Studying gene expression and function
Studying gene expression and functionStudying gene expression and function
Studying gene expression and function
 
An Introduction to Genomics
An Introduction to GenomicsAn Introduction to Genomics
An Introduction to Genomics
 
second generation of DNA Sequencing
second generation of DNA Sequencingsecond generation of DNA Sequencing
second generation of DNA Sequencing
 
Microarray Data Analysis
Microarray Data AnalysisMicroarray Data Analysis
Microarray Data Analysis
 
Genome Mapping
Genome MappingGenome Mapping
Genome Mapping
 
Zinc Finger Nuclease.
Zinc Finger Nuclease.Zinc Finger Nuclease.
Zinc Finger Nuclease.
 
Introduction to Next Generation Sequencing
Introduction to Next Generation SequencingIntroduction to Next Generation Sequencing
Introduction to Next Generation Sequencing
 
Study of Transcriptome
Study of TranscriptomeStudy of Transcriptome
Study of Transcriptome
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 
ZINC FINGER NUCLEASE TECHNOLOGY
ZINC FINGER NUCLEASE TECHNOLOGYZINC FINGER NUCLEASE TECHNOLOGY
ZINC FINGER NUCLEASE TECHNOLOGY
 
ویرایش ژنوم Genome editing tools
ویرایش ژنوم Genome editing toolsویرایش ژنوم Genome editing tools
ویرایش ژنوم Genome editing tools
 
NGS data formats and analyses
NGS data formats and analysesNGS data formats and analyses
NGS data formats and analyses
 

Similaire à ENCODE Data Encyclopedia

New insights into the human genome by encode 14.12.12
New insights into the human genome by encode 14.12.12New insights into the human genome by encode 14.12.12
New insights into the human genome by encode 14.12.12Ranjani Reddy
 
Quality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic ModelingQuality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic ModelingStuti Nayak
 
1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymesGetachew Birhanu
 
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptxAroojSheikh12
 
Encode jc 20130412
Encode jc 20130412Encode jc 20130412
Encode jc 20130412Dake Zhang
 
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...IBM India Smarter Computing
 
Organellar genome and its composition
Organellar genome and its compositionOrganellar genome and its composition
Organellar genome and its compositionShilpa C
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
 
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG: connecting the knowledge community
 
Omdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital AgeOmdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital Agepetermurrayrust
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesElia Brodsky
 
Metadata-based tools at the ENCODE Portal
Metadata-based tools at the ENCODE PortalMetadata-based tools at the ENCODE Portal
Metadata-based tools at the ENCODE PortalENCODE-DCC
 
Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030GenomeInABottle
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part IIIhhalhaddad
 
Biotechnology Principles and Processes
Biotechnology Principles and ProcessesBiotechnology Principles and Processes
Biotechnology Principles and ProcessesMuralidhar Shingri
 

Similaire à ENCODE Data Encyclopedia (20)

New insights into the human genome by encode 14.12.12
New insights into the human genome by encode 14.12.12New insights into the human genome by encode 14.12.12
New insights into the human genome by encode 14.12.12
 
Quality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic ModelingQuality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic Modeling
 
lecture 1.pptx
lecture 1.pptxlecture 1.pptx
lecture 1.pptx
 
1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes
 
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
 
Encode jc 20130412
Encode jc 20130412Encode jc 20130412
Encode jc 20130412
 
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...
 
Organellar genome and its composition
Organellar genome and its compositionOrganellar genome and its composition
Organellar genome and its composition
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
 
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...
 
Omdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital AgeOmdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital Age
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and Approaches
 
T-bioinfo overview
T-bioinfo overviewT-bioinfo overview
T-bioinfo overview
 
Metadata-based tools at the ENCODE Portal
Metadata-based tools at the ENCODE PortalMetadata-based tools at the ENCODE Portal
Metadata-based tools at the ENCODE Portal
 
Dr Robert Hanner - Barcode Data standards for animals, plants & fungi
Dr Robert Hanner - Barcode Data standards for animals, plants & fungiDr Robert Hanner - Barcode Data standards for animals, plants & fungi
Dr Robert Hanner - Barcode Data standards for animals, plants & fungi
 
Recombinanant dna technology
Recombinanant dna technologyRecombinanant dna technology
Recombinanant dna technology
 
Human Genome Project
Human Genome ProjectHuman Genome Project
Human Genome Project
 
Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part III
 
Biotechnology Principles and Processes
Biotechnology Principles and ProcessesBiotechnology Principles and Processes
Biotechnology Principles and Processes
 

Plus de Maté Ongenaert

Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Maté Ongenaert
 
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Maté Ongenaert
 
Ecobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenEcobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenMaté Ongenaert
 
Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Maté Ongenaert
 
Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Maté Ongenaert
 
Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Maté Ongenaert
 
Exploring the neuroblastoma epigenome: perspectives for improved prognosis
Exploring the neuroblastoma epigenome: perspectives for improved prognosisExploring the neuroblastoma epigenome: perspectives for improved prognosis
Exploring the neuroblastoma epigenome: perspectives for improved prognosisMaté Ongenaert
 
High-throughput proteomics: from understanding data to predicting them
High-throughput proteomics: from understanding data to predicting themHigh-throughput proteomics: from understanding data to predicting them
High-throughput proteomics: from understanding data to predicting themMaté Ongenaert
 
Microarray data and pathway analysis: example from the bench
Microarray data and pathway analysis: example from the benchMicroarray data and pathway analysis: example from the bench
Microarray data and pathway analysis: example from the benchMaté Ongenaert
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyMaté Ongenaert
 
Integrative transcriptomics to study non-coding RNA functions
Integrative transcriptomics to study non-coding RNA functionsIntegrative transcriptomics to study non-coding RNA functions
Integrative transcriptomics to study non-coding RNA functionsMaté Ongenaert
 
Race against the sequencing machine: processing of raw DNA sequence data at t...
Race against the sequencing machine: processing of raw DNA sequence data at t...Race against the sequencing machine: processing of raw DNA sequence data at t...
Race against the sequencing machine: processing of raw DNA sequence data at t...Maté Ongenaert
 
Bringing the data back to the researchers
Bringing the data back to the researchersBringing the data back to the researchers
Bringing the data back to the researchersMaté Ongenaert
 
The post-genomic era: epigenetic sequencing applications and data integration
The post-genomic era: epigenetic sequencing applications and data integrationThe post-genomic era: epigenetic sequencing applications and data integration
The post-genomic era: epigenetic sequencing applications and data integrationMaté Ongenaert
 
Literature managment training
Literature managment trainingLiterature managment training
Literature managment trainingMaté Ongenaert
 
Scientific literature managment - exercises
Scientific literature managment - exercisesScientific literature managment - exercises
Scientific literature managment - exercisesMaté Ongenaert
 

Plus de Maté Ongenaert (18)

Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...
 
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
 
Ecobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenEcobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis Lokeren
 
Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Workshop NGS data analysis - 3
Workshop NGS data analysis - 3
 
Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Workshop NGS data analysis - 2
Workshop NGS data analysis - 2
 
Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Workshop NGS data analysis - 1
Workshop NGS data analysis - 1
 
Bots & spiders
Bots & spidersBots & spiders
Bots & spiders
 
Exploring the neuroblastoma epigenome: perspectives for improved prognosis
Exploring the neuroblastoma epigenome: perspectives for improved prognosisExploring the neuroblastoma epigenome: perspectives for improved prognosis
Exploring the neuroblastoma epigenome: perspectives for improved prognosis
 
High-throughput proteomics: from understanding data to predicting them
High-throughput proteomics: from understanding data to predicting themHigh-throughput proteomics: from understanding data to predicting them
High-throughput proteomics: from understanding data to predicting them
 
Microarray data and pathway analysis: example from the bench
Microarray data and pathway analysis: example from the benchMicroarray data and pathway analysis: example from the bench
Microarray data and pathway analysis: example from the bench
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biology
 
Integrative transcriptomics to study non-coding RNA functions
Integrative transcriptomics to study non-coding RNA functionsIntegrative transcriptomics to study non-coding RNA functions
Integrative transcriptomics to study non-coding RNA functions
 
Race against the sequencing machine: processing of raw DNA sequence data at t...
Race against the sequencing machine: processing of raw DNA sequence data at t...Race against the sequencing machine: processing of raw DNA sequence data at t...
Race against the sequencing machine: processing of raw DNA sequence data at t...
 
Bringing the data back to the researchers
Bringing the data back to the researchersBringing the data back to the researchers
Bringing the data back to the researchers
 
The post-genomic era: epigenetic sequencing applications and data integration
The post-genomic era: epigenetic sequencing applications and data integrationThe post-genomic era: epigenetic sequencing applications and data integration
The post-genomic era: epigenetic sequencing applications and data integration
 
Introduction
IntroductionIntroduction
Introduction
 
Literature managment training
Literature managment trainingLiterature managment training
Literature managment training
 
Scientific literature managment - exercises
Scientific literature managment - exercisesScientific literature managment - exercises
Scientific literature managment - exercises
 

Dernier

Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxMusic 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxleah joy valeriano
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 

Dernier (20)

Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxMusic 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 

ENCODE Data Encyclopedia

  • 1. ENCODE Encyclopedia of DNA Elements Outline What and who is ENCODE Key ENCODE topics and most important papers for our research ENCODE data – make use of the encyclopedia… Maté Ongenaert
  • 2. What and who is ENCODE Main aims, funding and the institutions/labs behind the 200 M $ Who? International consortium Funded by NHGRI – National Human Genome Research Institute 200 million dollar Main collaborators (for human data) Broad Institute (ChIP-seq) HudsonAlpha Institute for Biotechnology (methylation) Sanger Institute (RNA-seq) Duke University (DNAse) Yale University (Pol II) EBI (data analysis) Main aims “Build a comprehensive parts list of functional elements in the human genome, including elements that act at the protein and RNA levels, and regulatory elements that control cells and circumstances in which a gene is active”
  • 3. What and who is ENCODE Main aims, funding and the institutions/labs behind the 200 M $ What’s so hot… It has been running for years? Started in 2007 – pilot project 1% of the genome 2007-2012 Since then, introduction of new technologies  Higher throughput  Genome-wide  Much more samples and different tissues (different ‘tiers’ – see later)  Better data analysis and integration
  • 4. What and who is ENCODE Main aims, funding and the institutions/labs behind the 200 M $ What’s so hot… It has been running for years? World wide press attention
  • 5. What and who is ENCODE Main aims, funding and the institutions/labs behind the 200 M $ What’s so hot… It has been running for years? World wide press attention… and criticisms “Popular” media focus on the “junk DNA aspect” The authors also claim in their press- release that > 80% of the genome is ‘biologically active’ (<> may be involved in regulation in one way or another <> junk DNA) ENCODE reveals for the fist time a lot of factors of the very complex switching board controlling expression / …
  • 6. What and who is ENCODE Main aims, funding and the institutions/labs behind the 200 M $ What’s so hot… It has been running for years? 30 (!) research papers published in three journals at the same time
  • 7. ENCODE Encyclopedia of DNA Elements Outline What and who is ENCODE Key ENCODE topics and most important papers for our research ENCODE data – make use of the encyclopedia…
  • 8. Key ENCODE topics Main ENCODE topics and selection of most important papers Key topics Transcription factor binding motifs Chromatin patterns at transcription factor binding sites Characterization of intergenic regions and gene definitions RNA and chromatin modification patterns around promoters Epigenetic regulation of RNA processing Non-coding RNA characterisation DNA-methylation Enhancer discovery and characterization 3D connections across the genome Characterisation of network topology Machine learning approaches to genomics Impact of functional information on understanding variation Impact of evolutionary selection on functional regions
  • 9. Key ENCODE topics Main ENCODE topics and selection of most important papers Main paper 95% of the genome lies within 8 kilobases (kb) of a DNA–protein interaction Classifying the genome into seven chromatin states indicates an initial set of 399,124 regions with enhancer-like features and 70,292 regions with promoter-like features It is possible to correlate quantitatively RNA sequence production and processing with both chromatin marks and transcription factor binding at promoters, indicating that promoter functionality can explain most of the variation in RNA expression Single nucleotide polymorphisms (SNPs) associated with disease by GWAS are enriched within non-coding functional elements, with a majority residing in or near ENCODE-defined regions that are outside of protein-coding genes. In many cases, the disease phenotypes can be associated with a specific cell type or transcription factor
  • 10. Key ENCODE topics Main ENCODE topics and selection of most important papers Main paper Techniques used: RNA-seq ChIP-seq DNAse-seq DNA-methylation arrays and bisulfite seq FAIRE-seq Tier 1: three cell lines (K652 – GM12878 – H1 hESC) Tier 2: cell line panel (HeLa-S3 – HepG2 – HUVECs) Tier 3 (all other cell types) Total: 1640 datasets / 147 different cell types
  • 11. Key ENCODE topics Main ENCODE topics and selection of most important papers Main paper
  • 12. Key ENCODE topics Main ENCODE topics and selection of most important papers Main paper 95% of the genome lies within 8 kilobases (kb) of a DNA–protein interaction Classifying the genome into seven chromatin states indicates an initial set of 399,124 regions with enhancer-like features and 70,292 regions with promoter-like features It is possible to correlate quantitatively RNA sequence production and processing with both chromatin marks and transcription factor binding at promoters, indicating that promoter functionality can explain most of the variation in RNA expression Single nucleotide polymorphisms (SNPs) associated with disease by GWAS are enriched within non-coding functional elements, with a majority residing in or near ENCODE-defined regions that are outside of protein-coding genes. In many cases, the disease phenotypes can be associated with a specific cell type or transcription factor
  • 13. Key ENCODE topics Main ENCODE topics and selection of most important papers Expression – chromatin state Expression – transcription factors
  • 14. Key ENCODE topics Main ENCODE topics and selection of most important papers Expression – transcription factors
  • 15. Key ENCODE topics Main ENCODE topics and selection of most important papers Chromatin state patterns at transcription-factor binding sites
  • 16. Key ENCODE topics Main ENCODE topics and selection of most important papers Co-association between transcription factors (K562)
  • 17. Key ENCODE topics Main ENCODE topics and selection of most important papers Insight in genomic variation – allele specific variation
  • 18. Key ENCODE topics Main ENCODE topics and selection of most important papers Main paper 95% of the genome lies within 8 kilobases (kb) of a DNA–protein interaction Classifying the genome into seven chromatin states indicates an initial set of 399,124 regions with enhancer-like features and 70,292 regions with promoter-like features It is possible to correlate quantitatively RNA sequence production and processing with both chromatin marks and transcription factor binding at promoters, indicating that promoter functionality can explain most of the variation in RNA expression Single nucleotide polymorphisms (SNPs) associated with disease by GWAS are enriched within non-coding functional elements, with a majority residing in or near ENCODE-defined regions that are outside of protein-coding genes. In many cases, the disease phenotypes can be associated with a specific cell type or transcription factor
  • 19. Key ENCODE topics Main ENCODE topics and selection of most important papers Overlap SNPs with regulatory elements
  • 20. Key ENCODE topics Main ENCODE topics and selection of most important papers Overlap SNPs with regulatory elements and ‘open’ chromatin
  • 21. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 22. Key ENCODE topics Main ENCODE topics and selection of most important papers Accessible chromatin landscape DNAseI treatment Combined analysis with TFs and H3K4me3  Identification of “accessible” chromatin regions
  • 23. Key ENCODE topics Main ENCODE topics and selection of most important papers Accessible chromatin landscape – location of accessible regions
  • 24. Key ENCODE topics Main ENCODE topics and selection of most important papers Accessible chromatin landscape – association with ChIP-seq and TFs
  • 25. Key ENCODE topics Main ENCODE topics and selection of most important papers Accessible chromatin landscape – novel transcripts
  • 26. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 27. Key ENCODE topics Main ENCODE topics and selection of most important papers Landscape of transcription RNA-seq  Get a grip on what is transcribed, including novel transcripts and RNAs
  • 28. Key ENCODE topics Main ENCODE topics and selection of most important papers Landscape of transcription – nucleolar fraction vs. whole cell
  • 29. Key ENCODE topics Main ENCODE topics and selection of most important papers Landscape of transcription
  • 30. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 31. Key ENCODE topics Main ENCODE topics and selection of most important papers Long-range interaction of promoters 5C mapping (chromatin interaction mapping technology)  Long-range interactions of promoter regions
  • 32. Key ENCODE topics Main ENCODE topics and selection of most important papers Long-range interaction of promoters
  • 33. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 34. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 35. Key ENCODE topics Main ENCODE topics and selection of most important papers Transcriptional regulation ChIP-seq <> expression detection  Predict transcriptional regulation
  • 36. Key ENCODE topics Main ENCODE topics and selection of most important papers Transcriptional regulation – predict transcription
  • 37. Key ENCODE topics Main ENCODE topics and selection of most important papers Transcriptional regulation – expression prediction
  • 38. Key ENCODE topics Main ENCODE topics and selection of most important papers Transcriptional regulation – TFs predict location of histone modifications
  • 39. Key ENCODE topics Main ENCODE topics and selection of most important papers Transcriptional regulation – model
  • 40. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 41. Key ENCODE topics Main ENCODE topics and selection of most important papers Cell-type specific gene expression from open chromatin regions
  • 42. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 43. Key ENCODE topics Main ENCODE topics and selection of most important papers Cell-type specific TF binding
  • 44. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 45. Key ENCODE topics Main ENCODE topics and selection of most important papers SNPs in regulatory regions
  • 46. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 47. Key ENCODE topics Main ENCODE topics and selection of most important papers TF binding - interactions
  • 48. Key ENCODE topics Main ENCODE topics and selection of most important papers TF binding – cell-type specificity
  • 49. Key ENCODE topics Main ENCODE topics and selection of most important papers Other important papers to us
  • 50. Key ENCODE topics Main ENCODE topics and selection of most important papers Classification of genomic regions
  • 51. Key ENCODE topics Main ENCODE topics and selection of most important papers Classification of genomic regions
  • 52. Key ENCODE topics Main ENCODE topics and selection of most important papers Classification of genomic regions
  • 53. ENCODE Encyclopedia of DNA Elements Outline What and who is ENCODE Key ENCODE topics and most important papers for our research ENCODE data – make use of the encyclopedia…
  • 54. ENCODE data Data availability Data availability All data is available, from raw data to final processed data For end-level users: - Tracks in the UCSC browser with desired level of detail  Visualize tracks and explore genomic context For end-level users and bio-IT: - In UCSC “Table browser” and other UCSC tools Export genomic information, including processed data For high end-level users and Bio-IT: - Raw data and semi-processed data in GEO and others
  • 55. ENCODE data Data availability Tracks in the UCSC browser with desired level of detail
  • 56. ENCODE data Data availability Tracks in the UCSC table browser
  • 59. Blok de Van… ETER