SlideShare a Scribd company logo
1 of 70
FBW
               25-09-2012




Wim Van Criekinge
What is Bioinformatics ?

                   • Application of information technology to the
                     storage, management and analysis of biological
                     information (Facilitated by the use of
                     computers)
                       –   Sequence analysis?
                       –   Molecular modeling (HTX) ?
                       –   Phylogeny/evolution?
                       –   Ecology and population studies?
                       –   Medical informatics?
                       –   Image Analysis ?
                       –   Statistics ? AI ?
                       –   Sterkstroom of zwakstroom ?
Promises of genomics and bioinformatics

• Medicine (Pharma)
   – Genome analysis allows the targeting of genetic
     diseases
   – The effect of a disease or of a therapeutic on RNA and
     protein levels can be elucidated
   – Knowledge of protein structure facilitates drug design
   – Understanding of genomic variation allows the tailoring
     of medical treatment to the individual’s genetic make-
     up
• The same techniques can be applied to crop (Agro) and
  livestock improvement (Animal Health)
Bioinformatics: What’s in a name ?



           • Begin 1990’s
           • “Bio-informatics”:


  Computing Power
  Genbank
  (Log)




                                     Time (years)
Bioinformatics: What’s in a name ?



   • Begin 1990’s
   • “Bio-informatics”:
       – convergence of explosive growth in
         biotechnology, paralled by the explosive growth
         in information technology
   • Not new: > 30 years that people use
     “computers” in biology
   • In silico biology, database biology, ...
Time (years)
Happy Birthday …
PCR + dye termination

  Suddenly, a flash of insight caused him to pull the car
    off the road and stop. He awakened his friend
    dozing in the passenger seat and excitedly
    explained to her that he had hit upon a solution -
    not to his original problem, but to one of even
    greater significance. Kary Mullis had just conceived
    of a simple method for producing virtually unlimited
    copies of a specific DNA sequence in a test tube -
    the polymerase chain reaction (PCR)
Bioinformatics, a scientific discipline …


                                               Math




       Computer Science                                      Theoretical Biology



                                            Bioinformatics



                                                                    (Molecular)
     Informatics
                                                                      Biology
                                     Computational Biology
Bioinformatics, a scientific discipline …


                                               Math
                                                Algorithm Development




       Computer Science                                        Theoretical Biology
                           NP                         AI, Image Analysis
                           Datamining                 structure prediction (HTX)
                                            Bioinformatics

                   Interface Design                     Expert Annotation
                                     Sequence Analysis                  (Molecular)
     Informatics
                                                                          Biology
                                     Computational Biology
Bioinformatics, a scientific discipline …


                                            Math
                                             Algorithm Development




       Computer Science                                      Theoretical Biology
                           NP                      AI, Image Analysis
                           Datamining              structure prediction (HTX)
                                 Bioinformatics
                Discovery Informatics – Computational Genomics
                   Interface Design                  Expert Annotation
                                     Sequence Analysis               (Molecular)
     Informatics
                                                                       Biology
                                     Computational Biology
Doel van de cursus

                     • Meer dan een inleiding tot ... het is de
                       bedoeling van de cursus een onderliggend
                       inzicht te verschaffen achter de
                       verschillende technieken.
                     • Naast het gebruik van recepten, wat terug
                       te vinden is in delen van de syllabus laat
                       een inzicht in
                       – de werking van databanken
                       – en de achterliggende algoritmen
                     • toe
                       – om wisselende interfaces op nieuwe
                         problemen toe te passen.
Inhoud Lessen: Bioinformatica
Examen

         • Theorie
           – Deel rond een zelf te kiezen publicatie die in verband
             staat met de cursus
              • Bv Bioinformatics of Computational Biology
           – Drie inzichtsvragen over de cursus (inclusief  !!)


         • Practicum (“open-book”)
           – Viertal oefeningen die meestal het schrijven van een
             programma veronderstellen


         • Puntenverdeling 50/50
Cursus


         • 25 Euro
           – Syllabus
           – Hand-outs van Les/Practicum 1
           – V|Podcasts
           – Weblems – Screencasts
           – Flash Drive
             • Image to be installed
• Timelin: Magaret
  Dayhoff …
Nexus > FAQ > Bioinformatics Milestones
• http://www.sciencemag.org/cgi/content/fu
  ll/291/5507/1195
• Printed version in cursus
Setting the stage …



                      nature
                           the
                           Human
                           genome
Genome Meters



 • Genomes Online Database (GOLD 1.0)
     – http://geta.life.uiuc.edu/~nikos/genomes.html
     – http://www.ebi.ac.uk/research/cgg/genomes.html
 • NCBI
     – http://www.ncbi.nlm.nih.gov/PMGifs/Genomes/bact.h
       tml
 • INFOBIOGEN
     – http://www.infobiogen.fr/doc/data/complete_genome.
       html
Genome Size

              E. coli = 4.2 x 106
              Yeast = 18 x 106
              Arabidopsis = 80 x 106
              C.elegans = 100 x 106
              Drosophila = 180 x 106
              Human/Rat/Mouse = 3000 x 106
              Lily = 300 000 x 106

                                       With ... : 99.9 %
                                       To primates: 99%

                DOGS: Database Of Genome Sizes
Biological Research




                      Adapted from John McPherson, OICR
And this is just the beginning ….

Next Generation Sequencing is here
Basics of the “old” technology


• Clone the DNA.
• Generate a ladder of labeled (colored) molecules
  that are different by 1 nucleotide.
• Separate mixture on some matrix.
• Detect fluorochrome by laser.
• Interpret peaks as string of DNA.
• Strings are 500 to 1,000 letters long
• 1 machine generates 57,000 nucleotides/run
• Assemble all strings into a genome.
Basics of the “new” technology


• Get DNA.
• Attach it to something.
• Extend and amplify signal with some color
  scheme.
• Detect fluorochrome by microscopy.
• Interpret series of spots as short strings of DNA.
• Strings are 30-300 letters long
• Multiple images are interpreted as 0.4 to 1.2
  GB/run (1,200,000,000 letters/day).
• Map or align strings to one or many genome.
Next Generation Technologies

• 454
   –Emulsion PCR
   –Polymerase
   –Natural Nucleotides
• 20-100Mb for 5-15k
   –1% error rate
   –Homopolymers
One additional insight ...
Read Length is Not As Important For Resequencing


                                                  100%
               % of Paired K-mers with Uniquely   90%
                     Assignable Location          80%
                                                  70%
                                                  60%
                                                                                           E.COLI
                                                  50%
                                                                                           HUMAN
                                                  40%
                                                  30%
                                                  20%
                                                  10%
                                                   0%
                                                         8   10   12   14   16   18   20
                                                         Length of K-mer Reads (bp)
Jay Shendure
Two Short Read Techologies


• Illumina GA



• ABI SOLID
Technology Overview: Solexa/Illumina Sequencing
ABI Solid




            Dressman 2003
ABI SOLID
ABI SOLID
Paired End Reads are Important!

                                 Known Distance

                           Read 1          Read 2

                 Repetitive DNA
                               Unique DNA

                                                  Paired read maps uniquely




           Single read maps to
           multiple positions
Adapted from: Barak Cohen, Washington University, Bio5488   http://tinyurl.com/6zttuq http://tinyurl.com/6k26nh




             Single Molecule Sequencing

Microscope slide
                                             *                          *                              *


Single DNA
molecule
                                                                                      Super-cooled
 primer                                                                               TIRF microscope

 dNTP-Cy3 *

                                                                          Helicos Biosciences Corp.
Introducing
NXT GNT DXS
Next Generation Diagnostics




                              18th september 2009
                                Wim Van Criekinge
develop in shortest time frame
 best assay for most relevant
      clinical application
NXT GNT DXS
              • GNT
                – Dedicated Team & Network
                – Operational: Location
                – Professionalized

              • DXS
                – Content engine
                – Product 1 established
                – Pipeline for n+1

              • NXT
                – Workflow management
                – Bioinformatics
                – Epigenetics
Next next generation sequencing
 Third generation sequencing
       Now sequencing
Complete genomics
Complete genomics
Pacific Biosciences: A Third Generation Sequencing Technology




                                                          Eid et al 2008
Pacific Biosciences: A Third Generation Sequencing Technology
Nanopore Sequencing
NCBI (educational resources)
Weblems

          • What ?
            – Web-based problemes (over de huidige les
              en/of voorbereiding op volgende les)
          • When ?
            – Einde van elke les
          • How ?
            – Oplossingen online via screencasts
            – Practicum
            – Voorbedereiding op het practicum examen ...
              Niet alle problemen vereisen noodzakelijk
              programmacode ...
Weblems

          W1.1: To which phyla do the following species belong (a)
           starfish (b) ginko tree (c) scorpion
          W1.2: What are the common names for the following
           species (a) Orycterophus afer (b) Beta vulagaris (c)
           macrocystis pyrifera
          W1.3: What species has the smallest known genome ? And
           is genome size related to number of genes ?
          W1.4: What are the 5 latest genomes published ? How
           complete is “coverage” ?
          W1.5: For approximately 10% of europeans, the painkiller
           codeine is ineffective because the patients lack the
           enzyme that converts codeine into the active molecule,
           morphine. What is the most common mutation that
           causes this condition ?

More Related Content

Similar to Bioinformatica t1-bioinformatics

Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsProf. Wim Van Criekinge
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfkigaruantony
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsAznaShihab
 
2005: Natural Computing - Concepts and Applications
2005: Natural Computing - Concepts and Applications2005: Natural Computing - Concepts and Applications
2005: Natural Computing - Concepts and ApplicationsLeandro de Castro
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Sijo A
 
bioinfo_00-introduction.ppt
bioinfo_00-introduction.pptbioinfo_00-introduction.ppt
bioinfo_00-introduction.pptAmmr2
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsPragya Pai
 
BITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS
 
BIOINFO unit 1.pptx
BIOINFO unit 1.pptxBIOINFO unit 1.pptx
BIOINFO unit 1.pptxrnath286
 
What_is_Bioinformatics_Dr_Sudha.pdf
What_is_Bioinformatics_Dr_Sudha.pdfWhat_is_Bioinformatics_Dr_Sudha.pdf
What_is_Bioinformatics_Dr_Sudha.pdfVishwanathAvanti
 
Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Elia Brodsky
 
BEACON 101: Sequencing tech
BEACON 101: Sequencing techBEACON 101: Sequencing tech
BEACON 101: Sequencing techc.titus.brown
 
Computer for Biological Research
Computer for Biological ResearchComputer for Biological Research
Computer for Biological ResearchChakard Chalayut
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Iddo
 
In search of a cyberspace …to launch Biologically-Inspired Advanced Computing...
In search of a cyberspace …to launch Biologically-Inspired Advanced Computing...In search of a cyberspace …to launch Biologically-Inspired Advanced Computing...
In search of a cyberspace …to launch Biologically-Inspired Advanced Computing...ICAC09
 

Similar to Bioinformatica t1-bioinformatics (20)

Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformatics
 
T1 2018 bioinformatics
T1 2018 bioinformaticsT1 2018 bioinformatics
T1 2018 bioinformatics
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdf
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
2012 12 12_adam_v_final
2012 12 12_adam_v_final2012 12 12_adam_v_final
2012 12 12_adam_v_final
 
2005: Natural Computing - Concepts and Applications
2005: Natural Computing - Concepts and Applications2005: Natural Computing - Concepts and Applications
2005: Natural Computing - Concepts and Applications
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 
bioinfo_00-introduction.ppt
bioinfo_00-introduction.pptbioinfo_00-introduction.ppt
bioinfo_00-introduction.ppt
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in Bioinformatics
 
BITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS: Basics of sequence databases
BITS: Basics of sequence databases
 
BIOINFO unit 1.pptx
BIOINFO unit 1.pptxBIOINFO unit 1.pptx
BIOINFO unit 1.pptx
 
What_is_Bioinformatics_Dr_Sudha.pdf
What_is_Bioinformatics_Dr_Sudha.pdfWhat_is_Bioinformatics_Dr_Sudha.pdf
What_is_Bioinformatics_Dr_Sudha.pdf
 
Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1
 
BEACON 101: Sequencing tech
BEACON 101: Sequencing techBEACON 101: Sequencing tech
BEACON 101: Sequencing tech
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Computer for Biological Research
Computer for Biological ResearchComputer for Biological Research
Computer for Biological Research
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013
 
In search of a cyberspace …to launch Biologically-Inspired Advanced Computing...
In search of a cyberspace …to launch Biologically-Inspired Advanced Computing...In search of a cyberspace …to launch Biologically-Inspired Advanced Computing...
In search of a cyberspace …to launch Biologically-Inspired Advanced Computing...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 

More from Prof. Wim Van Criekinge

2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_uploadProf. Wim Van Criekinge
 
2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_uploadProf. Wim Van Criekinge
 
2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_uploadProf. Wim Van Criekinge
 
2019 02 21_biological_databases_part2_v_upload
2019 02 21_biological_databases_part2_v_upload2019 02 21_biological_databases_part2_v_upload
2019 02 21_biological_databases_part2_v_uploadProf. Wim Van Criekinge
 
2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_uploadProf. Wim Van Criekinge
 
Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]Prof. Wim Van Criekinge
 
2018 03 27_biological_databases_part4_v_upload
2018 03 27_biological_databases_part4_v_upload2018 03 27_biological_databases_part4_v_upload
2018 03 27_biological_databases_part4_v_uploadProf. Wim Van Criekinge
 
2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part2_v_upload2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part2_v_uploadProf. Wim Van Criekinge
 
2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_uploadProf. Wim Van Criekinge
 

More from Prof. Wim Van Criekinge (20)

2020 02 11_biological_databases_part1
2020 02 11_biological_databases_part12020 02 11_biological_databases_part1
2020 02 11_biological_databases_part1
 
2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload
 
2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload
 
2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload
 
2019 02 21_biological_databases_part2_v_upload
2019 02 21_biological_databases_part2_v_upload2019 02 21_biological_databases_part2_v_upload
2019 02 21_biological_databases_part2_v_upload
 
2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload
 
P7 2018 biopython3
P7 2018 biopython3P7 2018 biopython3
P7 2018 biopython3
 
P6 2018 biopython2b
P6 2018 biopython2bP6 2018 biopython2b
P6 2018 biopython2b
 
P4 2018 io_functions
P4 2018 io_functionsP4 2018 io_functions
P4 2018 io_functions
 
P3 2018 python_regexes
P3 2018 python_regexesP3 2018 python_regexes
P3 2018 python_regexes
 
P1 2018 python
P1 2018 pythonP1 2018 python
P1 2018 python
 
Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]
 
2018 05 08_biological_databases_no_sql
2018 05 08_biological_databases_no_sql2018 05 08_biological_databases_no_sql
2018 05 08_biological_databases_no_sql
 
2018 03 27_biological_databases_part4_v_upload
2018 03 27_biological_databases_part4_v_upload2018 03 27_biological_databases_part4_v_upload
2018 03 27_biological_databases_part4_v_upload
 
2018 03 20_biological_databases_part3
2018 03 20_biological_databases_part32018 03 20_biological_databases_part3
2018 03 20_biological_databases_part3
 
2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part2_v_upload2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part2_v_upload
 
2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload
 
P7 2017 biopython3
P7 2017 biopython3P7 2017 biopython3
P7 2017 biopython3
 
P6 2017 biopython2
P6 2017 biopython2P6 2017 biopython2
P6 2017 biopython2
 
Van criekinge 2017_11_13_rodebiotech
Van criekinge 2017_11_13_rodebiotechVan criekinge 2017_11_13_rodebiotech
Van criekinge 2017_11_13_rodebiotech
 

Recently uploaded

Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 

Recently uploaded (20)

Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 

Bioinformatica t1-bioinformatics

  • 1.
  • 2. FBW 25-09-2012 Wim Van Criekinge
  • 3.
  • 4. What is Bioinformatics ? • Application of information technology to the storage, management and analysis of biological information (Facilitated by the use of computers) – Sequence analysis? – Molecular modeling (HTX) ? – Phylogeny/evolution? – Ecology and population studies? – Medical informatics? – Image Analysis ? – Statistics ? AI ? – Sterkstroom of zwakstroom ?
  • 5. Promises of genomics and bioinformatics • Medicine (Pharma) – Genome analysis allows the targeting of genetic diseases – The effect of a disease or of a therapeutic on RNA and protein levels can be elucidated – Knowledge of protein structure facilitates drug design – Understanding of genomic variation allows the tailoring of medical treatment to the individual’s genetic make- up • The same techniques can be applied to crop (Agro) and livestock improvement (Animal Health)
  • 6. Bioinformatics: What’s in a name ? • Begin 1990’s • “Bio-informatics”: Computing Power Genbank (Log) Time (years)
  • 7. Bioinformatics: What’s in a name ? • Begin 1990’s • “Bio-informatics”: – convergence of explosive growth in biotechnology, paralled by the explosive growth in information technology • Not new: > 30 years that people use “computers” in biology • In silico biology, database biology, ...
  • 9.
  • 11. PCR + dye termination Suddenly, a flash of insight caused him to pull the car off the road and stop. He awakened his friend dozing in the passenger seat and excitedly explained to her that he had hit upon a solution - not to his original problem, but to one of even greater significance. Kary Mullis had just conceived of a simple method for producing virtually unlimited copies of a specific DNA sequence in a test tube - the polymerase chain reaction (PCR)
  • 12. Bioinformatics, a scientific discipline … Math Computer Science Theoretical Biology Bioinformatics (Molecular) Informatics Biology Computational Biology
  • 13. Bioinformatics, a scientific discipline … Math Algorithm Development Computer Science Theoretical Biology NP AI, Image Analysis Datamining structure prediction (HTX) Bioinformatics Interface Design Expert Annotation Sequence Analysis (Molecular) Informatics Biology Computational Biology
  • 14. Bioinformatics, a scientific discipline … Math Algorithm Development Computer Science Theoretical Biology NP AI, Image Analysis Datamining structure prediction (HTX) Bioinformatics Discovery Informatics – Computational Genomics Interface Design Expert Annotation Sequence Analysis (Molecular) Informatics Biology Computational Biology
  • 15. Doel van de cursus • Meer dan een inleiding tot ... het is de bedoeling van de cursus een onderliggend inzicht te verschaffen achter de verschillende technieken. • Naast het gebruik van recepten, wat terug te vinden is in delen van de syllabus laat een inzicht in – de werking van databanken – en de achterliggende algoritmen • toe – om wisselende interfaces op nieuwe problemen toe te passen.
  • 17.
  • 18. Examen • Theorie – Deel rond een zelf te kiezen publicatie die in verband staat met de cursus • Bv Bioinformatics of Computational Biology – Drie inzichtsvragen over de cursus (inclusief  !!) • Practicum (“open-book”) – Viertal oefeningen die meestal het schrijven van een programma veronderstellen • Puntenverdeling 50/50
  • 19. Cursus • 25 Euro – Syllabus – Hand-outs van Les/Practicum 1 – V|Podcasts – Weblems – Screencasts – Flash Drive • Image to be installed
  • 20.
  • 21. • Timelin: Magaret Dayhoff …
  • 22. Nexus > FAQ > Bioinformatics Milestones
  • 23. • http://www.sciencemag.org/cgi/content/fu ll/291/5507/1195 • Printed version in cursus
  • 24. Setting the stage … nature the Human genome
  • 25.
  • 26.
  • 27.
  • 28. Genome Meters • Genomes Online Database (GOLD 1.0) – http://geta.life.uiuc.edu/~nikos/genomes.html – http://www.ebi.ac.uk/research/cgg/genomes.html • NCBI – http://www.ncbi.nlm.nih.gov/PMGifs/Genomes/bact.h tml • INFOBIOGEN – http://www.infobiogen.fr/doc/data/complete_genome. html
  • 29. Genome Size E. coli = 4.2 x 106 Yeast = 18 x 106 Arabidopsis = 80 x 106 C.elegans = 100 x 106 Drosophila = 180 x 106 Human/Rat/Mouse = 3000 x 106 Lily = 300 000 x 106 With ... : 99.9 % To primates: 99% DOGS: Database Of Genome Sizes
  • 30.
  • 31. Biological Research Adapted from John McPherson, OICR
  • 32. And this is just the beginning …. Next Generation Sequencing is here
  • 33. Basics of the “old” technology • Clone the DNA. • Generate a ladder of labeled (colored) molecules that are different by 1 nucleotide. • Separate mixture on some matrix. • Detect fluorochrome by laser. • Interpret peaks as string of DNA. • Strings are 500 to 1,000 letters long • 1 machine generates 57,000 nucleotides/run • Assemble all strings into a genome.
  • 34. Basics of the “new” technology • Get DNA. • Attach it to something. • Extend and amplify signal with some color scheme. • Detect fluorochrome by microscopy. • Interpret series of spots as short strings of DNA. • Strings are 30-300 letters long • Multiple images are interpreted as 0.4 to 1.2 GB/run (1,200,000,000 letters/day). • Map or align strings to one or many genome.
  • 35. Next Generation Technologies • 454 –Emulsion PCR –Polymerase –Natural Nucleotides • 20-100Mb for 5-15k –1% error rate –Homopolymers
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 42. Read Length is Not As Important For Resequencing 100% % of Paired K-mers with Uniquely 90% Assignable Location 80% 70% 60% E.COLI 50% HUMAN 40% 30% 20% 10% 0% 8 10 12 14 16 18 20 Length of K-mer Reads (bp) Jay Shendure
  • 43. Two Short Read Techologies • Illumina GA • ABI SOLID
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50. ABI Solid Dressman 2003
  • 53.
  • 54.
  • 55.
  • 56. Paired End Reads are Important! Known Distance Read 1 Read 2 Repetitive DNA Unique DNA Paired read maps uniquely Single read maps to multiple positions
  • 57. Adapted from: Barak Cohen, Washington University, Bio5488 http://tinyurl.com/6zttuq http://tinyurl.com/6k26nh Single Molecule Sequencing Microscope slide * * * Single DNA molecule Super-cooled primer TIRF microscope dNTP-Cy3 * Helicos Biosciences Corp.
  • 58. Introducing NXT GNT DXS Next Generation Diagnostics 18th september 2009 Wim Van Criekinge
  • 59. develop in shortest time frame best assay for most relevant clinical application
  • 60.
  • 61. NXT GNT DXS • GNT – Dedicated Team & Network – Operational: Location – Professionalized • DXS – Content engine – Product 1 established – Pipeline for n+1 • NXT – Workflow management – Bioinformatics – Epigenetics
  • 62. Next next generation sequencing Third generation sequencing Now sequencing
  • 65. Pacific Biosciences: A Third Generation Sequencing Technology Eid et al 2008
  • 66. Pacific Biosciences: A Third Generation Sequencing Technology
  • 69. Weblems • What ? – Web-based problemes (over de huidige les en/of voorbereiding op volgende les) • When ? – Einde van elke les • How ? – Oplossingen online via screencasts – Practicum – Voorbedereiding op het practicum examen ... Niet alle problemen vereisen noodzakelijk programmacode ...
  • 70. Weblems W1.1: To which phyla do the following species belong (a) starfish (b) ginko tree (c) scorpion W1.2: What are the common names for the following species (a) Orycterophus afer (b) Beta vulagaris (c) macrocystis pyrifera W1.3: What species has the smallest known genome ? And is genome size related to number of genes ? W1.4: What are the 5 latest genomes published ? How complete is “coverage” ? W1.5: For approximately 10% of europeans, the painkiller codeine is ineffective because the patients lack the enzyme that converts codeine into the active molecule, morphine. What is the most common mutation that causes this condition ?