SlideShare une entreprise Scribd logo
1  sur  21
Genome exploration in  A-T G-C space introducing   Icarus a DNA walking program Jonathan Blakes MSc Biotechnology and Computation Department of Biosciences Faculty of Science, Technology and Medical Studies
Problem too much information!
EnsEMBL UCSC Genome Browsers
Hypothesis Can DNA sequences be plotted in such a way that long sequences can be easily interpreted by humans without  a prior i knowledge? “ It seems that the simplest method of visualizing some properties of genomes is to send a virtual walker for a genomic walk, ask "it" to talk about what it has seen and note its observations. If our walker doesn't move with a Brownian-like motion, it is possible to extract from its walk a lot of information . ” Stanislaw Cebrat , the principal Polish proponent of DNA walks Assigning a cardinal coordinate ( north ,  south ,  east  or  west ) to each of the four nucleotide bases ( A ,  T ,  G ,  C ) and taking steps in those directions as a sequence is read sequentially will produce a ‘walk’ of the sequence in which repetitive DNA elements will be seen as repetitive 2-dimensional ‘structures’.
DNA walks are plots of DNA or RNA sequences where each of the four nucleotide bases is assigned a direction and distance, the sequence is read off one nucleotide at a time and for each nucleotide the virtual walker takes a step in the designated direction creating a 'walk' of the sequence that reveals elements of structure in the nucleotide composition. DNA walking From  Comparative Genometrics website,  L'Université de Lausanne
Icarus Live Demonstration Could someone please suggest a mammalian gene to walk?
Mapping 24  possible combinations of cardinal vectors: 4 rotations for each of the 3 above mappings, and  4 rotations of each of their reflections about the x or y plane. Choosing which  3  ‘unique’ mappings of those 24 is a matter of parsimony.
A-T G-C
A-G C-T
A-C G-T
A-T G-C
A-T G-C is consistently smallest Smaller pictures can contain more information in less space and are therefore more amenable to publication, hence  Genome Exploration in  A-T G-C space
Duplications exons   introns a  7  fold contiguous duplication in the male Y chromosome. Members of the TSPY (Testis-specific Y-encoded proteins) family identified by Skaletsky et al 1  using a combination of a whole chromosome dotplot with a 2-kb window and a custom Perl script running BLAST alignments of all 5-kb sequence segments, in 2-kb steps, of the entire MSY (Male Specific Y).  In contrast I stumbled upon this purely by accident. 1. Skaletsky et al. Nature 2003 423.
DNA walks for phylogenetics ,[object Object],[object Object],[object Object],Imagine a 1-dimensional textual DNA sequence. The distance from the first base to the last is simply the number of bases in the sequence. A comparison of aligned sequences on the basis of spatial distance (a much simpler measure than the Jukes-Cantor definition of evolutionary distance) will be unable to discriminate between them. 7  previously aligned 1798-nucleotide long  small ribosomal subunit sequences  of Candida and Saccharomyces species as detailed in Gilfillan 1  were walked and their total  euclidean  distances used to produce a phylogeny, which was compared to Gilfillan’s. 1.  Gilfillan GD, et. al. Microbiology. 1998. 144: 829-838.
Phylogeny algorithms neighbour joining Icarus’ UPGMA Distance Matrix
Phylogeny Demonstration
Newick format    Distance Matrix Output Newick format string representation of a tree: (Bovine:0.69395, (Gibbon:0.36079, (Orang:0.33636, (Gorilla:0.17147, (Chimp:0.19268, Human:0.11927) :0.08386):0.06124):0.15057):0.54939, Mouse:1.21460);
Phylogenies with DNA walks
Does summing distances from 3 mappings eliminate bias and produce a better phylogeny? NO. A better distance measure is needed.
Conclusion ,[object Object],[object Object],[object Object]
Acknowledgements I would like to thank: Dr. Gary Robinson Dr. Colin Johnson Dr. Anthony Baines And everyone I have met during the  Biotechnology and Computation MSc.

Contenu connexe

Tendances

Tendances (20)

How to quantify hierarchy?
How to quantify hierarchy?How to quantify hierarchy?
How to quantify hierarchy?
 
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
Gene  Mapping  Methods:Linkage Maps & Mapping with Molecular MarkersGene  Mapping  Methods:Linkage Maps & Mapping with Molecular Markers
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
 
Genome mapping
Genome mappingGenome mapping
Genome mapping
 
Gene mapping
Gene mappingGene mapping
Gene mapping
 
Gene mapping
Gene mappingGene mapping
Gene mapping
 
Chromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysisChromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysis
 
Human genome
Human genomeHuman genome
Human genome
 
Difference between genetic linkage and physical map
Difference between genetic  linkage and physical  mapDifference between genetic  linkage and physical  map
Difference between genetic linkage and physical map
 
genome mapping
genome mappinggenome mapping
genome mapping
 
Linkage analysis and genome mapping
Linkage analysis and genome mappingLinkage analysis and genome mapping
Linkage analysis and genome mapping
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysis
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
 
Location and mapping of chromosomes using conventional and cytological means.
Location and mapping of chromosomes using conventional and cytological means.Location and mapping of chromosomes using conventional and cytological means.
Location and mapping of chromosomes using conventional and cytological means.
 
Gene Mapping; By: Lauren Mary
Gene Mapping; By: Lauren MaryGene Mapping; By: Lauren Mary
Gene Mapping; By: Lauren Mary
 
Concept of genome mapping
Concept of genome mappingConcept of genome mapping
Concept of genome mapping
 
Human genome
Human genomeHuman genome
Human genome
 
Gene mapping
Gene mappingGene mapping
Gene mapping
 
Genomics
GenomicsGenomics
Genomics
 
Gene mapping
Gene mappingGene mapping
Gene mapping
 
Gene mapping
Gene  mappingGene  mapping
Gene mapping
 

En vedette

Powerpoint presentation in DNA of living organisms
Powerpoint presentation in DNA of living organismsPowerpoint presentation in DNA of living organisms
Powerpoint presentation in DNA of living organismsuniversity of johannesburg
 
DNA structure, genes and its chemical composition
DNA structure, genes and its chemical compositionDNA structure, genes and its chemical composition
DNA structure, genes and its chemical compositionValentina Duque
 
Physical and chemical mutagen copy
Physical and chemical mutagen   copyPhysical and chemical mutagen   copy
Physical and chemical mutagen copyFizza Naeem
 

En vedette (6)

Powerpoint presentation in DNA of living organisms
Powerpoint presentation in DNA of living organismsPowerpoint presentation in DNA of living organisms
Powerpoint presentation in DNA of living organisms
 
Lecture 4 winter 2012
Lecture 4 winter 2012Lecture 4 winter 2012
Lecture 4 winter 2012
 
DNA structure, genes and its chemical composition
DNA structure, genes and its chemical compositionDNA structure, genes and its chemical composition
DNA structure, genes and its chemical composition
 
Chemical composition of dna
Chemical composition of dnaChemical composition of dna
Chemical composition of dna
 
Physical and chemical mutagen copy
Physical and chemical mutagen   copyPhysical and chemical mutagen   copy
Physical and chemical mutagen copy
 
A complete PPT on DNA
A complete PPT on DNA A complete PPT on DNA
A complete PPT on DNA
 

Similaire à Exploring Genomes Using DNA Walks in A-T G-C Space

Human Genome 2009
Human Genome 2009Human Genome 2009
Human Genome 2009lyonja
 
Validating and improving the D. melanogaster reference genome sequence using ...
Validating and improving the D. melanogaster reference genome sequence using ...Validating and improving the D. melanogaster reference genome sequence using ...
Validating and improving the D. melanogaster reference genome sequence using ...Casey Bergman
 
A statistical physics approach to system biology
A statistical physics approach to system biologyA statistical physics approach to system biology
A statistical physics approach to system biologySamir Suweis
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityMonica Munoz-Torres
 
Marzillier_09052014.pdf
Marzillier_09052014.pdfMarzillier_09052014.pdf
Marzillier_09052014.pdf7006ASWATHIRR
 
Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...QBiC_Tue
 
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...Sérgio Sacani
 
Karen miga centromere sequence characterization and variant detection
Karen miga centromere sequence characterization and variant detectionKaren miga centromere sequence characterization and variant detection
Karen miga centromere sequence characterization and variant detectionGenomeInABottle
 
Kulakova sbb2014
Kulakova sbb2014Kulakova sbb2014
Kulakova sbb2014Ek_Kul
 
Genome Informatics 2016 poster
Genome Informatics 2016 posterGenome Informatics 2016 poster
Genome Informatics 2016 posterWilliam Chow
 
Genomic mapping by kk sahu
Genomic mapping by kk sahuGenomic mapping by kk sahu
Genomic mapping by kk sahuKAUSHAL SAHU
 

Similaire à Exploring Genomes Using DNA Walks in A-T G-C Space (20)

Human Genome 2009
Human Genome 2009Human Genome 2009
Human Genome 2009
 
Validating and improving the D. melanogaster reference genome sequence using ...
Validating and improving the D. melanogaster reference genome sequence using ...Validating and improving the D. melanogaster reference genome sequence using ...
Validating and improving the D. melanogaster reference genome sequence using ...
 
New generation Sequencing
New generation Sequencing New generation Sequencing
New generation Sequencing
 
Basics of Genome Assembly
Basics of Genome Assembly Basics of Genome Assembly
Basics of Genome Assembly
 
A statistical physics approach to system biology
A statistical physics approach to system biologyA statistical physics approach to system biology
A statistical physics approach to system biology
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research community
 
Marzillier_09052014.pdf
Marzillier_09052014.pdfMarzillier_09052014.pdf
Marzillier_09052014.pdf
 
Gene mapping and its sequence
Gene mapping and its sequenceGene mapping and its sequence
Gene mapping and its sequence
 
Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...
 
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
 
Karen miga centromere sequence characterization and variant detection
Karen miga centromere sequence characterization and variant detectionKaren miga centromere sequence characterization and variant detection
Karen miga centromere sequence characterization and variant detection
 
HGP, the human genome project
HGP, the human genome projectHGP, the human genome project
HGP, the human genome project
 
A tutorial in Connectome Analysis (1) - Marcus Kaiser
A tutorial in Connectome Analysis (1) - Marcus KaiserA tutorial in Connectome Analysis (1) - Marcus Kaiser
A tutorial in Connectome Analysis (1) - Marcus Kaiser
 
Predicting Functional Regions in Genomic DNA Sequences Using Artificial Neur...
Predicting Functional Regions in Genomic DNA Sequences Using  Artificial Neur...Predicting Functional Regions in Genomic DNA Sequences Using  Artificial Neur...
Predicting Functional Regions in Genomic DNA Sequences Using Artificial Neur...
 
Synthetic biology
Synthetic biologySynthetic biology
Synthetic biology
 
Kulakova sbb2014
Kulakova sbb2014Kulakova sbb2014
Kulakova sbb2014
 
Genome Informatics 2016 poster
Genome Informatics 2016 posterGenome Informatics 2016 poster
Genome Informatics 2016 poster
 
A tutorial in Connectome Analysis (3) - Marcus Kaiser
A tutorial in Connectome Analysis (3) - Marcus KaiserA tutorial in Connectome Analysis (3) - Marcus Kaiser
A tutorial in Connectome Analysis (3) - Marcus Kaiser
 
Genomic mapping by kk sahu
Genomic mapping by kk sahuGenomic mapping by kk sahu
Genomic mapping by kk sahu
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
 

Plus de Jonathan Blakes

20080516 Spontaneous separation of bi-stable biochemical systems
20080516 Spontaneous separation of bi-stable biochemical systems20080516 Spontaneous separation of bi-stable biochemical systems
20080516 Spontaneous separation of bi-stable biochemical systemsJonathan Blakes
 
20090608 Abstraction and reusability in the biological modelling process
20090608 Abstraction and reusability in the biological modelling process20090608 Abstraction and reusability in the biological modelling process
20090608 Abstraction and reusability in the biological modelling processJonathan Blakes
 
20090918 Agile Computer Control of a Complex Experiment
20090918 Agile Computer Control of a Complex Experiment20090918 Agile Computer Control of a Complex Experiment
20090918 Agile Computer Control of a Complex ExperimentJonathan Blakes
 
20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environmentJonathan Blakes
 
20080620 Formal systems/synthetic biology modelling re-engineered
20080620 Formal systems/synthetic biology modelling re-engineered20080620 Formal systems/synthetic biology modelling re-engineered
20080620 Formal systems/synthetic biology modelling re-engineeredJonathan Blakes
 

Plus de Jonathan Blakes (6)

20101026 ASAP Seminar
20101026 ASAP Seminar20101026 ASAP Seminar
20101026 ASAP Seminar
 
20080516 Spontaneous separation of bi-stable biochemical systems
20080516 Spontaneous separation of bi-stable biochemical systems20080516 Spontaneous separation of bi-stable biochemical systems
20080516 Spontaneous separation of bi-stable biochemical systems
 
20090608 Abstraction and reusability in the biological modelling process
20090608 Abstraction and reusability in the biological modelling process20090608 Abstraction and reusability in the biological modelling process
20090608 Abstraction and reusability in the biological modelling process
 
20090918 Agile Computer Control of a Complex Experiment
20090918 Agile Computer Control of a Complex Experiment20090918 Agile Computer Control of a Complex Experiment
20090918 Agile Computer Control of a Complex Experiment
 
20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment
 
20080620 Formal systems/synthetic biology modelling re-engineered
20080620 Formal systems/synthetic biology modelling re-engineered20080620 Formal systems/synthetic biology modelling re-engineered
20080620 Formal systems/synthetic biology modelling re-engineered
 

Dernier

PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 

Dernier (20)

PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 

Exploring Genomes Using DNA Walks in A-T G-C Space

  • 1. Genome exploration in A-T G-C space introducing Icarus a DNA walking program Jonathan Blakes MSc Biotechnology and Computation Department of Biosciences Faculty of Science, Technology and Medical Studies
  • 2. Problem too much information!
  • 4. Hypothesis Can DNA sequences be plotted in such a way that long sequences can be easily interpreted by humans without a prior i knowledge? “ It seems that the simplest method of visualizing some properties of genomes is to send a virtual walker for a genomic walk, ask "it" to talk about what it has seen and note its observations. If our walker doesn't move with a Brownian-like motion, it is possible to extract from its walk a lot of information . ” Stanislaw Cebrat , the principal Polish proponent of DNA walks Assigning a cardinal coordinate ( north , south , east or west ) to each of the four nucleotide bases ( A , T , G , C ) and taking steps in those directions as a sequence is read sequentially will produce a ‘walk’ of the sequence in which repetitive DNA elements will be seen as repetitive 2-dimensional ‘structures’.
  • 5. DNA walks are plots of DNA or RNA sequences where each of the four nucleotide bases is assigned a direction and distance, the sequence is read off one nucleotide at a time and for each nucleotide the virtual walker takes a step in the designated direction creating a 'walk' of the sequence that reveals elements of structure in the nucleotide composition. DNA walking From Comparative Genometrics website, L'Université de Lausanne
  • 6. Icarus Live Demonstration Could someone please suggest a mammalian gene to walk?
  • 7. Mapping 24 possible combinations of cardinal vectors: 4 rotations for each of the 3 above mappings, and 4 rotations of each of their reflections about the x or y plane. Choosing which 3 ‘unique’ mappings of those 24 is a matter of parsimony.
  • 12. A-T G-C is consistently smallest Smaller pictures can contain more information in less space and are therefore more amenable to publication, hence Genome Exploration in A-T G-C space
  • 13. Duplications exons introns a 7 fold contiguous duplication in the male Y chromosome. Members of the TSPY (Testis-specific Y-encoded proteins) family identified by Skaletsky et al 1 using a combination of a whole chromosome dotplot with a 2-kb window and a custom Perl script running BLAST alignments of all 5-kb sequence segments, in 2-kb steps, of the entire MSY (Male Specific Y). In contrast I stumbled upon this purely by accident. 1. Skaletsky et al. Nature 2003 423.
  • 14.
  • 15. Phylogeny algorithms neighbour joining Icarus’ UPGMA Distance Matrix
  • 17. Newick format  Distance Matrix Output Newick format string representation of a tree: (Bovine:0.69395, (Gibbon:0.36079, (Orang:0.33636, (Gorilla:0.17147, (Chimp:0.19268, Human:0.11927) :0.08386):0.06124):0.15057):0.54939, Mouse:1.21460);
  • 19. Does summing distances from 3 mappings eliminate bias and produce a better phylogeny? NO. A better distance measure is needed.
  • 20.
  • 21. Acknowledgements I would like to thank: Dr. Gary Robinson Dr. Colin Johnson Dr. Anthony Baines And everyone I have met during the Biotechnology and Computation MSc.