SlideShare une entreprise Scribd logo
1  sur  17
Télécharger pour lire hors ligne
Karen Cranston
National Evolutionary Synthesis Center
@kcranstn / @opentreeoflife
http://www.slideshare.net/kcranstn
opentreeoflife.org
Gordon Burleigh
Keith Crandall
Karen Cranston
Karl Gude
David Hibbett
Mark Holder
Laura Katz
Rick Ree
Stephen Smith
Doug Soltis
Tiffani Williams
What does it mean to “have” the tree of life?
complete & dynamic
browse, download, query
use for research questions
implies digital access
Open Tree of Life
Taxonomy +
Source trees
•filter / weight input trees
•combine into synthetic tree
•feedback
•input new data sets
~ 4% of all published
phylogenetic trees
Stoltzfus et al 2012
Inputs: Phylogenetic data
Archiving sequence data is a community norm
Heroic data collection efforts
Surveyed >7000 phylogenetic studies in plants, fungi and
animals, unicellular organisms
Result: data for >2300 studies, >4800 trees
Poster P133003 tonight!
Inputs: Taxonomies
Large fraction of species not represented in phylogenies
taxonomy provides backbone & coverage at tips
2,644,685 names: NCBI (structure) + GBIF (completeness)
https://github.com/OpenTreeOfLife/opentree/wiki/Open-
Tree-Taxonomy
Synthesis process
Source trees
(Phylografter) Data storage &
synthesis
(treemachine)
OpenTree: visualize,
comment, search,
download
Taxonomies
(taxamachine)
Source tree management
phylografter.opentreeoflife.org
Synthesizing trees and taxonomies
Graph database for phylogenies (treemachine) and
taxonomy (taxomachine)
Allows for extremely efficient storage and retrieval
Rules to extract binary tree from highly conflicting graph
More details? Stephen
Smith 8:30 am Monday!
OpenTree browser
dev.opentreeoflife.org/opentree
Public tree of life
publictreeoflife.com/tree
“Open” Tree of Life
Collaborations
providing images and text for public tree
developing methods for subtree extraction
summer student providing links to ToLWeb
pages
treeviz project from U Indiana MOOC,
GNOME summer intern
partner for data archiving / harvest
August 2013 release
Year 2 & 3 goals
Refine draft tree based on user feedback / new data
Research into phylogenetic synthesis
User features
How does my tree compare with others?
Synthesis on demand
Quantifying / visualizing conflict
Suggestions?
Gordon Burleigh
Keith Crandall
Karen Cranston
Karl Gude
David Hibbett
Mark Holder
Laura Katz
Rick Ree
Stephen Smith
Doug Soltis
Tiffani Williams

Contenu connexe

Tendances

Tendances (15)

ischools future of data managemente dec2017
ischools future of data managemente dec2017ischools future of data managemente dec2017
ischools future of data managemente dec2017
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
 
Laurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data PublicationLaurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data Publication
 
ContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific LiteratureContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific Literature
 
Can machines understand the scientific literature?
Can machines understand the scientific literature?Can machines understand the scientific literature?
Can machines understand the scientific literature?
 
Cloud-native machine learning - Transforming bioinformatics research
Cloud-native machine learning - Transforming bioinformatics research Cloud-native machine learning - Transforming bioinformatics research
Cloud-native machine learning - Transforming bioinformatics research
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
AnMicro-TBRC Seminar on Phylogenetic Analysis (EP.1)
AnMicro-TBRC Seminar on Phylogenetic Analysis (EP.1)AnMicro-TBRC Seminar on Phylogenetic Analysis (EP.1)
AnMicro-TBRC Seminar on Phylogenetic Analysis (EP.1)
 
Digital tools and training for environmental sciences in Australia
Digital tools and training for environmental sciences in AustraliaDigital tools and training for environmental sciences in Australia
Digital tools and training for environmental sciences in Australia
 
Implementation of Semantic Network Dictionary System for Global Observation ...
Implementation of Semantic Network Dictionary System for Global Observation ...Implementation of Semantic Network Dictionary System for Global Observation ...
Implementation of Semantic Network Dictionary System for Global Observation ...
 
The Future of Microalgal Taxonomy
The Future of Microalgal TaxonomyThe Future of Microalgal Taxonomy
The Future of Microalgal Taxonomy
 
Doing More With Oa Repositories
Doing More With Oa RepositoriesDoing More With Oa Repositories
Doing More With Oa Repositories
 
Stories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global InfrastructureStories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global Infrastructure
 
Lines of Communication: Effectively Advocating Open Access Repositories
Lines of Communication: Effectively Advocating Open Access RepositoriesLines of Communication: Effectively Advocating Open Access Repositories
Lines of Communication: Effectively Advocating Open Access Repositories
 
GigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDBGigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDB
 

Similaire à Cranston Evolution 2013

Similaire à Cranston Evolution 2013 (20)

The emerging biodiversity data ecosystem
The emerging biodiversity data ecosystemThe emerging biodiversity data ecosystem
The emerging biodiversity data ecosystem
 
Open Tree of Life @NSF
Open Tree of Life @NSFOpen Tree of Life @NSF
Open Tree of Life @NSF
 
Linking biodiversity data for ecology
Linking biodiversity data for ecologyLinking biodiversity data for ecology
Linking biodiversity data for ecology
 
Shorthouse
ShorthouseShorthouse
Shorthouse
 
20140623 swets agosti_final
20140623 swets agosti_final20140623 swets agosti_final
20140623 swets agosti_final
 
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
 
Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?
 
Gregoire Taillefer poster ESC final
Gregoire Taillefer poster ESC finalGregoire Taillefer poster ESC final
Gregoire Taillefer poster ESC final
 
Project Unity: The Way of the Future for Plant Breeding
Project Unity: The Way of the Future for Plant BreedingProject Unity: The Way of the Future for Plant Breeding
Project Unity: The Way of the Future for Plant Breeding
 
GloBI @ Berkeley Institute for Data Science Feb 5, 2015
GloBI @ Berkeley Institute for Data Science Feb 5, 2015GloBI @ Berkeley Institute for Data Science Feb 5, 2015
GloBI @ Berkeley Institute for Data Science Feb 5, 2015
 
Long Term Ecological Research Network
Long Term Ecological Research NetworkLong Term Ecological Research Network
Long Term Ecological Research Network
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
2015 LIBER rinaldo&smith 25-06-15 (3)
2015 LIBER rinaldo&smith 25-06-15 (3)2015 LIBER rinaldo&smith 25-06-15 (3)
2015 LIBER rinaldo&smith 25-06-15 (3)
 
AB3ACBS 2016: EMBL Australia Bioinformatics Resource
AB3ACBS 2016: EMBL Australia Bioinformatics ResourceAB3ACBS 2016: EMBL Australia Bioinformatics Resource
AB3ACBS 2016: EMBL Australia Bioinformatics Resource
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 
Data Publication for UC Davis Publish or Perish
Data Publication for UC Davis Publish or PerishData Publication for UC Davis Publish or Perish
Data Publication for UC Davis Publish or Perish
 
Data dialogue - Human Genomic Data Discovery
Data dialogue - Human Genomic Data DiscoveryData dialogue - Human Genomic Data Discovery
Data dialogue - Human Genomic Data Discovery
 
Big Data Field Museum
Big Data Field MuseumBig Data Field Museum
Big Data Field Museum
 
2018 04-03-shorthouse
2018 04-03-shorthouse2018 04-03-shorthouse
2018 04-03-shorthouse
 

Plus de Karen Cranston

Open Tree of Life at Evolution 2014
Open Tree of Life at Evolution 2014Open Tree of Life at Evolution 2014
Open Tree of Life at Evolution 2014
Karen Cranston
 
Carleton Biology talk : March 2014
Carleton Biology talk : March 2014Carleton Biology talk : March 2014
Carleton Biology talk : March 2014
Karen Cranston
 

Plus de Karen Cranston (13)

Open Tree of Life at Evolution 2014
Open Tree of Life at Evolution 2014Open Tree of Life at Evolution 2014
Open Tree of Life at Evolution 2014
 
Carleton Biology talk : March 2014
Carleton Biology talk : March 2014Carleton Biology talk : March 2014
Carleton Biology talk : March 2014
 
Open Tree of Life Phyloseminar 2014
Open Tree of Life Phyloseminar 2014Open Tree of Life Phyloseminar 2014
Open Tree of Life Phyloseminar 2014
 
WSSSPE: Building communities
WSSSPE: Building communitiesWSSSPE: Building communities
WSSSPE: Building communities
 
Building communities around open-source scientific software
Building communities around open-source scientific softwareBuilding communities around open-source scientific software
Building communities around open-source scientific software
 
Using phylogenetic metadata for large-scale phylogeny synthesis
Using phylogenetic metadata for large-scale phylogeny synthesisUsing phylogenetic metadata for large-scale phylogeny synthesis
Using phylogenetic metadata for large-scale phylogeny synthesis
 
Open Tree at UNCC Jan 2013
Open Tree at UNCC Jan 2013Open Tree at UNCC Jan 2013
Open Tree at UNCC Jan 2013
 
Freeing scientific data using CC0
Freeing scientific data using CC0Freeing scientific data using CC0
Freeing scientific data using CC0
 
If this is the future, where is my tree of life?
If this is the future, where is my tree of life?If this is the future, where is my tree of life?
If this is the future, where is my tree of life?
 
Phylotastic @iEvoBio
Phylotastic @iEvoBioPhylotastic @iEvoBio
Phylotastic @iEvoBio
 
Open Tree of Life @Evolution 2012
Open Tree of Life @Evolution 2012Open Tree of Life @Evolution 2012
Open Tree of Life @Evolution 2012
 
OpenTree at NESCent Academy 2012
OpenTree at NESCent Academy 2012OpenTree at NESCent Academy 2012
OpenTree at NESCent Academy 2012
 
Open Tree of Life at Duke Futures
Open Tree of Life at Duke FuturesOpen Tree of Life at Duke Futures
Open Tree of Life at Duke Futures
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Cranston Evolution 2013

  • 1. Karen Cranston National Evolutionary Synthesis Center @kcranstn / @opentreeoflife http://www.slideshare.net/kcranstn opentreeoflife.org
  • 2. Gordon Burleigh Keith Crandall Karen Cranston Karl Gude David Hibbett Mark Holder Laura Katz Rick Ree Stephen Smith Doug Soltis Tiffani Williams
  • 3. What does it mean to “have” the tree of life? complete & dynamic browse, download, query use for research questions implies digital access
  • 4. Open Tree of Life Taxonomy + Source trees •filter / weight input trees •combine into synthetic tree •feedback •input new data sets
  • 5. ~ 4% of all published phylogenetic trees Stoltzfus et al 2012 Inputs: Phylogenetic data Archiving sequence data is a community norm
  • 6. Heroic data collection efforts Surveyed >7000 phylogenetic studies in plants, fungi and animals, unicellular organisms Result: data for >2300 studies, >4800 trees Poster P133003 tonight!
  • 7. Inputs: Taxonomies Large fraction of species not represented in phylogenies taxonomy provides backbone & coverage at tips 2,644,685 names: NCBI (structure) + GBIF (completeness) https://github.com/OpenTreeOfLife/opentree/wiki/Open- Tree-Taxonomy
  • 8. Synthesis process Source trees (Phylografter) Data storage & synthesis (treemachine) OpenTree: visualize, comment, search, download Taxonomies (taxamachine)
  • 10. Synthesizing trees and taxonomies Graph database for phylogenies (treemachine) and taxonomy (taxomachine) Allows for extremely efficient storage and retrieval Rules to extract binary tree from highly conflicting graph More details? Stephen Smith 8:30 am Monday!
  • 12. Public tree of life publictreeoflife.com/tree
  • 14. Collaborations providing images and text for public tree developing methods for subtree extraction summer student providing links to ToLWeb pages treeviz project from U Indiana MOOC, GNOME summer intern partner for data archiving / harvest
  • 16. Year 2 & 3 goals Refine draft tree based on user feedback / new data Research into phylogenetic synthesis User features How does my tree compare with others? Synthesis on demand Quantifying / visualizing conflict Suggestions?
  • 17. Gordon Burleigh Keith Crandall Karen Cranston Karl Gude David Hibbett Mark Holder Laura Katz Rick Ree Stephen Smith Doug Soltis Tiffani Williams