SlideShare une entreprise Scribd logo
1  sur  57
Towards a Hierarchical Classification of All Life – the IRMNG data assembly project Tony Rees – CSIRO Marine and Atmospheric Research, Australia October 2011
Why a hierarchical classification? Tony Rees: Hierarchical Classification of All Life
[object Object],Why a hierarchical classification? Tony Rees: Hierarchical Classification of All Life “ borrowed” from R. Page presentation, 2011 ,[object Object],[object Object]
[object Object],Why a hierarchical classification? Tony Rees: Hierarchical Classification of All Life Functional view The system Structural view genus / species name “X” useful information on taxon “X”
What should “the system” ideally hold? – something like… Tony Rees: Hierarchical Classification of All Life (etc.)
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],What should “the system” ideally hold? Tony Rees: Hierarchical Classification of All Life x 50+…
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],System is based on scientific names of taxa Tony Rees: Hierarchical Classification of All Life 2+ million ~250k ~10k ~2k Kingdoms (5/6/7/8) ~400 ~140 Phyla Classes Orders Families Genera Species
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Availability of comprehensive treatments Tony Rees: Hierarchical Classification of All Life
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Availability of comprehensive treatments – cont’d Tony Rees: Hierarchical Classification of All Life
[object Object],[object Object],[object Object],[object Object],Can we use Catalogue of Life as a comprehensive resource? Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What about “names aggregator” activities Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object],[object Object],[object Object],(etc.)
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Genus level compilations are much more complete, can we use those? Tony Rees: Hierarchical Classification of All Life
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The IRMNG concept Tony Rees: Hierarchical Classification of All Life
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],IRMNG desired content Tony Rees: Hierarchical Classification of All Life
Family placement – editorial decisions may be needed Tony Rees: Hierarchical Classification of All Life ,[object Object]
Data aggregation complicated by genus level homonyms e.g.: Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object]
Perseverance produces the following (subset of genus table, 453k names as at Oct 2011): Tony Rees: Hierarchical Classification of All Life
A glimpse of the IRMNG “master genus” table (currently 452,827 records) Tony Rees: Hierarchical Classification of All Life
A glimpse of the IRMNG “master genus” table (currently 452,827 records) Tony Rees: Hierarchical Classification of All Life (Mabberley plant names list)
Detail showing example source/s used Tony Rees: Hierarchical Classification of All Life
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Services / views this currently supports Tony Rees: Hierarchical Classification of All Life
IRMNG-generated statistics for “all life” (web query 6 Oct 2011) Tony Rees: Hierarchical Classification of All Life ,[object Object]
Other services / products e.g. full hierarchical lists  Tony Rees: Hierarchical Classification of All Life however with caveat: some / many genera may still be classified only at higher level (e.g. “Mammalia – unallocated”) at this time (more work to do).
Check batches of entered names Tony Rees: Hierarchical Classification of All Life (1,406 genus names…)
Check batches of entered names Tony Rees: Hierarchical Classification of All Life (start of IRMNG search result)
Check batches of entered names Tony Rees: Hierarchical Classification of All Life
Check batches of entered names Tony Rees: Hierarchical Classification of All Life ?
Query by taxon name (correctly spelled or misspelled) Tony Rees: Hierarchical Classification of All Life
Check batches of entered names Tony Rees: Hierarchical Classification of All Life ,[object Object]
Linking names with literature Tony Rees: Hierarchical Classification of All Life
Tony Rees: Hierarchical Classification of All Life The “microcitation” (Nomenclator’s favourite…) ,[object Object],[object Object],[object Object],Name plus page in work List of all works as data objects
Expanded citation info in IRMNG - example  Tony Rees: Hierarchical Classification of All Life
Expanded citation info in IRMNG - example  Tony Rees: Hierarchical Classification of All Life
Expanded citation info in IRMNG - example  Tony Rees: Hierarchical Classification of All Life
IP issues regarding bibliographies, etc. Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
IRMNG content – recent missing genera… Tony Rees: Hierarchical Classification of All Life
IRMNG content – genus names published by year, 1995-current (as at Oct 2011), excluding virus names (which are undated) Tony Rees: Hierarchical Classification of All Life (NB could disaggregate further as desired, e.g. by detailed tax. group, or extant vs. fossil…) …  also would expect a small number of residual names missed for ostensibly “complete” years presumed missing names
IRMNG 2011 content cf. Cat. of Life 2011 Tony Rees: Hierarchical Classification of All Life Note, Chapman, 2009 estimates c.1.9m described extant species (see earlier slide) On that basis, CoL has 70% of valid extant species names, maybe 70% of valid extant genera (with subset of  genus-level synonyms) IRMNG is missing est. 10k genera from 2004-2011 (from last slide), maybe further 2-3% overall (say 10k-15k), “complete” list would thus be ~475k at this time (increasing at ~2k/year). Cat. of Life - 2011 edition % with auth's IRMNG – Oct 2011 - extant + fossil % with auth's IRMNG – Oct 2011 - fossil only           Kingdoms 8   7   0 Phyla 111   153   12 Classes 288   509   64 Orders 1,233   2,645   715 Families 8,071 0% 19,639 22.1% 6,542 Subfamilies           Genera 178,515 0% 452,848 97.1% 90,278 Subgenera           Species (valid) 1,347,224 ~100% 1,020,519 ~100% 16,792 Species (synonyms) 895,441 ~100% 440,738 ~100% 100
Many unfinished tasks Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object],[object Object],[object Object],[object Object]
Potential integration / replacement with “GN” components… Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Potential integration / replacement with “GN” components… Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object]
Thank you Thanks to: - OBIS, GBIF and Atlas of Living Australia for financial support, numerous data providers for data - CSIRO for salary and in-kind support, 2006-present - D. Patterson / MBL / NSF (this trip funding + hosting) Tony Rees: Hierarchical Classification of All Life Contact details Phone: +61 3 6232 5318 Email: Tony.Rees@csiro.au  Web: www.cmar.csiro.au/datacentre/
Supplementary slides Tony Rees: Hierarchical Classification of All Life
Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object]
New names: potential discovery paths Tony Rees: Hierarchical Classification of All Life new virus names new prokaryote names new botanical names – algae & fungi (except fossils) new botanical names – bryophytes through angiosperms (except fossils) new zoological names publication discovery official registers taxon-specific DB’s integrated DB’s “ all names” Botany Zoology Newly published names – primary literature (print, electronic) ICTV Viruses DB LPSN (Prokaryote names) ICBN Decisions ICZN Decisions Journal TOC’s, RSS feeds, text mining Abstracting services Subject bibliographies Reviews, secondary literature Zoological Record ION (Index of Organism Names) ChecklistBank GNI GNUB ZooBank? Catalogue of Life annual editions ITIS NCBI Taxonomy WoRMS etc. CyanoDB Index Fungorum MycoBank AlgaeBase Plant GSD’s PaleoDB Animal GSD’s other compilations e.g. regional lists, Wikispecies, Wikipedia, more… IRMNG
New names: potential discovery paths Tony Rees: Hierarchical Classification of All Life new virus names new prokaryote names new botanical names – algae & fungi (except fossils) new botanical names – bryophytes through angiosperms (except fossils) new zoological names publication discovery official registers taxon-specific DB’s integrated DB’s “ all names” Botany Zoology Newly published names – primary literature (print, electronic) ICTV Viruses DB LPSN (Prokaryote names) ICBN Decisions ICZN Decisions Journal TOC’s, RSS feeds, text mining Abstracting services Subject bibliographies Reviews, secondary literature Zoological Record ION (Index of Organism Names) ChecklistBank GNI GNUB ZooBank? Catalogue of Life annual editions ITIS NCBI Taxonomy WoRMS etc. CyanoDB Index Fungorum MycoBank AlgaeBase Plant GSD’s PaleoDB Animal GSD’s other compilations e.g. regional lists, Wikispecies, Wikipedia, more… IRMNG Lots of manual effort
New names: potential discovery paths Tony Rees: Hierarchical Classification of All Life new virus names new prokaryote names new botanical names – algae & fungi (except fossils) new botanical names – bryophytes through angiosperms (except fossils) new zoological names publication discovery official registers taxon-specific DB’s integrated DB’s “ all names” Botany Zoology Newly published names – primary literature (print, electronic) ICTV Viruses DB LPSN (Prokaryote names) ICBN Decisions ICZN Decisions Journal TOC’s, RSS feeds, text mining Abstracting services Subject bibliographies Reviews, secondary literature Zoological Record ION (Index of Organism Names) ChecklistBank GNI GNUB ZooBank? Catalogue of Life annual editions ITIS NCBI Taxonomy WoRMS etc. CyanoDB Index Fungorum MycoBank AlgaeBase Plant GSD’s PaleoDB Animal GSD’s other compilations e.g. regional lists, Wikispecies, Wikipedia, more… IRMNG Lots of automated feeds + expert curation
New names: potential discovery paths Tony Rees: Hierarchical Classification of All Life new virus names new prokaryote names new botanical names – algae & fungi (except fossils) new botanical names – bryophytes through angiosperms (except fossils) new zoological names publication discovery official registers taxon-specific DB’s integrated DB’s “ all names” Botany Zoology Newly published names – primary literature (print, electronic) ICTV Viruses DB LPSN (Prokaryote names) ICBN Decisions ICZN Decisions Journal TOC’s, RSS feeds, text mining Abstracting services Subject bibliographies Reviews, secondary literature Zoological Record ION (Index of Organism Names) ChecklistBank GNI GNUB ZooBank? Catalogue of Life annual editions ITIS NCBI Taxonomy WoRMS etc. CyanoDB Index Fungorum MycoBank AlgaeBase Plant GSD’s PaleoDB Animal GSD’s other compilations e.g. regional lists, Wikispecies, Wikipedia, more… IRMNG Lots of automated feeds + expert curation Lots of useful services
How many taxa? Tony Rees: Hierarchical Classification of All Life valid extant + fossil taxa (est.) How many species? estimates according to Chapman, 2009 (valid, extant taxa only); “others” comprise c. 54k protists, 10k prokaryotes, 2k viruses NB inverts. includes “~1,000,000” for Insects – probably +/- 60k Fossil species – no published estimates – maybe 500k names, 300k valid 2+ million ~250k ~10k ~2k Kingdoms (5/6/7/8) ~400 ~140 Phyla Classes Orders Families Genera Species
Relevant information domain: all life Tony Rees: Hierarchical Classification of All Life PROTISTS Fig. i-1 in Margulis & Schwartz, 1998
How many kingdoms… Tony Rees: Hierarchical Classification of All Life PROTISTS Fig. i-1 in Margulis & Schwartz, 1998 7 kingdoms (5 in Margulis & Schwartz, 8 in Cat. of Life…): Animals, Fungi, Plants : 3 kingdoms Protists : 1 (or 2 if Stramenopiles [Heterokonts] recognized, = Cavalier-Smith’s Kingdom “Chromista”) Bacteria + Archaea : 2 (=1 in Margulis & Schwartz) Viruses : 1 (not in Margulis & Schwartz)
Nomenclature governed by four separate  Codes , i.e. Zoological, Botanical, Bacteriological, Viruses Tony Rees: Hierarchical Classification of All Life PROTISTS Zoo. Code Bact. Code Bot. Code Vir. Code: viruses (not shown) Fig. i-1 in Margulis & Schwartz, 1998
CiteBank as a remote references repository? Tony Rees: Hierarchical Classification of All Life ,[object Object],[object Object],[object Object],[object Object],[object Object]
Parker, 1982 content example Tony Rees: Hierarchical Classification of All Life
Benton, 1993 content example Tony Rees: Hierarchical Classification of All Life
Rees TAXAMATCH fuzzy matching poster (start) Tony Rees: Hierarchical Classification of All Life
Schematic of TAXAMATCH operation Tony Rees: Hierarchical Classification of All Life

Contenu connexe

Tendances

Chapter 18.2
Chapter 18.2Chapter 18.2
Chapter 18.2
fj560
 
Comparing the Codes: Zoological and Botantical Nomenclature
Comparing the Codes: Zoological and Botantical NomenclatureComparing the Codes: Zoological and Botantical Nomenclature
Comparing the Codes: Zoological and Botantical Nomenclature
ICZN
 
Taxonomy Biology Notes
Taxonomy Biology NotesTaxonomy Biology Notes
Taxonomy Biology Notes
Fred Phillips
 
Nomenclature for the Future: The power and challenges for stable and sensible...
Nomenclature for the Future: The power and challenges for stable and sensible...Nomenclature for the Future: The power and challenges for stable and sensible...
Nomenclature for the Future: The power and challenges for stable and sensible...
ICZN
 
05 phylogeny modern taxonomy
05   phylogeny modern taxonomy05   phylogeny modern taxonomy
05 phylogeny modern taxonomy
mrtangextrahelp
 
Taxonomy
TaxonomyTaxonomy
Taxonomy
zqc
 
Classification system
Classification systemClassification system
Classification system
Syed Shah
 

Tendances (20)

Intro to biodiversity and taxonomy
Intro to biodiversity and taxonomyIntro to biodiversity and taxonomy
Intro to biodiversity and taxonomy
 
Historical resume of systematics by VISHAL BHOJYAWAL
Historical  resume of  systematics by VISHAL BHOJYAWALHistorical  resume of  systematics by VISHAL BHOJYAWAL
Historical resume of systematics by VISHAL BHOJYAWAL
 
Chapter 18.2
Chapter 18.2Chapter 18.2
Chapter 18.2
 
Comparing the Codes: Zoological and Botantical Nomenclature
Comparing the Codes: Zoological and Botantical NomenclatureComparing the Codes: Zoological and Botantical Nomenclature
Comparing the Codes: Zoological and Botantical Nomenclature
 
Taxonomy Biology Notes
Taxonomy Biology NotesTaxonomy Biology Notes
Taxonomy Biology Notes
 
Nomenclature for the Future: The power and challenges for stable and sensible...
Nomenclature for the Future: The power and challenges for stable and sensible...Nomenclature for the Future: The power and challenges for stable and sensible...
Nomenclature for the Future: The power and challenges for stable and sensible...
 
Binomial nomenclature
Binomial nomenclatureBinomial nomenclature
Binomial nomenclature
 
Biodiversity 12
Biodiversity 12Biodiversity 12
Biodiversity 12
 
05 phylogeny modern taxonomy
05   phylogeny modern taxonomy05   phylogeny modern taxonomy
05 phylogeny modern taxonomy
 
Classification
ClassificationClassification
Classification
 
Classification
ClassificationClassification
Classification
 
TAXONOMICAL CATEGORIES
TAXONOMICAL CATEGORIESTAXONOMICAL CATEGORIES
TAXONOMICAL CATEGORIES
 
Taxonomy
TaxonomyTaxonomy
Taxonomy
 
Classification system
Classification systemClassification system
Classification system
 
Type method
Type methodType method
Type method
 
SYSTEMATICS: Based on Evolutionary Relationships
SYSTEMATICS: Based on Evolutionary RelationshipsSYSTEMATICS: Based on Evolutionary Relationships
SYSTEMATICS: Based on Evolutionary Relationships
 
Unit 17b Domains and kingdoms
Unit 17b  Domains and kingdomsUnit 17b  Domains and kingdoms
Unit 17b Domains and kingdoms
 
Taxonomic order
Taxonomic orderTaxonomic order
Taxonomic order
 
History of classification
History of classificationHistory of classification
History of classification
 
Botanical nomenclature
Botanical nomenclatureBotanical nomenclature
Botanical nomenclature
 

En vedette (7)

Classification 1211248247479135 8
Classification 1211248247479135 8Classification 1211248247479135 8
Classification 1211248247479135 8
 
5.5 classification
5.5 classification5.5 classification
5.5 classification
 
Unit 8 Fourth Grade 2012 2013
Unit 8 Fourth Grade 2012 2013Unit 8 Fourth Grade 2012 2013
Unit 8 Fourth Grade 2012 2013
 
S v 20 rules
S v 20 rulesS v 20 rules
S v 20 rules
 
Taxonomic keys
Taxonomic keysTaxonomic keys
Taxonomic keys
 
Classification and Keys
Classification and KeysClassification and Keys
Classification and Keys
 
06 6 kingdoms and 3 domains
06   6 kingdoms and 3 domains06   6 kingdoms and 3 domains
06 6 kingdoms and 3 domains
 

Similaire à Tony Rees: Towards a Hierarchical Classification of All Life

Classification of living things
Classification of living thingsClassification of living things
Classification of living things
joygtablante
 
IRMNG presentation March 2012
IRMNG presentation March 2012IRMNG presentation March 2012
IRMNG presentation March 2012
Tony Rees
 
Introduction to Taxonomy, Components and Major Plant Taxonomist
Introduction to Taxonomy, Components and Major Plant TaxonomistIntroduction to Taxonomy, Components and Major Plant Taxonomist
Introduction to Taxonomy, Components and Major Plant Taxonomist
Krissa Gatan
 
The Living World
The Living WorldThe Living World
The Living World
shivrajrath
 
KOUSIK_GHOSHPhenetics and Cladistics2020-04-05Phenetics and Cladistics.pptx
KOUSIK_GHOSHPhenetics and Cladistics2020-04-05Phenetics and Cladistics.pptxKOUSIK_GHOSHPhenetics and Cladistics2020-04-05Phenetics and Cladistics.pptx
KOUSIK_GHOSHPhenetics and Cladistics2020-04-05Phenetics and Cladistics.pptx
PriyankaChakraborty95
 
230-Classification-Systematics animal phylogeny.ppt
230-Classification-Systematics animal phylogeny.ppt230-Classification-Systematics animal phylogeny.ppt
230-Classification-Systematics animal phylogeny.ppt
v3wfcbase
 
vdocument.in_taxonomy-taxonomy-science-of-classifying-living-things-based-on-...
vdocument.in_taxonomy-taxonomy-science-of-classifying-living-things-based-on-...vdocument.in_taxonomy-taxonomy-science-of-classifying-living-things-based-on-...
vdocument.in_taxonomy-taxonomy-science-of-classifying-living-things-based-on-...
RaniaElwatidy1
 
Classification of life taxonomy
Classification of life taxonomyClassification of life taxonomy
Classification of life taxonomy
tas11244
 
Classification
ClassificationClassification
Classification
ilanasaxe
 
Zoo Bank Talk Ms Ccourse09 Compressed Test
Zoo Bank Talk Ms Ccourse09 Compressed TestZoo Bank Talk Ms Ccourse09 Compressed Test
Zoo Bank Talk Ms Ccourse09 Compressed Test
ICZN
 

Similaire à Tony Rees: Towards a Hierarchical Classification of All Life (20)

Tony Rees: An All Genera Index
Tony Rees: An All Genera IndexTony Rees: An All Genera Index
Tony Rees: An All Genera Index
 
10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?
 
Classification of living things
Classification of living thingsClassification of living things
Classification of living things
 
IRMNG presentation March 2012
IRMNG presentation March 2012IRMNG presentation March 2012
IRMNG presentation March 2012
 
Introduction to Taxonomy, Components and Major Plant Taxonomist
Introduction to Taxonomy, Components and Major Plant TaxonomistIntroduction to Taxonomy, Components and Major Plant Taxonomist
Introduction to Taxonomy, Components and Major Plant Taxonomist
 
Plant taxonomy
Plant taxonomyPlant taxonomy
Plant taxonomy
 
The Living World
The Living WorldThe Living World
The Living World
 
Remsen Lect04
Remsen Lect04Remsen Lect04
Remsen Lect04
 
KOUSIK_GHOSHPhenetics and Cladistics2020-04-05Phenetics and Cladistics.pptx
KOUSIK_GHOSHPhenetics and Cladistics2020-04-05Phenetics and Cladistics.pptxKOUSIK_GHOSHPhenetics and Cladistics2020-04-05Phenetics and Cladistics.pptx
KOUSIK_GHOSHPhenetics and Cladistics2020-04-05Phenetics and Cladistics.pptx
 
230-Classification-Systematics animal phylogeny.ppt
230-Classification-Systematics animal phylogeny.ppt230-Classification-Systematics animal phylogeny.ppt
230-Classification-Systematics animal phylogeny.ppt
 
Sharing information between projects
Sharing information between projectsSharing information between projects
Sharing information between projects
 
vdocument.in_taxonomy-taxonomy-science-of-classifying-living-things-based-on-...
vdocument.in_taxonomy-taxonomy-science-of-classifying-living-things-based-on-...vdocument.in_taxonomy-taxonomy-science-of-classifying-living-things-based-on-...
vdocument.in_taxonomy-taxonomy-science-of-classifying-living-things-based-on-...
 
Classification of life taxonomy
Classification of life taxonomyClassification of life taxonomy
Classification of life taxonomy
 
Tony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentationTony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentation
 
Classification
ClassificationClassification
Classification
 
Writing The Encyclopedia Of Life (not EoL.org)
Writing The Encyclopedia Of Life (not EoL.org)Writing The Encyclopedia Of Life (not EoL.org)
Writing The Encyclopedia Of Life (not EoL.org)
 
Chapter_2_Systematics.pptx
Chapter_2_Systematics.pptxChapter_2_Systematics.pptx
Chapter_2_Systematics.pptx
 
Class xi ch 1
Class xi ch 1Class xi ch 1
Class xi ch 1
 
Class xi ch 1
Class xi ch 1Class xi ch 1
Class xi ch 1
 
Zoo Bank Talk Ms Ccourse09 Compressed Test
Zoo Bank Talk Ms Ccourse09 Compressed TestZoo Bank Talk Ms Ccourse09 Compressed Test
Zoo Bank Talk Ms Ccourse09 Compressed Test
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Dernier (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

Tony Rees: Towards a Hierarchical Classification of All Life

  • 1. Towards a Hierarchical Classification of All Life – the IRMNG data assembly project Tony Rees – CSIRO Marine and Atmospheric Research, Australia October 2011
  • 2. Why a hierarchical classification? Tony Rees: Hierarchical Classification of All Life
  • 3.
  • 4.
  • 5. What should “the system” ideally hold? – something like… Tony Rees: Hierarchical Classification of All Life (etc.)
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17. Perseverance produces the following (subset of genus table, 453k names as at Oct 2011): Tony Rees: Hierarchical Classification of All Life
  • 18. A glimpse of the IRMNG “master genus” table (currently 452,827 records) Tony Rees: Hierarchical Classification of All Life
  • 19. A glimpse of the IRMNG “master genus” table (currently 452,827 records) Tony Rees: Hierarchical Classification of All Life (Mabberley plant names list)
  • 20. Detail showing example source/s used Tony Rees: Hierarchical Classification of All Life
  • 21.
  • 22.
  • 23. Other services / products e.g. full hierarchical lists Tony Rees: Hierarchical Classification of All Life however with caveat: some / many genera may still be classified only at higher level (e.g. “Mammalia – unallocated”) at this time (more work to do).
  • 24. Check batches of entered names Tony Rees: Hierarchical Classification of All Life (1,406 genus names…)
  • 25. Check batches of entered names Tony Rees: Hierarchical Classification of All Life (start of IRMNG search result)
  • 26. Check batches of entered names Tony Rees: Hierarchical Classification of All Life
  • 27. Check batches of entered names Tony Rees: Hierarchical Classification of All Life ?
  • 28. Query by taxon name (correctly spelled or misspelled) Tony Rees: Hierarchical Classification of All Life
  • 29.
  • 30. Linking names with literature Tony Rees: Hierarchical Classification of All Life
  • 31.
  • 32. Expanded citation info in IRMNG - example Tony Rees: Hierarchical Classification of All Life
  • 33. Expanded citation info in IRMNG - example Tony Rees: Hierarchical Classification of All Life
  • 34. Expanded citation info in IRMNG - example Tony Rees: Hierarchical Classification of All Life
  • 35.
  • 36. IRMNG content – recent missing genera… Tony Rees: Hierarchical Classification of All Life
  • 37. IRMNG content – genus names published by year, 1995-current (as at Oct 2011), excluding virus names (which are undated) Tony Rees: Hierarchical Classification of All Life (NB could disaggregate further as desired, e.g. by detailed tax. group, or extant vs. fossil…) … also would expect a small number of residual names missed for ostensibly “complete” years presumed missing names
  • 38. IRMNG 2011 content cf. Cat. of Life 2011 Tony Rees: Hierarchical Classification of All Life Note, Chapman, 2009 estimates c.1.9m described extant species (see earlier slide) On that basis, CoL has 70% of valid extant species names, maybe 70% of valid extant genera (with subset of genus-level synonyms) IRMNG is missing est. 10k genera from 2004-2011 (from last slide), maybe further 2-3% overall (say 10k-15k), “complete” list would thus be ~475k at this time (increasing at ~2k/year). Cat. of Life - 2011 edition % with auth's IRMNG – Oct 2011 - extant + fossil % with auth's IRMNG – Oct 2011 - fossil only           Kingdoms 8   7   0 Phyla 111   153   12 Classes 288   509   64 Orders 1,233   2,645   715 Families 8,071 0% 19,639 22.1% 6,542 Subfamilies           Genera 178,515 0% 452,848 97.1% 90,278 Subgenera           Species (valid) 1,347,224 ~100% 1,020,519 ~100% 16,792 Species (synonyms) 895,441 ~100% 440,738 ~100% 100
  • 39.
  • 40.
  • 41.
  • 42. Thank you Thanks to: - OBIS, GBIF and Atlas of Living Australia for financial support, numerous data providers for data - CSIRO for salary and in-kind support, 2006-present - D. Patterson / MBL / NSF (this trip funding + hosting) Tony Rees: Hierarchical Classification of All Life Contact details Phone: +61 3 6232 5318 Email: Tony.Rees@csiro.au Web: www.cmar.csiro.au/datacentre/
  • 43. Supplementary slides Tony Rees: Hierarchical Classification of All Life
  • 44.
  • 45. New names: potential discovery paths Tony Rees: Hierarchical Classification of All Life new virus names new prokaryote names new botanical names – algae & fungi (except fossils) new botanical names – bryophytes through angiosperms (except fossils) new zoological names publication discovery official registers taxon-specific DB’s integrated DB’s “ all names” Botany Zoology Newly published names – primary literature (print, electronic) ICTV Viruses DB LPSN (Prokaryote names) ICBN Decisions ICZN Decisions Journal TOC’s, RSS feeds, text mining Abstracting services Subject bibliographies Reviews, secondary literature Zoological Record ION (Index of Organism Names) ChecklistBank GNI GNUB ZooBank? Catalogue of Life annual editions ITIS NCBI Taxonomy WoRMS etc. CyanoDB Index Fungorum MycoBank AlgaeBase Plant GSD’s PaleoDB Animal GSD’s other compilations e.g. regional lists, Wikispecies, Wikipedia, more… IRMNG
  • 46. New names: potential discovery paths Tony Rees: Hierarchical Classification of All Life new virus names new prokaryote names new botanical names – algae & fungi (except fossils) new botanical names – bryophytes through angiosperms (except fossils) new zoological names publication discovery official registers taxon-specific DB’s integrated DB’s “ all names” Botany Zoology Newly published names – primary literature (print, electronic) ICTV Viruses DB LPSN (Prokaryote names) ICBN Decisions ICZN Decisions Journal TOC’s, RSS feeds, text mining Abstracting services Subject bibliographies Reviews, secondary literature Zoological Record ION (Index of Organism Names) ChecklistBank GNI GNUB ZooBank? Catalogue of Life annual editions ITIS NCBI Taxonomy WoRMS etc. CyanoDB Index Fungorum MycoBank AlgaeBase Plant GSD’s PaleoDB Animal GSD’s other compilations e.g. regional lists, Wikispecies, Wikipedia, more… IRMNG Lots of manual effort
  • 47. New names: potential discovery paths Tony Rees: Hierarchical Classification of All Life new virus names new prokaryote names new botanical names – algae & fungi (except fossils) new botanical names – bryophytes through angiosperms (except fossils) new zoological names publication discovery official registers taxon-specific DB’s integrated DB’s “ all names” Botany Zoology Newly published names – primary literature (print, electronic) ICTV Viruses DB LPSN (Prokaryote names) ICBN Decisions ICZN Decisions Journal TOC’s, RSS feeds, text mining Abstracting services Subject bibliographies Reviews, secondary literature Zoological Record ION (Index of Organism Names) ChecklistBank GNI GNUB ZooBank? Catalogue of Life annual editions ITIS NCBI Taxonomy WoRMS etc. CyanoDB Index Fungorum MycoBank AlgaeBase Plant GSD’s PaleoDB Animal GSD’s other compilations e.g. regional lists, Wikispecies, Wikipedia, more… IRMNG Lots of automated feeds + expert curation
  • 48. New names: potential discovery paths Tony Rees: Hierarchical Classification of All Life new virus names new prokaryote names new botanical names – algae & fungi (except fossils) new botanical names – bryophytes through angiosperms (except fossils) new zoological names publication discovery official registers taxon-specific DB’s integrated DB’s “ all names” Botany Zoology Newly published names – primary literature (print, electronic) ICTV Viruses DB LPSN (Prokaryote names) ICBN Decisions ICZN Decisions Journal TOC’s, RSS feeds, text mining Abstracting services Subject bibliographies Reviews, secondary literature Zoological Record ION (Index of Organism Names) ChecklistBank GNI GNUB ZooBank? Catalogue of Life annual editions ITIS NCBI Taxonomy WoRMS etc. CyanoDB Index Fungorum MycoBank AlgaeBase Plant GSD’s PaleoDB Animal GSD’s other compilations e.g. regional lists, Wikispecies, Wikipedia, more… IRMNG Lots of automated feeds + expert curation Lots of useful services
  • 49. How many taxa? Tony Rees: Hierarchical Classification of All Life valid extant + fossil taxa (est.) How many species? estimates according to Chapman, 2009 (valid, extant taxa only); “others” comprise c. 54k protists, 10k prokaryotes, 2k viruses NB inverts. includes “~1,000,000” for Insects – probably +/- 60k Fossil species – no published estimates – maybe 500k names, 300k valid 2+ million ~250k ~10k ~2k Kingdoms (5/6/7/8) ~400 ~140 Phyla Classes Orders Families Genera Species
  • 50. Relevant information domain: all life Tony Rees: Hierarchical Classification of All Life PROTISTS Fig. i-1 in Margulis & Schwartz, 1998
  • 51. How many kingdoms… Tony Rees: Hierarchical Classification of All Life PROTISTS Fig. i-1 in Margulis & Schwartz, 1998 7 kingdoms (5 in Margulis & Schwartz, 8 in Cat. of Life…): Animals, Fungi, Plants : 3 kingdoms Protists : 1 (or 2 if Stramenopiles [Heterokonts] recognized, = Cavalier-Smith’s Kingdom “Chromista”) Bacteria + Archaea : 2 (=1 in Margulis & Schwartz) Viruses : 1 (not in Margulis & Schwartz)
  • 52. Nomenclature governed by four separate Codes , i.e. Zoological, Botanical, Bacteriological, Viruses Tony Rees: Hierarchical Classification of All Life PROTISTS Zoo. Code Bact. Code Bot. Code Vir. Code: viruses (not shown) Fig. i-1 in Margulis & Schwartz, 1998
  • 53.
  • 54. Parker, 1982 content example Tony Rees: Hierarchical Classification of All Life
  • 55. Benton, 1993 content example Tony Rees: Hierarchical Classification of All Life
  • 56. Rees TAXAMATCH fuzzy matching poster (start) Tony Rees: Hierarchical Classification of All Life
  • 57. Schematic of TAXAMATCH operation Tony Rees: Hierarchical Classification of All Life