SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Interoperability of Taxon 
Treatments 
Donat Agosti 
Plazi 
Brussels, June 2, 2014 
Supported by the European Commission through its FP7 research funding programme
The big question 
What is the future of the biological world? 
Imagine if we could: 
…Predict community level dynamics of ecosystems at 
scales from local to global, based on the ecology and 
biology of all individual organisms 
Harfoot, BIH2013, Rome, 2013 
Hardisty, Nature 502, 171 (2013) 
BUT: predictive ecology has substantial data needs
Biodiversity libraries 
200,000,000+ printed pages 
1,900,000 species described 
20,000,000+ species treatments 
17,000 new species per year 
BUT: The data are hidden 
Incomplete digitization 
Publications are not 
semantically enhanced 
Collections are incomplete 
Data is not linked 
Most data are not open
Interoperability of taxa 
Can we build a system (e.g. Open Biodiversity Knowledge 
Management System) that includes a component that extracts, 
stores and serves and serves information on taxa in a system that 
is agnostic of Biota? 
Traditionally Floras, Faunas, Mycotas are dealt with by different communities
Pro‐iBiosphere project is to develop a blue print of an Open 
Knowledge Management System 
It is not building a system 
Pilots to demonstrate specific issues 
interoperability of taxa 
explore workflows to produce recommendations of «best» 
practices 
interoperability of infrastructures 
registration of names 
advanced publishing 
Do not expect production level products
Treatment 
Formica obsoleta Linnaeus, 1758: 580 
Each taxonomic name usage has it’s treatment
Treatment as standard containers 
http://en.wikipedia.org
Pilot 1: Taxa used for markup 
Taxa Documents Treatments 
Mistletoes 3 124 
Chenopodium 15 174 
Fungi 5 5 
Bryophyta 2 25 
Nephrolepis 1 35 
Centipedes 50 154 
Ants 40 486 
Spiders 30 219 
TOTAL ca. 140 ca. 1500
Chenopodium pilot
Spider pilot: machine access to content through markup 
Pardosa logunovi
Spider pilot: overview of 34 OA Zootaxa publications 
5170 specimens 
4062 plottable specimens from 
1138 unique locations
melanoceras 
chiapensis 
cookii 
sphaerocephala 
allenii 
collinsii 
ruddiae 
cornigera 
globulifera 
hindsii 
janzenii 
mayana 
boopis 
Pseudomyrmex ants and Vachellia ant‐acacias 
are a classic example of mutualism in biology. 
hesperius 
flavicornis 
Treatment: redescription 
opaciceps 
ita 
janzeni 
kuenckeli 
mixtecus 
nigrocinctus 
nigropilosus 
particeps 
peperi 
reconditus 
satanicus 
simulans 
spinicola 
subtilissimus 
veneficus 
ferrugineus 
gentlei 
gracilis 
Transbiotic link network 
Associated species linked through 
references in taxonomic treatments 
Acacia‐ant species: Pseudomyrmex gracili 
Treatment: original description 
Associated ant‐acacia: Acacia gentlei 
Ants Plants 
Photocredits: Alex Wild 
Treatment 
Treatments linked 
through citations 
Transbiotic interoperability
Pro‐iBiosphere 
1,000 treatements 
Plazi 
10,000 treatments 
Pensoft 
23,000 
Total 
34,000 treatments 
Legacy 
literature 
Prospective 
literature
0°
All data in Plazi 
14,590 specimens 
8900 plottable specimens from 
1138 unique locations
Brazil 
5170 specimens 
4062 plottable specimens from 
1138 unique locations
Brasil
Journal of Hymenoptera Research 
5170 specimens 
4062 plottable specimens from 
1138 unique locations
Interoperability of taxa 
Can we build a system (e.g. Open Biodiversity Knowledge 
Management System) that includes a component that extracts, 
stores and serves and serves information on taxa in a system that 
is agnostic of Biota? 
Yes, we can.
Isssues and Recommendations 
Legacy Prospective 
Digitization √ 
OCR / Text capture √ 
Markup √ (√) 
Standardization √ √ 
Strategies to markup √ 
External links √ (√) 
Semantic 
√ (√) 
enhancment 
Create content √ (√)
Plazi 
SRS 
Digitization and Markup Workflow: 
$$$$ ? 
find scan «OCR» markup store 
? 
domain generic domain 
Find the right mix of generic and domain specific solutions
Create Content: selection strategy 
200,000 Taxonomic Articles in Zoological Record Since 1864
Markup / data extraction strategies 
Dedicated external services, bulk 
Applications for individual contributor, small scale 
Involve community / crowd / wikimedia 
Ad hoc Web Services, individual 
Mixed strategies 
Combination with re‐publishing, small scale 
Create market for treatments, large scale
Variation in status labels 
Quality Control and Standardization 
TaxStatus ctd. Total ctd 
REVISED STATUS 10 
s. str. 1 
sp. n. 130 
sp. nov. 4057 
sp.n. 3 
spec. nov. 34 
stat. nov. 56 
Status revised 9 
subsp. nov. 26 
var. nov. 80 
(blank) 
Grand Total 5965 
TaxStatus Total 
comb. nov. 246 
G. N. 65 
gen. nov. 19 
gen.nov. 10 
hybr. nov. «sp.nov.» 
13 
n sp 12 
n. comb. 2 
n. nom. 6 
n. sp. 267 
n. stat. 5 
n. subg. 3 
new combination 139 
new species 651 
NEW STATUS 114 
nomen novum 6 
nov. spec. 1 
Standardize and apply in prospective publishing …
Standardization of markup 
Formica rufa Linnaeus 1758: 426 
Genus name year of pub. 
Species 
epithet page of 
publicat 
Name 
Authority 
Bibliographic reference 
Treatment citation
Linking of treatment as an example for external links 
Treatment 
citation 
Treatment 
identifier
Conclusions 
• Biodiversity literature is very rich in data 
• BL has a basic structure (treatments) across all Biota 
• Legacy literature should be strategically marked up 
• Prospective literature should be semantically enhanced 
• Markup tools exist and should be optimized 
• Identifiers for treatments exist to link to treatments
Thank you very much! 
Donat Agosti 
Plazi 
agosti@plazi.org

Contenu connexe

Tendances

Biological databases: Challenges in organization and usability
Biological databases: Challenges in organization and usabilityBiological databases: Challenges in organization and usability
Biological databases: Challenges in organization and usability
Lars Juhl Jensen
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
Sreekanth Gali
 
Bioinformatics Databases
Bioinformatics DatabasesBioinformatics Databases
Bioinformatics Databases
cschlos2
 
Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...
nolmar01
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
nadeem akhter
 

Tendances (12)

Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
Biological databases: Challenges in organization and usability
Biological databases: Challenges in organization and usabilityBiological databases: Challenges in organization and usability
Biological databases: Challenges in organization and usability
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
 
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
 
Bioinformatics Databases
Bioinformatics DatabasesBioinformatics Databases
Bioinformatics Databases
 
Darwin Core extension for genebanks (germplasm), at Kansas University (May 2012)
Darwin Core extension for genebanks (germplasm), at Kansas University (May 2012)Darwin Core extension for genebanks (germplasm), at Kansas University (May 2012)
Darwin Core extension for genebanks (germplasm), at Kansas University (May 2012)
 
Gen bank
Gen bankGen bank
Gen bank
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners Slides
 
B.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseB.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 database
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 

En vedette

Amerikiin emgenelt yavdal 2
Amerikiin emgenelt yavdal 2Amerikiin emgenelt yavdal 2
Amerikiin emgenelt yavdal 2
bayarankh
 
Sou Crente... E Agora, o Que Eu Faço?
Sou Crente... E Agora, o Que Eu Faço?Sou Crente... E Agora, o Que Eu Faço?
Sou Crente... E Agora, o Que Eu Faço?
Jonas Martins Olímpio
 
Virtuelle Techniken in textilen Anwendungen: VDC-Whitepaper
Virtuelle Techniken in textilen Anwendungen: VDC-WhitepaperVirtuelle Techniken in textilen Anwendungen: VDC-Whitepaper
Virtuelle Techniken in textilen Anwendungen: VDC-Whitepaper
Virtual Dimension Center (VDC) Fellbach
 
Das antiquarium, diemer
Das antiquarium, diemerDas antiquarium, diemer
Das antiquarium, diemer
3153657
 
Presentacion carrera derecho
Presentacion carrera derechoPresentacion carrera derecho
Presentacion carrera derecho
ortizcarlos99
 
Referencial curricular nacional para ed. infantil vol 1
Referencial curricular nacional para ed. infantil vol 1Referencial curricular nacional para ed. infantil vol 1
Referencial curricular nacional para ed. infantil vol 1
Maria Galdino
 

En vedette (17)

Elementos de automatización y control sesion 2 para ingeniería electromecánica
Elementos de automatización y control sesion 2 para ingeniería electromecánicaElementos de automatización y control sesion 2 para ingeniería electromecánica
Elementos de automatización y control sesion 2 para ingeniería electromecánica
 
Amerikiin emgenelt yavdal 2
Amerikiin emgenelt yavdal 2Amerikiin emgenelt yavdal 2
Amerikiin emgenelt yavdal 2
 
Trabajo final proceso
Trabajo final procesoTrabajo final proceso
Trabajo final proceso
 
Aravindhmc-cv
Aravindhmc-cvAravindhmc-cv
Aravindhmc-cv
 
CyberquêTe Ii
CyberquêTe IiCyberquêTe Ii
CyberquêTe Ii
 
Manual comenius
Manual comeniusManual comenius
Manual comenius
 
Sou Crente... E Agora, o Que Eu Faço?
Sou Crente... E Agora, o Que Eu Faço?Sou Crente... E Agora, o Que Eu Faço?
Sou Crente... E Agora, o Que Eu Faço?
 
Measuring IPv6 Adoption
Measuring IPv6 AdoptionMeasuring IPv6 Adoption
Measuring IPv6 Adoption
 
Virtuelle Techniken in textilen Anwendungen: VDC-Whitepaper
Virtuelle Techniken in textilen Anwendungen: VDC-WhitepaperVirtuelle Techniken in textilen Anwendungen: VDC-Whitepaper
Virtuelle Techniken in textilen Anwendungen: VDC-Whitepaper
 
Das antiquarium, diemer
Das antiquarium, diemerDas antiquarium, diemer
Das antiquarium, diemer
 
Presentacion carrera derecho
Presentacion carrera derechoPresentacion carrera derecho
Presentacion carrera derecho
 
Referencial curricular nacional para ed. infantil vol 1
Referencial curricular nacional para ed. infantil vol 1Referencial curricular nacional para ed. infantil vol 1
Referencial curricular nacional para ed. infantil vol 1
 
Interruptores magnetotérmicos interruptores diferenciales
Interruptores magnetotérmicos   interruptores diferencialesInterruptores magnetotérmicos   interruptores diferenciales
Interruptores magnetotérmicos interruptores diferenciales
 
ADMINISTRACION DE VENTAS 1
ADMINISTRACION DE VENTAS 1ADMINISTRACION DE VENTAS 1
ADMINISTRACION DE VENTAS 1
 
Elementos De Protección Y Comando
Elementos De Protección Y ComandoElementos De Protección Y Comando
Elementos De Protección Y Comando
 
Bienvenue dans l'ère du vieillissement
Bienvenue dans l'ère du vieillissementBienvenue dans l'ère du vieillissement
Bienvenue dans l'ère du vieillissement
 
Control de-motores-electricos
Control de-motores-electricosControl de-motores-electricos
Control de-motores-electricos
 

Similaire à 2 donat agosti-1

Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and CommunicationSetting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
vbrant
 
DNA Bar-code to Distinguish the Species
DNA Bar-code to Distinguish the SpeciesDNA Bar-code to Distinguish the Species
DNA Bar-code to Distinguish the Species
Roya Shariati
 
20140327 rda plazi_final
20140327 rda plazi_final20140327 rda plazi_final
20140327 rda plazi_final
agosti
 

Similaire à 2 donat agosti-1 (20)

Nothing in taxonomy makes sense except in the light of Open Access
Nothing in taxonomy makes sense except in the light of Open Access Nothing in taxonomy makes sense except in the light of Open Access
Nothing in taxonomy makes sense except in the light of Open Access
 
BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...
BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...
BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...
 
A Step Towards (From) Read to Write Access to Taxonomic Publications
A Step Towards  (From) Read to Write Access to Taxonomic PublicationsA Step Towards  (From) Read to Write Access to Taxonomic Publications
A Step Towards (From) Read to Write Access to Taxonomic Publications
 
20140317 pi b_nmbe_journal_club
20140317 pi b_nmbe_journal_club20140317 pi b_nmbe_journal_club
20140317 pi b_nmbe_journal_club
 
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and CommunicationSetting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
 
20110122 vibrant final
20110122 vibrant final20110122 vibrant final
20110122 vibrant final
 
Agosti 20140813 icd8_agosti_global_dipterology-2
Agosti 20140813 icd8_agosti_global_dipterology-2Agosti 20140813 icd8_agosti_global_dipterology-2
Agosti 20140813 icd8_agosti_global_dipterology-2
 
20140623 swets agosti_final
20140623 swets agosti_final20140623 swets agosti_final
20140623 swets agosti_final
 
Global patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctnessGlobal patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctness
 
A Logical Model for Taxonomic Concepts for Expanding Knowledge using Linked O...
A Logical Model for Taxonomic Concepts for Expanding Knowledge using Linked O...A Logical Model for Taxonomic Concepts for Expanding Knowledge using Linked O...
A Logical Model for Taxonomic Concepts for Expanding Knowledge using Linked O...
 
Text-mining and ontologies - new approaches to knowledge discovery of microbi...
Text-mining and ontologies - new approaches to knowledge discovery of microbi...Text-mining and ontologies - new approaches to knowledge discovery of microbi...
Text-mining and ontologies - new approaches to knowledge discovery of microbi...
 
Molecular Systematics and Biodiversity
Molecular Systematics and BiodiversityMolecular Systematics and Biodiversity
Molecular Systematics and Biodiversity
 
FAIR data requires FAIR ontologies, how do we do?
FAIR data requires FAIR ontologies, how do we do?FAIR data requires FAIR ontologies, how do we do?
FAIR data requires FAIR ontologies, how do we do?
 
DNA Bar-code to Distinguish the Species
DNA Bar-code to Distinguish the SpeciesDNA Bar-code to Distinguish the Species
DNA Bar-code to Distinguish the Species
 
20140327 rda plazi_final
20140327 rda plazi_final20140327 rda plazi_final
20140327 rda plazi_final
 
Plant Pathology Seminar
Plant Pathology SeminarPlant Pathology Seminar
Plant Pathology Seminar
 
David Cooke wp1 14 Nov 19
David Cooke wp1 14 Nov 19David Cooke wp1 14 Nov 19
David Cooke wp1 14 Nov 19
 
Visualizing Primary Data form Taxonomic Literature
Visualizing Primary Data form Taxonomic LiteratureVisualizing Primary Data form Taxonomic Literature
Visualizing Primary Data form Taxonomic Literature
 
David cooke wp1 13 Nov 19
David cooke wp1 13 Nov 19David cooke wp1 13 Nov 19
David cooke wp1 13 Nov 19
 
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
 

Plus de agosti

Plus de agosti (15)

DOI and the Mitteilungen: communicating scientific results in the future
DOI and the Mitteilungen: communicating scientific results in the futureDOI and the Mitteilungen: communicating scientific results in the future
DOI and the Mitteilungen: communicating scientific results in the future
 
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
 
Revolutionizing the Research on Ants through new Methods and Technologies: th...
Revolutionizing the Research on Ants through new Methods and Technologies: th...Revolutionizing the Research on Ants through new Methods and Technologies: th...
Revolutionizing the Research on Ants through new Methods and Technologies: th...
 
Open Research Data: Taxonomy
Open Research Data: TaxonomyOpen Research Data: Taxonomy
Open Research Data: Taxonomy
 
20150701 opendata bern_agosti_2
20150701 opendata bern_agosti_220150701 opendata bern_agosti_2
20150701 opendata bern_agosti_2
 
Plazi or the challenge to free biodiversity data caught in hundreds of millio...
Plazi or the challenge to free biodiversity data caught in hundreds of millio...Plazi or the challenge to free biodiversity data caught in hundreds of millio...
Plazi or the challenge to free biodiversity data caught in hundreds of millio...
 
20141027 bouchout declaration
20141027 bouchout declaration20141027 bouchout declaration
20141027 bouchout declaration
 
20140924 rda _bouchout
20140924 rda _bouchout20140924 rda _bouchout
20140924 rda _bouchout
 
20140922 rda codata_legal_ig_plazi_final
20140922 rda codata_legal_ig_plazi_final20140922 rda codata_legal_ig_plazi_final
20140922 rda codata_legal_ig_plazi_final
 
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
 
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
 
20140523 swiss curators_bouchout_2
20140523 swiss curators_bouchout_220140523 swiss curators_bouchout_2
20140523 swiss curators_bouchout_2
 
20110725 ibc xml
20110725 ibc xml20110725 ibc xml
20110725 ibc xml
 
20110222 behesty monitoring and measuring biodiversity
20110222 behesty monitoring and measuring biodiversity20110222 behesty monitoring and measuring biodiversity
20110222 behesty monitoring and measuring biodiversity
 
20090921 Art Databanken Agosti Final
20090921 Art Databanken Agosti Final20090921 Art Databanken Agosti Final
20090921 Art Databanken Agosti Final
 

Dernier

Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 

Dernier (20)

Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 

2 donat agosti-1

  • 1. Interoperability of Taxon Treatments Donat Agosti Plazi Brussels, June 2, 2014 Supported by the European Commission through its FP7 research funding programme
  • 2. The big question What is the future of the biological world? Imagine if we could: …Predict community level dynamics of ecosystems at scales from local to global, based on the ecology and biology of all individual organisms Harfoot, BIH2013, Rome, 2013 Hardisty, Nature 502, 171 (2013) BUT: predictive ecology has substantial data needs
  • 3. Biodiversity libraries 200,000,000+ printed pages 1,900,000 species described 20,000,000+ species treatments 17,000 new species per year BUT: The data are hidden Incomplete digitization Publications are not semantically enhanced Collections are incomplete Data is not linked Most data are not open
  • 4. Interoperability of taxa Can we build a system (e.g. Open Biodiversity Knowledge Management System) that includes a component that extracts, stores and serves and serves information on taxa in a system that is agnostic of Biota? Traditionally Floras, Faunas, Mycotas are dealt with by different communities
  • 5. Pro‐iBiosphere project is to develop a blue print of an Open Knowledge Management System It is not building a system Pilots to demonstrate specific issues interoperability of taxa explore workflows to produce recommendations of «best» practices interoperability of infrastructures registration of names advanced publishing Do not expect production level products
  • 6. Treatment Formica obsoleta Linnaeus, 1758: 580 Each taxonomic name usage has it’s treatment
  • 7. Treatment as standard containers http://en.wikipedia.org
  • 8.
  • 9. Pilot 1: Taxa used for markup Taxa Documents Treatments Mistletoes 3 124 Chenopodium 15 174 Fungi 5 5 Bryophyta 2 25 Nephrolepis 1 35 Centipedes 50 154 Ants 40 486 Spiders 30 219 TOTAL ca. 140 ca. 1500
  • 11. Spider pilot: machine access to content through markup Pardosa logunovi
  • 12. Spider pilot: overview of 34 OA Zootaxa publications 5170 specimens 4062 plottable specimens from 1138 unique locations
  • 13. melanoceras chiapensis cookii sphaerocephala allenii collinsii ruddiae cornigera globulifera hindsii janzenii mayana boopis Pseudomyrmex ants and Vachellia ant‐acacias are a classic example of mutualism in biology. hesperius flavicornis Treatment: redescription opaciceps ita janzeni kuenckeli mixtecus nigrocinctus nigropilosus particeps peperi reconditus satanicus simulans spinicola subtilissimus veneficus ferrugineus gentlei gracilis Transbiotic link network Associated species linked through references in taxonomic treatments Acacia‐ant species: Pseudomyrmex gracili Treatment: original description Associated ant‐acacia: Acacia gentlei Ants Plants Photocredits: Alex Wild Treatment Treatments linked through citations Transbiotic interoperability
  • 14.
  • 15. Pro‐iBiosphere 1,000 treatements Plazi 10,000 treatments Pensoft 23,000 Total 34,000 treatments Legacy literature Prospective literature
  • 16.
  • 17. All data in Plazi 14,590 specimens 8900 plottable specimens from 1138 unique locations
  • 18. Brazil 5170 specimens 4062 plottable specimens from 1138 unique locations
  • 20. Journal of Hymenoptera Research 5170 specimens 4062 plottable specimens from 1138 unique locations
  • 21. Interoperability of taxa Can we build a system (e.g. Open Biodiversity Knowledge Management System) that includes a component that extracts, stores and serves and serves information on taxa in a system that is agnostic of Biota? Yes, we can.
  • 22. Isssues and Recommendations Legacy Prospective Digitization √ OCR / Text capture √ Markup √ (√) Standardization √ √ Strategies to markup √ External links √ (√) Semantic √ (√) enhancment Create content √ (√)
  • 23. Plazi SRS Digitization and Markup Workflow: $$$$ ? find scan «OCR» markup store ? domain generic domain Find the right mix of generic and domain specific solutions
  • 24. Create Content: selection strategy 200,000 Taxonomic Articles in Zoological Record Since 1864
  • 25. Markup / data extraction strategies Dedicated external services, bulk Applications for individual contributor, small scale Involve community / crowd / wikimedia Ad hoc Web Services, individual Mixed strategies Combination with re‐publishing, small scale Create market for treatments, large scale
  • 26. Variation in status labels Quality Control and Standardization TaxStatus ctd. Total ctd REVISED STATUS 10 s. str. 1 sp. n. 130 sp. nov. 4057 sp.n. 3 spec. nov. 34 stat. nov. 56 Status revised 9 subsp. nov. 26 var. nov. 80 (blank) Grand Total 5965 TaxStatus Total comb. nov. 246 G. N. 65 gen. nov. 19 gen.nov. 10 hybr. nov. «sp.nov.» 13 n sp 12 n. comb. 2 n. nom. 6 n. sp. 267 n. stat. 5 n. subg. 3 new combination 139 new species 651 NEW STATUS 114 nomen novum 6 nov. spec. 1 Standardize and apply in prospective publishing …
  • 27. Standardization of markup Formica rufa Linnaeus 1758: 426 Genus name year of pub. Species epithet page of publicat Name Authority Bibliographic reference Treatment citation
  • 28. Linking of treatment as an example for external links Treatment citation Treatment identifier
  • 29. Conclusions • Biodiversity literature is very rich in data • BL has a basic structure (treatments) across all Biota • Legacy literature should be strategically marked up • Prospective literature should be semantically enhanced • Markup tools exist and should be optimized • Identifiers for treatments exist to link to treatments
  • 30. Thank you very much! Donat Agosti Plazi agosti@plazi.org