SlideShare a Scribd company logo
1 of 54
Rod Page @rdmpage
http://iphylo.blogspot.com
Knowledge graphs
Holly Bik @hollybik
Let’s rise up to unite taxonomy and technology
10.1371/journal.pbio.2002231
http://ispecies.org
Simple Javascript mashup
DBpedia
GBIF
CrossRef
EOL
Open Tree of Life
TreeBASE
https://doi.org/10.7717/peerj.190
The Semantic web:
“The future of the web…
and always will be” –
Peter Norvig (Google)
Obstacles to building knowledge graphs
•Technical
•Social
Obstacles to building knowledge graphs
• Need globally unique, persistent identifiers
(how to label the nodes of the graph)
• Need to create and agree on vocabularies
(how to label the edges of the graph)
• Need to agree how to transmit the graph
• Who stores the global graph?
A new hope
• The identifier wars are (nearly) over (DOIs FTW)
• Lots of domain-specific vocabularies, but
schema.org is “good enough” for most things
• XML becoming a bedtime story to frighten the
children, JSON is everywhere (JSON-LD FTW).
• Wikidata
Obstacles to building knowledge graphs
•Technical
•Social Economic
Identifiers, identifiers, identifiers, identifiers
How do we measure progress?
before
now
now
before
Linear growth (easy) Connectivity (hard)
Need network effects
One is useless Two is “meh” Many is better
The Semantic web:
“The future of the web…
and always will be” –
Peter Norvig (Google)
The knowledge graph is
already here (it’s just
not evenly distributed)
William Gibson @GreatDismal
Google’s Knowledge Graph
PREFIX wdt: http://www.wikidata.org/prop/direct/
PREFIX wd: <http://www.wikidata.org/entity/>
SELECT ?root_name ?parent_name ?child_name WHERE
{
VALUES ?root_name {"Hominini"}
?root wdt:P225 ?root_name .
?child wdt:P171+ ?root .
?child wdt:P171 ?parent .
?child wdt:P225 ?child_name .
?parent wdt:P225 ?parent_name .
}
http://biohackathon.org/d3sparql/
Toshiaki Katayama @tktym
http://iphylo.blogspot.ca/2017/01/displaying-taxonomic-classifications.html
“Citations for the sum of
human knowledge”
WikiCite @WikiCite
Goal 1: Every citation in the Wikipedias should be in Wikidata
Goal 2: Every citation should be in Wikidata (!?)
Small knowledge graphs (hexastores)
Very simple
ontology
Tom Scott @derivadow
Leigh Dodds @ldodds
Hexastore
• A triple is [s, p, o]
• Find all statements [s, ?, ?] is simple array lookup (all elements with key “s”)
• Find all statements [?, ?, o] is slow (scan all triples)…
• …unless we add array of [o, s, p] triples, then simple array lookup (all elements with
key “o”)
• Six variations cover all queries: [s,p,o], [s,o,p], [p, s, o], [p, o, s], [o, s, p], [o, p, s]
(hence “hexastore”)
• In-memory graph database in Javascript (think offline apps)
http://crubier.github.io/Hexastore/
Xanadu,
the web that wasn’t
Ted Nelson Hyperlinks and
hypermedia
Two-way links and
“transclusion”
= Xanadu
Tim Berners-Lee
HTTP, URL, HTML
One-way links
= world wide web
Web page Other web
page
Web linking, one way, document-level, “target”
doesn’t know that it is linked to (“cited”),
link can break (404)
text
Work Source
text
Xanadu linking, two way, fragment-level,
“source” knows it is linked to, source content
is embedded, links don’t break
Xanadu
A New Account of the Genus
Horsfieldia (Myristicaceae), Pt 2
W J J O De Wilde
The Gardens' bulletin, Singapore 38(1): 55-144 (1985)
http://biostor.org/reference/175018
Horsfieldia lancifolia
BioStor @biostor_org
Biodiversity Heritage Library @biodivlibrary
Flora Malesiana. Series I - Seed Plants,
Volume 14. Myristicaceae
https://doi.org/10.3897/ab.e1141
DescriptionDescriptio
n
Flora Article
Embedded markup (bad)…
Crocidura absconditus, new species
<i>Crocidura absconditus</i>, new species
0 20
{ [0,20], “italics” }
…versus annotation (good)
(think NLM JATS XML markup
versus Substance JSON used
by Lens viewer
https://lens.elifesciences.org/
about/)
Crocidura absconditus, new species
@hypothes_is
Annotating a
scientific paper
Aggregating annotations (iPhylo)
http://iphylo.blogspot.co.uk/2016/06/aggregating-annotations-on-scientific_30.html
Taxonomic
names,
specimen
codes,
geographic
localities,
references are
all
annotations
Taxonomic databases
are not lists of names…
…they are lists of annotations
(“this name occurs on this page”)
Annotations are retrospective nanopublications
Annotating existing content
(extracting “facts”)
Today
Publishing “facts” as nanopublications
Stream of “facts”
Social design and the
knowledge graph
Obstacles to building knowledge graphs
•Technical
•Social Economic
Nico Franz @taxonbytes
ORCID
(person)
DOI
(publication)
LSID
(plant name)
Find my papers that
published new species
@SandyKnapp
ORCID
(person)
DOI
(publication)
LSID
(plant name)
#Iamataxonomist
(claim/demonstrate expertise)
specimen plant name
What Sandy really wants
collected type for
publication
person
“What specimens that I collected that have been
described as new species by other people?”
Published in
author
other person
not the same person
Knowledge graphs
considered harmful
(remember Impact Factors?)
http://www.museum-analytics.org/
Cited, linkable specimens
NMNH Vertebrate Zoology
Herpetology Collections
11194
CAS Herpetology Collection Catalog
MCZ Herpetology Collection
Herpetology Collection (University
of Kansas Biodiversity Research
Center)
9619
6720
5818
http://iphylo.blogspot.co.uk/2012/02/gbif-specimens-in-biostor-who-are-top.html
We will need to ensure our knowledge graph is
free, open, and used for good

More Related Content

Similar to Towards a biodiversity knowledge graph

ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Jon Voss
 
From Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science TalesFrom Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science Tales
Bertram Ludäscher
 
Seeding links from Wikipedia to BHL (2008 - 2012)
Seeding links from Wikipedia to BHL (2008 - 2012)Seeding links from Wikipedia to BHL (2008 - 2012)
Seeding links from Wikipedia to BHL (2008 - 2012)
Chris Freeland
 
download
downloaddownload
download
butest
 
download
downloaddownload
download
butest
 

Similar to Towards a biodiversity knowledge graph (20)

SLiDInG6 talk on biodiversity knowledge graph
SLiDInG6 talk on biodiversity knowledge graphSLiDInG6 talk on biodiversity knowledge graph
SLiDInG6 talk on biodiversity knowledge graph
 
Getting triples from records: the role of ISBD
Getting triples from records: the role of ISBDGetting triples from records: the role of ISBD
Getting triples from records: the role of ISBD
 
Something about links
Something about linksSomething about links
Something about links
 
Surfacing the deep data of taxonomy
Surfacing the deep data of taxonomySurfacing the deep data of taxonomy
Surfacing the deep data of taxonomy
 
Wikidata and the Biodiversity Knowledge Graph
Wikidata and the Biodiversity Knowledge GraphWikidata and the Biodiversity Knowledge Graph
Wikidata and the Biodiversity Knowledge Graph
 
Resources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the WebResources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the Web
 
Cornell 2011 05-13
Cornell 2011 05-13Cornell 2011 05-13
Cornell 2011 05-13
 
Ciard Initiative and a Global Infrastructure for Linked Open Data
Ciard Initiative and a Global Infrastructure for Linked Open Data Ciard Initiative and a Global Infrastructure for Linked Open Data
Ciard Initiative and a Global Infrastructure for Linked Open Data
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
 
Scott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data delugeScott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data deluge
 
Ozymandias - from an atlas to a knowledge graph of living Australia
Ozymandias - from an atlas to a knowledge graph of living AustraliaOzymandias - from an atlas to a knowledge graph of living Australia
Ozymandias - from an atlas to a knowledge graph of living Australia
 
Democratizing Big Semantic Data management
Democratizing Big Semantic Data managementDemocratizing Big Semantic Data management
Democratizing Big Semantic Data management
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
E scidocdays review
E scidocdays reviewE scidocdays review
E scidocdays review
 
TDWG at the University of Tasmania
TDWG at the University of TasmaniaTDWG at the University of Tasmania
TDWG at the University of Tasmania
 
From Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science TalesFrom Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science Tales
 
Seeding links from Wikipedia to BHL (2008 - 2012)
Seeding links from Wikipedia to BHL (2008 - 2012)Seeding links from Wikipedia to BHL (2008 - 2012)
Seeding links from Wikipedia to BHL (2008 - 2012)
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"
 
download
downloaddownload
download
 
download
downloaddownload
download
 

More from Roderic Page

GrBio Workshop talk
GrBio Workshop talkGrBio Workshop talk
GrBio Workshop talk
Roderic Page
 
Biodiversity Knowledge Graphs
Biodiversity Knowledge GraphsBiodiversity Knowledge Graphs
Biodiversity Knowledge Graphs
Roderic Page
 
GBIF Science Committee Report GB21, Delhi, India
GBIF Science Committee Report GB21, Delhi, IndiaGBIF Science Committee Report GB21, Delhi, India
GBIF Science Committee Report GB21, Delhi, India
Roderic Page
 

More from Roderic Page (20)

ALEC (A List of Everything Cool)
ALEC (A List of Everything Cool)ALEC (A List of Everything Cool)
ALEC (A List of Everything Cool)
 
BioStor Next
BioStor NextBioStor Next
BioStor Next
 
Wild idea for TDWG17 Bitcoins, biodiversity and micropayments
Wild idea for TDWG17 Bitcoins, biodiversity and micropaymentsWild idea for TDWG17 Bitcoins, biodiversity and micropayments
Wild idea for TDWG17 Bitcoins, biodiversity and micropayments
 
The Sam Adams talk
The Sam Adams talkThe Sam Adams talk
The Sam Adams talk
 
Unknown knowns, long tails, and long data
Unknown knowns, long tails, and long dataUnknown knowns, long tails, and long data
Unknown knowns, long tails, and long data
 
In praise of grumpy old men: Open versus closed data and the challenge of cre...
In praise of grumpy old men: Open versus closed data and the challenge of cre...In praise of grumpy old men: Open versus closed data and the challenge of cre...
In praise of grumpy old men: Open versus closed data and the challenge of cre...
 
BHL, BioStor, and beyond
BHL, BioStor, and beyondBHL, BioStor, and beyond
BHL, BioStor, and beyond
 
Cisco Digital Catapult
Cisco Digital CatapultCisco Digital Catapult
Cisco Digital Catapult
 
Built in the 19th century, rebuilt for the 21st
Built in the 19th century, rebuilt for the 21stBuilt in the 19th century, rebuilt for the 21st
Built in the 19th century, rebuilt for the 21st
 
Two graphs, three responses
Two graphs, three responsesTwo graphs, three responses
Two graphs, three responses
 
GrBio Workshop talk
GrBio Workshop talkGrBio Workshop talk
GrBio Workshop talk
 
Biodiversity Knowledge Graphs
Biodiversity Knowledge GraphsBiodiversity Knowledge Graphs
Biodiversity Knowledge Graphs
 
Visualing phylogenies: a personal view
Visualing phylogenies: a personal viewVisualing phylogenies: a personal view
Visualing phylogenies: a personal view
 
Biodiversity informatics: digitising the living world
Biodiversity informatics: digitising the living worldBiodiversity informatics: digitising the living world
Biodiversity informatics: digitising the living world
 
Ebbe Nielsen Challenge GBIF #gb21
Ebbe Nielsen Challenge GBIF #gb21Ebbe Nielsen Challenge GBIF #gb21
Ebbe Nielsen Challenge GBIF #gb21
 
GBIF Science Committee Report GB21, Delhi, India
GBIF Science Committee Report GB21, Delhi, IndiaGBIF Science Committee Report GB21, Delhi, India
GBIF Science Committee Report GB21, Delhi, India
 
Building the Biodiversity Knowledge Graph
Building the Biodiversity Knowledge GraphBuilding the Biodiversity Knowledge Graph
Building the Biodiversity Knowledge Graph
 
GBIF ideas
GBIF ideasGBIF ideas
GBIF ideas
 
Biodiversity informatics: why aren't we there yet?
Biodiversity informatics: why aren't we there yet?Biodiversity informatics: why aren't we there yet?
Biodiversity informatics: why aren't we there yet?
 
Why I blog instead of writing papers
Why I blog instead of writing papersWhy I blog instead of writing papers
Why I blog instead of writing papers
 

Recently uploaded

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 

Recently uploaded (20)

Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 

Towards a biodiversity knowledge graph

Editor's Notes

  1. https://doi.org/10.1371/journal.pbio.2002231
  2. http://ispecies.org
  3. https://doi.org/10.7717/peerj.190
  4. https://www.wikidata.org
  5. http://iphylo.blogspot.ca/2017/01/displaying-taxonomic-classifications.html
  6. http://www.bbc.co.uk/nature/life/Steller's_Sea_Eagle
  7. http://crubier.github.io/Hexastore/
  8. Ted Nelson’s Xanadu project, linking and microcredit
  9. http://biostor.org/reference/175018
  10. https://doi.org/10.3897/ab.e1141
  11. https://lens.elifesciences.org/about/)
  12. http://iphylo.blogspot.co.uk/2016/06/aggregating-annotations-on-scientific_30.html
  13. https://doi.org/10.1101/157214
  14. https://ontotext.com/knowledgehub/case-studies/sn-scigraph-uses-graphdb/. Springer SciGraph https://twitter.com/OntotextGraphDB/status/898143878724935681