1. 10 years Agricultural
Ontology Initiative:
Building Blocks for a
Linked Data Infrastructure
Dr. Johannes Keizer
FAO of the United Nations
Office of Knowledge Exchange, Research and Extension
Team Leader “Knowledge Standards and Services”
2. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
The Internet!
3. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
4. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Aggregation States of Knowledge
5. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Data Flows and Repositories in
Research
6. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
“... FAO’s principle task is to work
to ensure that the world’s
knowledge of food and
agriculture is available to those
who need it when they need it
and in a form which they can
access and use ...”
7. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
AOS Vision in 2001
8. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
..from thesaurus to Ontologies….
9. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Our push of AGROVOC to the Semantic Web had
enormous positive effects, among others
From 4 to 20 language versions
Defacto standard for indexing in many areas
More than 2000 downloads only in 2009
SKOS incorporated all our requirements
For many purposes we need semantics on a lower
level than of a fully elaborated ontology – but we
need them urgently
The development of specific Ontologies should be
always application driven – a demand economy
Lessons Learned
10. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
AOS - Today
SemanticsTools
Linked DataCommunity
11. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Community
12. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
The AOS Community
13. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
http://aims.fao.org/community/home
14. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Better Semantics
15. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
around 30,000 concepts
600000 labels in around 20 languages.
one-stop shop for terminological knowledge related to
agriculture in general
a knowledge base of related concepts organized in
ontological relationships (hierarchical, associative,
equivalence)
Is a concept/term/string based system
Concepts may be organized in multiple categories.
AGROVOC concept scheme
16. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Further schemes
in FAO
skos:broader
:bar
has_synonym
has_translation
skos:literalForm “maize”
:foo
maïs (fr)
:foo
has_synonym
skos:literalForm “corn”
:bar
8171
1474
skosxl:altLabel
skosxl:prefLabel
skos:broader
has_synonym
SKOS Label
The AGROVOC concept scheme
SKOS
Concept
rdf:type
rdf:type
6211
skos:broader
AGROVOC
Concept
Scheme
skos:topConceptOf
skos:inScheme
Another scheme
in FAO
Other scheme
in FAO
skos:inScheme
12332
17. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Work Expression Manifestation Item
Subjects
Corporate
Bodies
Conferences Journals
Series
FAO Projects
FAO FRBR Model & Authority Data
18. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
isSpatiallyIncludedIn
isPublishedBy
isOtherLanguageEditionOf
isFollowedBy/Follows
EXAMPLE: JOURNAL CONTENT MODEL
19. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
A Model to create Linked Data
Rich set of relationships
Easier implementation of concept-based thesauri and
authority data
20. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Geopolitical Ontology
OEKM FAO of the UN 20
21. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Fishery Ontologies
22. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
What Partners have done…..
Rice Knowledge Models
Rice Production Ontology
ASFA Thesaurus
VIVO Ontologies
Look to the many presentations on this
workshop
23. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Tools
24. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
The Concept Scheme Work Bench
25. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Drupal
AgriDrupal is a “suite of solutions” for agricultural information
management and dissemination, built on the Drupal CMS by different
Institutions and individuals who are now sharing their experiences in the
AgriDrupal community
Drupal, a semantic web enabled CMS
General purpose CMS necessary
Drupal has flexibility to manage all information types
Drupal has a strong user community
Version 7 natively with RDF backbone, but semantic applications
already possible with version 6
Drupal can be a producer and consumer of Linked Data
26. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Drupal Query run on a Drupal website
from a Virtuoso test environment
at http://demo.openlinksw.com/sparql_demo/
SPARQL endpoint
RDF triples
Drupal website
27. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
• Objectives:
• Assure quality in metadata creation
• Sharing information in a standardized manner
• Use of common semantics and interoperable
syntaxes
• Use of more sophisticated and specialized
metadata
• Use of controlled, multilingual vocabularies
• Requirements:
• AGRIS AP compliancy
• AGROVOC
AgriOcean Dspace – a tool for Repositories
28. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
AgriOceanDspace – Thesaurus plug in
• Developed by Kasetsart University (Bangkok,
Thailand)
• Thesaurus plug-in
o Web services: use local or remote version of AGROVOC
thesaurus/SKOS
29. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Linking Data
30. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
The Linked Data Universe:
http://www.linkeddata.org (july 2010)
31. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
http://agris.fao.org/agris-search/search/display.do?f=2004/ZA/ZA04002.xml;ZA2004000049
32. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
AGROVOC EUROVOC UNBIS Relationship
http://aims.fao.
org/aos/agrovoc
/c_207
http://eurovoc
.europa.eu/21
9055
agroforestry skos:exactMatch
/ owl:sameAs
http://aims.fao.
org/aos/agrovoc
/c_4826
http://eurovoc
.europa.eu/22
0018
MILK skos:exactMatch
/ owl:sameAs
http://aims.fao.
org/aos/agrovoc
/c_12332
http://eurovoc
.europa.eu/21
9871
MAIZE skos:exactMatch
/ owl:sameAs
Linking vocabularies
33. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
http://aims.fao.org/aos/agrovoc/c_7825
http://eurovoc.europa.eu/218754
34. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
http://eurovoc.europa.eu/
219871
Maize
skosxl: literalForm
Maize
http://aims.fao.org/ao
s/agrovoc/c_12332
AGROVOC
skosxl: literalForm
Maize
http://aims.fao.org/aos/agrovoc/c_12332 owl:sameAs http://eurovoc.europa.eu/219871
owl:sameAs/exactMatch
http://agris.fao.org/agris-
search/search/display.do?f=1996
/TR/TR96001.xml;TR9600026
Linking data through common URIs
skosxl: literalForm
owl:sameAs/exactMatch
http://eur-
lex.europa.eu/LexUriServ/LexUriSe
rv.do?uri=OJ:L:2010:202:0011:001
5:EN:PDF
http://unbisnet.un.org:8080/ipac20/ipac.j
sp?session=128F308557F34.283092&pr
ofile=bib&uri=full=3100001~!685149~!1&
ri=1&aspect=subtab124&menu=search&
source=~!horizon
Maize
Eurovoc
UNBIS
35. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
What are we doing with unstructured data?
• We have enormous amounts of unstructured
material
• Still most of the documents that we are producing
are mostly semantically unstructured
• Human work to catalogue and index is becoming
always more rare
• We need machines to do automatic semantic mark
ups of text
• If machines are trained and based on concept
schemes, ther are able to do so
36. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
37. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
• Does Concept identification in unstructured
texts
• Uses Agrovoc as a controlled vocabulary
• Prototype under testing with excellent results
(entire repository of ICARDA indexed)
• Will produce in future Structured RDF files
that can be used to link data like “open
Calais”
•
AgroTagger
38. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
39. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
40. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
41. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
Life Demo: Semantic mark ups:
http://viewer.opencalais.com/
http://agropedialabs.iitk.ac.in/Tagger/Agrotagger_text.php
42. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
The CIARD RING
43. dr johannes keizer - FAO of the United Nations - knowledge and capacity for development
Chinese Agricultural Ontology Service WS, Beijing 2010-10-30
AGRIS Linked Data
Thisis the AGROVOC SKOS modelthathasbeendeveloped and decided in April 2010 under activecollaborationfrom Tom Baker, whowasmemberof the W3C SKOS workinggroup.
AgriOceanDspace release: 09/2010Setting up of AgriOceanDspace Community on AIMS: 09-10/2010exchange of experiencesPilot Testing: “Bangladesh Agricultural Universities’ Institutional Repository”Cooperation with DURASPACEInforming about the new AgriOceanDspace implementationsCollaborating in the Dspace Ambassadors Program
Thisis a snapshotoneyearlater. The growthisenormous. A centralpointisDBPedia, “triplified” information fromWikipedia. The differentcoloursrepresent the different information types, being “life sciences” and “publications” the mostpopulatedareas, butwith the area “government” stronglygrowingInterestingnewcomers in the last months are the two VIVO datasetsfrom the UnitedStatesdescriping expertise in Science. Vivo isactually a project thatstarted the agriculturallibraryofCornellUniversity
In a bibliographical record thereismuch more hidden information thandisplayedwith the metadata. Manyof the highlystructured data are linkingtoother information on the web. In AGRIS wehavenowintroducedsomethingwhatwecall “naivelinking”. An AGRIS record linksautomaticallyto Google Mapsfor the location of the center and to Google toretrieve the full text of the resource, citationlists or otherpublicationsfrom the authors. Thisoftenworks, butclearlynotalway, s asitisnotcontrolledbysemantics, butonlythroughidentyofstrings. Foranuneducatedmachineunfortunately COW and C.O.W. are the same, whereaspeanuts and groundnuts are somethingdifferent.
The tableshows 3 descriptorsthat are in AGROVOC, EUROVOC and UNBIS. In AGROVOC and EUROVOC they are alreadyencodedasURIs. Easilywecouldestablishrelationshipslikeowl.sameAsbetween the concepts or skos:exactMatchbetweenlabels.
Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
How does this work: A resource is connected with each concept URI in the web. The concepts between three vocabularies are having same literal which is connected with owl:sameAS/exactMatch relationship. As we are speakingaboutthesauri and notontologieswekept the relation tobechosenpurposelyvague. The conceptscouldbematchedwithowl:sameAS or the termscouldbematcheswith SKOS:exactMatch. A lotofdiscussion on thisisongoing
Oneof the groundbreakingenterprises in this area isThomsonReuters “Open Calais”. Thisis a webservicethatprovidessemanticmark up foranyunstructured text thatyoufeedintotheir service The service is free ofCharge. Why? I will show youlater.
My team in collaborationwith the IndianInstituteofTechnology in Kanpur isdeveloping a similar service foroursubject area.
Wehavehere a text from 1964 without a bibliographic record at handabout a plantprotectionissue
Open Calais isverygood in thoseareas, in whichtheyhavetheirownelaboratedconceptschemeagainstwhich the texts are analyzed: “Places”, “Persons”, “Business Processes” , “IndustryTerms”, butitisweak in the specifictopicanalysis, whattheycall “social tags”
AgroTaggerstilllacksmanyof the sophisticated featuresof “Open Calais” ,butismuch, muchbetter in the subjectanalysisof the text