A museum collection search system called Linked
Open Data for Academia (LODAC) Museum has been developed that uses Linked Data. The LODAC Museum identifies and associates artists, artworks, and museum information from some different museums to provide integrated data that are published as Linked Data with the SPARQL endpoint.
(This side used at Culture and Computing 2011)
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
Study Support and Integration of Cultural Information Resources with Linked Data
1. Study Support and Integration of
Cultural Information Resources with Linked Data
Tetsuro, KAMURA
Doctoral Student
School of Multidisciplinary Sciences, Department of Informatics
The Graduate University for Advanced Studies(SOKENDAI)
Fumihiro, KATO (National Institute of Informatics)
Toru,TAKAHASHI (ATR-Promotions.Inc)
Hiroshi,UEDA (ATR-Promotions.Inc)
Ikki, OHMUKAI (National Institute of Informatics)
Hideaki, TAKEDA (National Institute of Informatics)
11年12月4日日曜日
2. Agenda
Introduction Approach Applications
・Next Generation Web ・Gathering ・Yokohama Art Spot
・Linked Data ・Standardization ・Photo BURARI(LODAC version)
・Integration & Association
・Publishing & Sharing
11年12月4日日曜日
4. Up until now, lots of Japanese museums have
been built Online DB and Digitized collection.
11年12月4日日曜日
5. Up until now, lots of Japanese museums have
been built Online DB and Digitized collection.
Each museum has developed separate collection
management system with original metadata.
It is difficult to retrieve relevant information
by searching multiple museum information.
11年12月4日日曜日
6. If, we could be search and use the
multiple information on the Web....
Museum Museum Create original
Infomation Publish Museum Data
× = service to the
Web service museum
Publish
GIS,Facilities
GIS Publish and
Data Museum
Share
× Event
GIS = recommendation
Local Publish Event Data
Information × application
Local events
Using Open Data
We can find new frontier, new study
and create Web service so on...
5
11年12月4日日曜日
7. The Museum field in particular.
We challenge to solve Japanese Arts and Culture fields
propositions with next generation Web technology
11年12月4日日曜日
8. A growing new way to distribute
information on the Web
11年12月4日日曜日
9. Next Generation of Distribute Information
Existing Web = Web of Document
ex) PDF, HTML, Image format information.
Already processed data.
if, want to use data, you have to extract data from
pdf data or strip HTML Tags from HTML data.
11年12月4日日曜日
10. Next Generation of Distribute Information
Existing Web = Web of Document
ex) PDF, HTML, Image format information.
Already processed data.
if, want to use data, you have to extract data from
pdf data or strip HTML Tags from HTML data.
Next Generation Web = Web of Data
ex) SPARQL Endpoint, RDF format data.
Directly refer to open data.
Available to use RAW data immediately.
The platform called...
11年12月4日日曜日
14. Basic Structure of the Linked Data
The address is described
somethings about information.
http://lod.ac./id/359
10
11年12月4日日曜日
15. Basic Structure of the Linked Data
The address is described Access to the URL,
somethings about information. you can look-up following string
http://lod.ac./id/359 KANZAN
What s mean?
10
11年12月4日日曜日
16. Basic Structure of the Linked Data
Creator s name
http://lod.ac./id/359 KANZAN
Understand http://lod.ac/id/359 is described about creator s name KANZAN
10
11年12月4日日曜日
17. Basic Structure of the Linked Data
Predicate
Creator s name
http://lod.ac./id/359 KANZAN
Subject
Object
Understand http://lod.ac/id/359 is described about creator s name KANZAN
The Linked Data consist of the three parts of resource.
10
This structure is called RDF model. (Resource Description Frameworl)
11年12月4日日曜日
18. Linking Data
KANZAN Autumn among Tre
Title of Artwork @en
Creator s name is a
秋の木の間
Title of Artwork @ja
Link to Artwork
http://lod.ac./id/359 http://lod.ac./id/20029 http://
Collected
Link to Creator s Reference
Job is a Japanese style painter
http://lod.ac./ref/359
1873
was born in
Link node, Contains other information links.
String node. Represent string information,(string,number,date )
11
11年12月4日日曜日
19. Linked Data represents information
as node and arc labeled directed graph
Autumn among Trees
of Artwork @en
秋の木の間
Title of Artwork @ja
0029 http://lod.ac./id/912
Museum
nese style painter
1873
12
11年12月4日日曜日
20. Linked Data represents information
as node and arc labeled directed graph
http://lod.ac./id/16510
Autumn among Trees
Link to Artwork
of Artwork @en http://lod.ac./id/17327
秋の木の間 Link to Artwork
Title of Artwork @ja
Link to Artwork
0029 http://lod.ac./id/17412
http://lod.ac./id/912
Museum
Link to Facilities Reference
Phone number is
nese style painter
http://lod.ac./ref/912 03-5777-8600
1873
Museum name is
TheTokyo National Modern Museum
12
11年12月4日日曜日
21. If, user wants look-up data.
Current Web VS Linked Data
11年12月4日日曜日
22. If, user wants look-up data.
Current Web VS Linked Data
Processed
Query
Query
Converted Current
Query Search and extract data
Processed with several websites every
Query
Converted time.
Irritated User
Distribute Information
11年12月4日日曜日
23. If, user wants look-up data.
Current Web VS Linked Data
Processed
Query
Query
Converted Current
Query Search and extract data
Processed with several websites every
Query
Converted time.
Irritated User
Distribute Information
Linked Data
Query
Querying integrated data.
Integrate Information Happy User
11年12月4日日曜日
27. Gathering data
Museums Source Uses Data Amount of Data
Catalog of the collections of 3 National Art Museum 25,180
National Museum of Western Art 4,373
Kyoto National Museum 5,819
Nara National Museum 431
Fukushima Pref. Art Museum 20
Tochigi Pref. Art Museum 32
Artwork
Akita Pref. Art Museum 22
Iwate Pref. Art Museum 1,588
Tokushima Pref. Art Museum 18,482
Yamanashi Pref. Art Museum 5.416
Kagawa Higashitama Kaii Setouchi Art Museum 5.416
Yokohama Art Museum 6,286
These are not official authorized use... 17
11年12月4日日曜日
28. Relevant Sources Use Data Amount of Data
Database for National Treasure & Important Cultural
Artwork 915
Property of National Designated
Cultural Heritage Online Facilities 648
DBpedia Japanese (Referred to DBpedia) WikiPedia -
Geographical
GIS data National and Regional Planning Bureau
Facilities
Artwork 266
Creator 3,800
The Japanese Art Thesaurus Association for Arts 1,332
Facilities 289
Overall 109,382
Covers a wide range of content types as already structured concept.
Contains several metadata such as creator name,work title, era,
owner, current location, facilities.
18
11年12月4日日曜日
29. Scraping and processing sources
Museums website
(HTML, Perl, PHP)
Processed Raw Data
Relevant source website
(HTML, Perl)
Extract contents data, as Raw Data
The Japanese Art Thesaurus
(MS-EXCEL Sheets)
19
11年12月4日日曜日
30. Standardization of data
Re-organized common metadata.
dc:title
crm:P45_consistOf
skos:preflabel
Raw Data .... lodac:era
Re-organized Metadata
Current organized policies
・Use existing metadata.(Use string as data only)
・Define own metadata.
20
11年12月4日日曜日
31. Prefix Metadata Name
crm CIDOC-CRM
dc11 Dublin Core 1.1
dc DCMI Terms
Simple Knowledge
skos
Organization System
Resource Description
rdfs
Frame Work Schema
foaf Friend of a Friend
Resource Description
rda2
and Access
lodac LODAC Project
21
11年12月4日日曜日
32. Integrating Data
Integration Data
dc:references dc:references
(Ref-resource) (ID-resource) (Ref-resource)
Creator s reference Creator s information Creator s reference
Generated from Generate RDF and
RAW data to RDF assigned LODAC ID
11年12月4日日曜日
33. Integrating Creator s Information
SHOMOMURA,
DBpedia (Wikipedia) 1873
Kanzan@en
foaf:name
dc:references 下村観山@ja crm:P98I_was_born
foaf:name foaf:name
dc:references dc:source
LODAC ID-resource LODAC Ref-resource Japanese Art Thesaurus
Integrated
lodac:creates External link
External Link
creator resource
dc:references dc:source
National Museum of
LODAC ID-resource LODAC Ref-resource Modern Art
dc:title dc:title
dc:created
木の間の秋@ja
dc:title dc:title
Autumn Among
Trees@en 1907 23
11年12月4日日曜日
34. Associating data
Associate Creator and Artwork
A. Japanese Art Thesaurus - 1,332 creators
B. All of artwork - 61,861 titles
Using string match method
A. Creator of artwork
Matching KEY
B. Creator s Name
24
11年12月4日日曜日
35. Amount Integration
Integrate Item Source of Data Data
A.Japanese Art Thesaurus 648
Facilities 77
B.Cultural Heritage Online 915
Title of important A.Japanese Art Thesaurus (Art work) 3,800
74
cultural properties B.DB for National Treasure (Art work)
10,115
Creator information
A.Japanese Art Thesaurus (Creator) 1,332
15,020
and Work Title
B.All of art work (Work title string) 61,861
A.Japanese Art Thesaurus (Creator) 1,332
Creator name 615
B.All of art work title(using creator name) 61,861
25
11年12月4日日曜日
36. Publishing & Sharing
We build a Linked Data infrastructure for
for the museum information
26
11年12月4日日曜日
37. Publish data as RDF
<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:ns0="http://purl.org/dc/terms/"
xmlns:ns1="http://xmlns.com/foaf/0.1/" ID-resource URI
xmlns:ns2="http://lod.ac/ns/lodac#"
(Own address)
xmlns:ns3="http://www.w3.org/2000/01/rdf-schema#"
xmlns:ns4="http://www.w3.org/2004/02/skos/core#">
http://lod.ac/id/359
<rdf:Description rdf:about="http://lod.ac/id/359.json">
<ns0:title>JSON representation for http://lod.ac/id/359</ns0:title>
External link <ns1:primaryTopic>
DBpedia Japanese <ns1:Person rdf:about="http://lod.ac/id/359">
<ns2:creates rdf:resource="http://lod.ac/id/20029"/>
<ns0:references rdf:resource="http://dbpedia.jp/resource/%E4%B8%8B
%E6%9D%91%E8%A6%B3%E5%B1%B1"/>
<ns0:references rdf:resource="http://lod.ac/ref/359"/>
<ns3:label xml:lang="ja">下村観山</ns3:label>
<ns4:prefLabel xml:lang="ja">下村観山</ns4:prefLabel>
<ns1:name xml:lang="ja">下村観山</ns1:name>
</ns1:Person>
Ref-resource URI
</ns1:primaryTopic>
http://lod.ac/ref/359
</rdf:Description>
</rdf:RDF> 27
11年12月4日日曜日
38. SPARQL Query
SPARQL query language is widely
used for querying RDF data.
How many duplicate titles?
WHERE
Pull an artwork resources
out of the RDF dataset
An artwork resources SELECT
Pulled data, count duplicate work title. 28
11年12月4日日曜日
39. SPARQL Query TOP20 s Duplicate Titles
SPARQL query language is widely
used for querying RDF data.
How many duplicate titles?
WHERE
Pull an artwork resources
out of the RDF dataset
An artwork resources SELECT
Pulled data, count duplicate work title. 28
11年12月4日日曜日
43. Photo BURARI (LODAC.Ver)
(C)ATR-Promotions,Inc
GIS and Facilities information
through the SPARQL 32
11年12月4日日曜日
44. Summary
• Organizing
We tried to integrating distributed information as Linked Data.
In consequence, approximately 11 million information available
for common platform.
• Publishing
We published an RDF data on a LODAC Museum website. These
are everybody can use for free!
• Using
Currently, the two applications use LODAC Museum s Data. We
are more consider how to use these resources.
(We have a plan to use for the purpose of study)
11年12月4日日曜日
45. http://lod.ac
LODAC Project
Linked Open Data for Academia
11年12月4日日曜日