Presentation in Chinese on: Understanding and Using Linked Data – Libraries, Archives, and Museums (LAM) as the contributors and consumers of Linked Data
4. 1 个目的:关联
将数据从 silo 中解放出来,通过关联得到最大
利用,再利用,产生新的数据、信息、知识
Image: Slide from T.B.Lee at TED 2009 conference, "The Great Unveiling" in Long Beach, CA. USA, 4, Feb 2009
11. 图示 1. thing 的
描述和无限联接
1 2 3
predicates objects
1
2 3
subjects predicates objects subjects
Backgroud image borrowed from Andrea Kosavic: The Semantic Web, (some of) what you need
to know. OLA Superconference 2009.01.30. Compiled by mzeng 2009-03-06.
17
14. 四项基本原则:
1. 使用 URI 作为任何事物的标识名称
– Use URIs as names for things
1. 使用 HTTP URI 使任何人都可以访问名称
– Use HTTP URIs so that people can look up those names
1. 当有人访问名称时,提供有用的 [rdf] 信息
– When someone looks up a name, provide useful
information
1. 尽可能提供相关的 URI 以使人们发现更多的信息
– Include links to other URIs so that they can discover
more things
http://www.w3.org/DesignIssues/LinkedData.html
Translated by Liu Wei, 2008 12
15. 1 1. 使用 URI 作为任何事
物的标识名称
2. 使用 HTTP URI 使任何
人都可以访问名称
2 3
3. 当有人访问名称时,
提供有用的 [rdf] 信息
4. 尽可能提供相关的
URI 以使人们发现更多
的信息
Source: dbpedia
16. RDF triples
取出其中一截,其结构是:
is president of http://dbpedia.org/page/
http://dbpedia.org/page/Yao_Ming Shanghai_Sharks
2
1
properties 3 values
thing
1
<RDF:RDF>
<RDF:Description RDF:HREF=”http://dbpedia.org/page/Yao_Ming”>
2 < dbpprop:president
rdf:resource= "http://dbpedia.org/page/Shanghai_Sharks">
3
</RDF:Description>
</RDF:RDF>
机器可理解、可处理
17. 只要遵循有 URI 命名, HTTP 途径,凡是有可能的地方对联
接的 properties 合理管理,则可以被考虑为关联数据。
•这是一种为 Web 所用的含义更丰富的联结方式,使
我们从超文本链接(文献到文献)发展到超数据的联
结(文献中所讨论的内容资源( thing) 的联结) ;
•人们可以通过 HTTP/URI 机制,直接获得数字资源
(Thing) 。
Tim Burners-Lee: http://www.w3.org/DesignIssues/LinkedData.html
参考刘炜 2008-12 的翻译
18. 相对于传统的图书馆数据来说,
关联的图书馆数据有如下优点
• Shareable 可分享
– 不管谁都能解析的唯一标识- URI
– 可信赖的数据和元数据供大家使用
• Extensible 可无限伸展
– “ 无边无际的世界” - 没有什么描述是完成了的,任何人都可以从
他自己发布的空间添加描述信息
• Re-usable 可再利用
– 各种来源的描述谈的是同样的 thing
– 完善、加注、等等
• Internationalizable 国际化
– 多语种、翻译等都没有问题
– 自然语言词串( strings )不被用来指代 things 。
This slide is based on a summary prepared by Antoine Isaac, for Talis
Linked Data and Libraries day, London, July 14th 2011;
built on the W3C Library Linked Data Incubator Group Final Report, draft
20. 李爵士在提出关联数据时借用过这样一个五星排行表:
From InkDroid, a nice summary (and CSS) of the star scheme
★ make your stuff available on the web (whatever format)
把你的东西发布在网上(以任何格式)
★★ make it available as structured data (e.g. excel instead of image
scan of a table)
把你的东西变成有结构的数据 ( excel 格式的表格而不是扫描的
图像)
★★★ non-proprietary format (e.g. csv instead of excel)
非商家格式,(如 csv 而不是 excel )
★★★★ use URLs to identify things, so that people can point at your
stuff
用 URLs 来指代和辨识事物,这样别人就能指点到你的东西
★★★★★ link your data to other people’s data to provide context 将
你的数据联到其他人的 数据
-- http://www.w3.org/DesignIssues/LinkedData.html. TBL, 2006
08/14/12 20
26. “ 关联”是有不同层次的,从简单到复杂
-- 以值词汇 (value vocabularies) 为例
相联的各类
用结构化的
同指 ‘ 关于’相似的主题 RDF 图表
达 '类 '
References: Linked Data: Evolving the Web into a Global Data Space, by Tom
Heath and Christian Bizer.
Linked Data FAQ, by Structured Dynamics.
http://structureddynamics.com/linked_data.html#question_8
34. Xu Bing:Background
Story 7
徐冰
《背后的故事: 7 》
British Museum, 2011
大英博物馆, 2011
Image source: An installation by Xu Bing:
Background Story 7
By British Museum (Albums)
http://www.facebook.com/media/set/?
set=a.10150184112629723.318031.7222852972
2
35. Image source: An installation by Xu Bing:
Background Story 7
By British Museum (Albums)
http://www.facebook.com/media/set/?
set=a.10150184112629723.318031.7222852972
2
40. :sh85066731
prefLabel Installations (Art)
•RDF 图的结构能
bro
表征概念之间的关 note
"……"
ad
系
er
sh85007805
bro
•其中有些固定的 Art, Modern--20th century
a de
可联接的节点, "……"
r
通过它们能将四
related
narr
处分散的数据系 sh85044164
o
在一起
we r
Environment (Art)
Environment (Art)
"……"
"……"
sh2001001631
Site-specific installations (Art)
" ……"
" ……"
41. 知识组织系统里定义的概念能够起到‘汇合点’的作用,
令分散的资源很容易地通过已发布的三段式用 URI 来关联起来
Slide from Tom Baker "The concepts of knowledge organization systems as hubs in the
Web of data". UDC 2011 Seminar. http://seminar.udcc.org/2011/index.htm
This slide set was presented at the TED 2009 conference, "The Great Unveiling" in Long Beach, CA. USA, 4, Feb 2009. The current Web is still a Web of documents, a web of un-typed documents. Datasets exist in silos, and the web is like a global file system. Now we want to turn this into a global database, so we can share, expose, and connect data in those silos. From a global file system to a global database From a Web of documents to a Web of data Break the silos Link data This slide set was presented at the TED 2009 conference, "The Great Unveiling" in Long Beach, CA. USA, 4, Feb 2009.
When there are lots of triples together, they are like this… The first subject here really has quite a few predicates, and the objects can be URIrefs or literals. Those objects can also be the things, so they also may be the subjects of further statements. Backgroud image borrowed from Andrea Kosavic: The Semantic Web, (some of) what you need to know . OLA Superconference 2009.01.30. Compiled by mzeng 2009-03-06.
Tim Burners-Lee: http://www.w3.org/DesignIssues/LinkedData.html Linked Open Data 关联数据的四项基本原则: 原则一:使用 URI 作为任何事物的标识名称 Use URIs as names for things ; 原则二:使用 HTTP URI 使任何人都可以访问名称 Use HTTP URIs so that people can look up those names ; 原则三:当有人访问名称时,提供有用的[ RDF ]信息 When someone looks up a name, provide useful [ RDF ] information 原则四:尽可能提供相关的 URI 以使人们发现更多的信息 Include links to other URIs so that they can discover more things
If we transfer the graph using the XML syntax, the graph will become the code illustrated in this slide. Now the statement becomes machine understandable and processable.
From: Kingsley Idehen, Creating, Deploying, and Exploiting Linked Data
Sept.. 2011 draft version: http://www.w3.org/2005/Incubator/lld/wiki/DraftReportWithTransclusion#Benefits_of_the_Linked_Data_Approach The Linked Data approach offers significant advantages over current practices for creating and delivering library data while providing a natural extension to the collaborative sharing models historically employed by libraries. Linked Data and especially Linked Open Data is sharable, extensible, and easily re-usable. It supports multilingual functionality for data and user services, such as the labeling of concepts identified by a language-agnostic URIs. These characteristics are inherent in the Linked Data standards and are supported by the use of Web-friendly identifiers for data and concepts. Resources can be described in collaboration with other libraries and linked to data contributed by other communities or even by individuals.
So long as the principles of URI naming, HTTP access, and linking predicates where possible are maintained, the approach qualifies as linked data. Source: http://vocab.umbel.org/
Linking Open Data cloud diagram by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/ There must be resolvable http:// (or https://) URIs. They must resolve, with or without content negotiation, to RDF data in one of the popular RDF formats (RDFa, RDF/XML, Turtle, N-Triples). The dataset must contain at least 1000 triples. (Hence, your FOAF file most likely does not qualify.) The dataset must be connected via RDF links to a dataset that is already in the diagram. This means, either your dataset must use URIs from the other dataset, or vice versam. We arbitrarily require at least 50 links. Access of the entire dataset must be possible via RDF crawling, via an RDF dump, or via a SPARQL endpoint.
The simple case is where two data sources refer to the exact same entity or instance (individual) with the same identity. The standard sameAs predicate is used to assert the equivalence in such cases.
Data sources aggregated at DBpedia, Freebase, and New York Times for the {Apple Inc.} are all referring to the same company, even though the labelsmay be different. Their unique URIs are connected by the “owl:sameAs” property.
The more important case is where the data sources are about similar subjects or concepts, in which case a structure of well-defined reference classes is employed.
ID: 300182935 Record Type: concept installations (exhibitions) For example, books, catalogs, museum exhibitions about {installation arts} may use different terms, however they are about similar subjects or concepts. In a VRA record it might give an AAT concept {ID: 300047896@aat} with preferred term “installations (visual works)”, while a MARC record may have used a LCSH subject heading “Installations (Art)” { http://id.loc.gov/authorities/subjects/sh85066731}. In both AAT and LCSH, the concept is well defined, with notes, semantic relationships with other concepts, reference sources, preferred and alternative labels. The concept in AAT {ID:300047896@aat} and in LCSH {http://id.loc.gov/authorities/subjects/sh85066731} can be considered closely matched (coded as “ skos:closeMatch ” ). (Note they are not considered as the same (coded as “ owl:sameAs ” )).
Furthermore, these classes can themselves be expressed in a graph structure capturing the relationships amongst the concepts. For example SKOSified LCSH captured the relationships of the {Installations (Art)}with the broader concept (coded as “skos:broader”) {Art, Modern--20th century} and a number of narrower concepts. The concept in LCSH is also mapped to that in the French RAMEAU vocabulary {Installations (art)} . Following the RDF graphs (statements) encoded by “skos:closeMatch”, concepts in the national authorities of the U.S., France, and Germany are connected. RDF graphs can mix different descriptions, for example, through properties such as “foaf:primaryFocus” or “dcterms:subject” to connect a thesaurus concept to the documents about the concept. Using the German resource as an example, SWD’s concept {Installation <Kunst>} is linked with all publications that have this subject heading (showing as “Thema in 1154 Publikationen”). The same process can be applied to the resources about an agent, a place, an object, etc.
If these classes can themselves be expressed in a graph structure capturing the relationships amongst the concepts, we now have some fixed points in the conceptual information space for relating and tying together disparate data.
Still further, such a conceptual structure also provides the means to relate the people, places, things, organizations, events, etc., of the individual instances of the world to one another as well. Reference: http://structureddynamics.com/linked_data.html#question_8