DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
LOD2: State of Play WP7: Use Case Publishers and Media
1. Creating Knowledge out of Interlinked Data
Overview WP7
State-of-Play
Paris Meeting 24/25 March 2011
Christian Dirschl
Wolters Kluwer Deutschland GmbH
LOD2 Presentation . 02.09.2010 . Page http://lod2.eu
2. Creating Knowledge out of Interlinked Data
WP 7 – Wolters Kluwer Deutschland (WKD) Company Profile
„Semantic Technologies and Standards are an enabler for the media and publishing industry to create
added-value for their customers with reasonable costs“
WKD Legal & Regulatory
Companies/Brands Products (Examples)
- Carl Heymanns Verlag - IP, Administrative Law WKD is part of Wolters Kluwer B.V.
- Luchterhand - Civil, Family, Labor Law
- Werner Verlag - Construction Law Customer orientation Worldwide reach
- Carl Link - Publications for Schools/KiTas - Lawyers - Europe
- CW Haarfeld - Public Health Insurance - Tax Accountants - North America
- Deutscher - Magazin „Personalwirtschaft“ - Corporations and SMEs - Asia/Pacific
Wirtschaftsdienst (HR Management) - Fincancial institutions
- AnNoText - SW for Lawyers and Notaries - Health Providers Economic success
- Trigon Data - Public Sector - Revenue 2009 EUR 3,4 bln.
- 18.000 Employees
WKD Tax & Accounting - Listed Amsterdam SE
Companies/Brands Products (Examples)
- Akademische Arbeits- - Tax SW for Consumers
gemeinschaft Verlag
- Addison Group - SW for Tax Accountants
- Schleupen Tax - SW for SMEs with focus
- Wago Curadata Controlling and Accounting
LOD2 Event . 06.10.2010 . 2
2 http://lod2.eu
3. Creating Knowledge out of Interlinked Data
WP 7 - WKD Content Supply Chain As Is
Content Supply
Chain of
Content Composing Publishing Customer
Wolters Kluwer Editing Sales Customer
Acquisition Bundling Interfacing Service
Deutschland
(WKD)
Content Akquisition Editing Publishing Sales
Composing/Bundling Interfacing Customer Service
Manual collecting
data from different Online libraries as
Using internal Publishing mainly in isolated applications
sources taxonomies and the context of a distinct
Most information is thesauri product Hardly any
publicly not available integration with Web
Mainly manual Publishing of texts, content
1:1 contractual enrichment not information
relationships with Only first steps in
Linking of WK integration of client
authors content only software and content
LOD2 Event . 06.10.2010 . 3
3 http://lod2.eu
4. Creating Knowledge out of Interlinked Data
WP 7 - WKD as a Consumer of LOD Data
Content Supply
Chain of
Content Composing Publishing Customer
Wolters Kluwer Editing Sales Customer
Acquisition Bundling Interfacing Service
Deutschland
(WKD)
Content Acquisition Content Enrichment Enterprise Applications
Acquisition of LOD governmental data Enrichment of WKD data Data integration in Enterprise and other
Costumer Applications
- Laws & Regulations - Enrichment with additional metadata
from the LOD cloud - Integration of customer and WKD data
- Court cases
with data from the LOD cloud
- Automatic Interlinking within WKD data,
- Administrative Rulings
but also into the LOD cloud - Development of new services, e.g.
- Statistical information around metadata economics
Based on:
Based on: Based on:
- Adequate delivery format
- Adequate delivery format - Adequate functionality
- Adequate metadata
- Adequate metadata - Adequate APIs
- Adequate functionality
- Adequate Licensing and IPR - Adequate Licensing and IPR
- Adequate Licensing and IPR
LOD2 Event . 06.10.2010 . 4
4 http://lod2.eu
5. Creating Knowledge out of Interlinked Data
WP 7 - WKD as a Publisher of LOD Data
Content Supply
Chain of
Content Composing Publishing Customer
Wolters Kluwer Editing Sales Customer
Acquisition Bundling Interfacing Service
Deutschland
(WKD)
Cloud - Publishing Marketing measures
Development of WKpedia Integration in overall marketing
strategy of WKD
- Publishing of enriched governmental
information - Dissemination of LOD2 in media and
publishing sector
- Publishing of legal domain thesauri
- Launching surveys
- Motivating contextualisation in LOD
cloud - Permanent information of customers
Based on: - Sponsoring of conferences
- Adequate functionality Based on:
-Adequate APIs - Clear scope of LOD2 project to support
future publishing paradigms
- Adequate Licensing and IPR
LOD2 Event . 06.10.2010 . 5
5 http://lod2.eu
6. Creating Knowledge out of Interlinked Data
WP 7 – Task Description of Task 7.1
Task 7.1: Adoption and Deployment of the LOD2 Stack for Media and Publishing (SWC):
This task is dedicated to adopting and deploying the LOD2 Stack to the data sets of Wolters
Kluwer.
These data sets cover all document types being normally used in legal publishing (laws and
regulations, court decisions, legal commentary, handbooks and journals).
The documents cover all main legal fields of law like labour law, penal law, construction law,
administration law, tax law, etc.
The data sets also cover existing legal taxonomies and thesauri, covering each a specific
field of law, e.g. labour law, family law or social law. The overall amount of data (e.g. 300.000
court decisions, 200 books) is large enough to make sure that realistic operational tasks of a
publisher can be executed with the data format and tools developed within the project in order
to support the respective use case.
The data sets will be catalogued according to various dimensions, e.g. actors,
origin,geographical coverage, temporal coverage, type of data etc. relevant to the domain of
legal information.
All data sets will be available in formats adhering to open-standards, in particular RDF and
Linked Data.
For convenience we will deploy automatic conversion tools to other data formats, such as
CSV, Excel, KML.
The data set already exists in XML format on a central server at WKD and can therefore be
contributed to the project from M1 on.
LOD2 Event . 06.10.2010 . 6
6 http://lod2.eu
7. Creating Knowledge out of Interlinked Data
WP 7 – Description of first Deliverable in Task 7.1
D7.1.1) First release of the news & media data sets
The first release will provide well structured data sets according to the
specifications of the LOD2 Stack.
It will include basic functionalities for publishing, searching, browsing
and exploring this data.
This includes facet-based browsing of data set metadata along various
dimensions (data set type, spatial/temporal coverage, origin etc.).
[month 20]
LOD2 Event . 06.10.2010 . 7
7 http://lod2.eu
8. Creating Knowledge out of Interlinked Data
WP 7 – Main challenges coming from Legal Domain
Linking
Special constructs like „§7 Abs. 1 Lit. 2 BGB“ or „BauR 2007, 124“ or „Coca-
Cola“
Structure
Special structure like in a German court decision „Leitsatz – Normenkette –
Rubrum – Tenor - Tatbestand – Gründe“
Wording
Wide usage of specialised legal terms, that are not covered with normal
dictionary like „grundsätzlich“, „Heilung“
Local world
Legislative structures are dependant on the language and country.
Differences are severe and solutions to cover whole Europe need to be
generic
LOD2 Event . 06.10.2010 . 8
8 http://lod2.eu
9. Creating Knowledge out of Interlinked Data
WP 7 - WKD content delivery
Delivered content in one format
XML with DTD WKDSC-DTD 2.8
Different document types:
- Legislation
- Jurisdiction
- Journal article
- books (commentaries and handbooks)
LOD2 Event . 06.10.2010 . 9
9 http://lod2.eu
10. Creating Knowledge out of Interlinked Data
WP 7 – Background around Use Cases
Background Information in WP1.1
Requirements Elicitation Document
LOD2 Event . 06.10.2010 . 10
10 http://lod2.eu