SlideShare a Scribd company logo
1 of 54
Download to read offline
Rinke Hoekstra and Adianto Wibisono
VU University Amsterdam/University of Amsterdam
               rinke.hoekstra@vu.nl
Rinke Hoekstra and Adianto Wibisono
           VU University Amsterdam/University of Amsterdam
                          rinke.hoekstra@vu.nl


What is Data2Semantics?
Rinke Hoekstra and Adianto Wibisono
           VU University Amsterdam/University of Amsterdam
                          rinke.hoekstra@vu.nl


What is Data2Semantics?
                                       What is
Rinke Hoekstra and Adianto Wibisono
           VU University Amsterdam/University of Amsterdam
                          rinke.hoekstra@vu.nl


What is Data2Semantics?
                                       What is
Rinke Hoekstra and Adianto Wibisono
           VU University Amsterdam/University of Amsterdam
                          rinke.hoekstra@vu.nl


What is Data2Semantics?
                                       What is
Next Steps...




                   Rinke Hoekstra and Adianto Wibisono
                VU University Amsterdam/University of Amsterdam
                               rinke.hoekstra@vu.nl


    What is Data2Semantics?
                                            What is
... first a bit of background
Data to    2 Semantics
                                          From Data   Semantics for Scientific Data Publishers                                                               http://www.data2semantics.org




                            TabLinker
                    Semi-Automatic RDF Converter for Eccentric Excel Files




, February 27, 12

                                                                                               Yasgui                                     Provenance Reconstruction


                          COMPLEXITY vs. INTERESTINGNESS


                                               ?




                                  Data Analysis
                                                                                 PROV-O-MaticTM
                                                                 •
                                                                                                                    HUBBLE
                                                                     Python Wrapper script for shell commands
                                                                     https://github.com/Data2Semantics/data/blob/master/src/d2s/prov.py      Linked Data Hub for
                                                                 •   Output in PROV-O & W3C Time vocabulary                                  Clinical Decision Support
•
                                                                                                                                                 HUBBLE
                                                                              Python Wrapper script for shell commands
                                                                              https://github.com/Data2Semantics/data/blob/master/src/d2s/prov.py                                      Linked Data Hub for
                                                                  •           Output in PROV-O & W3C Time vocabulary                                                                  Clinical Decision Support
                                                                  •           Timestamped URIs for files/resources                                                                      Hubble demonstrates three ‘sales pitches’ of
                                                                  •           ... integrate with GIT?                                                                                  linked data: inter-operability, interlinking and
                                                                                                                                                                                       tool availability.
                                                                  •           Provenance trail for conversion, loading and linking

                                                                                                                                                                                                                 AERS-LD
                                                                                                                                                                                                               serious adverse
                                              Monday, February 27, 12
                                                                                                                                                                                                                event reports
                                                                                                                                                                                                                 exposed as
                                                                                                                                                                                                                 linked data




                                                                                                                                                                                            BioPortal                  SILK link
                                                                                                                                                                                              Mesh,                 specification
                                                                                                                                                                  Google WebToolkit          MedDRA,                  language
                                                                                                                                                                                            SnomedCT,
                                    Partial Replication                                                                                          From patient to:
                                                                                                                                                 - Relevant publications
                                                                                                                                                                                               etc.                      and
                                                                                                                                                                                                                       PROV-O

                 acquiring$data$from$text?$
                                                                                      Cloud$

                                                                                                       Analysis/
                                                                                                                                                 - Related adverse events
         Semi8
                                                                                                        Metrics$
                                                                                                                                                 - Clinical trials                              BioPortal
       Automa;c$
       Annota;on$      e.g.$GATE$
                      OpenCalais$                                  Amalgame$                SILK$
                                                                                                       Querying$
                                                                                                                                                 - Drug information                            Annotator           LOD Cloud
                                                                 Graph$Rewri;ng$
                                                                                                                                                 - Known side effects
                                                                                                                                                                            Papers &
                                               Graph$Rewri;ng$
                                                                                                      and$Ranking$

                                 RDF$                RDF$               Internal$      Link$to$                                                                                                   with            UMLS, DBPedia,
                              Conversion$          Cleaning$             Linking$    Other$Data$
                                                                                                                                                 - Statistical analysis
                                                                                                                                                                            Guidelines         Annotation         Sider, Drugbank,
xml2rdf$
  d2rq$                                                                                               Visualiza;on$   sgvizler$
rdb2rdf$
   $

                                                                                     Provenance$
                                                                                                                                                                                                Ontology             LinkedCT
                                                                                                                                                                                                   and
                                                                                     Enrichment$
                                                                                                          User$            AIDA$Browser$
                                                                                                       Interfaces$    Poseidon$(Pirates/Maps)$

                                                                                                                                                                                                PROV-O
         Semi8                                                                                                                   …$
       Automa;c$
       Conversion$

       “tablinker”$



                                                                                                                                                                                                                             4Store
                                                                                      RDF$Feedback$


                                                   Provenance$
Key Points

•   Build useful services and tools for data publishers ...
•   ... that maintain provenance information ...
•   ... and cater for the entire research cycle ...
•   ... including a feedback loop to new research
One of our use cases ...
•   Public-private research community
•   Emphasis on applications of IT
•   Emphasis on knowledge transfer
•   15 projects
•   Collaboration with EIT ICT-Labs
    http://www.eitictlabs.eu/




                                 http://www.commit-nl.nl
Why VIVO?
•   Demonstrate collaboration within COMMIT/
    between projects (synergy), between organizations

•   Integrate project results with collaboration network
    shared publications, deliverables




                                                  Linked Data Rubik’s Cube by Duncan Hull
Why   ?
Why                          ?
Most Dutch universities



                   Large companies


   Government organizations
The Data

•   COMMIT Website
    http://www.commit-nl.nl

•   All project plans (buzzword mining)
•   All public deliverables (~200 per year)
•   All participating persons (not just researchers)
“Pilot”
          •   Scraping
          •   Web Karma
              http://bit.ly/WebKarma
Future Work
•   Improve people scraper
    first name, family name, affiliation

•   Ingest other content
    deliverables, plans etc.

•   Shared ontology amongst Dutch VIVO installations
•   Shared identifiers for researchers in NL (and VIVO)
    ORCID, ResearcherID, Digital Author ID
Event


•   Yearly event for all COMMIT people
•   Tap into registration process to get detailed info
•   Wireless sensor networks to capture “synergy”
•   Prizes whatnot...
VIVO Pitfalls



•   Very “institutional” perspective
•   How to actively engage individual researchers?
    Reward mechanisms, integrate with Web 2.0 practices...


                              http://oreilly.com/web2/archive/what-is-web-20.html (2005)
Web 2.0
•   Web applications generate your data
•   Rich user experience
•   You control your own data
•   Immediate reward
•   Quality increases by usage
•   Lightweight Web Application
•   Interface to API of existing data repositories
•   Enrich metadata by linking to Linked Data resources
•   Provide annotation services for data files
•   Plugin based architecture
•   Publish RDF metadata as new data publication
http://linkitup.data2semantics.org
Where to publish the RDF?




http://linkitup.data2semantics.org
Where to publish the RDF?




Send me more!


                http://linkitup.data2semantics.org
Future Work
•   Improve people scraper
    first name, family name, affiliation

•   Ingest other content
    deliverables, plans etc.

•   Shared ontology amongst Dutch VIVO installations
•   Shared identifiers for researchers in NL
    ORCID, ResearcherID, Digital Author ID

•   ... reward mechanisms for individual authors!

                                  http://www.data2semantics.org
Future Work
                                               Next week      COMMIT/ Data
•   Improve people scraper
    first name, family name, affiliation
                                               Early March    COMMIT/ VIVO
                                                Early April   COMMIT/ Days
•   Ingest other content
    deliverables, plans etc.

•   Shared ontology amongst Dutch VIVO installations
•   Shared identifiers for researchers in NL
    ORCID, ResearcherID, Digital Author ID

•   ... reward mechanisms for individual authors!

                                  http://www.data2semantics.org
Future Work
                                               Next week      COMMIT/ Data
•   Improve people scraper
    first name, family name, affiliation
                                               Early March    COMMIT/ VIVO
                                                Early April   COMMIT/ Days
•   Ingest other content
    deliverables, plans etc.

•   Shared ontology amongst Dutch VIVO installations
•   Shared identifiers for researchers in NL
    ORCID, ResearcherID, Digital Author ID

•   ... reward mechanisms for individual authors!

                                  http://www.data2semantics.org
Future Work
                                               Next week      COMMIT/ Data
•   Improve people scraper
    first name, family name, affiliation
                                               Early March    COMMIT/ VIVO
                                                Early April   COMMIT/ Days
•   Ingest other content
    deliverables, plans etc.

•   Shared ontology amongst Dutch VIVO installations
•   Shared identifiers for researchers in NL
    ORCID, ResearcherID, Digital Author ID

•   ... reward mechanisms for individual authors!

                                  http://www.data2semantics.org

More Related Content

Similar to COMMIT/VIVO

제1회 Korea Community Day 발표자료 Bigdata
제1회 Korea Community Day 발표자료 Bigdata 제1회 Korea Community Day 발표자료 Bigdata
제1회 Korea Community Day 발표자료 Bigdata Gruter
 
On demand access to Big Data through Semantic Technologies
 On demand access to Big Data through Semantic Technologies On demand access to Big Data through Semantic Technologies
On demand access to Big Data through Semantic TechnologiesPeter Haase
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET Journal
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET Journal
 
AUTOMATIC TRANSFER OF DATA USING SERVICE-ORIENTED ARCHITECTURE TO NoSQL DATAB...
AUTOMATIC TRANSFER OF DATA USING SERVICE-ORIENTED ARCHITECTURE TO NoSQL DATAB...AUTOMATIC TRANSFER OF DATA USING SERVICE-ORIENTED ARCHITECTURE TO NoSQL DATAB...
AUTOMATIC TRANSFER OF DATA USING SERVICE-ORIENTED ARCHITECTURE TO NoSQL DATAB...IRJET Journal
 
KESW2012 Linked Data for Enterprises and Governments (5 Oct 2012)
KESW2012 Linked Data for Enterprises and Governments (5 Oct 2012)KESW2012 Linked Data for Enterprises and Governments (5 Oct 2012)
KESW2012 Linked Data for Enterprises and Governments (5 Oct 2012)AI4BD GmbH
 
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...Stichting ePortfolio Support
 
Data-intensive profile for the VAMDC
Data-intensive profile for the VAMDCData-intensive profile for the VAMDC
Data-intensive profile for the VAMDCAstroAtom
 
Soeren okfn greece meetup
Soeren okfn greece meetupSoeren okfn greece meetup
Soeren okfn greece meetupOKFN-GR
 
ISWC 2012 - Industry Track: "Linked Enterprise Data: leveraging the Semantic ...
ISWC 2012 - Industry Track: "Linked Enterprise Data: leveraging the Semantic ...ISWC 2012 - Industry Track: "Linked Enterprise Data: leveraging the Semantic ...
ISWC 2012 - Industry Track: "Linked Enterprise Data: leveraging the Semantic ...Antidot
 
Jena based implementation of a iso 11179 meta data registry
Jena based implementation of a iso 11179 meta data registryJena based implementation of a iso 11179 meta data registry
Jena based implementation of a iso 11179 meta data registryA. Anil Sinaci
 
Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Stefan Dietze
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNRDatiGovIT
 
Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...Jonathan Challener
 
Introduction of big data unit 1
Introduction of big data unit 1Introduction of big data unit 1
Introduction of big data unit 1RojaT4
 
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...Gezim Sejdiu
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016StampedeCon
 
LinkedUp - Linked Data & Education
LinkedUp - Linked Data & EducationLinkedUp - Linked Data & Education
LinkedUp - Linked Data & EducationStefan Dietze
 
Enterprise linked data clouds
Enterprise linked data cloudsEnterprise linked data clouds
Enterprise linked data cloudsdamienjoyce
 
What's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsWhat's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsStefan Dietze
 

Similar to COMMIT/VIVO (20)

제1회 Korea Community Day 발표자료 Bigdata
제1회 Korea Community Day 발표자료 Bigdata 제1회 Korea Community Day 발표자료 Bigdata
제1회 Korea Community Day 발표자료 Bigdata
 
On demand access to Big Data through Semantic Technologies
 On demand access to Big Data through Semantic Technologies On demand access to Big Data through Semantic Technologies
On demand access to Big Data through Semantic Technologies
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
 
AUTOMATIC TRANSFER OF DATA USING SERVICE-ORIENTED ARCHITECTURE TO NoSQL DATAB...
AUTOMATIC TRANSFER OF DATA USING SERVICE-ORIENTED ARCHITECTURE TO NoSQL DATAB...AUTOMATIC TRANSFER OF DATA USING SERVICE-ORIENTED ARCHITECTURE TO NoSQL DATAB...
AUTOMATIC TRANSFER OF DATA USING SERVICE-ORIENTED ARCHITECTURE TO NoSQL DATAB...
 
KESW2012 Linked Data for Enterprises and Governments (5 Oct 2012)
KESW2012 Linked Data for Enterprises and Governments (5 Oct 2012)KESW2012 Linked Data for Enterprises and Governments (5 Oct 2012)
KESW2012 Linked Data for Enterprises and Governments (5 Oct 2012)
 
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
 
Data-intensive profile for the VAMDC
Data-intensive profile for the VAMDCData-intensive profile for the VAMDC
Data-intensive profile for the VAMDC
 
Soeren okfn greece meetup
Soeren okfn greece meetupSoeren okfn greece meetup
Soeren okfn greece meetup
 
ISWC 2012 - Industry Track: "Linked Enterprise Data: leveraging the Semantic ...
ISWC 2012 - Industry Track: "Linked Enterprise Data: leveraging the Semantic ...ISWC 2012 - Industry Track: "Linked Enterprise Data: leveraging the Semantic ...
ISWC 2012 - Industry Track: "Linked Enterprise Data: leveraging the Semantic ...
 
Jena based implementation of a iso 11179 meta data registry
Jena based implementation of a iso 11179 meta data registryJena based implementation of a iso 11179 meta data registry
Jena based implementation of a iso 11179 meta data registry
 
Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNR
 
Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...
 
Introduction of big data unit 1
Introduction of big data unit 1Introduction of big data unit 1
Introduction of big data unit 1
 
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
LinkedUp - Linked Data & Education
LinkedUp - Linked Data & EducationLinkedUp - Linked Data & Education
LinkedUp - Linked Data & Education
 
Enterprise linked data clouds
Enterprise linked data cloudsEnterprise linked data clouds
Enterprise linked data clouds
 
What's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsWhat's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked Datasets
 

More from Rinke Hoekstra

Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseRinke Hoekstra
 
An Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataAn Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataRinke Hoekstra
 
QBer - Connect your data to the cloud
QBer - Connect your data to the cloudQBer - Connect your data to the cloud
QBer - Connect your data to the cloudRinke Hoekstra
 
Jurix 2014 welcome presentation
Jurix 2014 welcome presentationJurix 2014 welcome presentation
Jurix 2014 welcome presentationRinke Hoekstra
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Rinke Hoekstra
 
Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationRinke Hoekstra
 
Linkitup: Link Discovery for Research Data
Linkitup: Link Discovery for Research DataLinkitup: Link Discovery for Research Data
Linkitup: Link Discovery for Research DataRinke Hoekstra
 
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
A Network Analysis of Dutch Regulations - Using the Metalex Document ServerA Network Analysis of Dutch Regulations - Using the Metalex Document Server
A Network Analysis of Dutch Regulations - Using the Metalex Document ServerRinke Hoekstra
 
Linked (Open) Data - But what does it buy me?
Linked (Open) Data - But what does it buy me?Linked (Open) Data - But what does it buy me?
Linked (Open) Data - But what does it buy me?Rinke Hoekstra
 
Linked Science - Building a Web of Research Data
Linked Science - Building a Web of Research DataLinked Science - Building a Web of Research Data
Linked Science - Building a Web of Research DataRinke Hoekstra
 
Semantic Representations for Research
Semantic Representations for ResearchSemantic Representations for Research
Semantic Representations for ResearchRinke Hoekstra
 
A Slightly Different Web of Data
A Slightly Different Web of DataA Slightly Different Web of Data
A Slightly Different Web of DataRinke Hoekstra
 
The Knowledge Reengineering Bottleneck
The Knowledge Reengineering BottleneckThe Knowledge Reengineering Bottleneck
The Knowledge Reengineering BottleneckRinke Hoekstra
 
Concept- en Definitie Extractie
Concept- en Definitie ExtractieConcept- en Definitie Extractie
Concept- en Definitie ExtractieRinke Hoekstra
 
SIKS 2011 Semantic Web Languages
SIKS 2011 Semantic Web LanguagesSIKS 2011 Semantic Web Languages
SIKS 2011 Semantic Web LanguagesRinke Hoekstra
 
The MetaLex Document Server - Legal Documents as Versioned Linked Data
The MetaLex Document Server - Legal Documents as Versioned Linked DataThe MetaLex Document Server - Legal Documents as Versioned Linked Data
The MetaLex Document Server - Legal Documents as Versioned Linked DataRinke Hoekstra
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of DataRinke Hoekstra
 
History of Knowledge Representation (SIKS Course 2010)
History of Knowledge Representation (SIKS Course 2010)History of Knowledge Representation (SIKS Course 2010)
History of Knowledge Representation (SIKS Course 2010)Rinke Hoekstra
 

More from Rinke Hoekstra (20)

Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 
An Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataAn Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities Data
 
QBer - Connect your data to the cloud
QBer - Connect your data to the cloudQBer - Connect your data to the cloud
QBer - Connect your data to the cloud
 
Jurix 2014 welcome presentation
Jurix 2014 welcome presentationJurix 2014 welcome presentation
Jurix 2014 welcome presentation
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
 
Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance Visualization
 
Linkitup: Link Discovery for Research Data
Linkitup: Link Discovery for Research DataLinkitup: Link Discovery for Research Data
Linkitup: Link Discovery for Research Data
 
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
A Network Analysis of Dutch Regulations - Using the Metalex Document ServerA Network Analysis of Dutch Regulations - Using the Metalex Document Server
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
 
Linked (Open) Data - But what does it buy me?
Linked (Open) Data - But what does it buy me?Linked (Open) Data - But what does it buy me?
Linked (Open) Data - But what does it buy me?
 
Linked Science - Building a Web of Research Data
Linked Science - Building a Web of Research DataLinked Science - Building a Web of Research Data
Linked Science - Building a Web of Research Data
 
Semantic Representations for Research
Semantic Representations for ResearchSemantic Representations for Research
Semantic Representations for Research
 
A Slightly Different Web of Data
A Slightly Different Web of DataA Slightly Different Web of Data
A Slightly Different Web of Data
 
The Knowledge Reengineering Bottleneck
The Knowledge Reengineering BottleneckThe Knowledge Reengineering Bottleneck
The Knowledge Reengineering Bottleneck
 
Linked Census Data
Linked Census DataLinked Census Data
Linked Census Data
 
Concept- en Definitie Extractie
Concept- en Definitie ExtractieConcept- en Definitie Extractie
Concept- en Definitie Extractie
 
SIKS 2011 Semantic Web Languages
SIKS 2011 Semantic Web LanguagesSIKS 2011 Semantic Web Languages
SIKS 2011 Semantic Web Languages
 
The MetaLex Document Server - Legal Documents as Versioned Linked Data
The MetaLex Document Server - Legal Documents as Versioned Linked DataThe MetaLex Document Server - Legal Documents as Versioned Linked Data
The MetaLex Document Server - Legal Documents as Versioned Linked Data
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
History of Knowledge Representation (SIKS Course 2010)
History of Knowledge Representation (SIKS Course 2010)History of Knowledge Representation (SIKS Course 2010)
History of Knowledge Representation (SIKS Course 2010)
 

Recently uploaded

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Recently uploaded (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

COMMIT/VIVO

  • 1. Rinke Hoekstra and Adianto Wibisono VU University Amsterdam/University of Amsterdam rinke.hoekstra@vu.nl
  • 2. Rinke Hoekstra and Adianto Wibisono VU University Amsterdam/University of Amsterdam rinke.hoekstra@vu.nl What is Data2Semantics?
  • 3. Rinke Hoekstra and Adianto Wibisono VU University Amsterdam/University of Amsterdam rinke.hoekstra@vu.nl What is Data2Semantics? What is
  • 4. Rinke Hoekstra and Adianto Wibisono VU University Amsterdam/University of Amsterdam rinke.hoekstra@vu.nl What is Data2Semantics? What is
  • 5. Rinke Hoekstra and Adianto Wibisono VU University Amsterdam/University of Amsterdam rinke.hoekstra@vu.nl What is Data2Semantics? What is
  • 6. Next Steps... Rinke Hoekstra and Adianto Wibisono VU University Amsterdam/University of Amsterdam rinke.hoekstra@vu.nl What is Data2Semantics? What is
  • 7. ... first a bit of background
  • 8. Data to 2 Semantics From Data Semantics for Scientific Data Publishers http://www.data2semantics.org TabLinker Semi-Automatic RDF Converter for Eccentric Excel Files , February 27, 12 Yasgui Provenance Reconstruction COMPLEXITY vs. INTERESTINGNESS ? Data Analysis PROV-O-MaticTM • HUBBLE Python Wrapper script for shell commands https://github.com/Data2Semantics/data/blob/master/src/d2s/prov.py Linked Data Hub for • Output in PROV-O & W3C Time vocabulary Clinical Decision Support
  • 9. HUBBLE Python Wrapper script for shell commands https://github.com/Data2Semantics/data/blob/master/src/d2s/prov.py Linked Data Hub for • Output in PROV-O & W3C Time vocabulary Clinical Decision Support • Timestamped URIs for files/resources Hubble demonstrates three ‘sales pitches’ of • ... integrate with GIT? linked data: inter-operability, interlinking and tool availability. • Provenance trail for conversion, loading and linking AERS-LD serious adverse Monday, February 27, 12 event reports exposed as linked data BioPortal SILK link Mesh, specification Google WebToolkit MedDRA, language SnomedCT, Partial Replication From patient to: - Relevant publications etc. and PROV-O acquiring$data$from$text?$ Cloud$ Analysis/ - Related adverse events Semi8 Metrics$ - Clinical trials BioPortal Automa;c$ Annota;on$ e.g.$GATE$ OpenCalais$ Amalgame$ SILK$ Querying$ - Drug information Annotator LOD Cloud Graph$Rewri;ng$ - Known side effects Papers & Graph$Rewri;ng$ and$Ranking$ RDF$ RDF$ Internal$ Link$to$ with UMLS, DBPedia, Conversion$ Cleaning$ Linking$ Other$Data$ - Statistical analysis Guidelines Annotation Sider, Drugbank, xml2rdf$ d2rq$ Visualiza;on$ sgvizler$ rdb2rdf$ $ Provenance$ Ontology LinkedCT and Enrichment$ User$ AIDA$Browser$ Interfaces$ Poseidon$(Pirates/Maps)$ PROV-O Semi8 …$ Automa;c$ Conversion$ “tablinker”$ 4Store RDF$Feedback$ Provenance$
  • 10. Key Points • Build useful services and tools for data publishers ... • ... that maintain provenance information ... • ... and cater for the entire research cycle ... • ... including a feedback loop to new research
  • 11. One of our use cases ...
  • 12.
  • 13. Public-private research community • Emphasis on applications of IT • Emphasis on knowledge transfer • 15 projects • Collaboration with EIT ICT-Labs http://www.eitictlabs.eu/ http://www.commit-nl.nl
  • 14. Why VIVO? • Demonstrate collaboration within COMMIT/ between projects (synergy), between organizations • Integrate project results with collaboration network shared publications, deliverables Linked Data Rubik’s Cube by Duncan Hull
  • 15. Why ?
  • 16. Why ? Most Dutch universities Large companies Government organizations
  • 17.
  • 18.
  • 19. The Data • COMMIT Website http://www.commit-nl.nl • All project plans (buzzword mining) • All public deliverables (~200 per year) • All participating persons (not just researchers)
  • 20. “Pilot” • Scraping • Web Karma http://bit.ly/WebKarma
  • 21.
  • 22.
  • 23.
  • 24.
  • 25. Future Work • Improve people scraper first name, family name, affiliation • Ingest other content deliverables, plans etc. • Shared ontology amongst Dutch VIVO installations • Shared identifiers for researchers in NL (and VIVO) ORCID, ResearcherID, Digital Author ID
  • 26. Event • Yearly event for all COMMIT people • Tap into registration process to get detailed info • Wireless sensor networks to capture “synergy” • Prizes whatnot...
  • 27. VIVO Pitfalls • Very “institutional” perspective • How to actively engage individual researchers? Reward mechanisms, integrate with Web 2.0 practices... http://oreilly.com/web2/archive/what-is-web-20.html (2005)
  • 28. Web 2.0 • Web applications generate your data • Rich user experience • You control your own data • Immediate reward • Quality increases by usage
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46. Lightweight Web Application • Interface to API of existing data repositories • Enrich metadata by linking to Linked Data resources • Provide annotation services for data files • Plugin based architecture • Publish RDF metadata as new data publication
  • 48. Where to publish the RDF? http://linkitup.data2semantics.org
  • 49. Where to publish the RDF? Send me more! http://linkitup.data2semantics.org
  • 50.
  • 51. Future Work • Improve people scraper first name, family name, affiliation • Ingest other content deliverables, plans etc. • Shared ontology amongst Dutch VIVO installations • Shared identifiers for researchers in NL ORCID, ResearcherID, Digital Author ID • ... reward mechanisms for individual authors! http://www.data2semantics.org
  • 52. Future Work Next week COMMIT/ Data • Improve people scraper first name, family name, affiliation Early March COMMIT/ VIVO Early April COMMIT/ Days • Ingest other content deliverables, plans etc. • Shared ontology amongst Dutch VIVO installations • Shared identifiers for researchers in NL ORCID, ResearcherID, Digital Author ID • ... reward mechanisms for individual authors! http://www.data2semantics.org
  • 53. Future Work Next week COMMIT/ Data • Improve people scraper first name, family name, affiliation Early March COMMIT/ VIVO Early April COMMIT/ Days • Ingest other content deliverables, plans etc. • Shared ontology amongst Dutch VIVO installations • Shared identifiers for researchers in NL ORCID, ResearcherID, Digital Author ID • ... reward mechanisms for individual authors! http://www.data2semantics.org
  • 54. Future Work Next week COMMIT/ Data • Improve people scraper first name, family name, affiliation Early March COMMIT/ VIVO Early April COMMIT/ Days • Ingest other content deliverables, plans etc. • Shared ontology amongst Dutch VIVO installations • Shared identifiers for researchers in NL ORCID, ResearcherID, Digital Author ID • ... reward mechanisms for individual authors! http://www.data2semantics.org