SlideShare une entreprise Scribd logo
1  sur  62
Télécharger pour lire hors ligne
AN ECOSYSTEM TO SUPPORT FAIR DATA
Luiz Olavo Bonino - luiz.bonino@dtls.nl
April 3rd 2017
FAIR DATA PRINCIPLES
Findable:
F1. (meta)data are assigned a globally unique and
persistent identifier;
F2. data are described with rich metadata;
F3. metadata clearly and explicitly include the
identifier of the data it describes;
F4. (meta)data are registered or indexed in a
searchable resource;
Accessible:
A1. (meta)data are retrievable by their identifier
using a standardized communications protocol;
A1.1 the protocol is open, free, and universally
implementable;
A1.2. the protocol allows for an authentication and
authorization procedure, where necessary;
A2. metadata are accessible, even when the data
are no longer available;
Interoperable:
I1. (meta)data use a formal, accessible,
shared, and broadly applicable language for
knowledge representation.
I2. (meta)data use vocabularies that follow
FAIR principles;
I3. (meta)data include qualified references to
other (meta)data;
Reusable:
R1. meta(data) are richly described with a
plurality of accurate and relevant attributes;
R1.1. (meta)data are released with a clear and
accessible data usage license;
R1.2. (meta)data are associated with detailed
provenance;
R1.3. (meta)data meet domain-relevant
community standards;
FAIR DATA PRINCIPLES - METADATA
Findable:
F1. metadata are assigned a globally unique and
persistent identifier;
F2. data are described with rich metadata;
F3. metadata clearly and explicitly include the
identifier of the data it describes;
F4. (meta)data are registered or indexed in a
searchable resource;
Accessible:
A1. metadata are retrievable by their identifier
using a standardized communications protocol;
A1.1 the protocol is open, free, and universally
implementable;
A1.2. the protocol allows for an authentication
and authorization procedure, where necessary;
A2. metadata are accessible, even when the
data are no longer available;
Interoperable:
I1. metadata use a formal, accessible, shared,
and broadly applicable language for
knowledge representation;
I2. metadata use vocabularies that follow FAIR
principles;
I3. metadata include qualified references to
other (meta)data;
Reusable:
R1. metadata are richly described with a
plurality of accurate and relevant attributes;
R1.1. metadata are released with a clear and
accessible data usage license;
R1.2. metadata are associated with detailed
provenance;
R1.3. metadata meet domain-relevant
community standards;
FAIR DATA PRINCIPLES - DATA
Findable:
F1. data are assigned a globally unique and
persistent identifier;
F2. data are described with rich metadata;
F3. metadata clearly and explicitly include the
identifier of the data it describes;
F4. (meta)data are registered or indexed in a
searchable resource;
Accessible:
A1. data are retrievable by their identifier using a
standardized communications protocol;
A1.1 the protocol is open, free, and universally
implementable;
A1.2. the protocol allows for an authentication
and authorization procedure, where necessary;
A2. metadata are accessible, even when the
data are no longer available;
Interoperable:
I1. data use a formal, accessible, shared, and
broadly applicable language for knowledge
representation;
I2. data use vocabularies that follow FAIR
principles;
I3. data include qualified references to other
(meta)data;
Reusable:
R1. data are richly described with a plurality of
accurate and relevant attributes;
R1.1. data are released with a clear and
accessible data usage license;
R1.2. data are associated with detailed
provenance;
R1.3. data meet domain-relevant community
standards;
FAIR DATA PRINCIPLES - SUPPORTING INFRASTRUCTURE
Findable:
F1. (meta)data are assigned a globally unique and
persistent identifier;
F2. data are described with rich metadata;
F3. metadata clearly and explicitly include the
identifier of the data it describes;
F4. (meta)data are registered or indexed in a
searchable resource;
Accessible:
A1. (meta)data are retrievable by their identifier
using a standardized communications protocol;
A1.1 the protocol is open, free, and universally
implementable;
A1.2. the protocol allows for an authentication and
authorization procedure, where necessary;
A2. metadata are accessible, even when the data
are no longer available;
Interoperable:
I1. (meta)data use a formal, accessible,
shared, and broadly applicable language for
knowledge representation.
I2. (meta)data use vocabularies that follow
FAIR principles;
I3. (meta)data include qualified references to
other (meta)data;
Reusable:
R1. meta(data) are richly described with a
plurality of accurate and relevant attributes;
R1.1. (meta)data are released with a clear and
accessible data usage license;
R1.2. (meta)data are associated with detailed
provenance;
R1.3. (meta)data meet domain-relevant
community standards;
FAIR transformation FAIR transformation
Analysis transformation Analysis transformation
FAIR DATA ECOSYSTEM (DTL)
Create Publish AnnotateFind
011001
1
110010
1
100110
0
BYOD FAIR Hackathon
FAIR DATA ECOSYSTEM (DTL)
Create Publish AnnotateFind
011001
1
110010
1
100110
0
DataFAIRport
DTL
BRING YOUR OWN DATA - BYOD
■ Goals:
■ Learn how to make data linkable “hands-on” with experts
■ Create a “telling story” to demonstrate its use
■ Make FAIR Data at the source
■ Composition:
■ Data owners – specialists on given datasets
■ Data interoperability experts
■ Domain experts
Source: Marcos Roos
Domain Expert
Data Owner FAIR Data Expert
BYOD
BYOD
BYOD Planning
Preparation Execution Follow Up
BYOD Planning
Preparation
Identify Plan
Datasets
Attendees' profile
Output data access
Tentative dates
Tentative venue
Costs
Funds
Coordination
Set date
Invite attendees
Set venue
Catering
Lodging
Financial planning
Publicity
Working document
Preparatory calls
Data hosting
Software hosting
Documentation hosting
BYOD Planning
Execution
Day One
Introduction
SW, LD, Ontology intro
Use case intro
Workgroups division
Working sessions
WWW/TTTALA
Day Two
Progress report
Working sessions
Groups reports
WWW/TTTALA
Day Three
Data integration
Answer driving question
Explore data
Demo improvement
Final report
WWW/TTTALA
BYOD Planning
Follow-Up
D+15
Report difficulties
Clarifications
Next steps
D+45
Report difficulties
Clarifications
Next steps
Implementation
Expand FAIRification
Implement solution
Scale-up solution
Deploy
Based on OpenRefine
FAIR DATA PRINCIPLES - METADATA
Findable:
F1. metadata are assigned a globally unique and
persistent identifier;
F2. data are described with rich metadata;
F3. metadata clearly and explicitly include the
identifier of the data it describes;
F4. (meta)data are registered or indexed in a
searchable resource;
Accessible:
A1. metadata are retrievable by their identifier
using a standardized communications protocol;
A1.1 the protocol is open, free, and universally
implementable;
A1.2. the protocol allows for an authentication
and authorization procedure, where necessary;
A2. metadata are accessible, even when the
data are no longer available;
Interoperable:
I1. metadata use a formal, accessible, shared,
and broadly applicable language for
knowledge representation;
I2. metadata use vocabularies that follow FAIR
principles;
I3. metadata include qualified references to
other (meta)data;
Reusable:
R1. metadata are richly described with a
plurality of accurate and relevant attributes;
R1.1. metadata are released with a clear and
accessible data usage license;
R1.2. metadata are associated with detailed
provenance;
R1.3. metadata meet domain-relevant
community standards;
FAIRIFICATION
FAIR Data Resource
submit generate
Generic
semantic
model
FAIRIFIER
■ Transform non-FAIR datasets into FAIR Data Resources
(dataset in FAIR format, license and metadata)
■ Data munging
■ Semantic modeling
■ License definition
■ Metadata definition and extraction
■ Data publication
FAIRIFIER
FAIRIFICATION PROCESS
■ Retrieve original data
■ Dataset identification and analysis
■ Definition of the semantic model
■ Data transformation
■ License assignment
■ Metadata definition
■ FAIR Data resource (data, metadata, license)
deployment
FAIRIFICATION
FAIR Data Resource
submit generate
Semantic
model
FAIRIFICATION - NEW DATASET TYPE
FAIR Data Resource
submit generate
FAIR Data
Model Registry
store
Semantic
Model &
Non-FAIR
- FAIR
mapping
FAIRIFICATION - RECURRING DATASET TYPE
FAIR Data Resource
submit generate
FAIR Data
Model Registry
query
Semantic
Model &
Non-FAIR
- FAIR
mapping
retrieve
FAIR DATA PRINCIPLES - DATA
Findable:
F1. data are assigned a globally unique and
persistent identifier;
F2. data are described with rich metadata;
F3. metadata clearly and explicitly include the
identifier of the data it describes;
F4. (meta)data are registered or indexed in a
searchable resource;
Accessible:
A1. data are retrievable by their identifier using a
standardized communications protocol;
A1.1 the protocol is open, free, and universally
implementable;
A1.2. the protocol allows for an authentication
and authorization procedure, where necessary;
A2. metadata are accessible, even when the
data are no longer available;
Interoperable:
I1. data use a formal, accessible, shared, and
broadly applicable language for knowledge
representation;
I2. data use vocabularies that follow FAIR
principles;
I3. data include qualified references to other
(meta)data;
Reusable:
R1. data are richly described with a plurality of
accurate and relevant attributes;
R1.1. data are released with a clear and
accessible data usage license;
R1.2. data are associated with detailed
provenance;
R1.3. data meet domain-relevant community
standards;
FAIR Data Point metadata
Title
Responsible institution(s)
Contact
FAIR API version
License
…
FAIR Data Point metadata
Catalog metadata
Title
Theme taxonomy
Issued date
…
DCAT
FAIR Data Point metadata
Catalog 1 metadata
Dataset metadata
Title
Publisher
License
Theme(s)
Version
…
DCAT/HCLS
FAIR Data Point metadata
Catalog 1 metadata
Dataset 1 metadata
Distribution metadata
Title
Media type
Download/access URL
License
…
DCAT
FAIR Data Point metadata
Catalog metadata
Dataset metadata
Distribution
metadata
Data record metadata
Type
Domain
Range
…
RML
FAIR Data Point metadata
Catalog 2
metadata
Catalog 1 metadata
Dataset 1 metadata
Distribution
1.a
metadata
Data record
metadata
Distribution
1.b
metadata
Dataset 2 metadata
Distribution
2.a
metadata
Data record
metadata
Distribution
2.b
metadata
Dataset 3 metadata
Distribution
3.a
metadata
Data record
metadata
METADATA LAYERS
Data Repository (FDP)
(Dataset) Catalog(s)
Dataset
Distribution
Data Record
FAIR DATA PRINCIPLES - SUPPORTING INFRASTRUCTURE
Findable:
F1. (meta)data are assigned a globally unique and
persistent identifier;
F2. data are described with rich metadata;
F3. metadata clearly and explicitly include the
identifier of the data it describes;
F4. (meta)data are registered or indexed in a
searchable resource;
Accessible:
A1. (meta)data are retrievable by their identifier
using a standardized communications protocol;
A1.1 the protocol is open, free, and universally
implementable;
A1.2. the protocol allows for an authentication and
authorization procedure, where necessary;
A2. metadata are accessible, even when the data
are no longer available;
Interoperable:
I1. (meta)data use a formal, accessible,
shared, and broadly applicable language for
knowledge representation.
I2. (meta)data use vocabularies that follow
FAIR principles;
I3. (meta)data include qualified references to
other (meta)data;
Reusable:
R1. meta(data) are richly described with a
plurality of accurate and relevant attributes;
R1.1. (meta)data are released with a clear and
accessible data usage license;
R1.2. (meta)data are associated with detailed
provenance;
R1.3. (meta)data meet domain-relevant
community standards;
FAIR DATA POINT - GUI - FOR TECHIES
FAIR DATA POINT - GUI - FOR “NORMAL" PEOPLE
}
}
Repository
metadata
Catalog
metadata
summary
FAIR DATA POINT - GUI
}
}
Repository
metadata
Catalog
metadata
summary
}
Dataset/
distribution
metadata
summary
}Catalog
metadata
FAIR DATA POINT - GUI - DATASET
FAIR DATA POINT
EXISTING DATA REPOSITORIES
EXTENDING EXISTING DATA REPOSITORIES
+
FAIR HACKATHON - GOALS
■ Align solutions with FAIR Data Point specifications.
■ Metadata content
■ API
■ Data
FAIR HACKATHON OUTCOME
■ FAIR data model for solutions content;
■ Architecture of the required adjustments/extensions;
■ Technical specification of the adjustments/extensions;
■ Proof-of-concept of the adjusted solution;
FDP-COMPLIANT (BETA) SOLUTIONS
RDRF
FAIR transformation FAIR transformation
Analysis transformation Analysis transformation
0110011
1100101
1001100
metadata
index
retrieves
metadata
search
interfaces
(GUI and API)
■ Allow third-party annotation on existing knowledge
bases
■ Capture the provenance of the annotator and the
original statement
Open RDF
Knowledge AnnotatorORKA
DEMO: HTTP://DEV-VM.FAIR-DTLS.SURF-HOSTED.NL:8080/#/
DEMO: HTTP://DEV-VM.FAIR-DTLS.SURF-HOSTED.NL:8080/#/
DEMO: HTTP://DEV-VM.FAIR-DTLS.SURF-HOSTED.NL:8080/#/
ANNOTATIONS GO TO NANOPUB STORE
■ A particular class of FAIR Data System to provide
support for data interoperability;
■ Supports publication and access to FAIR data.
■ Fosters an ecosystems of applications and services;
■ Federated architecture: different FAIRports (and other
FAIR Data Systems) are interconnectable;
■ Supports citations of datasets and data items;
■ Provides metrics for data usage and citation;
F A I
R
QUESTIONS?
Luiz Bonino
luiz.bonino@dtls.nl
www.dtls.nl
METADATA LAYERS
Data Repository (FDP)
(Dataset) Catalog(s)
Dataset
Distribution
Data Record
DCAT/HCLS
RML
METADATA LAYERS’ EXTENSIONS - VOCABULARIES
Data Repository (FDP)
(Dataset) Catalog(s)
Dataset
Distribution
Data Record
METADATA LAYERS’ EXTENSIONS - VOCABULARIES
DCAT
dcat:publisher
biosch:organization
"@context": "http://schema.org",
"@type": "NGO",
"address": {
"@type": "PostalAddress",
"addressLocality": "Utrecht, The Netherlands"
"postalCode": “3511 GC",
"streetAddress": “Catharijnesingel 54"
},
"email": “info(at)dtls.nl",
"@type": “Organization”,
“@type”: “not-for-profit”,
"name": “Dutch Techncentre for Life Sciences",
"telephone": "( 31) 85 30 30 711"
METADATA LAYERS’ EXTENSIONS - VOCABULARIES
dbpedia: biobank
edam: biobank
METADATA LAYERS’ EXTENSIONS - EXTENDED MODEL
Data Repository (FDP)
(Dataset) Catalog(s)
Dataset
Distribution
Data Record
DatA Tag Suite
(DATS)
PROV
DatA Tag Suite
(DATS)
Dataset
Publication
citations primaryPublications

Contenu connexe

Tendances

Towards FAIR principles for research software @ FAIR Software Session, Nation...
Towards FAIR principles for research software @ FAIR Software Session, Nation...Towards FAIR principles for research software @ FAIR Software Session, Nation...
Towards FAIR principles for research software @ FAIR Software Session, Nation...
annalenalamprecht
 
Metadata & controlled vocabulary
Metadata & controlled vocabularyMetadata & controlled vocabulary
Metadata & controlled vocabulary
Daryl Superio
 
香港六合彩
香港六合彩香港六合彩
香港六合彩
shujia
 
AD_FTKX_BRO_ENG_19Nov2014
AD_FTKX_BRO_ENG_19Nov2014AD_FTKX_BRO_ENG_19Nov2014
AD_FTKX_BRO_ENG_19Nov2014
Leonard Cibelli
 

Tendances (20)

Towards FAIR principles for research software @ FAIR Software Session, Nation...
Towards FAIR principles for research software @ FAIR Software Session, Nation...Towards FAIR principles for research software @ FAIR Software Session, Nation...
Towards FAIR principles for research software @ FAIR Software Session, Nation...
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 
Mendeley Data FAIR hackathon
Mendeley Data FAIR hackathonMendeley Data FAIR hackathon
Mendeley Data FAIR hackathon
 
CARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practice
 
Webinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management PlanningWebinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management Planning
 
Meta data
Meta dataMeta data
Meta data
 
Preparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR PrinciplesPreparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR Principles
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge Graphs
 
Metadata & controlled vocabulary
Metadata & controlled vocabularyMetadata & controlled vocabulary
Metadata & controlled vocabulary
 
香港六合彩
香港六合彩香港六合彩
香港六合彩
 
Open Science goes FAIR
Open Science goes FAIROpen Science goes FAIR
Open Science goes FAIR
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
The necessity of metadata for linked open data and its contribution to policy...
The necessity of metadata for linked open data and its contribution to policy...The necessity of metadata for linked open data and its contribution to policy...
The necessity of metadata for linked open data and its contribution to policy...
 
FAIR data and data management
FAIR data and data managementFAIR data and data management
FAIR data and data management
 
AD_FTKX_BRO_ENG_19Nov2014
AD_FTKX_BRO_ENG_19Nov2014AD_FTKX_BRO_ENG_19Nov2014
AD_FTKX_BRO_ENG_19Nov2014
 
FAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data SharingFAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data Sharing
 
Research data management for historians
Research data management for historiansResearch data management for historians
Research data management for historians
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Metadata an overview
Metadata an overviewMetadata an overview
Metadata an overview
 

Similaire à An ecosystem to support FAIR data

Fair data vs 5 star open data final
Fair data vs 5 star open data finalFair data vs 5 star open data final
Fair data vs 5 star open data final
Syed Muhammad Ali Hasnain
 

Similaire à An ecosystem to support FAIR data (20)

FAIR Data ecosystem
FAIR Data ecosystemFAIR Data ecosystem
FAIR Data ecosystem
 
FAIR Ecosystem - Health RI 2017
FAIR Ecosystem - Health RI 2017FAIR Ecosystem - Health RI 2017
FAIR Ecosystem - Health RI 2017
 
Towards cross-domain interoperation in the internet of FAIR data and services
Towards cross-domain interoperation in the internet of FAIR data and servicesTowards cross-domain interoperation in the internet of FAIR data and services
Towards cross-domain interoperation in the internet of FAIR data and services
 
Increasing the Reputation of your Published Data on the Web
Increasing the Reputation of your Published Data on the WebIncreasing the Reputation of your Published Data on the Web
Increasing the Reputation of your Published Data on the Web
 
Kr slides fair astronomy 20181019
Kr slides fair astronomy 20181019Kr slides fair astronomy 20181019
Kr slides fair astronomy 20181019
 
04 findable imming
04 findable imming04 findable imming
04 findable imming
 
Fair data principles for AOASG
Fair data principles for AOASGFair data principles for AOASG
Fair data principles for AOASG
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologies
 
Fair data vs 5 star open data final
Fair data vs 5 star open data finalFair data vs 5 star open data final
Fair data vs 5 star open data final
 
Science in the open, what does it take?
Science in the open, what does it take?Science in the open, what does it take?
Science in the open, what does it take?
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
FAIR data overview
FAIR data overviewFAIR data overview
FAIR data overview
 
FAIR in relation to drone and geosaptial data
FAIR in relation to drone and geosaptial dataFAIR in relation to drone and geosaptial data
FAIR in relation to drone and geosaptial data
 
06 interoperable neale
06 interoperable neale06 interoperable neale
06 interoperable neale
 
FAIR-Principles-and-ELN.pdf
FAIR-Principles-and-ELN.pdfFAIR-Principles-and-ELN.pdf
FAIR-Principles-and-ELN.pdf
 
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
 
DataCite and its Members: Connecting Research and Identifying Knowledge
DataCite and its Members: Connecting Research and Identifying KnowledgeDataCite and its Members: Connecting Research and Identifying Knowledge
DataCite and its Members: Connecting Research and Identifying Knowledge
 
FAIR data
FAIR dataFAIR data
FAIR data
 
OSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data setsOSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data sets
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptx
 

Plus de Blue BRIDGE

Plus de Blue BRIDGE (20)

PerformFISH: Consumer Driven Production - Integrating Innovative Approaches f...
PerformFISH: Consumer Driven Production - Integrating Innovative Approaches f...PerformFISH: Consumer Driven Production - Integrating Innovative Approaches f...
PerformFISH: Consumer Driven Production - Integrating Innovative Approaches f...
 
BlueBRIDGE supporting education
BlueBRIDGE supporting educationBlueBRIDGE supporting education
BlueBRIDGE supporting education
 
LME: LEARN & IOC Capacity Building Activities
LME: LEARN & IOC Capacity Building ActivitiesLME: LEARN & IOC Capacity Building Activities
LME: LEARN & IOC Capacity Building Activities
 
Machine Learning methods to estimate the performance of aquafarms
Machine Learning methods to estimate the performance of aquafarms Machine Learning methods to estimate the performance of aquafarms
Machine Learning methods to estimate the performance of aquafarms
 
Environmental observation data to detect aquaculture structures: merging Cope...
Environmental observation data to detect aquaculture structures: merging Cope...Environmental observation data to detect aquaculture structures: merging Cope...
Environmental observation data to detect aquaculture structures: merging Cope...
 
Application of Earth Observation (EO) Data for Detection, Characterization an...
Application of Earth Observation (EO) Data for Detection, Characterization an...Application of Earth Observation (EO) Data for Detection, Characterization an...
Application of Earth Observation (EO) Data for Detection, Characterization an...
 
Capacity building, validation and repeatability
Capacity building, validation and repeatabilityCapacity building, validation and repeatability
Capacity building, validation and repeatability
 
Fostering global data management with public tuna fisheries data
Fostering global data management with public tuna fisheries dataFostering global data management with public tuna fisheries data
Fostering global data management with public tuna fisheries data
 
Understanding biodiversity features in marine protected areas
Understanding biodiversity features in marine protected areasUnderstanding biodiversity features in marine protected areas
Understanding biodiversity features in marine protected areas
 
Panel discussion on Global Repositories of Merged Public Data
Panel discussion on Global Repositories of Merged Public DataPanel discussion on Global Repositories of Merged Public Data
Panel discussion on Global Repositories of Merged Public Data
 
Invasive species and climate change
Invasive species and climate changeInvasive species and climate change
Invasive species and climate change
 
Blue Skills
Blue SkillsBlue Skills
Blue Skills
 
The BIG picture - Advanced data visualization for SDG, basic stock assessment...
The BIG picture - Advanced data visualization for SDG, basic stock assessment...The BIG picture - Advanced data visualization for SDG, basic stock assessment...
The BIG picture - Advanced data visualization for SDG, basic stock assessment...
 
Global Record of Stocks and Fisheries (GRFS)
Global Record of Stocks and Fisheries (GRFS)Global Record of Stocks and Fisheries (GRFS)
Global Record of Stocks and Fisheries (GRFS)
 
Projecting global fish stocks and catches up to 2100
Projecting global fish stocks and catches up to 2100Projecting global fish stocks and catches up to 2100
Projecting global fish stocks and catches up to 2100
 
BlueBRIDGE: Major Achievements & future vision
BlueBRIDGE: Major Achievements & future visionBlueBRIDGE: Major Achievements & future vision
BlueBRIDGE: Major Achievements & future vision
 
Managing tuna fisheries data at a global scale: the Tuna Atlas VRE
Managing tuna fisheries data at a global scale: the Tuna Atlas VREManaging tuna fisheries data at a global scale: the Tuna Atlas VRE
Managing tuna fisheries data at a global scale: the Tuna Atlas VRE
 
SeaDataCloud – further developing the pan-European SeaDataNet infrastructure ...
SeaDataCloud – further developing the pan-European SeaDataNet infrastructure ...SeaDataCloud – further developing the pan-European SeaDataNet infrastructure ...
SeaDataCloud – further developing the pan-European SeaDataNet infrastructure ...
 
The BlueBRIDGE Project - Pasquale Pagano
The BlueBRIDGE Project - Pasquale PaganoThe BlueBRIDGE Project - Pasquale Pagano
The BlueBRIDGE Project - Pasquale Pagano
 
Thematic clouds for EOSC : The Food Cloud and the Blue Cloud
Thematic clouds for EOSC: The Food Cloud and the Blue Cloud�Thematic clouds for EOSC: The Food Cloud and the Blue Cloud�
Thematic clouds for EOSC : The Food Cloud and the Blue Cloud
 

Dernier

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Dernier (20)

Connecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKConnecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAK
 
Buy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptxBuy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptx
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
The UX of Automation by AJ King, Senior UX Researcher, Ocado
The UX of Automation by AJ King, Senior UX Researcher, OcadoThe UX of Automation by AJ King, Senior UX Researcher, Ocado
The UX of Automation by AJ King, Senior UX Researcher, Ocado
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreel
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 

An ecosystem to support FAIR data

  • 1. AN ECOSYSTEM TO SUPPORT FAIR DATA Luiz Olavo Bonino - luiz.bonino@dtls.nl April 3rd 2017
  • 2. FAIR DATA PRINCIPLES Findable: F1. (meta)data are assigned a globally unique and persistent identifier; F2. data are described with rich metadata; F3. metadata clearly and explicitly include the identifier of the data it describes; F4. (meta)data are registered or indexed in a searchable resource; Accessible: A1. (meta)data are retrievable by their identifier using a standardized communications protocol; A1.1 the protocol is open, free, and universally implementable; A1.2. the protocol allows for an authentication and authorization procedure, where necessary; A2. metadata are accessible, even when the data are no longer available; Interoperable: I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. I2. (meta)data use vocabularies that follow FAIR principles; I3. (meta)data include qualified references to other (meta)data; Reusable: R1. meta(data) are richly described with a plurality of accurate and relevant attributes; R1.1. (meta)data are released with a clear and accessible data usage license; R1.2. (meta)data are associated with detailed provenance; R1.3. (meta)data meet domain-relevant community standards;
  • 3. FAIR DATA PRINCIPLES - METADATA Findable: F1. metadata are assigned a globally unique and persistent identifier; F2. data are described with rich metadata; F3. metadata clearly and explicitly include the identifier of the data it describes; F4. (meta)data are registered or indexed in a searchable resource; Accessible: A1. metadata are retrievable by their identifier using a standardized communications protocol; A1.1 the protocol is open, free, and universally implementable; A1.2. the protocol allows for an authentication and authorization procedure, where necessary; A2. metadata are accessible, even when the data are no longer available; Interoperable: I1. metadata use a formal, accessible, shared, and broadly applicable language for knowledge representation; I2. metadata use vocabularies that follow FAIR principles; I3. metadata include qualified references to other (meta)data; Reusable: R1. metadata are richly described with a plurality of accurate and relevant attributes; R1.1. metadata are released with a clear and accessible data usage license; R1.2. metadata are associated with detailed provenance; R1.3. metadata meet domain-relevant community standards;
  • 4. FAIR DATA PRINCIPLES - DATA Findable: F1. data are assigned a globally unique and persistent identifier; F2. data are described with rich metadata; F3. metadata clearly and explicitly include the identifier of the data it describes; F4. (meta)data are registered or indexed in a searchable resource; Accessible: A1. data are retrievable by their identifier using a standardized communications protocol; A1.1 the protocol is open, free, and universally implementable; A1.2. the protocol allows for an authentication and authorization procedure, where necessary; A2. metadata are accessible, even when the data are no longer available; Interoperable: I1. data use a formal, accessible, shared, and broadly applicable language for knowledge representation; I2. data use vocabularies that follow FAIR principles; I3. data include qualified references to other (meta)data; Reusable: R1. data are richly described with a plurality of accurate and relevant attributes; R1.1. data are released with a clear and accessible data usage license; R1.2. data are associated with detailed provenance; R1.3. data meet domain-relevant community standards;
  • 5. FAIR DATA PRINCIPLES - SUPPORTING INFRASTRUCTURE Findable: F1. (meta)data are assigned a globally unique and persistent identifier; F2. data are described with rich metadata; F3. metadata clearly and explicitly include the identifier of the data it describes; F4. (meta)data are registered or indexed in a searchable resource; Accessible: A1. (meta)data are retrievable by their identifier using a standardized communications protocol; A1.1 the protocol is open, free, and universally implementable; A1.2. the protocol allows for an authentication and authorization procedure, where necessary; A2. metadata are accessible, even when the data are no longer available; Interoperable: I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. I2. (meta)data use vocabularies that follow FAIR principles; I3. (meta)data include qualified references to other (meta)data; Reusable: R1. meta(data) are richly described with a plurality of accurate and relevant attributes; R1.1. (meta)data are released with a clear and accessible data usage license; R1.2. (meta)data are associated with detailed provenance; R1.3. (meta)data meet domain-relevant community standards;
  • 6. FAIR transformation FAIR transformation Analysis transformation Analysis transformation
  • 7. FAIR DATA ECOSYSTEM (DTL) Create Publish AnnotateFind 011001 1 110010 1 100110 0 BYOD FAIR Hackathon
  • 8. FAIR DATA ECOSYSTEM (DTL) Create Publish AnnotateFind 011001 1 110010 1 100110 0 DataFAIRport DTL
  • 9. BRING YOUR OWN DATA - BYOD ■ Goals: ■ Learn how to make data linkable “hands-on” with experts ■ Create a “telling story” to demonstrate its use ■ Make FAIR Data at the source ■ Composition: ■ Data owners – specialists on given datasets ■ Data interoperability experts ■ Domain experts Source: Marcos Roos
  • 10. Domain Expert Data Owner FAIR Data Expert BYOD
  • 11. BYOD
  • 13. BYOD Planning Preparation Identify Plan Datasets Attendees' profile Output data access Tentative dates Tentative venue Costs Funds Coordination Set date Invite attendees Set venue Catering Lodging Financial planning Publicity Working document Preparatory calls Data hosting Software hosting Documentation hosting
  • 14. BYOD Planning Execution Day One Introduction SW, LD, Ontology intro Use case intro Workgroups division Working sessions WWW/TTTALA Day Two Progress report Working sessions Groups reports WWW/TTTALA Day Three Data integration Answer driving question Explore data Demo improvement Final report WWW/TTTALA
  • 15. BYOD Planning Follow-Up D+15 Report difficulties Clarifications Next steps D+45 Report difficulties Clarifications Next steps Implementation Expand FAIRification Implement solution Scale-up solution Deploy
  • 17. FAIR DATA PRINCIPLES - METADATA Findable: F1. metadata are assigned a globally unique and persistent identifier; F2. data are described with rich metadata; F3. metadata clearly and explicitly include the identifier of the data it describes; F4. (meta)data are registered or indexed in a searchable resource; Accessible: A1. metadata are retrievable by their identifier using a standardized communications protocol; A1.1 the protocol is open, free, and universally implementable; A1.2. the protocol allows for an authentication and authorization procedure, where necessary; A2. metadata are accessible, even when the data are no longer available; Interoperable: I1. metadata use a formal, accessible, shared, and broadly applicable language for knowledge representation; I2. metadata use vocabularies that follow FAIR principles; I3. metadata include qualified references to other (meta)data; Reusable: R1. metadata are richly described with a plurality of accurate and relevant attributes; R1.1. metadata are released with a clear and accessible data usage license; R1.2. metadata are associated with detailed provenance; R1.3. metadata meet domain-relevant community standards;
  • 18. FAIRIFICATION FAIR Data Resource submit generate Generic semantic model
  • 19.
  • 20. FAIRIFIER ■ Transform non-FAIR datasets into FAIR Data Resources (dataset in FAIR format, license and metadata) ■ Data munging ■ Semantic modeling ■ License definition ■ Metadata definition and extraction ■ Data publication
  • 22. FAIRIFICATION PROCESS ■ Retrieve original data ■ Dataset identification and analysis ■ Definition of the semantic model ■ Data transformation ■ License assignment ■ Metadata definition ■ FAIR Data resource (data, metadata, license) deployment
  • 23. FAIRIFICATION FAIR Data Resource submit generate Semantic model
  • 24. FAIRIFICATION - NEW DATASET TYPE FAIR Data Resource submit generate FAIR Data Model Registry store Semantic Model & Non-FAIR - FAIR mapping
  • 25. FAIRIFICATION - RECURRING DATASET TYPE FAIR Data Resource submit generate FAIR Data Model Registry query Semantic Model & Non-FAIR - FAIR mapping retrieve
  • 26.
  • 27. FAIR DATA PRINCIPLES - DATA Findable: F1. data are assigned a globally unique and persistent identifier; F2. data are described with rich metadata; F3. metadata clearly and explicitly include the identifier of the data it describes; F4. (meta)data are registered or indexed in a searchable resource; Accessible: A1. data are retrievable by their identifier using a standardized communications protocol; A1.1 the protocol is open, free, and universally implementable; A1.2. the protocol allows for an authentication and authorization procedure, where necessary; A2. metadata are accessible, even when the data are no longer available; Interoperable: I1. data use a formal, accessible, shared, and broadly applicable language for knowledge representation; I2. data use vocabularies that follow FAIR principles; I3. data include qualified references to other (meta)data; Reusable: R1. data are richly described with a plurality of accurate and relevant attributes; R1.1. data are released with a clear and accessible data usage license; R1.2. data are associated with detailed provenance; R1.3. data meet domain-relevant community standards;
  • 28.
  • 29. FAIR Data Point metadata Title Responsible institution(s) Contact FAIR API version License …
  • 30. FAIR Data Point metadata Catalog metadata Title Theme taxonomy Issued date … DCAT
  • 31. FAIR Data Point metadata Catalog 1 metadata Dataset metadata Title Publisher License Theme(s) Version … DCAT/HCLS
  • 32. FAIR Data Point metadata Catalog 1 metadata Dataset 1 metadata Distribution metadata Title Media type Download/access URL License … DCAT
  • 33. FAIR Data Point metadata Catalog metadata Dataset metadata Distribution metadata Data record metadata Type Domain Range … RML
  • 34. FAIR Data Point metadata Catalog 2 metadata Catalog 1 metadata Dataset 1 metadata Distribution 1.a metadata Data record metadata Distribution 1.b metadata Dataset 2 metadata Distribution 2.a metadata Data record metadata Distribution 2.b metadata Dataset 3 metadata Distribution 3.a metadata Data record metadata
  • 35. METADATA LAYERS Data Repository (FDP) (Dataset) Catalog(s) Dataset Distribution Data Record
  • 36. FAIR DATA PRINCIPLES - SUPPORTING INFRASTRUCTURE Findable: F1. (meta)data are assigned a globally unique and persistent identifier; F2. data are described with rich metadata; F3. metadata clearly and explicitly include the identifier of the data it describes; F4. (meta)data are registered or indexed in a searchable resource; Accessible: A1. (meta)data are retrievable by their identifier using a standardized communications protocol; A1.1 the protocol is open, free, and universally implementable; A1.2. the protocol allows for an authentication and authorization procedure, where necessary; A2. metadata are accessible, even when the data are no longer available; Interoperable: I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. I2. (meta)data use vocabularies that follow FAIR principles; I3. (meta)data include qualified references to other (meta)data; Reusable: R1. meta(data) are richly described with a plurality of accurate and relevant attributes; R1.1. (meta)data are released with a clear and accessible data usage license; R1.2. (meta)data are associated with detailed provenance; R1.3. (meta)data meet domain-relevant community standards;
  • 37. FAIR DATA POINT - GUI - FOR TECHIES
  • 38. FAIR DATA POINT - GUI - FOR “NORMAL" PEOPLE } } Repository metadata Catalog metadata summary
  • 39. FAIR DATA POINT - GUI } } Repository metadata Catalog metadata summary } Dataset/ distribution metadata summary }Catalog metadata
  • 40. FAIR DATA POINT - GUI - DATASET
  • 43. EXTENDING EXISTING DATA REPOSITORIES +
  • 44. FAIR HACKATHON - GOALS ■ Align solutions with FAIR Data Point specifications. ■ Metadata content ■ API ■ Data
  • 45. FAIR HACKATHON OUTCOME ■ FAIR data model for solutions content; ■ Architecture of the required adjustments/extensions; ■ Technical specification of the adjustments/extensions; ■ Proof-of-concept of the adjusted solution;
  • 47. FAIR transformation FAIR transformation Analysis transformation Analysis transformation
  • 49.
  • 50. ■ Allow third-party annotation on existing knowledge bases ■ Capture the provenance of the annotator and the original statement Open RDF Knowledge AnnotatorORKA
  • 54. ■ A particular class of FAIR Data System to provide support for data interoperability; ■ Supports publication and access to FAIR data. ■ Fosters an ecosystems of applications and services; ■ Federated architecture: different FAIRports (and other FAIR Data Systems) are interconnectable; ■ Supports citations of datasets and data items; ■ Provides metrics for data usage and citation;
  • 57. METADATA LAYERS Data Repository (FDP) (Dataset) Catalog(s) Dataset Distribution Data Record DCAT/HCLS RML
  • 58. METADATA LAYERS’ EXTENSIONS - VOCABULARIES Data Repository (FDP) (Dataset) Catalog(s) Dataset Distribution Data Record
  • 59. METADATA LAYERS’ EXTENSIONS - VOCABULARIES DCAT dcat:publisher biosch:organization "@context": "http://schema.org", "@type": "NGO", "address": { "@type": "PostalAddress", "addressLocality": "Utrecht, The Netherlands" "postalCode": “3511 GC", "streetAddress": “Catharijnesingel 54" }, "email": “info(at)dtls.nl", "@type": “Organization”, “@type”: “not-for-profit”, "name": “Dutch Techncentre for Life Sciences", "telephone": "( 31) 85 30 30 711"
  • 60. METADATA LAYERS’ EXTENSIONS - VOCABULARIES dbpedia: biobank edam: biobank
  • 61. METADATA LAYERS’ EXTENSIONS - EXTENDED MODEL Data Repository (FDP) (Dataset) Catalog(s) Dataset Distribution Data Record DatA Tag Suite (DATS) PROV