SlideShare une entreprise Scribd logo
1  sur  52
Télécharger pour lire hors ligne
FAIRy stories: the FAIR Data
principles in theory and in practice
Carole Goble
The University of Manchester, UK
carole.goble@manchester.ac.uk
The views expressed in this talk are my own
NSF Convergence Accelerator Series Tracks A&B webinar, 19th May 2021
March 18, 2021
http://spatial.ucsb.edu/2021/Natasha-Noy
Why do we need FAIR data in Research?
“there must be loads of legacy data. We’re desperately trying to go
back and look at what we knew from SARS 10 years ago”
https://www.covid19dataportal.org/
https://www.rd-alliance.org/group/rda-covid19-rda-covid19-omics-rda-covid19-epidemiology-rda-covid19-
clinical-rda-covid19-1
https://doi.org/10.15497/rda00052
Why do we need FAIR data in Research?
COVID Data sharing boost – mobilising people, infrastructure & initiatives
Spotlighted technical, territorial & practices
Provider: collection, upload and governance bottlenecks
User: find and access to datasets, licenses, data and metadata quality
Access to data for processing at scale, common standards
Behaviour inertia and relapse
Long term sustainability
“global pandemic is not sufficient to radically modify
scientific practices”*
* Larregue et al https://blogs.lse.ac.uk/impactofsocialsciences/2020/11/30/covid-19-where-is-the-data/
https://www.nature.com/articles/d41586-021-00305-7
https://www.nature.com/articles/s41597-020-0524-5
Why do we need FAIR data in Research?
information flows, secondary use
Figure: KnowledgeTurning, Information Flow Josh Sommer, Chordoma Foundation, 2011
Community domain enclaves
Resource fragmentation
Flow across platforms/ sovereignties
Pan-discipline drivers
Knowledge churn, loss and cost
2016
A set of GUIDING PRINCIPLES to
enhance the value of all digital
resources and their reuse by PEOPLE
and by MACHINES
ALIGNING a COMMUNITY around
common data guidelines
FAIR Research Data
branding a trend
(re)-stimulating a
movement
What ARE the FAIR principles?
Aspirational guardrails
Not a standard, nor metrics
A contract between data
provider and user
In the original paper
https://www.go-fair.org/fair-principles/
Relaunch a dialogue - research and policy communities.
Reboot a journey - wider accessibility and reusability of data.
compare &
combine data
https://doi.org/10.1038/sdata.2016.18
“enhancing the ability of machines to
automatically find and use data or any digital
object, and support its reuse by individuals”
INCF Statement
Persistent identifiers
Globally unique, resolvable for
data and always for metadata
Structured metadata
Community defined descriptive
metadata using common
terminologies and standards
Linked Data
Vocabularies are FAIR, (meta)data
reference (meta)data, provenance
Automation-
readiness
Access protocols
Open, free and universally
implementable comms protocols
Semantic Web ->
Linked Data ->
Knowledge Graphs.
Machine-processable
metadata.
[Icons: FAIRsharing]
Open as possible, Closed as necessary
Clear licences for innovation and reuse
Sensitive data, GDPR, IPR, jumpy Deans.
Crossing sovereignty boundaries
• Data sharing becomes data visiting &
federated analysis
An industry in controlled secure access….
• Data Usage Ontology, Beacon Passports,
Trusted Research Environments etc….
Terms of access and use: FAIR ≠ OPEN
FAIR OPEN
SAFE
Privacy preservation
Regulatory rigour
FAIR Implicit Assumptions & Implications
Data are first class objects
Primarily aimed at data creators
and providers for benefit of
consumers.
Operating in an (Open) Data
Ecosystem.
Adoption at scale in legacy
settings.
Data sharing
The Life Sciences & pan-European scale data infrastructure
The Life Sciences Infrastructure Zoo
Flows around a Federated & Diverse System
1466 data repositories
(100+ in EOSC-Life)
916 data format and metadata
standards*
from compounds to clinical trials
https://fairsharing.org/ accessed May 2021
Common standards & agreements
mappings of PIDs and metadata
moving metadata around
accountability and responsibility
FAIR players simplified
Researchers and
company
scientists who
generate and use
the data
Service providers
who manage data
and infrastructure
Local -> Global level
Public -> Commercial
Authorities who
drive policy, practice
& resources
Funders, Policy makers,
Publishers, Professional
societies, Standards
organisations, Institutions
Global and national initiatives
Dedicated projects
Community Orgs
Funders
Policy
Publishers
FAIR
first
stage
Dedicated Services
Where we are going
Where we are
[Susanna Sansone]
FAIR
first
stage
FAIR first stage :
Policymakers, Data service providers
How to define, measure compliance and certify FAIR data?
What is a dataset?
General repos vs Curated authoritative archives?
Principles for Data Repositories
https://www.rd-alliance.org/trust-principles-rda-community-effort
https://fairassist.org/
https://www.natureindex.com/news-blog/what-scientists-need-to-know-about-fair-data
Open Data Survey, 2019
81% of researcher
respondents
unfamiliar with FAIR
1. A common mechanism for metadata
Respect and work with the huge legacy
resources: repositories, registries, tools …
community standards
Find, register, index, search resources
Move metadata between services
withoutAPIs
Repositories ->Tools, Aggregators (e.g. licenses)
-> Registries (upload, auto-curation)
Registries -> Registries (across disciplines)
Contribute to Knowledge Graphs
a little bit of semantics at scale
semantic underware
invisible to users
visible to developers & services
Picture: Carole Goble, Turing Lecture 2018
Schema.org: Semantic Mark up for the Web
Cartel of commercial search engines
Wide web use, web infrastructure
Web pages and sitemaps
Types (830+) IceCreamShop
Properties (1300+) hasMenu
Not targeted at science - too much / too little
Dataset type – 120 properties
(Google Data Profile requires 2 properties)
No type for Protein, Gene, Taxon
Harnessing Schema.org for Bioscience
Profile
Data model
Marginality information
Controlled vocabularies
Cardinality
Documentation
Examples
New (properties | types)
definition & consensus
deployment and use
tools & support
Opinionated conventions
Profiles & Link to domain ontologies
}Add Bioscience properties & types if necessary
Examples &Usage Guidelines
}
Community
Harnessing Schema.org for Bioscience
ChemicalSubstance
definition & consensus
deployment and use
tools & support
Opinionated conventions
Profiles & Link to domain ontologies
Add Bioscience properties & types if necessary
Examples &Usage Guidelines
Community
Bioschemas metadata stratification
broad & shallow / deepish & narrowish
Generic
Subject
specific
MolecularEntity,
Protein,
Sample,Taxon,
ChemicalSubstance…
DataCatalog
Dataset
dataset 5 minimum, 8
recommended properties
license & provenance
https://bioschemas.org/profiles/
Crosswalks to metadata schemas *
• DCAT, DataCite,CrossRef, OpenAIRE, DDI
• DCT:issued <-> Schema:dataPublished
What is a dataset?
Include community ontologies
• Type: ChemicalSubstance
• Property: biologicalRole
• ExpectedType: ChEBI ontology
* https://zenodo.org/record/4420116#.YKFOpaHTX18
400+
People
22
Types
32
Profiles
65
Sites
60M+
Pages
bioschemas.org/liveDeploys
bioschemas.org/
liveDeploys
20+
Countries
120
Profile deployments
bioschemas.org/
liveDeploys
Bioschemas Village
MolecularEntity ChemicalSubstance
Toxicology
Data Aggregator
[with thanks: EgonWillighagen]
MolecularEntity
Gene
Protein
Taxon
Dataset
Lessons: Putting FAIR into Practice
A little bit of semantics at scale -> build critical mass
Profiles
• Schema.org culture – Catch 22
• Consensus building, retention & Ontology-itis
Provider mark-up
• Developer friendly in house tools & wacky web implementations
• Adoption incentives & costs of adapting database processes
Consumer services
• Adoption incentives – Catch 22 & tipping points
• DataCatalog & Dataset popular -> Google Dataset search
Consumer-provider readiness
• Tools and training community take-up….
2. Packaging Research Objects
Gather together into a “crate” files,
unbounded references, & other
crates.
FAIR content: metadata,
identifiers, provenance, citation
about the content
FAIR crates: metadata, PIDs,
provenance, citation about the
crate.
more FAIR middleware -> towards FAIR Digital Objects*
*FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units:
https://doi.org/10.3390/publications8020021
Why “crate up” objects? FAIR+R
Flows:
Researchers work with multiple and
different objects using multiple
infrastructures over periods of time
exchange between platforms and people
Parts:
Research has associated objects
linked together by context
metadata files with files
datasets, scripts, SOPs, articles …
0
held in different places
made at different times by
different people & processes
publish, report, reuse, cite, reproduce
register, deposit, archive, port
point to big, sensitive & active content
Aggregate files and/or any URI-addressable
content with structured metadata
Web and Linked Data Native
machine and human readable PIDs + JSON-LD +
Schema.org, search engine & developer friendly
Flex for open ended content, respect legacy
typed by a profile + add more schema.org and
domain ontologies
http://www.researchobject.org/ro-crate/
Archive file
format
FAIR Object Middleware
FAIR Middleware
metadata carrying interchange format
Knowledge
Graph of
Research
Objects
It’s FAIR metadata middleware, stupid
• smart use of wheels already invented
• get tools, services on board
• developer friendly, firm best practice
Known and Unknown unknowns
One size does not fit all
• contextual interpretation
• descriptive openedness , multi-interpretation
Analogous to FAIR Software
• RDA/ReSA FAIR4Research SoftwareWG
Lessons: Putting FAIR into Practice
3. Making (legacy) datasets FAIR: FAIRification
[Picture credit: EgonWillighagen]
Credit to: Ian Harrow, FAIR & OM projects
FAIR as enabler for the digital transformation
● Biopharma R&D productivity can be
improved by implementing the FAIR Data
Principles.
● FAIR enables powerful new AI analytics access
to data for machine learning and prediction
● Fairly AI Ready
● Challenges
○ change the culture, show business value,
achieve the ‘FAIR enough’
○ Sustain FAIR solutions and activities
Slide credit: Susanna Sansone
Making (legacy) datasets FAIR: FAIRification
> 100 Public-Private partnerships of
European Commission, universities SMEs
and Big Pharma translational projects
Pharma’s own datasets
*https://www.go-fair.org/how-to-go-fair/fair-data-point/
Data visiting through a
FAIR Data Point*
Linked Data / RDF tech
Dataset transformation
Methodology
Linkset services
RDFWarehouse (Knowledge Graph)
- API not SPARQL
- Sustainability & maintenance
- Linksets PID mapping services
FAIRification of legacy datasets
Practical
advice
Assessment
processes
FAIR levels of
projects / data
Selection of
datasets
Cost/Benefit
analysis
Methodology
Steps for 1 or
more datasets
Cultural change
Legal templates
Squads & BYODs
Maturity models
Interlinking data from different sources
The lessons of good
global and persistent
identifiers.
Mapping identifiers
and services for
mapping ids to ids and
concepts to concepts.
https://fairplus.github.io/the-fair-cookbook/content/recipes/interoperability/identifier-mapping.html
FAIR by Design
At the start of a collection, built in throughout the life cycle
change management, capacity building
FAIRifying Retrospectively
Legacy datasets, build a cohort,
cost benefit and FAIR readiness over a collection of datasets
Reality
FA(I)R
New FAIRVariants
FAIR++
Legal > Organisational >
Semantic >Technical*
Business and change analysis.
Cost Benefit Analysis.
Scientific / BusinessValue
Sustainability
“…make a decision that
these data are valuable
enough to invest in the work
required for FAIRification.”
interoperability
*EOSC Interoperability Framework
What does FAIRifying a dataset mean?
A database?A pdf? Depositing to a public archive?
Identifier and ontology selecting, assigning,
mapping between and to existing vocabs, and knowing
about ontology services.
High-fidelity ETL loss-less moving (meta)data
from one system to another
Lessons: Putting FAIR into Practice
Lessons: Putting FAIR into Practice
FAIR enough.
Repository manager
Admin monitoring
Bioscientist
Scientific analysis
“Fairness does mean everyone
gets the same. Fairness means
everyone gets what they need”
(Rick Riordan).
Maturity and importance spectrum
Its not all worth it.
FAIR gardens + FAIR scrub
How to assess FAIR maturity
levels, not to be certified but
to make decisions.
FAIR ≠ FREE - an expensive, expert team sport
Mostly manual,
mostly specific
“It is a truth
universally
acknowledged
that a
Knowledge
Graph must be
in want of FAIR
data.
And FAIR data
is in want of
Knowledge
Graphs.”
harvesting
added value
DataCite PID Graph
Bottlenecks:
identifiers and ontologies
curating and ingest pipelines of data providers
4. FAIR Data by Design at Source
Data management platform for Project Hubs
organising, cataloguing, sharing and publishing
multiple kinds of research objects in multiple
repositories for multi-partner projects.
Community developed Knowledge Hub
for guides, examples, tools, and pointers.
Assembled and written by Life Science
researchers and data stewards for their peers.
https://rdmkit.elixir-europe.org
https://fair-dom.org
Lessons: Putting FAIR into Practice
Data creators
• Retention not sharing, act local not global
• Advantage*: intimate knowledge, data
flirting, credits & incentives
Process change and values
• Access to infrastructure with seamless
information flows,Values
• Time & resources to embed into practice
FAIR Stewardship skills
• Professionalisation & know-how
*Pasquetto, I. V., Borgman, C. L., & Wofford, M. F. (2019). Uses and Reuses of Scientific Data: The Data Creators’
Advantage. Harvard Data Science Review, 1(2). https://doi.org/10.1162/99608f92.fc14bf2d
Summary: FAIRy stories
Theory -> mobilised some
Practice -> marathon that takes a village
Move the story from data providers to
enabling creators & consumers prepare to
share FAIR -> Research on Research
Authorities Change Mgt
Stewardship
Service Providers
Sustained infrastructure
Acknowledgements
Special thanks to
• Stian Soiland-Reyes (Uni of Manchester/Uni of Amsterdam)
• Nick Juty & Ebtisam Alharbi (University of Manchester)
• Susanna Sansone (University of Oxford)
• Tony Burdett (EMBL-EBI)
• Ibrahim Emam (ImperialCollege)
• EgonWillighagen (Maastricht University)
• Alasdair Gray (Heriot-Watt University)
Manchester, Research Object, RDMkit, FAIRDOM, FAIRplus, Bioschemas colleagues
(about 130 people)
Icons from the noun project
(https://thenounproject.com/)

Contenu connexe

Tendances

“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKANChengjen Lee
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to MetadataEUDAT
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesOpen Data Support
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceDenodo
 
DSpace-CRIS technical level introduction
DSpace-CRIS technical level introductionDSpace-CRIS technical level introduction
DSpace-CRIS technical level introduction4Science
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lakeJames Serra
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIRSarah Jones
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Inside open metadata—the deep dive
Inside open metadata—the deep diveInside open metadata—the deep dive
Inside open metadata—the deep diveDataWorks Summit
 
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...Denodo
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesDATAVERSITY
 
DSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformDSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformAndrea Bollini
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture DesignKujambu Murugesan
 
Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data IntegrationRoberto Marchetto
 
Talend ETL Tutorial | Talend Tutorial For Beginners | Talend Online Training ...
Talend ETL Tutorial | Talend Tutorial For Beginners | Talend Online Training ...Talend ETL Tutorial | Talend Tutorial For Beginners | Talend Online Training ...
Talend ETL Tutorial | Talend Tutorial For Beginners | Talend Online Training ...Edureka!
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentationTao Feng
 

Tendances (20)

“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and Examples
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
DSpace-CRIS technical level introduction
DSpace-CRIS technical level introductionDSpace-CRIS technical level introduction
DSpace-CRIS technical level introduction
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIR
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
CKAN as an open-source data management solution for open data
CKAN as an open-source data management solution for open data CKAN as an open-source data management solution for open data
CKAN as an open-source data management solution for open data
 
Data Warehouse 101
Data Warehouse 101Data Warehouse 101
Data Warehouse 101
 
Inside open metadata—the deep dive
Inside open metadata—the deep diveInside open metadata—the deep dive
Inside open metadata—the deep dive
 
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
 
DSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformDSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platform
 
Data Vault Overview
Data Vault OverviewData Vault Overview
Data Vault Overview
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data Integration
 
Talend ETL Tutorial | Talend Tutorial For Beginners | Talend Online Training ...
Talend ETL Tutorial | Talend Tutorial For Beginners | Talend Online Training ...Talend ETL Tutorial | Talend Tutorial For Beginners | Talend Online Training ...
Talend ETL Tutorial | Talend Tutorial For Beginners | Talend Online Training ...
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentation
 

Similaire à FAIRy stories: the FAIR Data principles in theory and in practice

FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsCarole Goble
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsOpenAIRE
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptxGetu Tadele
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
LIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into RealityLIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into RealityLIBER Europe
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonAfrican Open Science Platform
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookSusanna-Assunta Sansone
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDASarah Jones
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 

Similaire à FAIRy stories: the FAIR Data principles in theory and in practice (20)

FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basics
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptx
 
Prototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional RepositoryPrototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional Repository
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
LIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into RealityLIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into Reality
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
 
Shifting the Burden from the User to the Data Provider
Shifting the Burden from the User to the Data ProviderShifting the Burden from the User to the Data Provider
Shifting the Burden from the User to the Data Provider
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 

Plus de Carole Goble

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a VillageCarole Goble
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Carole Goble
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learningCarole Goble
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects Carole Goble
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)Carole Goble
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpCarole Goble
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the FutureCarole Goble
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardCarole Goble
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsCarole Goble
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerCarole Goble
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 

Plus de Carole Goble (20)

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 

Dernier

bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
Food_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyFood_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyHemantThakare8
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxPayal Shrivastava
 
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...jana861314
 
Production technology of Brinjal -Solanum melongena
Production technology of Brinjal -Solanum melongenaProduction technology of Brinjal -Solanum melongena
Production technology of Brinjal -Solanum melongenajana861314
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova
 
Think Science: What Are Eclipses (101), by Craig Bobchin
Think Science: What Are Eclipses (101), by Craig BobchinThink Science: What Are Eclipses (101), by Craig Bobchin
Think Science: What Are Eclipses (101), by Craig BobchinNathan Cone
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsDobusch Leonhard
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxfarhanvvdk
 
Introduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsIntroduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsCreative-Biolabs
 
Environmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxEnvironmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxpriyankatabhane
 
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsTimeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsDanielBaumann11
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
Understanding Nutrition, 16th Edition pdf
Understanding Nutrition, 16th Edition pdfUnderstanding Nutrition, 16th Edition pdf
Understanding Nutrition, 16th Edition pdfHabibouKarbo
 
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyLAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyChayanika Das
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionJadeNovelo1
 

Dernier (20)

bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
Food_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyFood_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiology
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptx
 
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
 
Production technology of Brinjal -Solanum melongena
Production technology of Brinjal -Solanum melongenaProduction technology of Brinjal -Solanum melongena
Production technology of Brinjal -Solanum melongena
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
 
Think Science: What Are Eclipses (101), by Craig Bobchin
Think Science: What Are Eclipses (101), by Craig BobchinThink Science: What Are Eclipses (101), by Craig Bobchin
Think Science: What Are Eclipses (101), by Craig Bobchin
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and Pitfalls
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptx
 
PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
Introduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsIntroduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative Biolabs
 
Environmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxEnvironmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptx
 
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsTimeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
Understanding Nutrition, 16th Edition pdf
Understanding Nutrition, 16th Edition pdfUnderstanding Nutrition, 16th Edition pdf
Understanding Nutrition, 16th Edition pdf
 
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyLAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and Function
 

FAIRy stories: the FAIR Data principles in theory and in practice

  • 1. FAIRy stories: the FAIR Data principles in theory and in practice Carole Goble The University of Manchester, UK carole.goble@manchester.ac.uk The views expressed in this talk are my own NSF Convergence Accelerator Series Tracks A&B webinar, 19th May 2021
  • 3. Why do we need FAIR data in Research? “there must be loads of legacy data. We’re desperately trying to go back and look at what we knew from SARS 10 years ago” https://www.covid19dataportal.org/ https://www.rd-alliance.org/group/rda-covid19-rda-covid19-omics-rda-covid19-epidemiology-rda-covid19- clinical-rda-covid19-1 https://doi.org/10.15497/rda00052
  • 4. Why do we need FAIR data in Research? COVID Data sharing boost – mobilising people, infrastructure & initiatives Spotlighted technical, territorial & practices Provider: collection, upload and governance bottlenecks User: find and access to datasets, licenses, data and metadata quality Access to data for processing at scale, common standards Behaviour inertia and relapse Long term sustainability “global pandemic is not sufficient to radically modify scientific practices”* * Larregue et al https://blogs.lse.ac.uk/impactofsocialsciences/2020/11/30/covid-19-where-is-the-data/
  • 6. Why do we need FAIR data in Research? information flows, secondary use Figure: KnowledgeTurning, Information Flow Josh Sommer, Chordoma Foundation, 2011 Community domain enclaves Resource fragmentation Flow across platforms/ sovereignties Pan-discipline drivers Knowledge churn, loss and cost
  • 7. 2016 A set of GUIDING PRINCIPLES to enhance the value of all digital resources and their reuse by PEOPLE and by MACHINES ALIGNING a COMMUNITY around common data guidelines FAIR Research Data
  • 9. What ARE the FAIR principles? Aspirational guardrails Not a standard, nor metrics A contract between data provider and user In the original paper https://www.go-fair.org/fair-principles/ Relaunch a dialogue - research and policy communities. Reboot a journey - wider accessibility and reusability of data.
  • 11. “enhancing the ability of machines to automatically find and use data or any digital object, and support its reuse by individuals” INCF Statement
  • 12. Persistent identifiers Globally unique, resolvable for data and always for metadata Structured metadata Community defined descriptive metadata using common terminologies and standards Linked Data Vocabularies are FAIR, (meta)data reference (meta)data, provenance Automation- readiness Access protocols Open, free and universally implementable comms protocols Semantic Web -> Linked Data -> Knowledge Graphs. Machine-processable metadata. [Icons: FAIRsharing]
  • 13. Open as possible, Closed as necessary Clear licences for innovation and reuse Sensitive data, GDPR, IPR, jumpy Deans. Crossing sovereignty boundaries • Data sharing becomes data visiting & federated analysis An industry in controlled secure access…. • Data Usage Ontology, Beacon Passports, Trusted Research Environments etc…. Terms of access and use: FAIR ≠ OPEN FAIR OPEN SAFE Privacy preservation Regulatory rigour
  • 14. FAIR Implicit Assumptions & Implications Data are first class objects Primarily aimed at data creators and providers for benefit of consumers. Operating in an (Open) Data Ecosystem. Adoption at scale in legacy settings. Data sharing
  • 15. The Life Sciences & pan-European scale data infrastructure
  • 16. The Life Sciences Infrastructure Zoo Flows around a Federated & Diverse System 1466 data repositories (100+ in EOSC-Life) 916 data format and metadata standards* from compounds to clinical trials https://fairsharing.org/ accessed May 2021 Common standards & agreements mappings of PIDs and metadata moving metadata around accountability and responsibility
  • 17. FAIR players simplified Researchers and company scientists who generate and use the data Service providers who manage data and infrastructure Local -> Global level Public -> Commercial Authorities who drive policy, practice & resources Funders, Policy makers, Publishers, Professional societies, Standards organisations, Institutions
  • 18. Global and national initiatives Dedicated projects Community Orgs Funders Policy Publishers FAIR first stage Dedicated Services
  • 19. Where we are going Where we are [Susanna Sansone] FAIR first stage
  • 20. FAIR first stage : Policymakers, Data service providers How to define, measure compliance and certify FAIR data? What is a dataset? General repos vs Curated authoritative archives? Principles for Data Repositories https://www.rd-alliance.org/trust-principles-rda-community-effort https://fairassist.org/
  • 22.
  • 23. 1. A common mechanism for metadata Respect and work with the huge legacy resources: repositories, registries, tools … community standards Find, register, index, search resources Move metadata between services withoutAPIs Repositories ->Tools, Aggregators (e.g. licenses) -> Registries (upload, auto-curation) Registries -> Registries (across disciplines) Contribute to Knowledge Graphs a little bit of semantics at scale semantic underware invisible to users visible to developers & services
  • 24. Picture: Carole Goble, Turing Lecture 2018 Schema.org: Semantic Mark up for the Web Cartel of commercial search engines Wide web use, web infrastructure Web pages and sitemaps Types (830+) IceCreamShop Properties (1300+) hasMenu Not targeted at science - too much / too little Dataset type – 120 properties (Google Data Profile requires 2 properties) No type for Protein, Gene, Taxon
  • 25. Harnessing Schema.org for Bioscience Profile Data model Marginality information Controlled vocabularies Cardinality Documentation Examples New (properties | types) definition & consensus deployment and use tools & support Opinionated conventions Profiles & Link to domain ontologies }Add Bioscience properties & types if necessary Examples &Usage Guidelines } Community
  • 26. Harnessing Schema.org for Bioscience ChemicalSubstance definition & consensus deployment and use tools & support Opinionated conventions Profiles & Link to domain ontologies Add Bioscience properties & types if necessary Examples &Usage Guidelines Community
  • 27. Bioschemas metadata stratification broad & shallow / deepish & narrowish Generic Subject specific MolecularEntity, Protein, Sample,Taxon, ChemicalSubstance… DataCatalog Dataset dataset 5 minimum, 8 recommended properties license & provenance https://bioschemas.org/profiles/ Crosswalks to metadata schemas * • DCAT, DataCite,CrossRef, OpenAIRE, DDI • DCT:issued <-> Schema:dataPublished What is a dataset? Include community ontologies • Type: ChemicalSubstance • Property: biologicalRole • ExpectedType: ChEBI ontology * https://zenodo.org/record/4420116#.YKFOpaHTX18
  • 29. MolecularEntity ChemicalSubstance Toxicology Data Aggregator [with thanks: EgonWillighagen] MolecularEntity Gene Protein Taxon Dataset
  • 30. Lessons: Putting FAIR into Practice A little bit of semantics at scale -> build critical mass Profiles • Schema.org culture – Catch 22 • Consensus building, retention & Ontology-itis Provider mark-up • Developer friendly in house tools & wacky web implementations • Adoption incentives & costs of adapting database processes Consumer services • Adoption incentives – Catch 22 & tipping points • DataCatalog & Dataset popular -> Google Dataset search Consumer-provider readiness • Tools and training community take-up….
  • 31. 2. Packaging Research Objects Gather together into a “crate” files, unbounded references, & other crates. FAIR content: metadata, identifiers, provenance, citation about the content FAIR crates: metadata, PIDs, provenance, citation about the crate. more FAIR middleware -> towards FAIR Digital Objects* *FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units: https://doi.org/10.3390/publications8020021
  • 32. Why “crate up” objects? FAIR+R Flows: Researchers work with multiple and different objects using multiple infrastructures over periods of time exchange between platforms and people Parts: Research has associated objects linked together by context metadata files with files datasets, scripts, SOPs, articles … 0 held in different places made at different times by different people & processes publish, report, reuse, cite, reproduce register, deposit, archive, port point to big, sensitive & active content
  • 33. Aggregate files and/or any URI-addressable content with structured metadata Web and Linked Data Native machine and human readable PIDs + JSON-LD + Schema.org, search engine & developer friendly Flex for open ended content, respect legacy typed by a profile + add more schema.org and domain ontologies http://www.researchobject.org/ro-crate/ Archive file format FAIR Object Middleware
  • 34. FAIR Middleware metadata carrying interchange format Knowledge Graph of Research Objects
  • 35. It’s FAIR metadata middleware, stupid • smart use of wheels already invented • get tools, services on board • developer friendly, firm best practice Known and Unknown unknowns One size does not fit all • contextual interpretation • descriptive openedness , multi-interpretation Analogous to FAIR Software • RDA/ReSA FAIR4Research SoftwareWG Lessons: Putting FAIR into Practice
  • 36. 3. Making (legacy) datasets FAIR: FAIRification [Picture credit: EgonWillighagen]
  • 37. Credit to: Ian Harrow, FAIR & OM projects FAIR as enabler for the digital transformation ● Biopharma R&D productivity can be improved by implementing the FAIR Data Principles. ● FAIR enables powerful new AI analytics access to data for machine learning and prediction ● Fairly AI Ready ● Challenges ○ change the culture, show business value, achieve the ‘FAIR enough’ ○ Sustain FAIR solutions and activities Slide credit: Susanna Sansone
  • 38. Making (legacy) datasets FAIR: FAIRification > 100 Public-Private partnerships of European Commission, universities SMEs and Big Pharma translational projects Pharma’s own datasets
  • 39. *https://www.go-fair.org/how-to-go-fair/fair-data-point/ Data visiting through a FAIR Data Point* Linked Data / RDF tech Dataset transformation Methodology Linkset services RDFWarehouse (Knowledge Graph) - API not SPARQL - Sustainability & maintenance - Linksets PID mapping services
  • 40. FAIRification of legacy datasets Practical advice Assessment processes FAIR levels of projects / data Selection of datasets Cost/Benefit analysis Methodology Steps for 1 or more datasets Cultural change Legal templates Squads & BYODs Maturity models
  • 41. Interlinking data from different sources The lessons of good global and persistent identifiers. Mapping identifiers and services for mapping ids to ids and concepts to concepts. https://fairplus.github.io/the-fair-cookbook/content/recipes/interoperability/identifier-mapping.html
  • 42. FAIR by Design At the start of a collection, built in throughout the life cycle change management, capacity building FAIRifying Retrospectively Legacy datasets, build a cohort, cost benefit and FAIR readiness over a collection of datasets
  • 44. FA(I)R New FAIRVariants FAIR++ Legal > Organisational > Semantic >Technical* Business and change analysis. Cost Benefit Analysis. Scientific / BusinessValue Sustainability “…make a decision that these data are valuable enough to invest in the work required for FAIRification.” interoperability *EOSC Interoperability Framework
  • 45. What does FAIRifying a dataset mean? A database?A pdf? Depositing to a public archive? Identifier and ontology selecting, assigning, mapping between and to existing vocabs, and knowing about ontology services. High-fidelity ETL loss-less moving (meta)data from one system to another Lessons: Putting FAIR into Practice
  • 46. Lessons: Putting FAIR into Practice FAIR enough. Repository manager Admin monitoring Bioscientist Scientific analysis “Fairness does mean everyone gets the same. Fairness means everyone gets what they need” (Rick Riordan). Maturity and importance spectrum Its not all worth it. FAIR gardens + FAIR scrub How to assess FAIR maturity levels, not to be certified but to make decisions.
  • 47. FAIR ≠ FREE - an expensive, expert team sport Mostly manual, mostly specific
  • 48. “It is a truth universally acknowledged that a Knowledge Graph must be in want of FAIR data. And FAIR data is in want of Knowledge Graphs.” harvesting added value DataCite PID Graph Bottlenecks: identifiers and ontologies curating and ingest pipelines of data providers
  • 49. 4. FAIR Data by Design at Source Data management platform for Project Hubs organising, cataloguing, sharing and publishing multiple kinds of research objects in multiple repositories for multi-partner projects. Community developed Knowledge Hub for guides, examples, tools, and pointers. Assembled and written by Life Science researchers and data stewards for their peers. https://rdmkit.elixir-europe.org https://fair-dom.org
  • 50. Lessons: Putting FAIR into Practice Data creators • Retention not sharing, act local not global • Advantage*: intimate knowledge, data flirting, credits & incentives Process change and values • Access to infrastructure with seamless information flows,Values • Time & resources to embed into practice FAIR Stewardship skills • Professionalisation & know-how *Pasquetto, I. V., Borgman, C. L., & Wofford, M. F. (2019). Uses and Reuses of Scientific Data: The Data Creators’ Advantage. Harvard Data Science Review, 1(2). https://doi.org/10.1162/99608f92.fc14bf2d
  • 51. Summary: FAIRy stories Theory -> mobilised some Practice -> marathon that takes a village Move the story from data providers to enabling creators & consumers prepare to share FAIR -> Research on Research Authorities Change Mgt Stewardship Service Providers Sustained infrastructure
  • 52. Acknowledgements Special thanks to • Stian Soiland-Reyes (Uni of Manchester/Uni of Amsterdam) • Nick Juty & Ebtisam Alharbi (University of Manchester) • Susanna Sansone (University of Oxford) • Tony Burdett (EMBL-EBI) • Ibrahim Emam (ImperialCollege) • EgonWillighagen (Maastricht University) • Alasdair Gray (Heriot-Watt University) Manchester, Research Object, RDMkit, FAIRDOM, FAIRplus, Bioschemas colleagues (about 130 people) Icons from the noun project (https://thenounproject.com/)