SlideShare a Scribd company logo
1 of 18
| 1
Anita de Waard 0000-0002-9034-4119
VP Research Data Collaborations, Elsevier RDM
a.dewaard@elsevier.com
Data, Data, Everywhere:
What’s A Publisher To Do?
| 2
Are Scientists Sharing Their Data? Why (not)?
Attitudes to data sharing
73% agree that having
access to other
researchers’ data would
benefit their research
… but 64% agree that
they would be willing to
share their own research
data with other
researchers
The reasons for discrepancy?
Only 26% agreed they
had received sufficient
training in data
management
Only 37% agreed that
sharing research data is
associated with credit/
reward in their field
59% agreed that research data
management specialists need to play a
role in research data sharing
Perceived benefits of data sharing
Top 3
More possibilities
for collaboration
55%
Reproducibility of
research
53%
Article more likely to
be cited
50%
Perceived drawbacks of data sharing
Competitors using data
before collector has a
chance to re-use it
Use without
crediting/citing the
data collector
Cost (time and financial)
Legal concerns
(ownership, misuse,
confidentiality)
CWTS –Elsevier Open Data Report (2016): 1,200 researchers responded to a survey of 51,672 individuals randomly selected from Scopus
author database (2.3% response rate). Survey tool: Online survey available in English only. Co-branded with University of Leiden (CWTS).
Fieldwork took place in June-July 2016
https://www.elsevier.com/about/open-science/research-data/open-data-report
| 3
Publishers Getting Together: AGU FAIR Data Sharing Guidelines
1. https://www.elsevier.com/authors/author-services/research-data/data-guidelines/
2. https://cos.io/our-services/top-guidelines/
AGU FAIR DATA DRAFT
Consolidated proposal for data guidelines for publishers of Earth and Space Science journals:
1. Authors are required to deposit their research data in a relevant data repository:
• Before publication, large data sets (such as microarray data, protein or DNA sequences, atomic coordinates or
climate data) must be deposited in an approved database.
• An inventory of suggested and supported data repositories is provided by the Coalition on Publishing Data in the
Earth and Space Sciences (COPDESS).
• All data used in the analysis must be available to any researcher for purposes of reproducing or extending the
analysis.
2. Authors are required to cite and link to this dataset in your article, following the Force11 Data CItation Principles
3. If this is not possible, authors are required to submit a statement explaining why research data cannot be shared
Truly exceptional circumstances requiring special treatment, such as protecting personal privacy, should be discussed
with the editor no later than at the manuscript revision stage, and spelled out explicitly in the acknowledgments.
Elsevier + Science + Nature + Wiley + PLoS + Digital Science + AGU
| 4
Research data is more than just a journal supplement:
9
All forms of research data, which
includes everything needed to
reproduce and reuse experimental
and computational results.
Raw data Processed data
Machine &
environment settings
Protocols, methods, workflows Scripts, analyses, algorithms
| 5
3. ‘Metrics on data’
Monitoring and reporting
on institutional data
• Benchmark • Rank Evaluate
• Manage • Preserve
Institution
Search Repository
Notebook
Manager
Mendeley Data Platform:
Monitor
A modular, cloud-based platform designed for research institutions,
to manage the entire lifecycle of research data
Find Topic
Design
Identify gaps
Plan & Fund
Discover data, people,
methods & protocols
Collect, analyze
& visualize
Prepare, reproduce,
re-use & benchmark
Store &
Share
Publish
Disseminate
1. Lab data
Execute
Research
2. Open data: data publicly available
| 6
But how do we publish Data Science?
https://projectreporter.nih.gov/project_description.cfm?projectnumber=1R01MH107238-01
1R01MH107238-01 (Arnold, Fraser, Kesselman):
An experimental paradigm to allow dynamic monitoring of the strength
and location of every glutamatergic and GABA/Glycinergic synapse within the brain of a
living organism. This will involve combining three technologies:
1. Recombinant probes, ...
2. 2P-SPIM microscopy, …
3. Software to calculate and store the location and strength of each synapse in
such a manner that it can be easily manipulated and analyzed
| 7
The computer is a scientist, too:
“intelligent systems for computer-aided
discovery can integrate into the insight
generation loop in scalable ways…”
Computer-Aided Discovery: Towards Scientific Insight Generation with Machine Support, V. Pankratius, J. Li, M. Gowanlock, D. Blair, C. Rude, T.
Herring, F. Lind, P. Erickson, C. Lonsdale, IEEE Intelligent Systems 31(4), pp. 3-10, Jul/Aug 2016
“This work combines time series Principal
Component Analysis with InSAR to constrain
the space of possible model explanations on
current empirical data sets and achieve a better
identification of deformation patterns”
| 8
Data
Tool
Article
User
A Move Towards Networked Knowledge:
| 9
Moving from a pipeline to a platform model:
A platform:
• Is a nexus of rules and architecture
• Is open, allowing regulated participation
• Actively promotes (positive) interactions among different partners
• Scales much faster than a pipeline.
A network inherently has an external focus.
To have an external focus, you must have a community strategy.
Pipelines, Platforms, and the New Rules of Strategy, Harvard Business review, April 2016,
https://hbr.org/2016/04/pipelines-platforms-and-the-new-rules-of-strategy
| 10
existing integration
planned integration
Example #1: Mendeley Data Platform to integrate with
Research Data Management ecosystem
Index datasets
metadata
Mint DOIs Import / export notebooks,
experiments
Import / export
datasets
Repository
indexed by
OpenAIRE
Zenodo indexed
by DataSearch
Publish links
between articles
and datasets
Datasets indexed by
DataSearchLong-term
preservation
of published
datasets
+ 22 repositories
Integrate with
machine
readable DMPs
Open API
| 11
• Goal: develop a cloud-based solution for doing bioinformatics experiment
• Elsevier portion: build a Global Unique Identifier Broker tool
• Based on Mendeley Data DOI resolver, to be shared in OS through NIH Platform
Example #2: Collaboration NIH Data Commons Pilot
With SevenBridges Genomics
http://www.healthcareitnews.com/news/nih-taps-new-partners-build-commons-petabytes-biomedical-data
| 12
In Summary:
• Publishers are getting together to help store and share data:
- Open Data Report
- AGU Fair Data Group
- Mendeley Data Platform
• But: all science is becoming data science:
- Scientists are building the tools that other scientists work with
- We are moving ‘beyond download science’, where computers join in
• Publishers need to change:
- We need to enable network effects
- This means moving from a pipeline to a platform model
• Some things we are doing at Elsevier RDM:
- Mendeley Data Platform
- Participating in the NIH Data Commons Pilot
| 13
Many Questions Remain…
• How do we best publish Data Science?
• (How) do we connect this to the article/journal model that we all know
and love?
• How does scientific software become sustainable, used, connected?
• What role do all the parties play: funding agencies, tool builders
(commercial and academic), HPCCs, cloud providers, libraries,
standards bodies, publishers?
• How do all of us become connected into a viable network?
• How do we collectively transition from the old models into the new?
| 14
Extra Slides
| 15
https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data
10.Integrateupstreamanddownstream
–makemetadatatoserveuse.
Save
Share
Use
9. Re-usable (allow tools to run on it)
8. Reproducible
7. Trusted (e.g. reviewed)
6. Comprehensible (description / method is available)
5. Citable
4. Discoverable (data is indexed or data is linked from article)
3. Accessible
1. Stored (existing in some form)
2. Preserved (long-term & format-independent)
Ten Habits of Highly Effective Research Data:
| 16
Ideas are becoming distributed
Tools are becoming
distributed
Easy to create networks of
tools to run anywhere
(Docker, Jupyter Notbook
collections etc)
Many sources, formats,
owners, types: global,
interconnected
Computers make hypotheses, too*;
citizen science/MOOCs enable
ubiquitous access to knowledge
*
http://ieeexplore.ieee.org/abstract/document/7
515118/: Computer-Aided Discovery: Toward
Scientific Insight Generation with Machine
Data is becoming distributed
| 17
Data
Tool
Article
User
Towards Networked Knowledge:
| 18
Thank you!
Links to RDM Work:
• https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data
• https://www.elsevier.com/about/open-science/research-data
• https://www.hivebench.com
• https://data.mendeley.com/
• https://datasearch.elsevier.com/
• https://www.elsevier.com/books-and-journals/content-innovation/data-base-linking
• http://www.journals.elsevier.com/softwarex/
• https://rd-alliance.org/groups/rdawds-publishing-data-services-wg.html
• https://www.force11.org/
• http://www.nationaldataservice.org/
• https://rd-alliance.org/
Anita de Waard, a.dewaard@elsevier.com

More Related Content

What's hot

Trust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceTrust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceBeth Plale
 
SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010Jisc
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things dataARDC
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...SEAD
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the libraryColleen DeLory
 
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012SEAD
 
Capsule Computing: Safe Open Science
Capsule Computing: Safe Open Science Capsule Computing: Safe Open Science
Capsule Computing: Safe Open Science Beth Plale
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Jisc
 
SLIDES | 12 time-saving tips for research support
SLIDES | 12 time-saving tips for research supportSLIDES | 12 time-saving tips for research support
SLIDES | 12 time-saving tips for research supportLibrary_Connect
 
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEWUSING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEWNellore Harilakshmi
 
Slides | Targeting the librarian’s role in research services
Slides | Targeting the librarian’s role in research servicesSlides | Targeting the librarian’s role in research services
Slides | Targeting the librarian’s role in research servicesLibrary_Connect
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research RequirementsICPSR
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data ManagementAmanda Whitmire
 

What's hot (20)

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Trust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceTrust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail Science
 
SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
 
Capsule Computing: Safe Open Science
Capsule Computing: Safe Open Science Capsule Computing: Safe Open Science
Capsule Computing: Safe Open Science
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014
 
SLIDES | 12 time-saving tips for research support
SLIDES | 12 time-saving tips for research supportSLIDES | 12 time-saving tips for research support
SLIDES | 12 time-saving tips for research support
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEWUSING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
 
Baker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated AudiencesBaker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated Audiences
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Slides | Targeting the librarian’s role in research services
Slides | Targeting the librarian’s role in research servicesSlides | Targeting the librarian’s role in research services
Slides | Targeting the librarian’s role in research services
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research Requirements
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 

Similar to Data, Data Everywhere: What's A Publisher to Do?

Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharingJisc RDM
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?Anita de Waard
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Effective research data management
Effective research data managementEffective research data management
Effective research data managementCatherine Gold
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointMark Parsons
 
eCitizen Sensible-Data Design Challenge
eCitizen Sensible-Data Design ChallengeeCitizen Sensible-Data Design Challenge
eCitizen Sensible-Data Design Challengehopbeat
 
My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018Susanna-Assunta Sansone
 
FORCE11: Creating a data and tools ecosystem
FORCE11:  Creating a data and tools ecosystemFORCE11:  Creating a data and tools ecosystem
FORCE11: Creating a data and tools ecosystemMaryann Martone
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data ManagementCarole Goble
 
How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...ariadnenetwork
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data ManagementAnita de Waard
 
Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...ResearchSpace
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseAnita de Waard
 
Rachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we nowRachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we nowJisc
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsThe University of Edinburgh
 
2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum TalkPaul Bracke
 

Similar to Data, Data Everywhere: What's A Publisher to Do? (20)

Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the Point
 
eCitizen Sensible-Data Design Challenge
eCitizen Sensible-Data Design ChallengeeCitizen Sensible-Data Design Challenge
eCitizen Sensible-Data Design Challenge
 
My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018
 
FORCE11: Creating a data and tools ecosystem
FORCE11:  Creating a data and tools ecosystemFORCE11:  Creating a data and tools ecosystem
FORCE11: Creating a data and tools ecosystem
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...
 
The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Rachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we nowRachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we now
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflows
 
Concept on e-Research
Concept on e-ResearchConcept on e-Research
Concept on e-Research
 
2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk
 

More from Anita de Waard

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataAnita de Waard
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsAnita de Waard
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesAnita de Waard
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Anita de Waard
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of PublishingAnita de Waard
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data SharingAnita de Waard
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingAnita de Waard
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumAnita de Waard
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataAnita de Waard
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016Anita de Waard
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...Anita de Waard
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupAnita de Waard
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecycleAnita de Waard
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to ReuseAnita de Waard
 
Collaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and softwareCollaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and softwareAnita de Waard
 
Argumentation in biology papers
Argumentation in biology papersArgumentation in biology papers
Argumentation in biology papersAnita de Waard
 
Ten Habits of Highly Effective Data
Ten Habits of Highly Effective DataTen Habits of Highly Effective Data
Ten Habits of Highly Effective DataAnita de Waard
 

More from Anita de Waard (20)

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
History of the future
History of the futureHistory of the future
History of the future
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data Lifecycle
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to Reuse
 
Collaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and softwareCollaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and software
 
Argumentation in biology papers
Argumentation in biology papersArgumentation in biology papers
Argumentation in biology papers
 
Ten Habits of Highly Effective Data
Ten Habits of Highly Effective DataTen Habits of Highly Effective Data
Ten Habits of Highly Effective Data
 

Recently uploaded

Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxuniversity
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
trihybrid cross , test cross chi squares
trihybrid cross , test cross chi squarestrihybrid cross , test cross chi squares
trihybrid cross , test cross chi squaresusmanzain586
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxRitchAndruAgustin
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXDole Philippines School
 

Recently uploaded (20)

Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
trihybrid cross , test cross chi squares
trihybrid cross , test cross chi squarestrihybrid cross , test cross chi squares
trihybrid cross , test cross chi squares
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
 

Data, Data Everywhere: What's A Publisher to Do?

  • 1. | 1 Anita de Waard 0000-0002-9034-4119 VP Research Data Collaborations, Elsevier RDM a.dewaard@elsevier.com Data, Data, Everywhere: What’s A Publisher To Do?
  • 2. | 2 Are Scientists Sharing Their Data? Why (not)? Attitudes to data sharing 73% agree that having access to other researchers’ data would benefit their research … but 64% agree that they would be willing to share their own research data with other researchers The reasons for discrepancy? Only 26% agreed they had received sufficient training in data management Only 37% agreed that sharing research data is associated with credit/ reward in their field 59% agreed that research data management specialists need to play a role in research data sharing Perceived benefits of data sharing Top 3 More possibilities for collaboration 55% Reproducibility of research 53% Article more likely to be cited 50% Perceived drawbacks of data sharing Competitors using data before collector has a chance to re-use it Use without crediting/citing the data collector Cost (time and financial) Legal concerns (ownership, misuse, confidentiality) CWTS –Elsevier Open Data Report (2016): 1,200 researchers responded to a survey of 51,672 individuals randomly selected from Scopus author database (2.3% response rate). Survey tool: Online survey available in English only. Co-branded with University of Leiden (CWTS). Fieldwork took place in June-July 2016 https://www.elsevier.com/about/open-science/research-data/open-data-report
  • 3. | 3 Publishers Getting Together: AGU FAIR Data Sharing Guidelines 1. https://www.elsevier.com/authors/author-services/research-data/data-guidelines/ 2. https://cos.io/our-services/top-guidelines/ AGU FAIR DATA DRAFT Consolidated proposal for data guidelines for publishers of Earth and Space Science journals: 1. Authors are required to deposit their research data in a relevant data repository: • Before publication, large data sets (such as microarray data, protein or DNA sequences, atomic coordinates or climate data) must be deposited in an approved database. • An inventory of suggested and supported data repositories is provided by the Coalition on Publishing Data in the Earth and Space Sciences (COPDESS). • All data used in the analysis must be available to any researcher for purposes of reproducing or extending the analysis. 2. Authors are required to cite and link to this dataset in your article, following the Force11 Data CItation Principles 3. If this is not possible, authors are required to submit a statement explaining why research data cannot be shared Truly exceptional circumstances requiring special treatment, such as protecting personal privacy, should be discussed with the editor no later than at the manuscript revision stage, and spelled out explicitly in the acknowledgments. Elsevier + Science + Nature + Wiley + PLoS + Digital Science + AGU
  • 4. | 4 Research data is more than just a journal supplement: 9 All forms of research data, which includes everything needed to reproduce and reuse experimental and computational results. Raw data Processed data Machine & environment settings Protocols, methods, workflows Scripts, analyses, algorithms
  • 5. | 5 3. ‘Metrics on data’ Monitoring and reporting on institutional data • Benchmark • Rank Evaluate • Manage • Preserve Institution Search Repository Notebook Manager Mendeley Data Platform: Monitor A modular, cloud-based platform designed for research institutions, to manage the entire lifecycle of research data Find Topic Design Identify gaps Plan & Fund Discover data, people, methods & protocols Collect, analyze & visualize Prepare, reproduce, re-use & benchmark Store & Share Publish Disseminate 1. Lab data Execute Research 2. Open data: data publicly available
  • 6. | 6 But how do we publish Data Science? https://projectreporter.nih.gov/project_description.cfm?projectnumber=1R01MH107238-01 1R01MH107238-01 (Arnold, Fraser, Kesselman): An experimental paradigm to allow dynamic monitoring of the strength and location of every glutamatergic and GABA/Glycinergic synapse within the brain of a living organism. This will involve combining three technologies: 1. Recombinant probes, ... 2. 2P-SPIM microscopy, … 3. Software to calculate and store the location and strength of each synapse in such a manner that it can be easily manipulated and analyzed
  • 7. | 7 The computer is a scientist, too: “intelligent systems for computer-aided discovery can integrate into the insight generation loop in scalable ways…” Computer-Aided Discovery: Towards Scientific Insight Generation with Machine Support, V. Pankratius, J. Li, M. Gowanlock, D. Blair, C. Rude, T. Herring, F. Lind, P. Erickson, C. Lonsdale, IEEE Intelligent Systems 31(4), pp. 3-10, Jul/Aug 2016 “This work combines time series Principal Component Analysis with InSAR to constrain the space of possible model explanations on current empirical data sets and achieve a better identification of deformation patterns”
  • 8. | 8 Data Tool Article User A Move Towards Networked Knowledge:
  • 9. | 9 Moving from a pipeline to a platform model: A platform: • Is a nexus of rules and architecture • Is open, allowing regulated participation • Actively promotes (positive) interactions among different partners • Scales much faster than a pipeline. A network inherently has an external focus. To have an external focus, you must have a community strategy. Pipelines, Platforms, and the New Rules of Strategy, Harvard Business review, April 2016, https://hbr.org/2016/04/pipelines-platforms-and-the-new-rules-of-strategy
  • 10. | 10 existing integration planned integration Example #1: Mendeley Data Platform to integrate with Research Data Management ecosystem Index datasets metadata Mint DOIs Import / export notebooks, experiments Import / export datasets Repository indexed by OpenAIRE Zenodo indexed by DataSearch Publish links between articles and datasets Datasets indexed by DataSearchLong-term preservation of published datasets + 22 repositories Integrate with machine readable DMPs Open API
  • 11. | 11 • Goal: develop a cloud-based solution for doing bioinformatics experiment • Elsevier portion: build a Global Unique Identifier Broker tool • Based on Mendeley Data DOI resolver, to be shared in OS through NIH Platform Example #2: Collaboration NIH Data Commons Pilot With SevenBridges Genomics http://www.healthcareitnews.com/news/nih-taps-new-partners-build-commons-petabytes-biomedical-data
  • 12. | 12 In Summary: • Publishers are getting together to help store and share data: - Open Data Report - AGU Fair Data Group - Mendeley Data Platform • But: all science is becoming data science: - Scientists are building the tools that other scientists work with - We are moving ‘beyond download science’, where computers join in • Publishers need to change: - We need to enable network effects - This means moving from a pipeline to a platform model • Some things we are doing at Elsevier RDM: - Mendeley Data Platform - Participating in the NIH Data Commons Pilot
  • 13. | 13 Many Questions Remain… • How do we best publish Data Science? • (How) do we connect this to the article/journal model that we all know and love? • How does scientific software become sustainable, used, connected? • What role do all the parties play: funding agencies, tool builders (commercial and academic), HPCCs, cloud providers, libraries, standards bodies, publishers? • How do all of us become connected into a viable network? • How do we collectively transition from the old models into the new?
  • 15. | 15 https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data 10.Integrateupstreamanddownstream –makemetadatatoserveuse. Save Share Use 9. Re-usable (allow tools to run on it) 8. Reproducible 7. Trusted (e.g. reviewed) 6. Comprehensible (description / method is available) 5. Citable 4. Discoverable (data is indexed or data is linked from article) 3. Accessible 1. Stored (existing in some form) 2. Preserved (long-term & format-independent) Ten Habits of Highly Effective Research Data:
  • 16. | 16 Ideas are becoming distributed Tools are becoming distributed Easy to create networks of tools to run anywhere (Docker, Jupyter Notbook collections etc) Many sources, formats, owners, types: global, interconnected Computers make hypotheses, too*; citizen science/MOOCs enable ubiquitous access to knowledge * http://ieeexplore.ieee.org/abstract/document/7 515118/: Computer-Aided Discovery: Toward Scientific Insight Generation with Machine Data is becoming distributed
  • 18. | 18 Thank you! Links to RDM Work: • https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data • https://www.elsevier.com/about/open-science/research-data • https://www.hivebench.com • https://data.mendeley.com/ • https://datasearch.elsevier.com/ • https://www.elsevier.com/books-and-journals/content-innovation/data-base-linking • http://www.journals.elsevier.com/softwarex/ • https://rd-alliance.org/groups/rdawds-publishing-data-services-wg.html • https://www.force11.org/ • http://www.nationaldataservice.org/ • https://rd-alliance.org/ Anita de Waard, a.dewaard@elsevier.com

Editor's Notes

  1. Outline: Some Trends Some Questions What Elsevier is interested in, and doing
  2. When we talk about data, we are meaning all forms of research data, which includes everything you need to reproduce and reuse.