SlideShare une entreprise Scribd logo
1  sur  23
Stories, that Persuade With Data:
Some Thoughts on Making Networks
of Knowledge.
Anita de Waard
VP Research Data Collaborations
a.dewaard@elsevier.com
Outline:
• Stories, that persuade with data
• Networks of claims and data
• Research Data Management: some
thoughts.
Scientific articles are stories...
The Story of Goldilocks and the
Three Bears

Story

Grammar

Paper

The AXH Domain of Ataxin-1 Mediates Neurodegeneration through Its
Interaction with Gfi-1/Senseless Proteins

Once upon a time

Time

Setting

Background

The mechanisms mediating SCA1 pathogenesis are still not fully
understood, but some general principles have emerged.

a little girl named Goldilocks

Characters

Objects of study the Drosophila Atx-1 homolog (dAtx-1) which lacks a polyQ tract,

She went for a walk in the forest.
Pretty soon, she came upon a
house.

Location

Experimental
setup

studied and compared in vivo effects and interactions to those of the
human protein

She knocked and, when no one
answered,

Goal

Research
goal

Gain insight into how Atx-1's function contributes to SCA1 pathogenesis.
How these interactions might contribute to the disease process and how
they might cause toxicity in only a subset of neurons in SCA1 is not fully
understood.

she walked right in.

Attempt

Hypothesis

Atx-1 may play a role in the regulation of gene expression

At the table in the kitchen, there
were three bowls of porridge.

Name

Name

dAtX-1 and hAtx-1 Induce Similar Phenotypes When Overexpressed in File

Goldilocks was hungry.

Subgoal

Subgoal

test the function of the AXH domain

She tasted the porridge from the
first bowl.

Attempt

Method

overexpressed dAtx-1 in flies using the GAL4/UAS system (Brand and
Perrimon, 1993) and compared its effects to those of hAtx-1.

This porridge is too hot! she
exclaimed.

Outcome

Results

Overexpression of dAtx-1 by Rhodopsin1(Rh1)-GAL4, which drives
expression in the differentiated R1-R6 photoreceptor cells (Mollereau et a
2000 and O'Tousa et al., 1985), results in neurodegeneration in the eye, as
does overexpression of hAtx-1[82Q]. Although at 2 days after eclosion,
overexpression of either Atx-1 does not show obvious morphological
changes in the photoreceptor cells

Data

(data not shown),

Results

both genotypes show many large holes and loss of cell integrity at 28 days

So, she tasted the porridge from the Activity
second bowl.
3
This porridge is too cold, she said
Outcome

Theme

Episode 1
...that persuade (editors/authors/readers!)…
Aristotle

Quintilian

Scientific Paper

prooimion

The introduction of a speech, where one announces the
Introduction subject and purpose of the discourse, and where one usually
/ exordium
employs the persuasive appeal to ethos in order to establish
credibility with the audience.

Introduction:
positioning

prothesis

Statement
of
The speaker here provides a narrative account of what has
Facts/narrati happened and generally explains the nature of the case.
o

Introduction: research
question

Summary/
propostitio

The propositio provides a brief summary of what one is about
to speak on, or concisely puts forth the charges or accusation.

Summary of contents

Proof/
confirmatio

The main body of the speech where one offers logical
arguments as proof. The appeal to logos is emphasized here.

Results

Refutation/
refutatio

As the name connotes, this section of a speech was devoted to
answering the counterarguments of one's opponent.

Related Work

peroratio

Following the refutatio and concluding the classical oration, the
peroratio conventionally employed appeals through pathos,
and often included a summing up.

Discussion: summary,
implications.

pistis

epilogos
... with data.

5
As claims get cited, they become facts:
Voorhoeve et al, Cell, 2006:
To investigate the possibility that miR-372 and miR-373 suppress the
expression of LATS2, we...

Hypothesis

Therefore, these results point to LATS2 as a mediator of the miR-372 and miR-373
effects on cell proliferation and tumorigenicity,
Implication

Raver-Shapira et.al, JMolCell 2007
Cited Implication
... two miRNAs, miRNA-372 and-373, function as potential novel oncogenes in
testicular germ cell tumors by inhibition of LATS2 expression, which suggests that
Lats2 is an important tumor suppressor (Voorhoeve et al., 2006).
Yabuta, JBioChem 2007:
miR-372 and miR-373 target the Lats2 tumor suppressor (Voorhoeve et al., 2006)

Fact
There are many problems
with this system!
• There are too many papers to read – for
people.
• Papers are not written in a way that makes
reading easy – for computers.
• Issues with reproducibility.
• Hard to access or assess the data.
• And how do we know how many data points a
claim was based on?
What might work: networks of claims and data:
Claim
PHC

undergo

Paper A:

Growth arrest

Paper B:
implication

implication

method

fact
method link

method

fact

goal

fact

goal

fact

results

results

data 1

data 4
data 2

data 3

data 5

data 6
How do we get there? Find claims:
E.g., scientific discourse analysis:
In contrast with previous hypotheses compact plaques form before significant
deposition of diffuse A beta, suggesting that different mechanisms are involved
in the deposition of diffuse amyloid and the aggregation into plaques.
Entities
Relationships
Temporality
Connections

Status

core information
(proposition)

information extraction

discourse structure

discourse analysis

rhetorical
metadiscourse

discourse analysis

thematic roles

Sándor, Àgnes and de Waard, Anita, (2012).
Turn claims into formal representations:
Biological statement with BEL/ epistemic
markup

BEL representation:

Epistemic
evaluation

These miRNAs neutralize p53-mediated CDK
inhibition, possibly through direct inhibition
of the expression of the tumor-suppressor
LATS2.

r(MIR:miR-372) |(tscript(p(HUGO:Trp53)) -|
kin(p(PFH:”CDK Family”)))
Increased abundance of miR372 decreases abundance of
LATS2
r(MIR:miR-372) -|
r(HUGO:LATS2)

Value =
Possible
Source =
Unknown
Basis =
Unknown

Biological statement with
Medscan/epistemic markup

MedScan Representation:

Epistemic
evaluation

Furthermore, we present evidence that the
secretion of nesfatin-1 into the culture
media was dramatically increased during the
differentiation of 3T3-L1 preadipocytes into
adipocytes (P < 0.001) and after treatments
with TNF-alpha, IL-6, insulin, and
dexamethasone (P < 0.01).

IL-6  NUCB2 (nesfatin-1)
Relation: MolTransport
Effect: Positive
CellType: Adipocytes
Cell Line: 3T3-L1

Value =
Probable
Source =
Author
Basis = Data
Use Linked Data to point to claims, and connect them:
the xml is fixed, but the structure is open!

allows for layers of annotation
but we all know
she was deluded then

said @anitawaard
on January 9, 2014

<ce:section id=#123>

this says

mice like cheese
What about the data?
•
•
•
•

Can we see it?
How can we evaluate it?
Can it be reproduced?
Can we combine or compare data points?
Elsevier Research Data Services
• Goals:
–
–
–
–

Increase data preservation: quantity and quality
Improve data use: by and for authors, readers, and lay people
Enhance interoperability: between systems, journals, databases
Help develop a sustainable data infrastructure.

• Guiding principles:
– In principle, all data stays open
– Work with existing repositories and tools (so URLs, front end etc
stay where they are)
– 2013/2014: Series of pilots and questionnaires to drive data
strategy/data policy and contribute optimally to an integrated
data infrastructure, enabling networked knowledge.
Research data management today:
Using antibodies
and squishy bits
Grad Students experiment
and enter details into their
lab notebook.
The PI then tries to make
sense of their slides,
and writes a paper.
End of story.
Where research data goes now:
> 50 My Papers
2 M scientists
2 My papers/year

A small portion of data
(1-2%?) stored in small,
topic-focused
data repositories

Majority of data
(90%?) is stored
on local hard drives

PDB:
88,3 k

PetDB:
1,5 k
MiRB:
25k

Some data
(8%?) stored in large,
generic data
repositories

Dryad:
7,631 files

SedDB:
0.6 k
TAIR:
72,1 k

Dataverse:
0.6 My

Institutional
Repositories
An Urban Legend is born:
• How can we make a standard neuroscience
wet lab more data-sharing savvy?
• Incorporate structured workflows into the daily
practice of a typical electrophysiology lab (the
Urban Lab at CMU)
– What does it take?
– Where are points of conflict?

• 1-year pilot, funded by Elsevier RDS:
– CMU: Shreejoy Tripathy, manage/user test
– Elsevier: development, UI, project management
Annotating data during
experimentation:
What does high-quality data
curation take?
Pilot project with IEDA:
– Build a database for lunar geochemistry
– Write joint report on building
repository, curation, costs and
challenges
Some thoughts
about the role of
the institute:

Funding
Agencies
Performance
reporting
Library

Institution
Research Office

Usage/Citation
reporting
Institutional
Repository

Indexing
Integrated
Performance
Query

Usage/Citation
reporting
Indexing

Research Data
Repositories

Unified Metadata Layer

Curation

Deposit /
Store
Indexing
Generic Data Storage
(such as Dropbox)

Electronic Lab
Notebooks

Integrated
Data Search

Data Flow

Deposit /
Store

Indexing & Search

Researchers

Performance
Reporting
Data Initiatives:
• Data Citation group:
– Synthesize principles of proper data citation
– ‘Declaration of Data Citation Principles’, 8 principles of
successful data citation -http://www.force11.org/datacitation

• Resource Identification Initiative:
– Promote research resource identification, discovery, and reuse
– Resource Identification Portal http://scicrunch.com/resources
– Central location for obtaining research resource identifiers
(RRIDs) for materials and software used in biomedical research
• Antibody: Abgent Cat# AP7251E, ABR:AB_2140114
• Tool: CellProfiler Image Analysis Software, NIFRegistry:nif-0000-00280
• Organism: MGI:MGI:3840442
In summary:
• Stories, that persuade with data:
– We need better ways to communicate science!

• Networks of claims and data:
– Promising steps towards identifying claims
– Entity identification and Linked Data helps
– Problem: access to data

• Research Data Management:
– Key issue: get data in up-front
– Evaluate and scale up role of repositories
– Codevelop view of role of institutions/libraries
Questions?
Anita de Waard
VP Research Data Collaborations
a.dewaard@elsevier.com

http://researchdata.elsevier.com/
Thank you!
Collaborations and discussions gratefully acknowledged:
• CMU: Nathan Urban, Shreejoy Tripathy, Shawn Burton, Rick
Gerkin, Santosh Chandrasekaran, Matthew Geramita, Eduard
Hovy
• UCSD: Phil Bourne, Brian Shoettlander, David Minor, Declan
Fleming, Ilya Zaslavsky
• NIF/Force11: Maryann Martone, Anita Bandrowski
• OHSU: Melissa Haendel, Nicole Vasilevsky
• California Digital Library: Carly Strasser, John Kunze, Stephen
Abrams
• IEDA: Kerstin Lehnert, Annika
• Elsevier: Mark Harviston, Jez Alder, David Marques

Contenu connexe

Tendances

2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynotec.titus.brown
 
Literature Mining and Systems Biology
Literature Mining and Systems BiologyLiterature Mining and Systems Biology
Literature Mining and Systems BiologyLars Juhl Jensen
 
Quality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic ModelingQuality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic ModelingStuti Nayak
 
Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)AllSeq
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesDan Sullivan, Ph.D.
 
Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden Benjamin Good
 
Data101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalData101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalJackie Wirz, PhD
 
Exploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCHExploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCHbiocs
 
AI Systems @ Manchester
AI Systems @ ManchesterAI Systems @ Manchester
AI Systems @ ManchesterAndre Freitas
 
Biomedical literature mining
Biomedical literature miningBiomedical literature mining
Biomedical literature miningLars Juhl Jensen
 

Tendances (13)

2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynote
 
Literature Mining and Systems Biology
Literature Mining and Systems BiologyLiterature Mining and Systems Biology
Literature Mining and Systems Biology
 
Quality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic ModelingQuality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic Modeling
 
Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious Diseases
 
Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden
 
Ashg sedlazeck grc_share
Ashg sedlazeck grc_shareAshg sedlazeck grc_share
Ashg sedlazeck grc_share
 
Data101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalData101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_final
 
Exploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCHExploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCH
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 
AI Systems @ Manchester
AI Systems @ ManchesterAI Systems @ Manchester
AI Systems @ Manchester
 
Biomedical literature mining
Biomedical literature miningBiomedical literature mining
Biomedical literature mining
 

Similaire à 'Stories that persuade with data' - talk at CENDI meeting January 9 2014

How Scientists Read, How Computers Read, and What We Should Do
How Scientists Read, How Computers Read, and What We Should DoHow Scientists Read, How Computers Read, and What We Should Do
How Scientists Read, How Computers Read, and What We Should DoAnita de Waard
 
Are we finally ready for transclusion?*
Are we finally ready for transclusion?*Are we finally ready for transclusion?*
Are we finally ready for transclusion?*Paul Groth
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...Anita de Waard
 
How to persuade with data
How to persuade with dataHow to persuade with data
How to persuade with dataAnita de Waard
 
Argumentation in biology papers
Argumentation in biology papersArgumentation in biology papers
Argumentation in biology papersAnita de Waard
 
EEG and Telemetry: Best Practices for Managing Large Data Sets to Investigate...
EEG and Telemetry: Best Practices for Managing Large Data Sets to Investigate...EEG and Telemetry: Best Practices for Managing Large Data Sets to Investigate...
EEG and Telemetry: Best Practices for Managing Large Data Sets to Investigate...InsideScientific
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08Russ Altman
 
CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...CINECAProject
 
IJSRED-V2I1P5
IJSRED-V2I1P5IJSRED-V2I1P5
IJSRED-V2I1P5IJSRED
 
dkNET Webinar: RRIDs and Naughty Cell Lines 04/12/2019
dkNET Webinar: RRIDs and Naughty Cell Lines 04/12/2019dkNET Webinar: RRIDs and Naughty Cell Lines 04/12/2019
dkNET Webinar: RRIDs and Naughty Cell Lines 04/12/2019dkNET
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsmikaelhuss
 
Talk at ISWC 2012 Workshop on Semantic Technologies Applied to Biomedical In...
 Talk at ISWC 2012 Workshop on Semantic Technologies Applied to Biomedical In... Talk at ISWC 2012 Workshop on Semantic Technologies Applied to Biomedical In...
Talk at ISWC 2012 Workshop on Semantic Technologies Applied to Biomedical In...Anita de Waard
 
Friend harvard 2013-01-30
Friend harvard 2013-01-30Friend harvard 2013-01-30
Friend harvard 2013-01-30Sage Base
 
Crofton McKim Conf Thyroid QSAR Talk 10-17-2008
Crofton McKim Conf Thyroid QSAR Talk 10-17-2008Crofton McKim Conf Thyroid QSAR Talk 10-17-2008
Crofton McKim Conf Thyroid QSAR Talk 10-17-2008KevinCrofton
 
Bioinformatics final
Bioinformatics finalBioinformatics final
Bioinformatics finalRainu Rajeev
 
Molecular and data visualization in drug discovery
Molecular and data visualization in drug discoveryMolecular and data visualization in drug discovery
Molecular and data visualization in drug discoveryDeepak Bandyopadhyay
 

Similaire à 'Stories that persuade with data' - talk at CENDI meeting January 9 2014 (20)

How Scientists Read, How Computers Read, and What We Should Do
How Scientists Read, How Computers Read, and What We Should DoHow Scientists Read, How Computers Read, and What We Should Do
How Scientists Read, How Computers Read, and What We Should Do
 
Are we finally ready for transclusion?*
Are we finally ready for transclusion?*Are we finally ready for transclusion?*
Are we finally ready for transclusion?*
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 
How to persuade with data
How to persuade with dataHow to persuade with data
How to persuade with data
 
Argumentation in biology papers
Argumentation in biology papersArgumentation in biology papers
Argumentation in biology papers
 
2013 alumni-webinar
2013 alumni-webinar2013 alumni-webinar
2013 alumni-webinar
 
EEG and Telemetry: Best Practices for Managing Large Data Sets to Investigate...
EEG and Telemetry: Best Practices for Managing Large Data Sets to Investigate...EEG and Telemetry: Best Practices for Managing Large Data Sets to Investigate...
EEG and Telemetry: Best Practices for Managing Large Data Sets to Investigate...
 
Xerox2009
Xerox2009Xerox2009
Xerox2009
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08
 
CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...
 
IJSRED-V2I1P5
IJSRED-V2I1P5IJSRED-V2I1P5
IJSRED-V2I1P5
 
dkNET Webinar: RRIDs and Naughty Cell Lines 04/12/2019
dkNET Webinar: RRIDs and Naughty Cell Lines 04/12/2019dkNET Webinar: RRIDs and Naughty Cell Lines 04/12/2019
dkNET Webinar: RRIDs and Naughty Cell Lines 04/12/2019
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Talk at ISWC 2012 Workshop on Semantic Technologies Applied to Biomedical In...
 Talk at ISWC 2012 Workshop on Semantic Technologies Applied to Biomedical In... Talk at ISWC 2012 Workshop on Semantic Technologies Applied to Biomedical In...
Talk at ISWC 2012 Workshop on Semantic Technologies Applied to Biomedical In...
 
Insilico binding studies on tau protein and pp2 a as alternative targets in a...
Insilico binding studies on tau protein and pp2 a as alternative targets in a...Insilico binding studies on tau protein and pp2 a as alternative targets in a...
Insilico binding studies on tau protein and pp2 a as alternative targets in a...
 
Friend harvard 2013-01-30
Friend harvard 2013-01-30Friend harvard 2013-01-30
Friend harvard 2013-01-30
 
Dynamics of developmental fate decisions - Luís A. Nunes Amaral
Dynamics of developmental fate decisions - Luís A. Nunes AmaralDynamics of developmental fate decisions - Luís A. Nunes Amaral
Dynamics of developmental fate decisions - Luís A. Nunes Amaral
 
Crofton McKim Conf Thyroid QSAR Talk 10-17-2008
Crofton McKim Conf Thyroid QSAR Talk 10-17-2008Crofton McKim Conf Thyroid QSAR Talk 10-17-2008
Crofton McKim Conf Thyroid QSAR Talk 10-17-2008
 
Bioinformatics final
Bioinformatics finalBioinformatics final
Bioinformatics final
 
Molecular and data visualization in drug discovery
Molecular and data visualization in drug discoveryMolecular and data visualization in drug discovery
Molecular and data visualization in drug discovery
 

Plus de Anita de Waard

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?Anita de Waard
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataAnita de Waard
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsAnita de Waard
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesAnita de Waard
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Anita de Waard
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?Anita de Waard
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data ManagementAnita de Waard
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseAnita de Waard
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of PublishingAnita de Waard
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryAnita de Waard
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data SharingAnita de Waard
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingAnita de Waard
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumAnita de Waard
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataAnita de Waard
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016Anita de Waard
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupAnita de Waard
 

Plus de Anita de Waard (20)

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
History of the future
History of the futureHistory of the future
History of the future
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 

Dernier

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Dernier (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

'Stories that persuade with data' - talk at CENDI meeting January 9 2014

  • 1. Stories, that Persuade With Data: Some Thoughts on Making Networks of Knowledge. Anita de Waard VP Research Data Collaborations a.dewaard@elsevier.com
  • 2. Outline: • Stories, that persuade with data • Networks of claims and data • Research Data Management: some thoughts.
  • 3. Scientific articles are stories... The Story of Goldilocks and the Three Bears Story Grammar Paper The AXH Domain of Ataxin-1 Mediates Neurodegeneration through Its Interaction with Gfi-1/Senseless Proteins Once upon a time Time Setting Background The mechanisms mediating SCA1 pathogenesis are still not fully understood, but some general principles have emerged. a little girl named Goldilocks Characters Objects of study the Drosophila Atx-1 homolog (dAtx-1) which lacks a polyQ tract, She went for a walk in the forest. Pretty soon, she came upon a house. Location Experimental setup studied and compared in vivo effects and interactions to those of the human protein She knocked and, when no one answered, Goal Research goal Gain insight into how Atx-1's function contributes to SCA1 pathogenesis. How these interactions might contribute to the disease process and how they might cause toxicity in only a subset of neurons in SCA1 is not fully understood. she walked right in. Attempt Hypothesis Atx-1 may play a role in the regulation of gene expression At the table in the kitchen, there were three bowls of porridge. Name Name dAtX-1 and hAtx-1 Induce Similar Phenotypes When Overexpressed in File Goldilocks was hungry. Subgoal Subgoal test the function of the AXH domain She tasted the porridge from the first bowl. Attempt Method overexpressed dAtx-1 in flies using the GAL4/UAS system (Brand and Perrimon, 1993) and compared its effects to those of hAtx-1. This porridge is too hot! she exclaimed. Outcome Results Overexpression of dAtx-1 by Rhodopsin1(Rh1)-GAL4, which drives expression in the differentiated R1-R6 photoreceptor cells (Mollereau et a 2000 and O'Tousa et al., 1985), results in neurodegeneration in the eye, as does overexpression of hAtx-1[82Q]. Although at 2 days after eclosion, overexpression of either Atx-1 does not show obvious morphological changes in the photoreceptor cells Data (data not shown), Results both genotypes show many large holes and loss of cell integrity at 28 days So, she tasted the porridge from the Activity second bowl. 3 This porridge is too cold, she said Outcome Theme Episode 1
  • 4. ...that persuade (editors/authors/readers!)… Aristotle Quintilian Scientific Paper prooimion The introduction of a speech, where one announces the Introduction subject and purpose of the discourse, and where one usually / exordium employs the persuasive appeal to ethos in order to establish credibility with the audience. Introduction: positioning prothesis Statement of The speaker here provides a narrative account of what has Facts/narrati happened and generally explains the nature of the case. o Introduction: research question Summary/ propostitio The propositio provides a brief summary of what one is about to speak on, or concisely puts forth the charges or accusation. Summary of contents Proof/ confirmatio The main body of the speech where one offers logical arguments as proof. The appeal to logos is emphasized here. Results Refutation/ refutatio As the name connotes, this section of a speech was devoted to answering the counterarguments of one's opponent. Related Work peroratio Following the refutatio and concluding the classical oration, the peroratio conventionally employed appeals through pathos, and often included a summing up. Discussion: summary, implications. pistis epilogos
  • 6. As claims get cited, they become facts: Voorhoeve et al, Cell, 2006: To investigate the possibility that miR-372 and miR-373 suppress the expression of LATS2, we... Hypothesis Therefore, these results point to LATS2 as a mediator of the miR-372 and miR-373 effects on cell proliferation and tumorigenicity, Implication Raver-Shapira et.al, JMolCell 2007 Cited Implication ... two miRNAs, miRNA-372 and-373, function as potential novel oncogenes in testicular germ cell tumors by inhibition of LATS2 expression, which suggests that Lats2 is an important tumor suppressor (Voorhoeve et al., 2006). Yabuta, JBioChem 2007: miR-372 and miR-373 target the Lats2 tumor suppressor (Voorhoeve et al., 2006) Fact
  • 7. There are many problems with this system! • There are too many papers to read – for people. • Papers are not written in a way that makes reading easy – for computers. • Issues with reproducibility. • Hard to access or assess the data. • And how do we know how many data points a claim was based on?
  • 8. What might work: networks of claims and data: Claim PHC undergo Paper A: Growth arrest Paper B: implication implication method fact method link method fact goal fact goal fact results results data 1 data 4 data 2 data 3 data 5 data 6
  • 9. How do we get there? Find claims: E.g., scientific discourse analysis: In contrast with previous hypotheses compact plaques form before significant deposition of diffuse A beta, suggesting that different mechanisms are involved in the deposition of diffuse amyloid and the aggregation into plaques. Entities Relationships Temporality Connections Status core information (proposition) information extraction discourse structure discourse analysis rhetorical metadiscourse discourse analysis thematic roles Sándor, Àgnes and de Waard, Anita, (2012).
  • 10. Turn claims into formal representations: Biological statement with BEL/ epistemic markup BEL representation: Epistemic evaluation These miRNAs neutralize p53-mediated CDK inhibition, possibly through direct inhibition of the expression of the tumor-suppressor LATS2. r(MIR:miR-372) |(tscript(p(HUGO:Trp53)) -| kin(p(PFH:”CDK Family”))) Increased abundance of miR372 decreases abundance of LATS2 r(MIR:miR-372) -| r(HUGO:LATS2) Value = Possible Source = Unknown Basis = Unknown Biological statement with Medscan/epistemic markup MedScan Representation: Epistemic evaluation Furthermore, we present evidence that the secretion of nesfatin-1 into the culture media was dramatically increased during the differentiation of 3T3-L1 preadipocytes into adipocytes (P < 0.001) and after treatments with TNF-alpha, IL-6, insulin, and dexamethasone (P < 0.01). IL-6  NUCB2 (nesfatin-1) Relation: MolTransport Effect: Positive CellType: Adipocytes Cell Line: 3T3-L1 Value = Probable Source = Author Basis = Data
  • 11. Use Linked Data to point to claims, and connect them: the xml is fixed, but the structure is open! allows for layers of annotation but we all know she was deluded then said @anitawaard on January 9, 2014 <ce:section id=#123> this says mice like cheese
  • 12. What about the data? • • • • Can we see it? How can we evaluate it? Can it be reproduced? Can we combine or compare data points?
  • 13. Elsevier Research Data Services • Goals: – – – – Increase data preservation: quantity and quality Improve data use: by and for authors, readers, and lay people Enhance interoperability: between systems, journals, databases Help develop a sustainable data infrastructure. • Guiding principles: – In principle, all data stays open – Work with existing repositories and tools (so URLs, front end etc stay where they are) – 2013/2014: Series of pilots and questionnaires to drive data strategy/data policy and contribute optimally to an integrated data infrastructure, enabling networked knowledge.
  • 14. Research data management today: Using antibodies and squishy bits Grad Students experiment and enter details into their lab notebook. The PI then tries to make sense of their slides, and writes a paper. End of story.
  • 15. Where research data goes now: > 50 My Papers 2 M scientists 2 My papers/year A small portion of data (1-2%?) stored in small, topic-focused data repositories Majority of data (90%?) is stored on local hard drives PDB: 88,3 k PetDB: 1,5 k MiRB: 25k Some data (8%?) stored in large, generic data repositories Dryad: 7,631 files SedDB: 0.6 k TAIR: 72,1 k Dataverse: 0.6 My Institutional Repositories
  • 16. An Urban Legend is born: • How can we make a standard neuroscience wet lab more data-sharing savvy? • Incorporate structured workflows into the daily practice of a typical electrophysiology lab (the Urban Lab at CMU) – What does it take? – Where are points of conflict? • 1-year pilot, funded by Elsevier RDS: – CMU: Shreejoy Tripathy, manage/user test – Elsevier: development, UI, project management
  • 18. What does high-quality data curation take? Pilot project with IEDA: – Build a database for lunar geochemistry – Write joint report on building repository, curation, costs and challenges
  • 19. Some thoughts about the role of the institute: Funding Agencies Performance reporting Library Institution Research Office Usage/Citation reporting Institutional Repository Indexing Integrated Performance Query Usage/Citation reporting Indexing Research Data Repositories Unified Metadata Layer Curation Deposit / Store Indexing Generic Data Storage (such as Dropbox) Electronic Lab Notebooks Integrated Data Search Data Flow Deposit / Store Indexing & Search Researchers Performance Reporting
  • 20. Data Initiatives: • Data Citation group: – Synthesize principles of proper data citation – ‘Declaration of Data Citation Principles’, 8 principles of successful data citation -http://www.force11.org/datacitation • Resource Identification Initiative: – Promote research resource identification, discovery, and reuse – Resource Identification Portal http://scicrunch.com/resources – Central location for obtaining research resource identifiers (RRIDs) for materials and software used in biomedical research • Antibody: Abgent Cat# AP7251E, ABR:AB_2140114 • Tool: CellProfiler Image Analysis Software, NIFRegistry:nif-0000-00280 • Organism: MGI:MGI:3840442
  • 21. In summary: • Stories, that persuade with data: – We need better ways to communicate science! • Networks of claims and data: – Promising steps towards identifying claims – Entity identification and Linked Data helps – Problem: access to data • Research Data Management: – Key issue: get data in up-front – Evaluate and scale up role of repositories – Codevelop view of role of institutions/libraries
  • 22. Questions? Anita de Waard VP Research Data Collaborations a.dewaard@elsevier.com http://researchdata.elsevier.com/
  • 23. Thank you! Collaborations and discussions gratefully acknowledged: • CMU: Nathan Urban, Shreejoy Tripathy, Shawn Burton, Rick Gerkin, Santosh Chandrasekaran, Matthew Geramita, Eduard Hovy • UCSD: Phil Bourne, Brian Shoettlander, David Minor, Declan Fleming, Ilya Zaslavsky • NIF/Force11: Maryann Martone, Anita Bandrowski • OHSU: Melissa Haendel, Nicole Vasilevsky • California Digital Library: Carly Strasser, John Kunze, Stephen Abrams • IEDA: Kerstin Lehnert, Annika • Elsevier: Mark Harviston, Jez Alder, David Marques

Notes de l'éditeur

  1. Walk through pieces 1 by 1, also mention that this is very much an uncompleted work in progress