SlideShare a Scribd company logo
1 of 15
Download to read offline
Sharing Detailed Research Data
      is Associated with
   Increased Citation Rate

              Heather Piwowar
       Roger Day and Douglas Fridsma
           University of Pittsburgh


       Published in PLoS ONE, March 27 2007
           Funded by NLM Training Grant
 Presented at NLM Trainee Conference, June 27 2007
Sharing research data




    PAST MEDICAL HISTORY:
    Past medical history showed she had
    superficial phlebitis times two in the past, had
    non-insulin dependent diabetes mellitus for
    four years.
    She had been hypothyroid for three years.
    HISTORY OF PRESENT ILLNESS:
    The patient is a 58-year-old female, …


http://upload.wikimedia.org/wikipedia/commons/7/76/PeptideMSMS.jpg; http://en.wikipedia.org/wiki/Image:Helices.png; http://en.wikipedia.org/wiki/
Image:Heatmap.png; http://en.wikipedia.org/wiki/Image:Microarray2.gif;
http://zellig.cpmc.columbia.edu/medlee/demo/; htp://www.plosone.org/article/fetchArticle.action?articleURI=info:doi/10.1371/journal.pone.0000441
Shared data benefits science
•    Verify
•    Understand
•    Extend
•    Explore
•    Combine
•    Synergize
•    Train
•    Reduce
But… costly for authors
•    Find
•    Organize
•    Document
•    Deidentify
•    Format
•    Decide
•    Ask
•    Submit

•  Answer questions

•  Worry about mistakes being found
•  Worry about data being misinterpreted
•  Worry about being scooped

•  Forgo money and IP and prestige???
So what’s in it for them?
Carrot.
A currency of value?
Citations.
  $50!


 Do trials which share their data
      receive more citations?
Methods
Cancer Microarray Trials
   Ntzani and Ioannidis identified 85 trials published 1999-2003


Citations
   ISI Web of Science Citation Index, citations from 2004-2005


Data availability
   Publisher and lab websites, microarray databases,
   WayBack Internet Archive, Oncomine


Statistics
   Multivariate linear regression
Results:
                    Eligible trials
•  85 trials
•  41 (48%) made data available
•  Various locations:
  –  Lab websites (28)
  –  Publisher websites (4)
  –  SMD (6)
  –  GEO (6)
  –  GEDP (2)
•  6239 total citations
Results:
                                Big picture
           85 clinical trials used 
          These 85 trials were cited
     microarrays to study cancer 
            6239 times 
            between 1999-2003
                during 2004-2005


41 (48%) of these trials                                     Trials which shared
 made their microarray                                       data received 
 data publicly available                                     5334 (85%) 
        on the internet
                                     of these citations




                        Number of trials   Number of citations
Results:
                         Distribution of citation counts




From: Piwowar HA, Day RS, Fridsma DB (2007) Sharing Detailed Research Data Is Associated with Increased Citation Rate.
PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
Results:
                                Multivariate regression




From: Piwowar HA, Day RS, Fridsma DB (2007) Sharing Detailed Research Data Is Associated with Increased Citation Rate.
PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
Limitations
•  Outliers
  –  Subset analysis of lower profile papers


•  Complex timing
  –  Additional analysis of citations within 24 months


•  Association does not imply causation
  –  Could be common cause
Data sharing help on the way
•  Free, centralized databases
  –  SMD, GEO, ArrayExpress
•  Standards
  –  MIAME, CONSORT
•  Tools
  –  De-id, caBIG
•  Community
  –  Journals, Funders, Organizations, Blogs
Conclusions
•  70% increase in citation impact for trials
   which make data available

•  Result holds for lower-profile publications

•  Hopefully a motivation for authors to share
   data and thus maximize its usefulness
For more information
•  Participate in the discussion on this paper
   at PLoS ONE

•  Check out blogs on Open Access, Open Data,
   Open Notebook Science
   –  Peter Suber’s Open Access News blog
   –  Wikipedia: “Open Data”
   –  Nature Editorial: May 3, 2007

•  Contact Heather Piwowar for further discussion
   and enthusiasm!    hpiwowar@alumni.pitt.edu
Thank you
•  Peter Suber’s blog: “Open Access News”
•  Wikipedia: “Open Data”
•  Nature Editorial: May 3, 2007

I support Open Data
and share my literature, code, and data whenever possible.

Long term research interest:
data reuse as an underutilized informatics resource


                  Questions?

More Related Content

What's hot

Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011
cmzmasek
 
BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013
Andrea de Souza
 

What's hot (20)

Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
Roche_open_science_NIOO_KNAW_workshop_NL
Roche_open_science_NIOO_KNAW_workshop_NLRoche_open_science_NIOO_KNAW_workshop_NL
Roche_open_science_NIOO_KNAW_workshop_NL
 
Liberating facts from the scientific literature - Jisc Digifest 2016
Liberating facts from the scientific literature - Jisc Digifest 2016 Liberating facts from the scientific literature - Jisc Digifest 2016
Liberating facts from the scientific literature - Jisc Digifest 2016
 
Mcb database resources workshop 2013
Mcb database resources workshop 2013Mcb database resources workshop 2013
Mcb database resources workshop 2013
 
On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...
 
McIntosh "Improving the quality of preprints with automated checks"
McIntosh "Improving the quality of preprints with automated checks"McIntosh "Improving the quality of preprints with automated checks"
McIntosh "Improving the quality of preprints with automated checks"
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
 
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...
 
Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011
 
It summit dataverse-bigdata-mercecrosas
It summit dataverse-bigdata-mercecrosasIt summit dataverse-bigdata-mercecrosas
It summit dataverse-bigdata-mercecrosas
 
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
 
BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013
 
Open software and knowledge for MIOSS
Open software and knowledge for MIOSSOpen software and knowledge for MIOSS
Open software and knowledge for MIOSS
 
Laurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data PublicationLaurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data Publication
 
Automatic Extraction of Knowledge from the Literature
Automatic Extraction of Knowledge from the LiteratureAutomatic Extraction of Knowledge from the Literature
Automatic Extraction of Knowledge from the Literature
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
 
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...
 

Similar to PLoS ONE Piwowar: Sharing Detailed Research Data Is Associated with Increased Citation Rate

JCDL doctoral consortium 2008: Proposed Foundations for Evaluating Data Shar...
JCDL doctoral consortium 2008:  Proposed Foundations for Evaluating Data Shar...JCDL doctoral consortium 2008:  Proposed Foundations for Evaluating Data Shar...
JCDL doctoral consortium 2008: Proposed Foundations for Evaluating Data Shar...
Heather Piwowar
 
ELPUB 2008: A review of journal policies for sharing research data
ELPUB 2008:    A review of journal policies for sharing research dataELPUB 2008:    A review of journal policies for sharing research data
ELPUB 2008: A review of journal policies for sharing research data
Heather Piwowar
 

Similar to PLoS ONE Piwowar: Sharing Detailed Research Data Is Associated with Increased Citation Rate (20)

JCDL doctoral consortium 2008: Proposed Foundations for Evaluating Data Shar...
JCDL doctoral consortium 2008:  Proposed Foundations for Evaluating Data Shar...JCDL doctoral consortium 2008:  Proposed Foundations for Evaluating Data Shar...
JCDL doctoral consortium 2008: Proposed Foundations for Evaluating Data Shar...
 
NEDCC 2010 Piwowar Leaders and Laggards
NEDCC 2010 Piwowar Leaders and LaggardsNEDCC 2010 Piwowar Leaders and Laggards
NEDCC 2010 Piwowar Leaders and Laggards
 
Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014
 
Why study Data Sharing? (+ why share your data)
Why study Data Sharing?  (+ why share your data)Why study Data Sharing?  (+ why share your data)
Why study Data Sharing? (+ why share your data)
 
Thesis Proposal, as presented for dissertation proposal defense
Thesis Proposal, as presented for dissertation proposal defenseThesis Proposal, as presented for dissertation proposal defense
Thesis Proposal, as presented for dissertation proposal defense
 
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
NESCent visit:  Measuring progress toward a cultural norm of shared (and reus...NESCent visit:  Measuring progress toward a cultural norm of shared (and reus...
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
 
Thesis defense, Heather Piwowar, Sharing biomedical research data
Thesis defense, Heather Piwowar, Sharing biomedical research dataThesis defense, Heather Piwowar, Sharing biomedical research data
Thesis defense, Heather Piwowar, Sharing biomedical research data
 
Thesis Proposal Piwowar Presentation 20091109
Thesis Proposal Piwowar Presentation 20091109Thesis Proposal Piwowar Presentation 20091109
Thesis Proposal Piwowar Presentation 20091109
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?
 
Building bioinformatics resources for the global community
Building bioinformatics resources for the global communityBuilding bioinformatics resources for the global community
Building bioinformatics resources for the global community
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016
 
Public Sharing of Research Datasets: A Pilot Study of Associations
Public Sharing of Research Datasets: A Pilot Study of Associations Public Sharing of Research Datasets: A Pilot Study of Associations
Public Sharing of Research Datasets: A Pilot Study of Associations
 
Open Science and Ecological meta-anlaysis
Open Science and Ecological meta-anlaysisOpen Science and Ecological meta-anlaysis
Open Science and Ecological meta-anlaysis
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
ELPUB 2008: A review of journal policies for sharing research data
ELPUB 2008:    A review of journal policies for sharing research dataELPUB 2008:    A review of journal policies for sharing research data
ELPUB 2008: A review of journal policies for sharing research data
 
Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03
 

More from Heather Piwowar

More from Heather Piwowar (20)

Calculating how much your University spends on Open Access--and what to do ab...
Calculating how much your University spends on Open Access--and what to do ab...Calculating how much your University spends on Open Access--and what to do ab...
Calculating how much your University spends on Open Access--and what to do ab...
 
Unsub Lightning Talk
Unsub Lightning TalkUnsub Lightning Talk
Unsub Lightning Talk
 
How to Calculate OA APC Spend for Your University
How to Calculate OA APC Spend for Your UniversityHow to Calculate OA APC Spend for Your University
How to Calculate OA APC Spend for Your University
 
Intro to Managing Serials with Net Cost per Paid Use
Intro to Managing Serials with Net Cost per Paid UseIntro to Managing Serials with Net Cost per Paid Use
Intro to Managing Serials with Net Cost per Paid Use
 
The Future of OA: 
The Impact of Open Access on Readership and Subscription ...
 The Future of OA: 
The Impact of Open Access on Readership and Subscription ... The Future of OA: 
The Impact of Open Access on Readership and Subscription ...
The Future of OA: 
The Impact of Open Access on Readership and Subscription ...
 
The time has come to talk of... who should own scholarly infrastructure?
 The time has come to talk of... who should own scholarly infrastructure? The time has come to talk of... who should own scholarly infrastructure?
The time has come to talk of... who should own scholarly infrastructure?
 
What kinds of open have 
made a difference in scholarly communication infrast...
What kinds of open have 
made a difference in scholarly communication infrast...What kinds of open have 
made a difference in scholarly communication infrast...
What kinds of open have 
made a difference in scholarly communication infrast...
 
Data science needs Data and lots of it
Data science needs Data and lots of itData science needs Data and lots of it
Data science needs Data and lots of it
 
Oadoi and libraries
Oadoi and librariesOadoi and libraries
Oadoi and libraries
 
Impactstory OA week 2017
Impactstory OA week 2017Impactstory OA week 2017
Impactstory OA week 2017
 
Paperbuzz sneak peek
Paperbuzz sneak peekPaperbuzz sneak peek
Paperbuzz sneak peek
 
Software-Native metrics: Depsy lessons learned
Software-Native metrics: Depsy lessons learnedSoftware-Native metrics: Depsy lessons learned
Software-Native metrics: Depsy lessons learned
 
What's your Impactstory?
What's your Impactstory?What's your Impactstory?
What's your Impactstory?
 
capturing the impact of software AAS 2017
capturing the impact of software AAS 2017capturing the impact of software AAS 2017
capturing the impact of software AAS 2017
 
Software-Native metrics: Depsy lessons learned
Software-Native metrics: Depsy lessons learnedSoftware-Native metrics: Depsy lessons learned
Software-Native metrics: Depsy lessons learned
 
submission summary for #WSSSPE Policy session on Credit, Citation, and Impact
submission summary for #WSSSPE Policy session on Credit, Citation, and Impactsubmission summary for #WSSSPE Policy session on Credit, Citation, and Impact
submission summary for #WSSSPE Policy session on Credit, Citation, and Impact
 
Building Skyscrapers with our Scholarship
Building Skyscrapers with our ScholarshipBuilding Skyscrapers with our Scholarship
Building Skyscrapers with our Scholarship
 
Right time, right place, to change the world
Right time, right place, to change the worldRight time, right place, to change the world
Right time, right place, to change the world
 
No more waiting! Tools that work Today to reveal dataset use
No more waiting!  Tools that work Today to reveal dataset useNo more waiting!  Tools that work Today to reveal dataset use
No more waiting! Tools that work Today to reveal dataset use
 
Analyzing data about our data
Analyzing data about our dataAnalyzing data about our data
Analyzing data about our data
 

Recently uploaded

Failure to thrive in neonates and infants + pediatric case.pptx
Failure to thrive in neonates and infants  + pediatric case.pptxFailure to thrive in neonates and infants  + pediatric case.pptx
Failure to thrive in neonates and infants + pediatric case.pptx
claviclebrown44
 
Cardiac Impulse: Rhythmical Excitation and Conduction in the Heart
Cardiac Impulse: Rhythmical Excitation and Conduction in the HeartCardiac Impulse: Rhythmical Excitation and Conduction in the Heart
Cardiac Impulse: Rhythmical Excitation and Conduction in the Heart
MedicoseAcademics
 

Recently uploaded (20)

Integrated Neuromuscular Inhibition Technique (INIT)
Integrated Neuromuscular Inhibition Technique (INIT)Integrated Neuromuscular Inhibition Technique (INIT)
Integrated Neuromuscular Inhibition Technique (INIT)
 
CONGENITAL HYPERTROPHIC PYLORIC STENOSIS by Dr M.KARTHIK EMMANUEL
CONGENITAL HYPERTROPHIC PYLORIC STENOSIS  by Dr M.KARTHIK EMMANUELCONGENITAL HYPERTROPHIC PYLORIC STENOSIS  by Dr M.KARTHIK EMMANUEL
CONGENITAL HYPERTROPHIC PYLORIC STENOSIS by Dr M.KARTHIK EMMANUEL
 
Video capsule endoscopy (VCE ) in children
Video capsule endoscopy (VCE ) in childrenVideo capsule endoscopy (VCE ) in children
Video capsule endoscopy (VCE ) in children
 
DR. Neha Mehta Best Psychologist.in India
DR. Neha Mehta Best Psychologist.in IndiaDR. Neha Mehta Best Psychologist.in India
DR. Neha Mehta Best Psychologist.in India
 
Failure to thrive in neonates and infants + pediatric case.pptx
Failure to thrive in neonates and infants  + pediatric case.pptxFailure to thrive in neonates and infants  + pediatric case.pptx
Failure to thrive in neonates and infants + pediatric case.pptx
 
Cas 28578-16-7 PMK ethyl glycidate ( new PMK powder) best suppler
Cas 28578-16-7 PMK ethyl glycidate ( new PMK powder) best supplerCas 28578-16-7 PMK ethyl glycidate ( new PMK powder) best suppler
Cas 28578-16-7 PMK ethyl glycidate ( new PMK powder) best suppler
 
Evidence-based practiceEBP) in physiotherapy
Evidence-based practiceEBP) in physiotherapyEvidence-based practiceEBP) in physiotherapy
Evidence-based practiceEBP) in physiotherapy
 
Hemodialysis: Chapter 2, Extracorporeal Blood Circuit - Dr.Gawad
Hemodialysis: Chapter 2, Extracorporeal Blood Circuit - Dr.GawadHemodialysis: Chapter 2, Extracorporeal Blood Circuit - Dr.Gawad
Hemodialysis: Chapter 2, Extracorporeal Blood Circuit - Dr.Gawad
 
Cardiovascular Physiology - Regulation of Cardiac Pumping
Cardiovascular Physiology - Regulation of Cardiac PumpingCardiovascular Physiology - Regulation of Cardiac Pumping
Cardiovascular Physiology - Regulation of Cardiac Pumping
 
Factors Affecting child behavior in Pediatric Dentistry
Factors Affecting child behavior in Pediatric DentistryFactors Affecting child behavior in Pediatric Dentistry
Factors Affecting child behavior in Pediatric Dentistry
 
Unlocking Holistic Wellness: Addressing Depression, Mental Well-Being, and St...
Unlocking Holistic Wellness: Addressing Depression, Mental Well-Being, and St...Unlocking Holistic Wellness: Addressing Depression, Mental Well-Being, and St...
Unlocking Holistic Wellness: Addressing Depression, Mental Well-Being, and St...
 
Cardiac Impulse: Rhythmical Excitation and Conduction in the Heart
Cardiac Impulse: Rhythmical Excitation and Conduction in the HeartCardiac Impulse: Rhythmical Excitation and Conduction in the Heart
Cardiac Impulse: Rhythmical Excitation and Conduction in the Heart
 
Muscle Energy Technique (MET) with variant and techniques.
Muscle Energy Technique (MET) with variant and techniques.Muscle Energy Technique (MET) with variant and techniques.
Muscle Energy Technique (MET) with variant and techniques.
 
Denture base resins materials and its mechanism of action
Denture base resins materials and its mechanism of actionDenture base resins materials and its mechanism of action
Denture base resins materials and its mechanism of action
 
5Cladba ADBB 5cladba buy 6cl adbb powder 5cl ADBB precursor materials
5Cladba ADBB 5cladba buy 6cl adbb powder 5cl ADBB precursor materials5Cladba ADBB 5cladba buy 6cl adbb powder 5cl ADBB precursor materials
5Cladba ADBB 5cladba buy 6cl adbb powder 5cl ADBB precursor materials
 
Renal Replacement Therapy in Acute Kidney Injury -time modality -Dr Ayman Se...
Renal Replacement Therapy in Acute Kidney Injury -time  modality -Dr Ayman Se...Renal Replacement Therapy in Acute Kidney Injury -time  modality -Dr Ayman Se...
Renal Replacement Therapy in Acute Kidney Injury -time modality -Dr Ayman Se...
 
Vaccines: A Powerful and Cost-Effective Tool Protecting Americans Against Dis...
Vaccines: A Powerful and Cost-Effective Tool Protecting Americans Against Dis...Vaccines: A Powerful and Cost-Effective Tool Protecting Americans Against Dis...
Vaccines: A Powerful and Cost-Effective Tool Protecting Americans Against Dis...
 
TEST BANK For Huether and McCance's Understanding Pathophysiology, Canadian 2...
TEST BANK For Huether and McCance's Understanding Pathophysiology, Canadian 2...TEST BANK For Huether and McCance's Understanding Pathophysiology, Canadian 2...
TEST BANK For Huether and McCance's Understanding Pathophysiology, Canadian 2...
 
Hemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.Gawad
Hemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.GawadHemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.Gawad
Hemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.Gawad
 
World Hypertension Day 17th may 2024 ppt
World Hypertension Day 17th may 2024 pptWorld Hypertension Day 17th may 2024 ppt
World Hypertension Day 17th may 2024 ppt
 

PLoS ONE Piwowar: Sharing Detailed Research Data Is Associated with Increased Citation Rate

  • 1. Sharing Detailed Research Data is Associated with Increased Citation Rate Heather Piwowar Roger Day and Douglas Fridsma University of Pittsburgh Published in PLoS ONE, March 27 2007 Funded by NLM Training Grant Presented at NLM Trainee Conference, June 27 2007
  • 2. Sharing research data PAST MEDICAL HISTORY: Past medical history showed she had superficial phlebitis times two in the past, had non-insulin dependent diabetes mellitus for four years. She had been hypothyroid for three years. HISTORY OF PRESENT ILLNESS: The patient is a 58-year-old female, … http://upload.wikimedia.org/wikipedia/commons/7/76/PeptideMSMS.jpg; http://en.wikipedia.org/wiki/Image:Helices.png; http://en.wikipedia.org/wiki/ Image:Heatmap.png; http://en.wikipedia.org/wiki/Image:Microarray2.gif; http://zellig.cpmc.columbia.edu/medlee/demo/; htp://www.plosone.org/article/fetchArticle.action?articleURI=info:doi/10.1371/journal.pone.0000441
  • 3. Shared data benefits science •  Verify •  Understand •  Extend •  Explore •  Combine •  Synergize •  Train •  Reduce
  • 4. But… costly for authors •  Find •  Organize •  Document •  Deidentify •  Format •  Decide •  Ask •  Submit •  Answer questions •  Worry about mistakes being found •  Worry about data being misinterpreted •  Worry about being scooped •  Forgo money and IP and prestige???
  • 5. So what’s in it for them? Carrot. A currency of value? Citations. $50! Do trials which share their data receive more citations?
  • 6. Methods Cancer Microarray Trials Ntzani and Ioannidis identified 85 trials published 1999-2003 Citations ISI Web of Science Citation Index, citations from 2004-2005 Data availability Publisher and lab websites, microarray databases, WayBack Internet Archive, Oncomine Statistics Multivariate linear regression
  • 7. Results: Eligible trials •  85 trials •  41 (48%) made data available •  Various locations: –  Lab websites (28) –  Publisher websites (4) –  SMD (6) –  GEO (6) –  GEDP (2) •  6239 total citations
  • 8. Results: Big picture 85 clinical trials used These 85 trials were cited microarrays to study cancer 6239 times between 1999-2003 during 2004-2005 41 (48%) of these trials Trials which shared made their microarray data received data publicly available 5334 (85%) on the internet of these citations Number of trials Number of citations
  • 9. Results: Distribution of citation counts From: Piwowar HA, Day RS, Fridsma DB (2007) Sharing Detailed Research Data Is Associated with Increased Citation Rate. PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
  • 10. Results: Multivariate regression From: Piwowar HA, Day RS, Fridsma DB (2007) Sharing Detailed Research Data Is Associated with Increased Citation Rate. PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
  • 11. Limitations •  Outliers –  Subset analysis of lower profile papers •  Complex timing –  Additional analysis of citations within 24 months •  Association does not imply causation –  Could be common cause
  • 12. Data sharing help on the way •  Free, centralized databases –  SMD, GEO, ArrayExpress •  Standards –  MIAME, CONSORT •  Tools –  De-id, caBIG •  Community –  Journals, Funders, Organizations, Blogs
  • 13. Conclusions •  70% increase in citation impact for trials which make data available •  Result holds for lower-profile publications •  Hopefully a motivation for authors to share data and thus maximize its usefulness
  • 14. For more information •  Participate in the discussion on this paper at PLoS ONE •  Check out blogs on Open Access, Open Data, Open Notebook Science –  Peter Suber’s Open Access News blog –  Wikipedia: “Open Data” –  Nature Editorial: May 3, 2007 •  Contact Heather Piwowar for further discussion and enthusiasm! hpiwowar@alumni.pitt.edu
  • 15. Thank you •  Peter Suber’s blog: “Open Access News” •  Wikipedia: “Open Data” •  Nature Editorial: May 3, 2007 I support Open Data and share my literature, code, and data whenever possible. Long term research interest: data reuse as an underutilized informatics resource Questions?