SlideShare a Scribd company logo
1 of 10
Reuse of Repository Data Valerie Enriquez – DataONE – Summer 2010
Motivation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Initial Questions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
To whose benefit? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Methods ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Initial Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Stumbles and other Worrisome Things ,[object Object],[object Object],[object Object],[object Object],Image courtesy of: http://currentskateofmind.com/2008/03/25/glossary-of-skating-falls/
Initial Findings *: invalid field input $: effective # ineffective  ISI Web of Science  Scirus  Google Scholar TreeBASE ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Pangaea ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ORNL DAAC ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Lessons Learned Image courtesy of: http://www.squidoo.com/stop_information_overload  Hey, I think I found that data citation you were looking for.
Where do we go from here? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...Megan O'Donnell
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceCarole Goble
 
Data management (newest version)
Data management (newest version)Data management (newest version)
Data management (newest version)Graça Gabriel
 
DataCite overview 2014
DataCite overview 2014DataCite overview 2014
DataCite overview 2014datacite
 
Trustworthy AI and Open Science
Trustworthy AI and Open ScienceTrustworthy AI and Open Science
Trustworthy AI and Open ScienceBeth Plale
 
Creating Incentives
Creating IncentivesCreating Incentives
Creating Incentivesdatacite
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Stuart Chalk
 
The expanding dataverse
The expanding dataverseThe expanding dataverse
The expanding dataverseMerce Crosas
 
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...Stuart Chalk
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDMMarieke Guy
 
Ischools workshop - 4 - data discovery
Ischools workshop - 4 - data discoveryIschools workshop - 4 - data discovery
Ischools workshop - 4 - data discoveryARDC
 
Crossref DataCite joint data citation webinar
Crossref DataCite joint data citation webinarCrossref DataCite joint data citation webinar
Crossref DataCite joint data citation webinarCrossref
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementJamie Bisset
 
Access to langs Communicatio
Access to langs CommunicatioAccess to langs Communicatio
Access to langs CommunicatioMark Hetherington
 
How to get Information for Your Geology Assignments
How to get Information for Your Geology AssignmentsHow to get Information for Your Geology Assignments
How to get Information for Your Geology AssignmentsGaz Johnson
 
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Merce Crosas
 
The OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit ProjectThe OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit ProjectAlexandro Colorado
 
Internet searching
Internet searchingInternet searching
Internet searchingBadheeb
 

What's hot (19)

An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
Data management (newest version)
Data management (newest version)Data management (newest version)
Data management (newest version)
 
Search strategy
Search strategySearch strategy
Search strategy
 
DataCite overview 2014
DataCite overview 2014DataCite overview 2014
DataCite overview 2014
 
Trustworthy AI and Open Science
Trustworthy AI and Open ScienceTrustworthy AI and Open Science
Trustworthy AI and Open Science
 
Creating Incentives
Creating IncentivesCreating Incentives
Creating Incentives
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
The expanding dataverse
The expanding dataverseThe expanding dataverse
The expanding dataverse
 
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
Ischools workshop - 4 - data discovery
Ischools workshop - 4 - data discoveryIschools workshop - 4 - data discovery
Ischools workshop - 4 - data discovery
 
Crossref DataCite joint data citation webinar
Crossref DataCite joint data citation webinarCrossref DataCite joint data citation webinar
Crossref DataCite joint data citation webinar
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Access to langs Communicatio
Access to langs CommunicatioAccess to langs Communicatio
Access to langs Communicatio
 
How to get Information for Your Geology Assignments
How to get Information for Your Geology AssignmentsHow to get Information for Your Geology Assignments
How to get Information for Your Geology Assignments
 
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
 
The OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit ProjectThe OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit Project
 
Internet searching
Internet searchingInternet searching
Internet searching
 

Similar to Reuse of Repository Data

Reuse of repository_data_2.0
Reuse of repository_data_2.0Reuse of repository_data_2.0
Reuse of repository_data_2.0Valerie Enriquez
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataCyndy Parr
 
Getting started with looking up metadata
Getting started with looking up metadataGetting started with looking up metadata
Getting started with looking up metadataCrossref
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Natsuko Nicholls
 
DataONE Education Module 08: Data Citation
DataONE Education Module 08: Data CitationDataONE Education Module 08: Data Citation
DataONE Education Module 08: Data CitationDataONE
 
Data availability
Data availabilityData availability
Data availabilityRasayely
 
Semantic Search using RDF Metadata (SemTech 2005)
Semantic Search using RDF Metadata (SemTech 2005)Semantic Search using RDF Metadata (SemTech 2005)
Semantic Search using RDF Metadata (SemTech 2005)Bradley Allen
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale Bernadette Hyland-Wood
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...SC CTSI at USC and CHLA
 
Information retrieval is the process of accessing data resources. Usually doc...
Information retrieval is the process of accessing data resources. Usually doc...Information retrieval is the process of accessing data resources. Usually doc...
Information retrieval is the process of accessing data resources. Usually doc...NALESVPMEngg
 
Lines of Communication: Open Access Repositories & Scholarly Publication
Lines of Communication: Open Access Repositories & Scholarly PublicationLines of Communication: Open Access Repositories & Scholarly Publication
Lines of Communication: Open Access Repositories & Scholarly PublicationGaz Johnson
 
Identity, Location, and Citation at NEON
Identity, Location, and Citation at NEONIdentity, Location, and Citation at NEON
Identity, Location, and Citation at NEONMark Parsons
 
SHARE Update for CNI, Fall 2014
SHARE Update for CNI, Fall 2014SHARE Update for CNI, Fall 2014
SHARE Update for CNI, Fall 2014SHARE
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveJanifer Gatenby
 
Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6ARDC
 

Similar to Reuse of Repository Data (20)

Reuse of repository_data_2.0
Reuse of repository_data_2.0Reuse of repository_data_2.0
Reuse of repository_data_2.0
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
Getting started with looking up metadata
Getting started with looking up metadataGetting started with looking up metadata
Getting started with looking up metadata
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
 
DataONE Education Module 08: Data Citation
DataONE Education Module 08: Data CitationDataONE Education Module 08: Data Citation
DataONE Education Module 08: Data Citation
 
Data availability
Data availabilityData availability
Data availability
 
Metadata as Standard: improving Interoperability through the Research Data Al...
Metadata as Standard: improving Interoperability through the Research Data Al...Metadata as Standard: improving Interoperability through the Research Data Al...
Metadata as Standard: improving Interoperability through the Research Data Al...
 
Semantic Search using RDF Metadata (SemTech 2005)
Semantic Search using RDF Metadata (SemTech 2005)Semantic Search using RDF Metadata (SemTech 2005)
Semantic Search using RDF Metadata (SemTech 2005)
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
 
English 1102 2018
English 1102 2018English 1102 2018
English 1102 2018
 
Information retrieval is the process of accessing data resources. Usually doc...
Information retrieval is the process of accessing data resources. Usually doc...Information retrieval is the process of accessing data resources. Usually doc...
Information retrieval is the process of accessing data resources. Usually doc...
 
7806 eps1002l2b
7806 eps1002l2b7806 eps1002l2b
7806 eps1002l2b
 
Lines of Communication: Open Access Repositories & Scholarly Publication
Lines of Communication: Open Access Repositories & Scholarly PublicationLines of Communication: Open Access Repositories & Scholarly Publication
Lines of Communication: Open Access Repositories & Scholarly Publication
 
Identity, Location, and Citation at NEON
Identity, Location, and Citation at NEONIdentity, Location, and Citation at NEON
Identity, Location, and Citation at NEON
 
SHARE Update for CNI, Fall 2014
SHARE Update for CNI, Fall 2014SHARE Update for CNI, Fall 2014
SHARE Update for CNI, Fall 2014
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspective
 
Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6
 

Recently uploaded

Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Commit University
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...DianaGray10
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationIES VE
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8DianaGray10
 
COMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a WebsiteCOMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a Websitedgelyza
 
Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Adtran
 
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPathCommunity
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxUdaiappa Ramachandran
 
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDEADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDELiveplex
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostMatt Ray
 
How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?IES VE
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding TeamAdam Moalla
 
Empowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintEmpowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintMahmoud Rabie
 
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesAI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesMd Hossain Ali
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1DianaGray10
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Brian Pichman
 
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UbiTrack UK
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXTarek Kalaji
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaborationbruanjhuli
 
Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024D Cloud Solutions
 

Recently uploaded (20)

Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8
 
COMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a WebsiteCOMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a Website
 
Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™
 
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation Developers
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptx
 
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDEADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
 
How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team
 
Empowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintEmpowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership Blueprint
 
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesAI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )
 
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBX
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
 
Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024
 

Reuse of Repository Data

  • 1. Reuse of Repository Data Valerie Enriquez – DataONE – Summer 2010
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9. Lessons Learned Image courtesy of: http://www.squidoo.com/stop_information_overload Hey, I think I found that data citation you were looking for.
  • 10.

Editor's Notes

  1. The difference between data deposit vs. data reuse: Data is created in the process of a study and then deposited in the repository as opposed to the data being reused later for future studies and articles. Why is it important that we track the reuse of data? Transparency: ensuring that either the misinterpretation of data or outcome bias does not affect future studies Collaboration: enabling researchers to share datasets with each other to find overlap and break new ground as opposed to revisiting old territory. New data can either: Confirm existing data Refute existing data Combine with existing data to form new conclusions Healthy Competition: data citations could potentially bring the same level of prestige to a researcher or institute that article citations currently bring Invigoration: data that had gathered dust on the digital shelf gets new life when applied to new studies and articles. We track article citation, so why not data citation?
  2. Email [06/28/2010 01:34:12 AM EDT] Valerie Heather Piwowar has given me your email address as I wanted to contact you directly in relation to the project you are doing this summer.  It sounds a really interesting project and I. For one, will be really keen to see what you come up with.  I should imagine that others will find it equally as interesting. The reason I wanted to write to you directly was to let you know of some of the activities of the Australian National Data Service (ANDS) as these might relate to your work.  It's also to let you know that there is a huge amount going on in the US and elsewhere about the whole issue of data re-use and sharing. ANDS was established early last year with funding from the Australian Department of Innovation, Industry, Science and Research.  You can read all about us at our website which is linked below.  One of the important aspects of our work is developing the Australian Research Data Commons, which is to say that we want to identify data sets available in Australian research institutions or through government agencies, and make that information available as widely as we can.  Metadata about data sets is provided to a central registry and the public face to that is a web interface called Research Data Australia ( http:// services.ands.org.au / ).  The Australian government invests heavily in research, as do most governments, so it is keen to ensure that data is valued as an output of research and that it is available for re-use wherever that is possible.  As part of this overall project we will be providing data sets published through us with DOIs to enable tracking of citations.  We have joined DataCite ( http:// www.datacite.org / ) along with other international partners and hope to start minting DOIs within the next couple of months.  We have also had discussions with both Elsevier and ThomsonReuters about using them to track data citations, and both are interested.  Elsevier has already successfully conducted trials with TIB (the  German National Library of Science and Technology) to test this.   We discussed the lack of a data publication format in EndNote with ThomsonReuters, who own it, so they are aware of the issue.  This is a very brief outline of some of the activities we are doing in the area, but you can learn more from our short guide at  http://ands.org.au/guides/data-citation-awareness.html .  As you can see from all of this, we are very interested in the whole topic of data sharing and re-use and all of those things, such as data citation, which are linked to the process.  This has been a very brief outline of some of the work we are doing, so if you have any questions we would be happy to answer them.  With best wishes, Margaret (Henty) Email [07/02/2010 09:38:23 AM EDT] I have just seen your blog entries and the data resources page on citation and have found them very interesting as I have been doing some work on citation of data, most recently in the context of the Sage initiative.  http:// www.sagebase.org / (At one time I used to work on the eBank UK project and I was bemused to see the page created for that project rather a long time ago referenced in your list of resources :) I am forwarding some more recent resources that I am aware of - apologies for the formatting (or lack of) as I have just copied these from the wiki, and fo rjust emailing them rather than go through the process of adding them to your wiki. The first in particular reviewed the citation policies of a number of data repositories.  I hope that not many of them are repeated on your current list. I look forward to hearing further results of your research in the future. Best wishes, Monica Duke
  3. Initial search process: Test searches for TreeBASE resulting sample articles study accession numbers and data author names to search for later. ORNL DAAC: Oak Ridge National Laboratory Distributed Active Archive Center 
  4. Reasons for miss captured: no mention of repository name (ex: Pangaea supercontinent), only mention of repository (as in articles only about repositories citing TreeBASE/Pangaea/ORNL DAAC as examples), articles that have deposited data and not downloaded from the repositories, and other. Reasons for hit captured: Citations mentioning repository name, citation of DOI or study accession number, full citation according to repository recommendations (varies), citation of data author name.   Interpretation Browse through observations made within OpenWetware journal entries Look through Search Comparisons spreadsheet for percentage of hits versus misses as well as the types of hits and misses that occurred.
  5. Finding focus and the difficulty of going beyond the obvious Mention of repository could mean either data was deposited there or downloaded from there. Sometimes narrowing search terms with boolean operators or “-” exclusion only resulted in no results at all while broadening back out resulted in too many results to read through manually. A large result list can either be too much of a good thing, or just too much. However, a small list makes me worry that I'm excluding hits that would be valid by my fuzzy parameters that the search function cannot process very well. Or, in some cases, a narrower search would have no results at all. TreeBASE study accession numbers cited in article may have changed over time (from StudyID to LegacyID after study publication). “ Pangaea” can refer to either Pangaea.de data repository or the Pangaea supercontinent. How do I exclude these results? What if there is an article that mentions both? Do I risk excluding that article? Google scholar does not make the distinction between published journal articles and non-journal publications like dissertations deposited into academic repositories.   “ Missing” searches (use a table like the one found here [[DataONE:Notebook/Reuse_of_repository_data/2010/06/28#Search_Methodology_Table|\\Search Methodology Table]] as visual aid) For the sake of thoroughness, I intended to go through each possible search combination. Not all searches worked and I did not originally record them in my notebook. However, it is important to record these “failures” for future reference. So I went back through each search and recorded the results. Also, using this table helped show me that I missed some possible combinations.   Is it possible that a majority of the citations I find only cite data from articles where the researcher finds the information in the article without ever even looking at the data?
  6. Findings by Repository TreeBASE: Mentions of the repository name found in all three search databases, but distinction had to be made as to whether data was deposited into or downloaded from TreeBASE. These search limitations were constructed based on the verbiage of individual instances within found articles and also varied in effectiveness and structure within Scirus and Google Scholar. TreeBASE was listed as part of the Cited Author field within ISI. ISI did not allow for searching by study accession number. Searching for specific author names and article publications proved more useful in ISI since TreeBASE lists citation by article and ISI Web of Science Cited Reference Search is geared towards finding citations of specific articles as opposed to datasets. Even using boolean operators to limit searches yielded limited results, still often with more misses than hits. Searching by general mention of a “study accession number” was not very useful and the prefix “S####” was too general and vague for all search functions. Searches for individual study accession numbers (ex S1515, S2376, S1977) pulled no results at all in Scirus or Google Scholar In Scirus, general searches for TreeBASE with boolean limits pulled more misses than hits while searching for a specific study accession number (ex: S1515) proved too specific and usually pulled the message “no results matched your query.” Searching by data author name in Scirus proved limited, as often the searches by author name pulled either articles by that particular author or articles which did not cite the dataset or article derived from the original dataset, even when year and publisher was included in the search. Searching in Google Scholar for general mentions of TreeBASE yielded almost all misses while a specific data author proved more useful with narrower results with more hits than misses yet no mention of study accession numbers were found in the resulting articles. Pangaea ISI pulled a few hits just from general mention of Pangaea in Cited Author or Cited Work field. More hits (in the dozens range) came when specific authors and articles included in search field. ISI did not allow for searching by DOI. Searching by author in ISI was the most effective as far as hit to miss ratio in the search results. Using controlled boolean search with repository name proved ineffective in Scirus, while searching by DOI prefix only pulled a few results (12) with even fewer hits (5). Then again, the DOI prefix did not include the same controlled vocabulary. Searching by data author name proved useless in Scirus as well since either articles by the author were pulled or non-journal resources were found. Google Scholar pulled a lot of articles with the Pangaea DOI prefix, but there were far too many results to read through manually (1000+). Further search refinements may be needed still. ORNL DAAC ISI once again worked best when searching for specific authors and articles in for which the data was originally created. DOIs could be found, but again, not specifically entered in search fields. Scirus proved the most effective when using boolean operators with a keyword search of articles mentioning the repository name. Also, searching by specific project databases like BOREAS or FLUXNET proved useful. Many articles that were found were housed in Elsevier's Science Direct. Google scholar took well to the ORNL DOI prefix when compared to the Pangaea DOI prefix. Although it pulled fewer results, the results that were found held a majority of hits. Also, searching for the project databases and data author names found more solid hits.
  7. Confirmed: finding data citations in journal articles is difficult. “ Like trying to find someone on Facebook only knowing their hair color and favorite breakfast cereal.” Why is this? Even if repository has recommendations or best practices for data citation, not always consistently used. Even if a repository has a DOI or other identifier, not always mentioned in citation. However, in the case of a repository like ORNL DAAC, where there are more ways to find data, it is less difficult. Even so, if there were just one method used consistently, that would make searching for data citations much easier. Google Scholar is a bit too simplified. While searching through the full text of an article is useful, there are some things that search fields are better at doing, such as the cited author or cited work field in Cited Reference Search. Scirus has a nice faceted search function that allows you to see only journal articles and narrow things down further to individual journals. However, it too does not have a function that searches for citations. However, ISI, Scirus and Google Scholar all have “cited by # articles” functionality. While more human-friendly, federated search engines like Scirus and Google Scholar aren’t very machine-friendly. Little distinction between data deposited into repositories and data downloaded from repositories for an article or study. Some ways of finding out without reading the whole article: search for the phrases “are available,” “can be downloaded,” “can be obtained from” or “uploaded into [repository]” in relation to data being added to a repository. Or, search for the phrases “downloaded from,” “obtained from” Hence another conundrum: the words “downloaded” and “obtained” are used in both contexts. Even simplifying searches to include “from TreeBASE/Pangaea/ORNL DAAC” and exclude “into TreeBASE/Pangaea/ORNL DAAC” has limited success since fulltext search often ignores the quotes around the phrase resulting in many irrelevant results. Even if data download is mentioned, not always cited in Reference section (often just mentioned in introduction, methods or results sections instead). Things that would have made the search process easier or may encourage data citation: Consistent use of data citation format either as recommended by repository or publication Consistent use of unique persistent identifiers Assigning a weight similar to impact factors for journals and articles to data repositories/data sets based on criteria such as how often the data is cited, the impact factor of the journals in which the data is cited, how often the data is updated. Metadata tags for articles that indicate data citation Functions within search engines that search for data citations within metadata (similar to ISI Web of Science Cited Reference Search) While TreeBASE has BibTeX and RIS integration, it was still difficult to find citations in other articles.  
  8. Other repositories, search terms and databases. Bigger samples using different time periods (2005-2010 as opposed to 2008-2010). It would be interesting to capture whether an article agrees with or refutes the data cited from a previous article. Compare data with Nic and Sarah to see how much influence the data citation policies of journals and repositories have on authors/researchers and to see how many articles out of a random sample deposit, reuse and/or cite data. Also, compare data with other interns (especially the Baseline Assessment of Data Practices of Libraries and Librarians project) Article Some publication submission ideas: Collection Management  This journal recently ran an article titled "The Use of Web of Knowledge to Study Publishing and Citation Use for Local Researchers at the Campus Level"  doi:10.1080/01462671003597959 , in which the authors used ISI Web of Science to seek and identify periodical literature citing local researchers. DLib Link provided by Heather. Information Services & Use   Author Guidelines  International Journal focusing on information technology, particularly applications to business and scientific fields. Informing Science  Quote from their about page: "The academically peer refereed journal Informing Science endeavors to provide an understanding of the complexities in informing clientele. Fields from information systems, library science, journalism in all its forms to education all contribute to this science. These fields, which developed independently and have been researched in separate disciplines, are evolving to form a new transdiscipline, Informing Science. Informing Science publishes articles that provide insights into the nature, function and design of systems that inform clients. […] The ideal paper will serve to inform fellow researchers, perhaps from other fields, of contributions to this area." International Digital Curation Conference Call for Papers . Link provided by Nic. Journal of the American Society for Information Science & Technology  Quoted from their page: "The Journal welcomes rigorous work of an empirical, experimental, ethnographic, conceptual, historical, socio-technical, policy-analytic, or critical-theoretical nature. JASIST also commissions in-depth review articles (Advances in Information Science) and reviews of print and other media." I find this relevant to my interests. Journal of Information Science  Quoted from their about page: "The Journal of Information Science is an international journal of high repute covering topics of interest to all those researching and working in the sciences of information and knowledge management. The Editors welcome material on any aspect of information science theory, policy, application or practice that will advance thinking in the field." Library Technology Reports  As a publication of the American Library Association, this could reach a wide audience of librarians interested in born digital holdings or technological changes in scientific research. Scientometrics  Even if my study turns out to be more qualitative than quantitative, this may be useful for Nic and Sarah to consider.