SlideShare a Scribd company logo
1 of 49
Data Management
Stephanie Wright
University of Washington
swright@uw.edu SPATIAL / IsoCamp
June 2015
Tips & Tools
Who Am
I?
• Computing Trainer
• Cruise Ship Lecturer (Love Boat)
• Library Merger Manager
• Atmospheric Sciences Librarian
• Assessment Librarian
• Data Services Coordinator
HTTP://GUIDES.LIB.WASHINGTON.EDU/SWRIGHT
Disclaimer
I am not a scientist I am a librarian …
Disclaimer
I am not a scientist More like this…
What Do I
Do?
• Data Management Plans
(DMPs)
• Courses
• Consultations
• Research Projects
• DataONE, RDA, eScience
Institute
• Institutional Data
Repository (DRUW)
Why?
THEN NOW
THEN
NOW
THEN NOW
A Real Life
Example
Many tables
my spreadsheet
No headings
Embedded
figures
my spreadsheet
my spreadsheet
my spreadsheet
?
One More
Example
https://www.youtube.com/watch?v=66oNv_DJuPc
Data Sharing and
Management Snafu
in 3 Short Acts
Why Does It
Matter?
From Flickr by tomhilton
HTTP://WWW.SPARC.ARL.ORG/ISSUES/OPEN-DATA/DATA-SHARING-INITIATIVE/POLICIES
… “Federal agencies investing in research
and development (more than $100 million in
annual expenditures) must have clear and
coordinated policies for increasing public
access to research products.”
“The best thing to do with your data will be
thought of by someone else.”
“We need open data because we don’t just want
to use a car we want to poke around in the
engine, see how it works and then rebuild it.”
~ Rufus Pollock
Founder and President of Open Knowledge Foundation (www.okfn.org)
From Flickr by cogdog
WICHERTS JM, BAKKER M, MOLENAAR D (2011) WILLINGNESS TO SHARE RESEARCH DATA IS RELATED TO THE STRENGTH OF THE EVIDENCE AND THE QUALITY OF REPORTING OF
STATISTICAL RESULTS. PLOS ONE 6(11): E26828. DOI:10.1371/JOURNAL.PONE.0026828
HTTP://127.0.0.1:8081/PLOSONE/ARTICLE?ID=INFO:DOI/10.1371/JOURNAL.PONE.0026828
How To Do
It?
Data planning is more efficient than data forensics.
DATA MANAGEMENT PLANNING
•What will be collected
•Methods
•Standards
•Sharing/access
•Long-term storage
COLLECTING
•Keep raw data raw
• Use scripts to
process data
ORGANIZING
• Machine readable
• Human readable
• Works well with
default ordering
AVOID
• spaces
• punctuation
• special characters
• case sensitivity
20130503_DOEProject_DesignDocument_Smith_v2-01.docx
20130709_DOEProject_MasterData_Jones_v1-00.xlsx
20130825_DOEProject_Ex1Test1_Data_Gonzalez_v3-03.xlsx
20130825_DOEProject_Ex1Test1_Documentation_Gonzalez_v3-03.xlsx
20131002_DOEProject_Ex1Test2_Data_Gonzalez_v1-01.xlsx
20141023_DOEProject_ProjectMeetingNotes_Kramer_v1-00.docx
Eaffinis_nanaimo_2010_counts.xls
Site
name
Year
What was
measured
Study
organism
YYYYMMDD
NOBLE, WILLIAM S. (2009) "A QUICK GUIDE TO ORGANIZING COMPUTATIONAL BIOLOGY PROJECTS."
PLOS COMPUTATIONAL BIOLOGY. 5(7): DOI/10.1371/JOURNAL.PCBI.1000424
• Pick a method that works for you and stick to it
• DOCUMENT IT!
METADATA
•Who?
•What?
•Where?
•When?
•How?
•Why?
Digital context
• Name of the data set
• The name(s) of the data file(s) in the
data set
• Date the data set was last modified
• Example data file records for each
data type file
• Pertinent companion files
• List of related or ancillary data sets
• Software (including version number)
used to prepare/read the data set
• Data processing that was performed
Personnel & stakeholders
• Who collected
• Who to contact with questions
• Funders
Scientific context
• Scientific reason why the data were
collected
• What data were collected
• What instruments (including model & serial
number) were used
• Environmental conditions during collection
• Temporal & spatial resolution
• Standards or calibrations used
Information about parameters
• How each was measured or produced
• Units of measure
• Format used in the data set
• Precision & accuracy if known
Information about data
• Definitions of codes used
• Quality assurance & control measures
• Known problems that limit data use (e.g.
uncertainty, sampling problems)
Temperature
data
Salinity
data
Data import into Excel
Analysis: mean, SD
Graph production
Quality control &
data cleaning
“Clean” T
& S data
Summary
statistics
Data in
spread-
sheet
Simple: Flow chart
WORKFLOW
Simple: Commented script
Resulting output
More Fancy: Kepler, Taverna
From Flickr by cogdog
BACKING UP: 3 places, 3 ways
From Flickr by lippo
From Flickr by see phar
Original
Near
Far
What software?
What hardware?
What personnel?
How often?
Set up reminders!
Test system
SHARING
Repositories
Institutional
Disciplinary
Journal
re3data.org
Sustainable formats
Open, non-proprietary
Commonly used in your
discipline
Not encrypted or compressed
Review your DMP
Did you do what you said you would?
Photo credit Michael Ham
How Do I
Learn
More?
•Funding Mandates
http://chronicle.com/article/Where-Should-You-
Keep-Your/231065/
http://datapub.cdlib.org/2013/02/28/the-new-ostp-
policy-what-it-means/
•File Naming Conventions:
http://www.exadox.com/en/articles/file-naming-
convention-ten-rules-best-practice
•Folder Structures:
http://www.damlearningcenter.com/resources/
articles/best-practices-for-folder-organization/
•Metadata:
http://www.dcc.ac.uk/resources/metadata-
standards
•DataONE Primer
https://www.dataone.org/best-practices
•Software Carpentry
http://software-carpentry.org/
•Research Data Alliance
https://rd-alliance.org/
•Your Library
http://guides.lib.washington.edu/dmg
Tools
•Data Mgmt Planning
DMPTool https://dmptool.org/
•Metadata
Morpho https://www.dataone.org/software-
tools/morpho
NOAA MERMaid http://www.ncddc.noaa.gov/
metadata-standards/mermaid/
•Workflows
Kepler https://kepler-project.org/
Taverna http://www.taverna.org.uk/
•Sharing
re3data http://www.re3data.org/
GitHub https://github.com/
•Miscellaneous
EZID http://ezid.cdlib.org/
ImpactStory https://impactstory.org/
ORCID http://orcid.org/
Any Other
Questions? Stephanie Wright
Web data.blogspot.com
Twitter @UWLibsData
Email swright@uw.edu

More Related Content

What's hot

How to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectHow to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot Project
DATAVERSITY
 

What's hot (20)

How to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectHow to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot Project
 
2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management Strategy2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management Strategy
 
Getting Started with Data Stewardship
Getting Started with Data StewardshipGetting Started with Data Stewardship
Getting Started with Data Stewardship
 
Lessons Learned The Hard Way: 32+ Data Science Interviews
Lessons Learned The Hard Way: 32+ Data Science InterviewsLessons Learned The Hard Way: 32+ Data Science Interviews
Lessons Learned The Hard Way: 32+ Data Science Interviews
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data Modeling
 
How Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical ApplicationsHow Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical Applications
 
RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?
 
Do-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDo-It-Yourself Metadata Framework
Do-It-Yourself Metadata Framework
 
DataEd Slides: Leveraging Data Management Technologies
DataEd Slides: Leveraging Data Management TechnologiesDataEd Slides: Leveraging Data Management Technologies
DataEd Slides: Leveraging Data Management Technologies
 
RWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile EffortsRWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile Efforts
 
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
 
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture
DI&A Webinar: Building a Flexible and Scalable Analytics ArchitectureDI&A Webinar: Building a Flexible and Scalable Analytics Architecture
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture
 
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllReal-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
 
Data-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata StrategiesData-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata Strategies
 
Data-Ed Online: Data Operations Management: Turning Your Challenges Into Success
Data-Ed Online: Data Operations Management: Turning Your Challenges Into SuccessData-Ed Online: Data Operations Management: Turning Your Challenges Into Success
Data-Ed Online: Data Operations Management: Turning Your Challenges Into Success
 
How to Create Controlled Vocabularies for Competitive Intelligence
How to Create Controlled Vocabularies for Competitive IntelligenceHow to Create Controlled Vocabularies for Competitive Intelligence
How to Create Controlled Vocabularies for Competitive Intelligence
 
RWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data StewardshipRWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data Stewardship
 
Comparing Approaches to Data Governance
Comparing Approaches to Data GovernanceComparing Approaches to Data Governance
Comparing Approaches to Data Governance
 
RWDG Webinar: How to Construct a Data Governance Policy
RWDG Webinar: How to Construct a Data Governance PolicyRWDG Webinar: How to Construct a Data Governance Policy
RWDG Webinar: How to Construct a Data Governance Policy
 
Building a Collaborative Data Architecture
Building a Collaborative Data ArchitectureBuilding a Collaborative Data Architecture
Building a Collaborative Data Architecture
 

Viewers also liked

Data Archiving and Processing
Data Archiving and ProcessingData Archiving and Processing
Data Archiving and Processing
CRRC-Armenia
 

Viewers also liked (10)

Data Archiving and Processing
Data Archiving and ProcessingData Archiving and Processing
Data Archiving and Processing
 
Data Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLionData Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLion
 
5 Steps To Master Data Management
5 Steps To Master Data Management5 Steps To Master Data Management
5 Steps To Master Data Management
 
Data Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bwData Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bw
 
Data Management - Basic Concepts
Data Management - Basic ConceptsData Management - Basic Concepts
Data Management - Basic Concepts
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
Data Management for Dummies
Data Management for DummiesData Management for Dummies
Data Management for Dummies
 
Data strategy in a Big Data world
Data strategy in a Big Data worldData strategy in a Big Data world
Data strategy in a Big Data world
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 
Data Management Strategies
Data Management StrategiesData Management Strategies
Data Management Strategies
 

Similar to Data Management: Tips & Tools

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
Faculty Research Support Needs Survey
Faculty Research Support Needs SurveyFaculty Research Support Needs Survey
Faculty Research Support Needs Survey
Kathryn Crowe
 
CSU-ACADIS_dataManagement101-20120217
CSU-ACADIS_dataManagement101-20120217CSU-ACADIS_dataManagement101-20120217
CSU-ACADIS_dataManagement101-20120217
lyarmey
 
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
Defining the Libraries' Role in Research:  A Needs Assessment  Case StudyDefining the Libraries' Role in Research:  A Needs Assessment  Case Study
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
Kathryn Crowe
 

Similar to Data Management: Tips & Tools (20)

Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
 
Open data in ubi systems research data management plan (part 4)
Open data in ubi systems research   data management plan (part 4)Open data in ubi systems research   data management plan (part 4)
Open data in ubi systems research data management plan (part 4)
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter Libraries
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research Data
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...
 
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...
 
Educause 2015 RDM Maturity
Educause 2015 RDM Maturity Educause 2015 RDM Maturity
Educause 2015 RDM Maturity
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Faculty Research Support Needs Survey
Faculty Research Support Needs SurveyFaculty Research Support Needs Survey
Faculty Research Support Needs Survey
 
Data Management for librarians
Data Management for librariansData Management for librarians
Data Management for librarians
 
CSU-ACADIS_dataManagement101-20120217
CSU-ACADIS_dataManagement101-20120217CSU-ACADIS_dataManagement101-20120217
CSU-ACADIS_dataManagement101-20120217
 
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
Defining the Libraries' Role in Research:  A Needs Assessment  Case StudyDefining the Libraries' Role in Research:  A Needs Assessment  Case Study
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 

More from Stephanie Wright

More from Stephanie Wright (7)

Open Curriculum For Open Data Training
Open Curriculum For Open Data TrainingOpen Curriculum For Open Data Training
Open Curriculum For Open Data Training
 
University of Washington Research Commons
University of Washington Research CommonsUniversity of Washington Research Commons
University of Washington Research Commons
 
Riding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeRiding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data Deluge
 
Building Your Data Management Toolbox
Building Your Data Management ToolboxBuilding Your Data Management Toolbox
Building Your Data Management Toolbox
 
Trailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data ManagementTrailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data Management
 
UW Libraries Data Services Forum
UW Libraries Data Services ForumUW Libraries Data Services Forum
UW Libraries Data Services Forum
 
Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...
 

Recently uploaded

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 

Data Management: Tips & Tools

  • 1. Data Management Stephanie Wright University of Washington swright@uw.edu SPATIAL / IsoCamp June 2015 Tips & Tools
  • 3. • Computing Trainer • Cruise Ship Lecturer (Love Boat) • Library Merger Manager • Atmospheric Sciences Librarian • Assessment Librarian • Data Services Coordinator HTTP://GUIDES.LIB.WASHINGTON.EDU/SWRIGHT
  • 4. Disclaimer I am not a scientist I am a librarian …
  • 5. Disclaimer I am not a scientist More like this…
  • 6. What Do I Do? • Data Management Plans (DMPs) • Courses • Consultations • Research Projects • DataONE, RDA, eScience Institute • Institutional Data Repository (DRUW)
  • 12.
  • 19.
  • 20. ?
  • 22. Why Does It Matter? From Flickr by tomhilton
  • 23. HTTP://WWW.SPARC.ARL.ORG/ISSUES/OPEN-DATA/DATA-SHARING-INITIATIVE/POLICIES … “Federal agencies investing in research and development (more than $100 million in annual expenditures) must have clear and coordinated policies for increasing public access to research products.”
  • 24.
  • 25.
  • 26.
  • 27. “The best thing to do with your data will be thought of by someone else.” “We need open data because we don’t just want to use a car we want to poke around in the engine, see how it works and then rebuild it.” ~ Rufus Pollock Founder and President of Open Knowledge Foundation (www.okfn.org)
  • 28. From Flickr by cogdog
  • 29. WICHERTS JM, BAKKER M, MOLENAAR D (2011) WILLINGNESS TO SHARE RESEARCH DATA IS RELATED TO THE STRENGTH OF THE EVIDENCE AND THE QUALITY OF REPORTING OF STATISTICAL RESULTS. PLOS ONE 6(11): E26828. DOI:10.1371/JOURNAL.PONE.0026828 HTTP://127.0.0.1:8081/PLOSONE/ARTICLE?ID=INFO:DOI/10.1371/JOURNAL.PONE.0026828
  • 31. Data planning is more efficient than data forensics. DATA MANAGEMENT PLANNING •What will be collected •Methods •Standards •Sharing/access •Long-term storage
  • 32. COLLECTING •Keep raw data raw • Use scripts to process data
  • 33. ORGANIZING • Machine readable • Human readable • Works well with default ordering
  • 34. AVOID • spaces • punctuation • special characters • case sensitivity 20130503_DOEProject_DesignDocument_Smith_v2-01.docx 20130709_DOEProject_MasterData_Jones_v1-00.xlsx 20130825_DOEProject_Ex1Test1_Data_Gonzalez_v3-03.xlsx 20130825_DOEProject_Ex1Test1_Documentation_Gonzalez_v3-03.xlsx 20131002_DOEProject_Ex1Test2_Data_Gonzalez_v1-01.xlsx 20141023_DOEProject_ProjectMeetingNotes_Kramer_v1-00.docx Eaffinis_nanaimo_2010_counts.xls Site name Year What was measured Study organism
  • 36. NOBLE, WILLIAM S. (2009) "A QUICK GUIDE TO ORGANIZING COMPUTATIONAL BIOLOGY PROJECTS." PLOS COMPUTATIONAL BIOLOGY. 5(7): DOI/10.1371/JOURNAL.PCBI.1000424 • Pick a method that works for you and stick to it • DOCUMENT IT!
  • 38. Digital context • Name of the data set • The name(s) of the data file(s) in the data set • Date the data set was last modified • Example data file records for each data type file • Pertinent companion files • List of related or ancillary data sets • Software (including version number) used to prepare/read the data set • Data processing that was performed Personnel & stakeholders • Who collected • Who to contact with questions • Funders Scientific context • Scientific reason why the data were collected • What data were collected • What instruments (including model & serial number) were used • Environmental conditions during collection • Temporal & spatial resolution • Standards or calibrations used Information about parameters • How each was measured or produced • Units of measure • Format used in the data set • Precision & accuracy if known Information about data • Definitions of codes used • Quality assurance & control measures • Known problems that limit data use (e.g. uncertainty, sampling problems)
  • 39. Temperature data Salinity data Data import into Excel Analysis: mean, SD Graph production Quality control & data cleaning “Clean” T & S data Summary statistics Data in spread- sheet Simple: Flow chart WORKFLOW
  • 41. Resulting output More Fancy: Kepler, Taverna
  • 42. From Flickr by cogdog
  • 43. BACKING UP: 3 places, 3 ways From Flickr by lippo From Flickr by see phar Original Near Far What software? What hardware? What personnel? How often? Set up reminders! Test system
  • 45. Review your DMP Did you do what you said you would?
  • 47. How Do I Learn More? •Funding Mandates http://chronicle.com/article/Where-Should-You- Keep-Your/231065/ http://datapub.cdlib.org/2013/02/28/the-new-ostp- policy-what-it-means/ •File Naming Conventions: http://www.exadox.com/en/articles/file-naming- convention-ten-rules-best-practice •Folder Structures: http://www.damlearningcenter.com/resources/ articles/best-practices-for-folder-organization/ •Metadata: http://www.dcc.ac.uk/resources/metadata- standards •DataONE Primer https://www.dataone.org/best-practices •Software Carpentry http://software-carpentry.org/ •Research Data Alliance https://rd-alliance.org/ •Your Library http://guides.lib.washington.edu/dmg
  • 48. Tools •Data Mgmt Planning DMPTool https://dmptool.org/ •Metadata Morpho https://www.dataone.org/software- tools/morpho NOAA MERMaid http://www.ncddc.noaa.gov/ metadata-standards/mermaid/ •Workflows Kepler https://kepler-project.org/ Taverna http://www.taverna.org.uk/ •Sharing re3data http://www.re3data.org/ GitHub https://github.com/ •Miscellaneous EZID http://ezid.cdlib.org/ ImpactStory https://impactstory.org/ ORCID http://orcid.org/
  • 49. Any Other Questions? Stephanie Wright Web data.blogspot.com Twitter @UWLibsData Email swright@uw.edu