SlideShare une entreprise Scribd logo
1  sur  82
DATA MANAGEMENT OPEN
HOUSE
OHSU Library
October 9th, 2013
#OHSUdata @force11rescomm
0 | Introductions
1 | Scientific Communication
2 | Making it work for you
3 | Your impact
4 | Hands On
5 | Making it matter
0 | Introductions
Melissa
Haendel
Ontology
Development
Group and DMICE
Nicole
Vasilevsky
Ontology
Development
Group
Jackie
Wirz
Research Specialist,
Research Roadmap
SOM
Robin
Champieux
Scholarly
Communication
http://www.force11.org/
@force11rescomm
Who is FORCE11?
Publishers
Library and
Information
scientists
Policy
makers
Tool
builders
Funders
Scholars
Social
Science
Humanities
Science
Free to join!
Beyond-the-PDF
San Diego, Jan 2011 | Amsterdam, March 2013
www.force11.org/beyondthepdf2 | #btpdf2
How does OHSU fit in?
We won 1K to find out.
Today | Discuss data-research cycle, reproducibility, and
communication of findings
Later | Data playground with researchers:
 Your data needs
 Identify the material and services you need
 Get paid $50
1 | Scientific Communication
Once upon a time….
Research, Present, Publish.
Repeat.
You might say it wears a uniform
Our relationship
is so one-sided.
From Paper to Tweet
www.sciencemag.org/site/special/scicomm/infographic.jpg
New Modes & Models
Manage Your
Footprint
asdf
Data can be pretty complex…
Data does not speak for itself…
You speak for your data
You need to manage it
But, even more fundamentally…
what does data mean to you?
asdf
asdf
You speak for your data
How do you speak for your data
when you are not around?
Do you know what metadata is?
a. Philosophy
b. describes data
c. dating site
d. data
Title
Author
Call number
Publisher
ISBN
- Anne Gilliland
Your metadata should
make your data
understandable to
others…
without your
involvement
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
2 | Making it work for you
http://www.phd2published.com/wp-content/uploads/2011/09/publications_image.jpg
Biomed Res Int. 2013;2013:350419.
A Solution: Antibody Registry
The Antibody Registry
www.antibodyregistry.org
Data standards can help with
reproducibility
Average of
~50% of
resources
were not
identifiable
Vasilevsky et al., 2013 PeerJ 1:e148
www.force11.org/node/4463 biosharing.org/bsg-000532
Data Analysis Pipeline Reproducibility
Platforms
RESOURCES
www.wf4ever-project.org runmycode.org
galaxyproject.org/
Are you aware of data standards in
your field?
@OHSU, 72% said no or didn’t know!
Data standards are the rules by which data are
described and recorded. In order to share, exchange,
and understand data, we must standardize the format
as well as the meaning.
www.usgs.gov/datamanagement/plan/datastandards.php
Data Standards
Types of data standards
Reporting
guidelines
Terminology
Artifacts
(includes ontologies)
Exchange
Formats
Can be used together
Reporting
guidelines
Terminology
Artifacts
(includes
ontologies)
Exchange
Formats
MIAME
Data standards examples
Many microarray transcriptomics standards
JAMIA:sea-of-standards
www.cdisc.org
RESOURCES
Minimum Information for Biological and Biomedical Investigations
biosharing.org/
But it isn’t just about reproducibility…
It’s about
Data reuse?
www.erp-recycling.org
Ontologies as a tool for unification
Disease-
Phenotype
databases
Disease
phenotype
ontology
Expression
data
Gene function
data
Cell and tissue
ontology
GO
annotations
ontologies
For example, there are many useful ways to classify organism
parts:
its parts and their arrangement
its relation to other structures
what is it: part of; connected to; adjacent
to, overlapping?
its shape
its function
its developmental origins
its species or clade
its evolutionary history
Cajal 1915, “Accept the view that nothing in nature is useless, even from the human point of view.”
Ontologies classify data in multiple ways
http://www.boloncol.com/images/stories/boletin19/cajal16.jpg
Human Disease:
PFEIFFER
SYNDROME
Most similar
mouse model:
CD1.Cg-Fgfr2tm4Lni/H
shortened
head
MP:0000435
malocclusion
MP:0000120
ocular
hypertelorism
MP:0001300
short maxilla
MP:0000097
Brachyturricephaly
HP:0000244
Hypoplasia of
the maxilla
HP:0000327
Dental crowding
HP:0000678
Hypertelorism
HP:0000316
Coronal
craniosynostosis
HP:0004440
premature
suture
closure
maxilla
hypoplasia
malocclusion
shortened
head
ocular
hypertelorism
premature
suture closure
MP:0000081
Cross-species
Phenotype
Ontologies aid candidate gene identification for
undiagnosed diseases
Data Sharing Mandates
How can I make my data reusable?
There are tools to help!
Tools for research management
RESOURCES
www.labguru.com
www.labarchives.com
Data management plan tool
RESOURCES
https://dmp.cdlib.org/
What to do with data?
Storage Versioning Publication
Back up in multiple
locations:
 Local hard drive
 Removable
storage
 Shared Network
 Cloud server
 File name
versioning
 Dropbox
 Version control
software
 CVS
 SVN
 Git
Data sharing
repositories:
 Local repository
 Domain specific
 Generic public
repository
Computing in the cloud
Uniquely identifying data
 Document Object Identifier (DOI)
 Unique resource identifier (URI)
www.flickr.com/photos/pmeimon
v
figshare.com datadryad.org thedata.org
n2t.net/ezid www.dataone.org data.rutgers.edu/
Data journals and repositories
RESOURCES
nature.com/scientificdata/
3 | Your impact
Thinking Beyond the PDF
Raw Science Small publications Self-publishing
Datasets
Code
Experimental
design
Argument or
passage
Blogging
Microblogging
Comments &
Reviews
Annotations
Single figure
publications
Nanopublications
Who are you?
Impact.Story
impactstory.org
www.plumanalytics.com
orcid.org
RESOURCES
Services to identify yourself and your
impact
rubriq.com
scalar.usc.edu
RESOURCES
Alternative publishing mechanisms
thedata.org
http://theconversation.com/scientists-must-share-early-and-share-often-to-boost-citations-18699
Citing products of your
research
4 | Hands On
What is your scientific footprint?
5 | Making it Matter
 Legitimate, citable products of
research
 Same importance as traditional
citations
 Data management is central
Data
Data citation principles.
http://thedata.org/files/thedata_new2/
files/datacitationprinciples-datacite.pdf
Data Management 101
libguides.ohsu.edu/d
ata
Thank you!

Contenu connexe

Tendances

Getting onboard the data training: How librarians fit in
Getting onboard the data training: How librarians fit inGetting onboard the data training: How librarians fit in
Getting onboard the data training: How librarians fit inDiane Clark
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesStefan Dietze
 
Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Morgan Briles
 
Of Libraries and Labs: Effecting User-Driven Innovation
Of Libraries and Labs: Effecting User-Driven InnovationOf Libraries and Labs: Effecting User-Driven Innovation
Of Libraries and Labs: Effecting User-Driven InnovationAlex Humphreys
 
Cataloguer Makeover
Cataloguer MakeoverCataloguer Makeover
Cataloguer MakeoverVioleta Ilik
 
Lake us-canada policesupdate
Lake us-canada policesupdateLake us-canada policesupdate
Lake us-canada policesupdateSherry Lake
 
The Case for Stable VIVO URIs
The Case for Stable VIVO URIsThe Case for Stable VIVO URIs
The Case for Stable VIVO URIsVioleta Ilik
 
Governance 15 May 2009 1
Governance 15 May 2009 1Governance 15 May 2009 1
Governance 15 May 2009 1soscialsciences
 
Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...Violeta Ilik
 
Modeling Data with Karma – Data Integration Tool
Modeling Data with Karma – Data Integration ToolModeling Data with Karma – Data Integration Tool
Modeling Data with Karma – Data Integration ToolVioleta Ilik
 
Semantic Application for Healthcare
Semantic Application for HealthcareSemantic Application for Healthcare
Semantic Application for Healthcarescholten
 
Educon2.3 History, history
Educon2.3 History, historyEducon2.3 History, history
Educon2.3 History, historyvisiblehistory
 
Instruction Pres2008
Instruction Pres2008Instruction Pres2008
Instruction Pres2008Heather Davis
 
Access to Graduate Scholarship in VIVO: Establishing Connections and Tracing ...
Access to Graduate Scholarship in VIVO: Establishing Connections and Tracing ...Access to Graduate Scholarship in VIVO: Establishing Connections and Tracing ...
Access to Graduate Scholarship in VIVO: Establishing Connections and Tracing ...Violeta Ilik
 
Who and What Links to the Internet Archive
Who and What Links to the Internet ArchiveWho and What Links to the Internet Archive
Who and What Links to the Internet ArchiveYasmin AlNoamany, PhD
 
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...datascienceiqss
 
What do MARC, RDF, and OWL have in common?
What do MARC, RDF, and OWL have in common?What do MARC, RDF, and OWL have in common?
What do MARC, RDF, and OWL have in common?Violeta Ilik
 
LABR1F90 Presentation December 6 2016
LABR1F90 Presentation December 6 2016LABR1F90 Presentation December 6 2016
LABR1F90 Presentation December 6 2016Brock University
 
Citations needed for the sum of all human knowledge: Wikidata as the missing ...
Citations needed for the sum of all human knowledge: Wikidata as the missing ...Citations needed for the sum of all human knowledge: Wikidata as the missing ...
Citations needed for the sum of all human knowledge: Wikidata as the missing ...Dario Taraborelli
 

Tendances (20)

Getting onboard the data training: How librarians fit in
Getting onboard the data training: How librarians fit inGetting onboard the data training: How librarians fit in
Getting onboard the data training: How librarians fit in
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital Libraries
 
Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web
 
Of Libraries and Labs: Effecting User-Driven Innovation
Of Libraries and Labs: Effecting User-Driven InnovationOf Libraries and Labs: Effecting User-Driven Innovation
Of Libraries and Labs: Effecting User-Driven Innovation
 
Cataloguer Makeover
Cataloguer MakeoverCataloguer Makeover
Cataloguer Makeover
 
Lake us-canada policesupdate
Lake us-canada policesupdateLake us-canada policesupdate
Lake us-canada policesupdate
 
The Case for Stable VIVO URIs
The Case for Stable VIVO URIsThe Case for Stable VIVO URIs
The Case for Stable VIVO URIs
 
Governance 15 May 2009 1
Governance 15 May 2009 1Governance 15 May 2009 1
Governance 15 May 2009 1
 
Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...
 
Modeling Data with Karma – Data Integration Tool
Modeling Data with Karma – Data Integration ToolModeling Data with Karma – Data Integration Tool
Modeling Data with Karma – Data Integration Tool
 
Semantic Application for Healthcare
Semantic Application for HealthcareSemantic Application for Healthcare
Semantic Application for Healthcare
 
Day5
Day5Day5
Day5
 
Educon2.3 History, history
Educon2.3 History, historyEducon2.3 History, history
Educon2.3 History, history
 
Instruction Pres2008
Instruction Pres2008Instruction Pres2008
Instruction Pres2008
 
Access to Graduate Scholarship in VIVO: Establishing Connections and Tracing ...
Access to Graduate Scholarship in VIVO: Establishing Connections and Tracing ...Access to Graduate Scholarship in VIVO: Establishing Connections and Tracing ...
Access to Graduate Scholarship in VIVO: Establishing Connections and Tracing ...
 
Who and What Links to the Internet Archive
Who and What Links to the Internet ArchiveWho and What Links to the Internet Archive
Who and What Links to the Internet Archive
 
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...
 
What do MARC, RDF, and OWL have in common?
What do MARC, RDF, and OWL have in common?What do MARC, RDF, and OWL have in common?
What do MARC, RDF, and OWL have in common?
 
LABR1F90 Presentation December 6 2016
LABR1F90 Presentation December 6 2016LABR1F90 Presentation December 6 2016
LABR1F90 Presentation December 6 2016
 
Citations needed for the sum of all human knowledge: Wikidata as the missing ...
Citations needed for the sum of all human knowledge: Wikidata as the missing ...Citations needed for the sum of all human knowledge: Wikidata as the missing ...
Citations needed for the sum of all human knowledge: Wikidata as the missing ...
 

Similaire à Data Management Open House

Re-imagining the role of Institutional Repository in Open Scholarship
Re-imagining the role of Institutional Repository in Open ScholarshipRe-imagining the role of Institutional Repository in Open Scholarship
Re-imagining the role of Institutional Repository in Open ScholarshipLeslie Chan
 
Bias and the Data Lifecycle
Bias and the Data LifecycleBias and the Data Lifecycle
Bias and the Data LifecycleRichard Ferrers
 
OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...
OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...
OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...OpenAIRE
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKpetermurrayrust
 
Wild data: collaborative e-research and university libraries
Wild data: collaborative e-research and university librariesWild data: collaborative e-research and university libraries
Wild data: collaborative e-research and university librariesRAILS7
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Mark Wilkinson
 
Ontologies for baby animals and robots From "baby stuff" to the world of adul...
Ontologies for baby animals and robots From "baby stuff" to the world of adul...Ontologies for baby animals and robots From "baby stuff" to the world of adul...
Ontologies for baby animals and robots From "baby stuff" to the world of adul...Aaron Sloman
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapemhaendel
 
DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1AlyciaGold776
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Maryann Martone
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
Stories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global InfrastructureStories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global InfrastructureResearch Data Alliance
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
Research Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social SciencesResearch Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social SciencesMartin Donnelly
 
Rare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesRare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesAndrea Scharnhorst
 

Similaire à Data Management Open House (20)

Re-imagining the role of Institutional Repository in Open Scholarship
Re-imagining the role of Institutional Repository in Open ScholarshipRe-imagining the role of Institutional Repository in Open Scholarship
Re-imagining the role of Institutional Repository in Open Scholarship
 
Bias and the Data Lifecycle
Bias and the Data LifecycleBias and the Data Lifecycle
Bias and the Data Lifecycle
 
OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...
OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...
OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UK
 
Martone grethe
Martone gretheMartone grethe
Martone grethe
 
Wild data: collaborative e-research and university libraries
Wild data: collaborative e-research and university librariesWild data: collaborative e-research and university libraries
Wild data: collaborative e-research and university libraries
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014
 
Ontologies for baby animals and robots From "baby stuff" to the world of adul...
Ontologies for baby animals and robots From "baby stuff" to the world of adul...Ontologies for baby animals and robots From "baby stuff" to the world of adul...
Ontologies for baby animals and robots From "baby stuff" to the world of adul...
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
Stories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global InfrastructureStories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global Infrastructure
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
Research Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social SciencesResearch Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social Sciences
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
 
Rare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesRare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studies
 

Plus de Jackie Wirz, PhD

NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015Jackie Wirz, PhD
 
Online NW 2015 Wirz Developing Novel Outreach Data Visualization
Online NW 2015 Wirz Developing Novel Outreach Data VisualizationOnline NW 2015 Wirz Developing Novel Outreach Data Visualization
Online NW 2015 Wirz Developing Novel Outreach Data VisualizationJackie Wirz, PhD
 
AM Career Marketing OHSU RIPSS 2014
AM Career Marketing OHSU RIPSS 2014AM Career Marketing OHSU RIPSS 2014
AM Career Marketing OHSU RIPSS 2014Jackie Wirz, PhD
 
Data Viz CE 2014 Vision and the Brain
Data Viz CE 2014 Vision and the BrainData Viz CE 2014 Vision and the Brain
Data Viz CE 2014 Vision and the BrainJackie Wirz, PhD
 
Data Viz CE 2014 Storytelling
Data Viz CE 2014 StorytellingData Viz CE 2014 Storytelling
Data Viz CE 2014 StorytellingJackie Wirz, PhD
 
Data Viz CE 2014 Intro and Overview
Data Viz CE 2014 Intro and OverviewData Viz CE 2014 Intro and Overview
Data Viz CE 2014 Intro and OverviewJackie Wirz, PhD
 
Data Viz CE 2014 Libraries
Data Viz CE 2014 LibrariesData Viz CE 2014 Libraries
Data Viz CE 2014 LibrariesJackie Wirz, PhD
 
Scientific Writing 2014 IEH
Scientific Writing 2014 IEHScientific Writing 2014 IEH
Scientific Writing 2014 IEHJackie Wirz, PhD
 
Posters & Presentations that Don't Suck
Posters & Presentations that Don't SuckPosters & Presentations that Don't Suck
Posters & Presentations that Don't SuckJackie Wirz, PhD
 
Data management workshop 101113
Data management workshop 101113Data management workshop 101113
Data management workshop 101113Jackie Wirz, PhD
 
Data101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalData101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalJackie Wirz, PhD
 
SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation Jackie Wirz, PhD
 
Science is a moving target
Science is a moving targetScience is a moving target
Science is a moving targetJackie Wirz, PhD
 
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...Jackie Wirz, PhD
 

Plus de Jackie Wirz, PhD (20)

NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015
 
Online NW 2015 Wirz Developing Novel Outreach Data Visualization
Online NW 2015 Wirz Developing Novel Outreach Data VisualizationOnline NW 2015 Wirz Developing Novel Outreach Data Visualization
Online NW 2015 Wirz Developing Novel Outreach Data Visualization
 
AM Career Marketing OHSU RIPSS 2014
AM Career Marketing OHSU RIPSS 2014AM Career Marketing OHSU RIPSS 2014
AM Career Marketing OHSU RIPSS 2014
 
Data Viz CE 2014 Vision and the Brain
Data Viz CE 2014 Vision and the BrainData Viz CE 2014 Vision and the Brain
Data Viz CE 2014 Vision and the Brain
 
Data Viz CE 2014 Toolbox
Data Viz CE 2014 ToolboxData Viz CE 2014 Toolbox
Data Viz CE 2014 Toolbox
 
Data Viz CE 2014 Storytelling
Data Viz CE 2014 StorytellingData Viz CE 2014 Storytelling
Data Viz CE 2014 Storytelling
 
Data Viz CE 2014 Intro and Overview
Data Viz CE 2014 Intro and OverviewData Viz CE 2014 Intro and Overview
Data Viz CE 2014 Intro and Overview
 
Data Viz CE 2014 Color
Data Viz CE 2014 ColorData Viz CE 2014 Color
Data Viz CE 2014 Color
 
Data Viz CE 2014 Libraries
Data Viz CE 2014 LibrariesData Viz CE 2014 Libraries
Data Viz CE 2014 Libraries
 
Scientific Writing 2014 IEH
Scientific Writing 2014 IEHScientific Writing 2014 IEH
Scientific Writing 2014 IEH
 
Posters & Presentations that Don't Suck
Posters & Presentations that Don't SuckPosters & Presentations that Don't Suck
Posters & Presentations that Don't Suck
 
Data Management
Data ManagementData Management
Data Management
 
Rw 2014 poster final
Rw 2014 poster finalRw 2014 poster final
Rw 2014 poster final
 
Rw 2014 data visulization
Rw 2014 data visulizationRw 2014 data visulization
Rw 2014 data visulization
 
Data management workshop 101113
Data management workshop 101113Data management workshop 101113
Data management workshop 101113
 
Foundations of data viz
Foundations of data vizFoundations of data viz
Foundations of data viz
 
Data101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalData101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_final
 
SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation
 
Science is a moving target
Science is a moving targetScience is a moving target
Science is a moving target
 
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
 

Dernier

APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 

Dernier (20)

APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 

Data Management Open House

Notes de l'éditeur

  1. MH introduce
  2. MH: Introduce
  3. JW
  4. MH: introduce everyone
  5. MH: A grass roots effort to accelerate the pace and nature of scholarly communications and e-scholarship through technology, education, and community Why 11? We were born in 2011
  6. MH: Force11 is comprised of a diversity of participants to best aid in the redefinition of scholarly communication
  7. MH: (Un)conference where stakeholders came together as equals to discuss issues Incubator for change What would you do to change scholarly communication if you had $1K? M. Haendel award winner
  8. MH-why we are here today, and how all of you can help. JW: put the slide 2 here perhaps? This still has too much text; slide 2 is much less intimidating.
  9. JW
  10. RC: The traditional model of scientific communication is fairly straightforward. Sucessful research is shared via presentations and papers, after data is collected and analyzed.
  11. RC: This model is slow, even when considred within the context of electronic journals. A recent study clocked the average timeframe from submission to to publication for biomedical journals at over 9 months: http://www.openaccesspublishing.org/2013/09/06/the-publishing-delay-in-scholarly-peer-reviewed-journals/.
  12. RC: The traditional model is also very formalized in respect to when in the research cycle the science is shared (well after the study has taken place), how it is disseminated (peer-reviewed articles), to whom one is communicating (most often scientists in your specialized field) and how impact is measured (citation counts to articles).
  13. RC: Finally, it is unilateral in that it doesn’t faciliate dynamic, real-time, interaction between scientists outside of the society meeting or conference. Nor does it further coversation between scientists and the public.
  14. The Internet has had a profound affect on science and scientific communication, wherein the traditional model I just described is being reimainged, ulitmately in the perusuit of advancing the scientific process. But the traditional model of scholarly communication stills dominates how many scientists manage and share their research and data.
  15. RC: Volume of literature has exploded since the first online journals were launched in the late 1980’s. Today, virtually all science journals are online. There are over 28,000 active peer reviewed journals, publishing nearly 2 million articles per year, with a new paper is published every 20 seconds. This is a huge industry, with revenues of about 10 billion/year. 50% of new research is freely available online either immediately or within 12 months of publication. But the other 50% lives behind high paywalls. Limiting the scope of science available to potential readers (human and compter; scientists and the public). Infographic: http://www.sciencemag.org/site/special/scicomm/infographic.jpg
  16. RC: We have also seen a proliferation of new publishing modes and models. This includes a variety of open access publishers and journals with new peer-review models such as open and post-publication peer-reivew. And, new economic models, wherein authors, funders, and libraries are sharing the cost of publication. New modes include but are not limited to self-publication and social media, such as science blogs and twitter, and data sharing via public repositories.
  17. Communication is also occurring at more points across the research cycle. For instance, ideas are shared and developed via on-line conversations on blogs and Twitter. Code and data are being released as they are built and recorded via open lab notebooks. This activity compliments and feeds the traditional products of research – papers and presentations.
  18. RC: However, if scientists don’t thoughtfully and actively manage their research products in this new system, the advantages are minimized and all this new stuff becomes noise.
  19. Data. Complex…
  20. JW
  21. JW
  22. JW
  23. JW
  24. JW: this is raw crystallography data, collected at OHSU. This is visual, high resolution, data
  25. The image then gets integrated along the spots, transforming the image into a series of mathematical values
  26. The “model’ we are used to seeing is actually a mathematical representation of how well a model (the sticks) fits into the mathematical distillation of the raw image. This looks static, but is actually a best representaiton along one axis of the data (which is to say, confidence levels). Crystallography boils down to solving the “phase problem”, which can be done two ways: brute force (holy hell!), and by using an exisitng model as a jumping off point. This is the fastest and most efficient way of solving off structures, and is, in fact, what I did to solve this structure. I got the previously published data from pdb.org, which is also where I deposited my data. The point of this is three fold: 1) data comes in many shapes and forms, 2) data transforms, and 3) data helps inform more data.
  27. JW: this is raw crystallography data, collected at OHSU. This is visual, high resolution, data
  28. Ask them to think about what type of data they deal with/generate. Give a couple minutes.
  29. Ask if they have additional data types that they brainstormed JW: yes, need this slide if we are to cover the examples listed later. Also, we are eventually getting to alt metrics, which means the third quadrant; therefore, important to cover here.
  30. Data. Complex…
  31. Data. Complex…
  32. JW
  33. JW
  34. ! Add metadata not only to your experimental results, but also your process during research, such as resources, protocols, etc. Ways to apply metadata to every moving part of your research
  35. JW: this is raw crystallography data, collected at OHSU. This is visual, high resolution, data
  36. JW: this is raw crystallography data, collected at OHSU. This is visual, high resolution, data
  37. JW: this is raw crystallography data, collected at OHSU. This is visual, high resolution, data
  38. JW
  39. NV-The literature was the place we would go to find information to get protocols, information about techniques, find resources/reagents Assuming you got to your relevant paper- look at the methods section, is there enough info there for you to be able to reuse/reproduce the info/experiment/technique?
  40. NV- For example, if you look in the materials and methods section for an antibody used in a western blot, oftentimes the name is reported, along with the vendor and vendor’s location Say here that the authors met the journal standards, but that they really aren’t sufficient.
  41. NV However, there are several antibodies generated against one target, so how do you know which one works in this assay? Need to report catalog numbers…
  42. NV - Alternatively, report the AR ID Permanent identifier, stays with the Ab, even as it changes vendors or catalog #’s change. Similar to genbank for antibodies. Most resources can be reported more specifically than publisher guidelines, which are not intended to support reproducibility.
  43. NV: An area with poor data standards shows poor reproducibility. Here we showed how irrereproducible many studies were simply due to lack of specificity in the resources used in the experiments. We therefore developed guidelines that are now in place to support resource reporting, and these are now in effect in a number of journals, with more to come. OHSU participates in the Reproducibility Initiative, aimed at developing policies and tools to aid scientific reproducibility. Some bioinformatics tools to aid reproducibility are Workflow4Ever and RunMyCode.org. Outcomes from data standards: Reproducibility and data reuse Place urls in separate document, not on slide www.scienceexchange.com/reproducibility www.wf4ever-project.org runmycode.org
  44. Bioinformatics workflow standards such as Workflow 4Ever and Run my code have been developed to help with standardization and sharing of scientific workflows and code. Workflow 4 ever Run my code is a repository where people can share or reuse code that is associated with scientific publications. For data manipulations, here is an example of tools that can help with reproducibility.
  45. MH: Yes 28.0% , No 26.9% , I don't know 45.1% 175 answered question
  46. MH Put URL in supporting document. Too distracting here. http://www.usgs.gov/datamanagement/plan/datastandards.php
  47. MH: each type serves a different purpose: Reporting guidelines serve to ensure that a minimum of metadata is reported, so that someone else can know what your data is about. Terminology artifacts allow some of the data to be structured for reuse and interoperability. Think of these as interoperability handles. Exchange formats provide the syntax for the data structure, and further enable data integration and mashup.
  48. MH: each type serves a different purpose: Reporting guidelines serve to ensure that a minimum of metadata is reported, so that someone else can know what your data is about. Terminology artifacts allow some of the data to be structured for reuse and interoperability. Think of these as interoperability handles. Exchange formats provide the syntax for the data structure, and further enable data integration and mashup.
  49. MH: which one to use? Need a solution to help identify the right standard, contribute to and/or extend existing ones to best support community reproducibility and reuse
  50. MH –Both of these resources provide a survey of data standards of all three types – Reporting Guidelines Terminology Artifacts (includes ontologies) Exchange Formats Biosharing has a biology focus, CDISC is a clinical focus There are others, these are just two resources. Take away- there are different standards, no standard meets everyone’s need.
  51. NV: this is transition back to melissa
  52. MH: Reusing data is not as easy as dumpster diving. You don’t always know that a coke can or a keyboard key can be a critical data element. JW: Oh. My. God.
  53. MH: Slide from Chris Mungall Ontologies provide the handle by which data from different databases and of different types can be linked and integrated for maximal biological knowledge Do we need this slide? JW: Maybe not IN the deck, but at the back. If soembody asks what an ontology is during the Q&A, we can bring it up. I did this all the time for my seminars – always have extra slides at the back end for potential questions.
  54. MH: ontologies, unlike a file system, allow data to be classified in many different ways using logic and standardized identifiers
  55. MH: When data is encoded using ontologies, it can allow mashup in novel ways. Here, we are using clinical phenotype data and comparing it with model organism phenotype data to identify candidate genes for undiagnosed human diseases. JW: Please let me clean up the original image. The pixilated borders are driving me nuts, and the human head has some white pixels that can very easily and quickly be cleaned up!
  56. MH: those pesky data sharing mandates, what are they really for? Does dumping my data into a data repository with no metadata or use of standards really help? Answer- no it doesn’t. If you want your data to be a first class citizen as a scholarly product that can be cited and actually be reused, then you need to go a bit further. Need to add links to policies
  57. Transition- how can I meet data sharing requirements, and actually make my data reusable? ANSWER: Just like any experiment or quality statistical approach, you need to plan ahead. There are tools to help. The library can help too.
  58. FigShare Dryad Data.gov
  59. MH: add link
  60. Want people to come to library to help with archiving/data publication Where can you keep your data? Does it have sensitive info? Yes/no Does it need to be archived? Make decision tree for one on one meetings
  61. What does this mean? It means storing or performing analyses on (many times) unsecure shared servers that may exist anywhere in the world Why should you care? Tools like dropbox and googledocs are research effecience lifesavers but come with an IP risk as well as risk of sharing PHI data Similarly, amazon cloud servers and genomics data analysis platforms are all too easy to set up or use, and can lead to PHI data being leaked.
  62. MH: Example: DOIs for publications, data, or other research product doi: 10.1371/journal.pbio.1001339 A URI will resolve to a single location on the web URIs for people
  63. RC
  64. Scientific output and potential impact is more complex, dynamic, and diverse than peer-reviewed papers. Actively managing your research footprint – which includes of course your data – can positively affect your scientific impact.
  65. MH – I updated a bit..
  66. MELISSA
  67. Robin add better title? Needs cleanup still
  68. Grab info for NIH Melissa also talk about NSF biosketch and how everything you create speaks to you as a scientist- make it citable! End with your scholarly footprint – lead into breakouts
  69. JW
  70. JW
  71. MH: Should add links to libguide, library pages etc.