SlideShare une entreprise Scribd logo
1  sur  79
DATA MANAGEMENT 101
Nicole Vasilevsky, Jackie Wirz and Melissa Haendel
PMCB New Student Orientation
20 September 2013
1 | Data definitions
2 | Dealing with data
3 | How the OHSU
Library can help
Nicole
Vasilevsky, Ph
D
Project
Manager, Ontolo
gy Development
Group
Jackie Wirz,
PhD
Assistant
Professor,
Bioinformation
Specialist
Melissa
Haendel, PhD
Assistant
Professor,
Lead,
Ontology
Development
Group
1 | Data definitions
Data does not speak for itself…
YOU speak for YOUR data
But First, you need to manage it
But, even more fundamentally…
data
means many
things…
what does
data mean to
you?
What are data?
Experimental data
Social data
School related data
Personal data
Do you know what metadata is?
a. Philosophy
b. describes data
c. dating site
d. data
2 | dealing with data
Do you get frustrated with any of the following?
a. Storing data
b. Backing up data
c. Analyzing/manipulating data
d. Finding data produced by other researchers/clinicians
e. Ensuring data are secure
f. Making data accessible to other researchers
g. Controlling access to data
h. Tracking updates to data (ie versioning)
i. Creating metadata (ie describing the data to be more useful at a later
time or by others)
j. Protecting intellectual property rights
k. Ensuring appropriate professional credit/citation is given to data
sets/generated
Why?
Personal
organization
Efficiency
Credit where
credit is due
Accelerate
scientific and
clinical discovery
Reproducibility of
science and
medicine
naming | metadata | tools | standards
How?
naming
File naming
Naming conventions
Project_instrument_location_YYYYMMDDhhm
mss_extra.ext
Index/grant
conditions
Leading zero!
s/n, variable
Retain
order
Naming: Directory Structure
PCMB presentation
Library presentation
DMICE presentation
Presentations
PMCB Library DMICE
http://ftp.ihmc.us/
ReadMe
Version Control
Versioning
• Save a copy of every version of a file
• Follow a file naming convention
Data101_PMCB_Retreat_09-20-13_v1
Data101_PMCB_Retreat_09-20-13_v2
Data101_PMCB_Retreat_09-20-13_Final
Versioning
Versioning
Versioning
Version Control software:
• GIT
• SVN
Backups
Which of the following do you do?
a. Save copies of data on a disk, USB drive, or computer
hard drive
b. Save copies of data on a local server
c. Save copies of data on a central campus server
d. Save copies of data on a web-based or cloud server
e. Store data in a repository or archives
f. Automatically backup files
g. Manually generate backup
h. Restrict access to files
 1 on your local workstation
 1 local/removable, such as external hard
drive
 1 on central server
 1 remote, such as on a cloud server*
*Depending on the type of data, as cloud servers are not
always secure
Where can you backup your data?
Metadata
What is Metadata?
Title
Author
Call number
Publisher
ISBN
- Anne Gilliland
Your metadata
should make
your data
understandable
to others without
your
involvement
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Are you aware of data
standards in your field?
data standards
Data standards are the rules by which data are
described and recorded. In order to
share, exchange, and understand data, we must
standardize the format as well as the meaning.
http://www.usgs.gov/datamanagement/plan/datastandards.php
Controlled vocabularies
Structured data helps with searching
Craigslist search: Chaise
Craigslist matches on strings only
Craigslist search: Fainting couch
Structured data helps with searching
PubMed indexes articles with
MeSH Terms
Structured data helps with searching
Why are CVs and Ontologies useful?
• Can be used to structure your metadata
• Are often used to structure information in
databases
Cell Ontology Linnean Taxonomy
Order
Genus
Species
Phylum
Class
Family
Kingdom
tools
File renaming applications
• Bulk Rename Utility (Windows)
• Renamer (Mac)
• PSRenamer
Data Management tools and
repositories
• Purpose: Software where you can
organize, store and/or share data
• Often contain metadata to assist with data
entry and create structured data
Tools for data management
Repositories use Unique IDs
• Document Object Identifier (DOI)
• Example: DOIs for publications
– doi: 10.1371/journal.pbio.1001339
• Unique resource identifier (URI)
• A URI will resolve to a single location on the
web
• URIs for people
• Example:
• John L Campbell, Research Ecologist, Oregon State University, Corvallis
OR
• John L Campbell, Research Ecologist, Center for Research on
Ecosystem Change, Durham, NC
standards
nomenclature
antibodies
Western Blot
Immunohistochemstry
ELISA
Co-immunoprecipitation
ChIP
Radioimmunoassay
FACS analysis of T cells from LNs and tumors
T cells were liberated from LNs by disruption between two
frosted glass slides. Cells from LNs and tumors were stained
with various combination of the following Abs: FITC-
CD4, allophycocyanin-CD25, PE Cy7-CD8, APC-CD62L, PE-
CD25, PE Cy7-CD25, and biotinylated-KJ-126 and in some
experiments made permeable with
fixation/permeablization buffers and stained with PE-FoxP3
(eBioscience). Harvested samples, isotype controls, and
single stain controls were run on the FACSCalibur (BD
Biosciences).
Ruby and Weinberg (2009) J Immunol. 182(3):1481-9.
Which antibody did they use in the paper?
A Solution: Antibody Registry
antibodyregistry.org
Meet the Urban Lab
Meet the Urban Lab
A+ organization!
The Urban lab antibodies
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
Commerical Ab
identifiable
Catalog number
reported
Source organism
reported
Target uniquely
identifiable
Of 14 antibodies published in 45
articles, only 38% were identifiable
Percentidentifiable
http://www.force11.org/node/4463
http://biosharing.org/bsg-000532
http://www.biosharing.org/standards/mibbi
Minimum Information for Biological and Biomedical Investigations
data publication and sharing
Why share data?
• Data sharing
mandates
• Further science and
and medicine
• Build collaborations
• Enable new
discoveries with
your data
• Can be required at
time of publication
Distribution of 2004–2005 citation counts of 85 trials by data availability.
How?
Beyond the PDF:
What can be published (and cited)?
Raw Science Nanopublications Self-publishing
Beyond the PDF:
What can be published (and cited)?
Raw Science Nanopublications Self-publishing
Datasets
Code
Experimental
design
Argument or
passage
Blogging
Microblogging
Comments on
existing work
Annotations on
existing work
Single figure
publications
How?
Data Journals and Repositories
• FigShare
• Dryad
• DataVerse (social science)
• Institutional repositories
www.impactstory.org
3 | How the OHSU
Library can help
1 | Large Lecture: Data Management 101
2 | 10 –15 Small Groups: data playground
• 1 researcher paired with 2 or 3 library staff
• Tailored analysis of data reporting and instruction
Save the date:
10/09/13
4-6pm
1k challenge award recipients
Thank you!
URLs to resources
Go to:
http://libguides.ohsu.edu/data

Contenu connexe

Tendances

Dna the next big thing in data storage
Dna the next big thing in data storageDna the next big thing in data storage
Dna the next big thing in data storageOther Mother
 
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...Databricks
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.Elena Sügis
 
The DNA Era: Why DNA is important in your life.
The DNA Era:  Why DNA is important in your life.The DNA Era:  Why DNA is important in your life.
The DNA Era: Why DNA is important in your life.Richard Brownell
 
Scott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data delugeScott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data delugeGigaScience, BGI Hong Kong
 
Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...nolmar01
 
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Neuroscience Information Framework
 
Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)AllSeq
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingGigaScience, BGI Hong Kong
 
DNA as memory storage device
DNA as memory storage deviceDNA as memory storage device
DNA as memory storage deviceKiran Gajare
 
Karin Strauss - DNA Storage, July 2016
Karin Strauss - DNA Storage, July 2016Karin Strauss - DNA Storage, July 2016
Karin Strauss - DNA Storage, July 2016Seattle DAML meetup
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and modelsmyGrid team
 
2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynotec.titus.brown
 
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014Anita de Waard
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"GigaScience, BGI Hong Kong
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...GigaScience, BGI Hong Kong
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"GigaScience, BGI Hong Kong
 

Tendances (20)

Dna the next big thing in data storage
Dna the next big thing in data storageDna the next big thing in data storage
Dna the next big thing in data storage
 
Dr Justin Schonfeld - Bioinformatics Applications
Dr Justin Schonfeld - Bioinformatics ApplicationsDr Justin Schonfeld - Bioinformatics Applications
Dr Justin Schonfeld - Bioinformatics Applications
 
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
 
The DNA Era: Why DNA is important in your life.
The DNA Era:  Why DNA is important in your life.The DNA Era:  Why DNA is important in your life.
The DNA Era: Why DNA is important in your life.
 
Scott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data delugeScott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data deluge
 
Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...
 
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
 
Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)
 
EVQLV Deck
EVQLV Deck EVQLV Deck
EVQLV Deck
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
 
DNA as memory storage device
DNA as memory storage deviceDNA as memory storage device
DNA as memory storage device
 
Karin Strauss - DNA Storage, July 2016
Karin Strauss - DNA Storage, July 2016Karin Strauss - DNA Storage, July 2016
Karin Strauss - DNA Storage, July 2016
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and models
 
2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynote
 
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"
 
Introduction to Database Research Projects @ CWHR
Introduction to Database Research Projects @ CWHRIntroduction to Database Research Projects @ CWHR
Introduction to Database Research Projects @ CWHR
 

En vedette

Data Herding for Scientists - IGERT Symposium at UF
Data Herding for Scientists - IGERT Symposium at UFData Herding for Scientists - IGERT Symposium at UF
Data Herding for Scientists - IGERT Symposium at UFCarly Strasser
 
UC Santa Cruz: Data Management for Scientists
UC Santa Cruz: Data Management for ScientistsUC Santa Cruz: Data Management for Scientists
UC Santa Cruz: Data Management for ScientistsCarly Strasser
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceCarly Strasser
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data LocallyErin D. Foster
 
Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeCarly Strasser
 
NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015Jackie Wirz, PhD
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015Carly Strasser
 
Deep phenotyping to aid identification of coding & non-coding rare disease v...
Deep phenotyping to aid identification  of coding & non-coding rare disease v...Deep phenotyping to aid identification  of coding & non-coding rare disease v...
Deep phenotyping to aid identification of coding & non-coding rare disease v...mhaendel
 

En vedette (9)

Data Herding for Scientists - IGERT Symposium at UF
Data Herding for Scientists - IGERT Symposium at UFData Herding for Scientists - IGERT Symposium at UF
Data Herding for Scientists - IGERT Symposium at UF
 
Science101 slideshare
Science101 slideshareScience101 slideshare
Science101 slideshare
 
UC Santa Cruz: Data Management for Scientists
UC Santa Cruz: Data Management for ScientistsUC Santa Cruz: Data Management for Scientists
UC Santa Cruz: Data Management for Scientists
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career Conference
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
 
Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of Change
 
NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
 
Deep phenotyping to aid identification of coding & non-coding rare disease v...
Deep phenotyping to aid identification  of coding & non-coding rare disease v...Deep phenotyping to aid identification  of coding & non-coding rare disease v...
Deep phenotyping to aid identification of coding & non-coding rare disease v...
 

Similaire à Data101 pmcb retreat_09-20-13_final

Data management workshop 101113
Data management workshop 101113Data management workshop 101113
Data management workshop 101113Jackie Wirz, PhD
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesJosef Scheiber
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...GigaScience, BGI Hong Kong
 
SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation Jackie Wirz, PhD
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...GigaScience, BGI Hong Kong
 
Managing sensitive data in your repository
Managing sensitive data in your repositoryManaging sensitive data in your repository
Managing sensitive data in your repositoryARDC
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Robert Grossman
 
The CDISC-HL7 Project
The CDISC-HL7 ProjectThe CDISC-HL7 Project
The CDISC-HL7 Projectolivaa
 
Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Anita de Waard
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceGigaScience, BGI Hong Kong
 
FAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR trackFAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR trackHelena Deus
 
CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730jeffreylancaster
 
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...GigaScience, BGI Hong Kong
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholdermhaendel
 

Similaire à Data101 pmcb retreat_09-20-13_final (20)

Data management workshop 101113
Data management workshop 101113Data management workshop 101113
Data management workshop 101113
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use Cases
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation
 
Data Management
Data ManagementData Management
Data Management
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
METRO RDM Webinar
METRO RDM WebinarMETRO RDM Webinar
METRO RDM Webinar
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
 
Managing sensitive data in your repository
Managing sensitive data in your repositoryManaging sensitive data in your repository
Managing sensitive data in your repository
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
 
The CDISC-HL7 Project
The CDISC-HL7 ProjectThe CDISC-HL7 Project
The CDISC-HL7 Project
 
Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
 
Martone grethe
Martone gretheMartone grethe
Martone grethe
 
FAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR trackFAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR track
 
CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730
 
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholder
 
Data at the NIH
Data at the NIHData at the NIH
Data at the NIH
 

Plus de Jackie Wirz, PhD

Online NW 2015 Wirz Developing Novel Outreach Data Visualization
Online NW 2015 Wirz Developing Novel Outreach Data VisualizationOnline NW 2015 Wirz Developing Novel Outreach Data Visualization
Online NW 2015 Wirz Developing Novel Outreach Data VisualizationJackie Wirz, PhD
 
AM Career Marketing OHSU RIPSS 2014
AM Career Marketing OHSU RIPSS 2014AM Career Marketing OHSU RIPSS 2014
AM Career Marketing OHSU RIPSS 2014Jackie Wirz, PhD
 
Data Viz CE 2014 Vision and the Brain
Data Viz CE 2014 Vision and the BrainData Viz CE 2014 Vision and the Brain
Data Viz CE 2014 Vision and the BrainJackie Wirz, PhD
 
Data Viz CE 2014 Storytelling
Data Viz CE 2014 StorytellingData Viz CE 2014 Storytelling
Data Viz CE 2014 StorytellingJackie Wirz, PhD
 
Data Viz CE 2014 Intro and Overview
Data Viz CE 2014 Intro and OverviewData Viz CE 2014 Intro and Overview
Data Viz CE 2014 Intro and OverviewJackie Wirz, PhD
 
Data Viz CE 2014 Libraries
Data Viz CE 2014 LibrariesData Viz CE 2014 Libraries
Data Viz CE 2014 LibrariesJackie Wirz, PhD
 
Scientific Writing 2014 IEH
Scientific Writing 2014 IEHScientific Writing 2014 IEH
Scientific Writing 2014 IEHJackie Wirz, PhD
 
Posters & Presentations that Don't Suck
Posters & Presentations that Don't SuckPosters & Presentations that Don't Suck
Posters & Presentations that Don't SuckJackie Wirz, PhD
 
Data Management Open House
Data Management Open HouseData Management Open House
Data Management Open HouseJackie Wirz, PhD
 
Science is a moving target
Science is a moving targetScience is a moving target
Science is a moving targetJackie Wirz, PhD
 
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...Jackie Wirz, PhD
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesJackie Wirz, PhD
 

Plus de Jackie Wirz, PhD (19)

Online NW 2015 Wirz Developing Novel Outreach Data Visualization
Online NW 2015 Wirz Developing Novel Outreach Data VisualizationOnline NW 2015 Wirz Developing Novel Outreach Data Visualization
Online NW 2015 Wirz Developing Novel Outreach Data Visualization
 
AM Career Marketing OHSU RIPSS 2014
AM Career Marketing OHSU RIPSS 2014AM Career Marketing OHSU RIPSS 2014
AM Career Marketing OHSU RIPSS 2014
 
Data Viz CE 2014 Vision and the Brain
Data Viz CE 2014 Vision and the BrainData Viz CE 2014 Vision and the Brain
Data Viz CE 2014 Vision and the Brain
 
Data Viz CE 2014 Toolbox
Data Viz CE 2014 ToolboxData Viz CE 2014 Toolbox
Data Viz CE 2014 Toolbox
 
Data Viz CE 2014 Storytelling
Data Viz CE 2014 StorytellingData Viz CE 2014 Storytelling
Data Viz CE 2014 Storytelling
 
Data Viz CE 2014 Intro and Overview
Data Viz CE 2014 Intro and OverviewData Viz CE 2014 Intro and Overview
Data Viz CE 2014 Intro and Overview
 
Data Viz CE 2014 Color
Data Viz CE 2014 ColorData Viz CE 2014 Color
Data Viz CE 2014 Color
 
Data Viz CE 2014 Libraries
Data Viz CE 2014 LibrariesData Viz CE 2014 Libraries
Data Viz CE 2014 Libraries
 
Scientific Writing 2014 IEH
Scientific Writing 2014 IEHScientific Writing 2014 IEH
Scientific Writing 2014 IEH
 
Posters & Presentations that Don't Suck
Posters & Presentations that Don't SuckPosters & Presentations that Don't Suck
Posters & Presentations that Don't Suck
 
Rw 2014 poster final
Rw 2014 poster finalRw 2014 poster final
Rw 2014 poster final
 
Rw 2014 data visulization
Rw 2014 data visulizationRw 2014 data visulization
Rw 2014 data visulization
 
Data Management Open House
Data Management Open HouseData Management Open House
Data Management Open House
 
Foundations of data viz
Foundations of data vizFoundations of data viz
Foundations of data viz
 
Science is a moving target
Science is a moving targetScience is a moving target
Science is a moving target
 
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
Powered by Libraries: Leveraging Libraries for Semantic Web and Linked Open D...
 
RML NCBI Resources
RML NCBI ResourcesRML NCBI Resources
RML NCBI Resources
 
Science 101 Preview
Science 101 PreviewScience 101 Preview
Science 101 Preview
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners Slides
 

Dernier

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxdhanalakshmis0310
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxAmita Gupta
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 

Dernier (20)

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 

Data101 pmcb retreat_09-20-13_final

Notes de l'éditeur

  1. JW
  2. JW
  3. JW
  4. JW
  5. JW
  6. JW
  7. Ask them to think about what type of data they deal with/generate. Give a couple minutes.
  8. Ask if they have additional data types that they brainstormed
  9. JW
  10. These are all things that the library can help you do
  11. JW
  12. JW
  13. http://patenteux.com/Messy_desktop/messy_wallpaper-1280x1024.jpg
  14. If you work on the command line, you can see all the file paths
  15. JW
  16. Show examples of versionsCan go back when you make mistakes when changes are madeShare work with other peopleBoth work on things at the same time and merge back togetherAkin to game of telephone- version control can let you see exactly when a change was made
  17. Show examples of versionsCan go back when you make mistakes when changes are madeShare work with other peopleBoth work on things at the same time and merge back togetherAkin to game of telephone- version control can let you see exactly when a change was madeNEW SLIDES:Examples of versions of dataData101_NV_v1Data101_NV_v2Simple software solutionsSome software keeps versions for youShow where to go get itVersion Control SoftwareVersion control softwareSVN, GITShow example of google codeCan write commit messages you version you commit
  18. Show examples of versionsCan go back when you make mistakes when changes are madeShare work with other peopleBoth work on things at the same time and merge back togetherAkin to game of telephone- version control can let you see exactly when a change was made
  19. Show examples of versionsCan go back when you make mistakes when changes are madeShare work with other peopleBoth work on things at the same time and merge back togetherAkin to game of telephone- version control can let you see exactly when a change was made
  20. Show examples of versionsCan go back when you make mistakes when changes are madeShare work with other peopleBoth work on things at the same time and merge back togetherAkin to game of telephone- version control can let you see exactly when a change was made
  21. NICOLE
  22. Central servers will have multiple redundancy, back ups of back upsHigh quality secure USBs with passwords and encyrption, or burn to disk
  23. JW
  24. !
  25. Move this
  26. Information science is a parent
  27. Ontologies classify terms and the relationships between them.
  28. JW
  29. Software that can rename your files, if you already have them named
  30. Goal is to solve the author/contributor name ambiguity problem in scholarly communications Creating a central registry of unique identifiers for individual researchers Identifiers, and the relationships among them, can be linked to the researcher
  31. JW
  32. JW
  33. JW
  34. JW
  35. JW
  36. JW
  37. JW
  38. Maybe discuss the PlumX project?
  39. JW
  40. Say that we won an award to sponsor this program