SlideShare une entreprise Scribd logo
1  sur  42
deepcarbon.net
Xiaogang (Marshall) Ma and DCO-Data Science Team
Tetherless World Constellation
Rensselaer Polytechnic Institute
Why Data Science Matters?
and what can we do with it
Outline
• Data Management and Publication
• Interoperability of Data
• Provenance of Research
• Era of Science 2.0
2
Data Management and Publication
3
• Meet grant requirements
– Many funding agencies now require researchers formally state
how they will manage and preserve datasets generated from a
research project.
4
… …
Why Manage and Publish Data
• Increase your research efficiency
– Have you ever had a hard time understanding the data that
you or your colleagues have collected?
5
Data work
6
Image courtesy of British Geological Survey
Nice, now I have my DATA well managed, and next…
• Increase the visibility of your research
– Making your data available to other researchers through
widely-searched repositories can increase your prominence
and demonstrate continued use of the data and relevance of
your research.
• Facilitate new discoveries
– Enabling other researchers to use your data reinforces open
scientific inquiry and can lead to new and unanticipated
discoveries. And doing so prevents duplication of effort by
enabling others to use your data rather than trying to gather
the data themselves.
7
Data Management Plan: What and How
• What is a Data Management Plan?
– A data management plan is a formal document that outlines
what you will do with your data during and after you complete
your research.
• What is involved in developing one?
– Developing a data management plan can be time-consuming,
tedious, and daunting, but it's a very important step in ensuring
that your research data is safe and sound for the present and
future.
– With the right process and framework it does not take too long
time and can pay-off enormously in the long-run.
8
• Topics in a data management plan include
– Introduction and context
– Data types, formats, standards and capture methods
– Short-term storage and data management
– Deposit and long-term preservation
– Data sharing, access and re-use
– Resourcing
– Adherence and review
• Resources/Tools help create DMPs:
– DCC Data Management Plans:
http://www.dcc.ac.uk/resources/data-management-plans
– MIT Data Management and Publishing:
http://libraries.mit.edu/data-management/
– NSF Data Management Plan Requirements:
http://www.nsf.gov/eng/general/dmp.jsp
– DMPTool: https://dmptool.org
– IEDA Data Management Plan Tool:
http://www.iedadata.org/compliance/plan
– DCC DMPOnline: https://dmponline.dcc.ac.uk
10
12
Image from WWW
Data Publication & Citation
• Data as first class products of research
– NSF bio-sketches can include data publications
13
Image from j4h.net
• Ways of data publication
– Data as supplemental material of a paper
– Standalone data
– Data paper: data + descriptive ‘data paper’
14
(Strasser, 2014)
Examples:
• Standalone data journals: Nature Scientific Data, Geoscience Data
Journal, Ecological Archives
• Journals that publish data papers: GigaScience, F1000 Research,
Internet Archaeology
15
What does a DCO data
publication look like?
• Data Citation Index
– Indexes the world's leading data repositories
– Records for the datasets are connected to related peer-
reviewed literature indexed in the Web of Science™
– Allow researchers to efficiently access to data across subjects
and regions
16
Interoperability of Data
17
A good example
• OneGeology
18
• Web-accessible geologic map data
worldwide (scale ~1:1 million)
• Stimulate a rapid increase in interoperability
(i.e. disseminate GeoSciML and
vocabularies further and faster)
• 120 participating countries (July 2014)
http://portal.onegeology.org 19
20
Wyoming
Colorado
More challenges are still to be addressed
http://mrdata.usgs.gov/
21
Earth Resource Form
Environmental Impact Value
Exploration Activity Type
Exploration Result
UNFC Value
Earth Resource Expression
Earth Resource Shape
Enduse Potential
Mineral Occurrence Type
Mining Activity Type
Processing Activity Type
Mining Waste Type Value
Commodity Code
Mineral Deposit Group
Mineral Deposit Type
Product Value
A list of recently finished vocabularies
CGI Geoscience Terminology Workgroup
• Construct a collection of vocabularies for
populating information interchange documents and
enabling interoperability
• Provide labels for concepts, scope to various
communities defined by language, science domain,
or application domain
22
Another major effort...
And there is a vocabulary created by the CGI Geoscience Terminology Workgroup!
Golden Spike information portal
http://geotime.tw.rpi.edu/
23
Golden spike - Global Boundary
Stratotype Section and Point (GSSP)
(Haq, 2007)
24
Still, challenges …
25
(Ma et al., 2011)
Interoperable:
“Data should be discoverable, accessible, decodable,
understandable and usable, and data sharing should be
legal and ethical for all participants.”
• Interoperability does not mean that all data should be
mediated or standardized.
• However, it is important that data archives are
accompanied by detailed documentation, clarifying data
provenance, data model, vocabularies used, etc.
26
(Ma et al., 2011)
Provenance of Research
27
Provenance capture
• Documenting provenance
– Linking a range of observations and model outputs, research
activities, people and organizations involved in the production of
scientific findings with the supporting data sets and methods
used to generate them.
28
Well-curated provenance information
makes scientific workflows transparent
and improves the credibility and
trustworthiness of their outputs. It also
facilitates informed and rational policy
and decision-making.
Image from nature.com
(Ma et al., 2014)
“Figure 1.2: Sea Level Rise: Past, Present, and Future” in the Third National
Climate Assessment report draft of USA (NCA3) 29
What is the provenance
of this figure?
• Detailed caption of that figure:
– Estimated, observed and possible amounts of global sea level
rise from 1800 to 2100. Proxy estimates (Kemp et al. 2012)
(for example, based on sediment records) are shown in red
(pink band shows uncertainty), tide gauge data in blue
(Church and White 2011a), and satellite observations are
shown in green (Nerem et al. 2010). The future scenarios
range from 0.66 feet to 6.6 feet in 2100 (Parris et al. 2012).
Higher or lower amounts of sea level rise are considered
implausible, as represented by the gray shading. The orange
line at right shows the currently projected range of sea level
rise of 1 to 4 feet by 2100, which falls within the larger risk-
based scenario range. The large projected range reflects
uncertainty about how glaciers and ice sheets will react to the
warming ocean, the warming atmosphere, and changing winds
and currents. As seen in the observations, there are year-to-
year variations in the trend. (Figure source: Josh Willis, NASA
Jet Propulsion Laboratory)
30
As a case study, let’s trace the
provenance of this paper.
Provenance tracing of NASA contributions to Figure 1.2 in draft NCA3
Here only the details of
Topex-Poseidon mission are
shown
Here only the details of one
paper (i.e., “paper/103”) cited
by that figure are shown
(a) Instances of
calibration, model and
software underpinning
“paper/103”
(b) Instances of sensor,
instrument and platform
underpinning that paper
31
34
35
http://data.globalchange.gov
Era of Science 2.0
36
Practice
• Science 2.0
– New practices of scientists who post raw experimental results,
nascent theories, claims of discovery and draft papers on the
Web for others to see and comment on.
– Proponents say these “open access” practices make scientific
progress more collaborative and therefore more productive.
– Critics say scientists who put preliminary findings online risk
having others copy or exploit the work to gain credit or even
patents.
37
(Waldrop, 2008)
38
• Social scholarship: Reconsidering scholarly practices in
the age of social media
– Polled 1,600 US and Canadian faculty members
– Found that 15% use Twitter, 28% use YouTube and 39% use
Facebook for scholarly activity
39
(Greenhow and Gleason, 2014)
Using social media more often would help scientists to disseminate
their results, debate findings and engage a wider audience
Researchers must learn to create a robust
online presence
Social-media metrics to be added to the
tenure process
• Altmetrics
– A very broad group of metrics, capturing various parts of
impact a paper or work can have.
40
(Lin and Fenner, 2013)
The ImpactStory
Altermetrics Classifications
• altmetric.com
– already a product used by NPG, Springer, etc.
41
This Altmetric score means that the article is:
• in the 99 percentile (ranked 181st) of the
81,582 tracked articles of a similar age in all
journals
• in the 93 percentile (ranked 69th) of the 992
tracked articles of a similar age in Nature
http://www.nature.com/nature/journal/v497/n7449/nature12127/metrics
Summary
make data count
42
43
max7@rpi.edu
Thank you!

Contenu connexe

Tendances

Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014
Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014
Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014TERN Australia
 
Pacific Wave and PRP Update Big News for Big Data
Pacific Wave and PRP Update Big News for Big DataPacific Wave and PRP Update Big News for Big Data
Pacific Wave and PRP Update Big News for Big DataLarry Smarr
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research PlatformLarry Smarr
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)Dag Endresen
 
Open Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticeOpen Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticePhilip Bourne
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)Dag Endresen
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Philip Bourne
 
ischools future of data managemente dec2017
ischools future of data managemente dec2017ischools future of data managemente dec2017
ischools future of data managemente dec2017ARDC
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...GigaScience, BGI Hong Kong
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHPhilip Bourne
 
Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Dan Taylor
 
Astromat Update on Developments 2021-01-29
Astromat Update on Developments 2021-01-29Astromat Update on Developments 2021-01-29
Astromat Update on Developments 2021-01-29Kerstin Lehnert
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?Philip Bourne
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceAndrew Sallans
 
RDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
RDAP13 Lorrie Johnson: Facilitating Access to Scientific DataRDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
RDAP13 Lorrie Johnson: Facilitating Access to Scientific DataASIS&T
 
E Research Chapter 1
E Research Chapter 1E Research Chapter 1
E Research Chapter 1guest2426e1d
 
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14Dag Endresen
 
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)EUDAT
 

Tendances (20)

Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014
Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014
Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014
 
Pacific Wave and PRP Update Big News for Big Data
Pacific Wave and PRP Update Big News for Big DataPacific Wave and PRP Update Big News for Big Data
Pacific Wave and PRP Update Big News for Big Data
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
Open Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticeOpen Science: Where Theory Meets Practice
Open Science: Where Theory Meets Practice
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
ischools future of data managemente dec2017
ischools future of data managemente dec2017ischools future of data managemente dec2017
ischools future of data managemente dec2017
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIH
 
Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2
 
Astromat Update on Developments 2021-01-29
Astromat Update on Developments 2021-01-29Astromat Update on Developments 2021-01-29
Astromat Update on Developments 2021-01-29
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-Science
 
RDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
RDAP13 Lorrie Johnson: Facilitating Access to Scientific DataRDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
RDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
 
E Research Chapter 1
E Research Chapter 1E Research Chapter 1
E Research Chapter 1
 
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
 
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
 

Similaire à Why data science matters and what we can do with it

Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...EarthCube
 
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Sky Bristol
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...hsuleslie
 
The changing landscape of science
The changing landscape of scienceThe changing landscape of science
The changing landscape of scienceChelle Gentemann
 
British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...johnkayebl
 
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...datacite
 
IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014iedadata
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewLarry Smarr
 
VIVO Conference 2013 Panel Slides
VIVO Conference 2013 Panel SlidesVIVO Conference 2013 Panel Slides
VIVO Conference 2013 Panel SlidesPatrick West
 
Next Generation Citizen Science
Next Generation Citizen ScienceNext Generation Citizen Science
Next Generation Citizen ScienceLea Shanley
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityTERN Australia
 
Lightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsLightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsEarthCube
 
SemWeb 4 Gov – opportunities and challenges
SemWeb 4 Gov – opportunities and challengesSemWeb 4 Gov – opportunities and challenges
SemWeb 4 Gov – opportunities and challengesAndrew Woolf
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsMartin Donnelly
 
The FOSTER project - general overview
The FOSTER project - general overviewThe FOSTER project - general overview
The FOSTER project - general overviewMartin Donnelly
 
Learn to speak open
Learn to speak openLearn to speak open
Learn to speak openLilian Juma
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube
 

Similaire à Why data science matters and what we can do with it (20)

Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
 
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
 
Herring Noaa Spring08
Herring Noaa Spring08Herring Noaa Spring08
Herring Noaa Spring08
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
 
The changing landscape of science
The changing landscape of scienceThe changing landscape of science
The changing landscape of science
 
Johnston - How to Curate Research Data
Johnston - How to Curate Research DataJohnston - How to Curate Research Data
Johnston - How to Curate Research Data
 
British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...
 
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
 
IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An Overview
 
VIVO Conference 2013 Panel Slides
VIVO Conference 2013 Panel SlidesVIVO Conference 2013 Panel Slides
VIVO Conference 2013 Panel Slides
 
Next Generation Citizen Science
Next Generation Citizen ScienceNext Generation Citizen Science
Next Generation Citizen Science
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
 
Lightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsLightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded Projects
 
SemWeb 4 Gov – opportunities and challenges
SemWeb 4 Gov – opportunities and challengesSemWeb 4 Gov – opportunities and challenges
SemWeb 4 Gov – opportunities and challenges
 
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon HodsonCODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
 
The FOSTER project - general overview
The FOSTER project - general overviewThe FOSTER project - general overview
The FOSTER project - general overview
 
Learn to speak open
Learn to speak openLearn to speak open
Learn to speak open
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013
 

Plus de Xiaogang (Marshall) Ma

From data portal to knowledge portal: Leveraging semantic technologies to sup...
From data portal to knowledge portal: Leveraging semantic technologies to sup...From data portal to knowledge portal: Leveraging semantic technologies to sup...
From data portal to knowledge portal: Leveraging semantic technologies to sup...Xiaogang (Marshall) Ma
 
Exploring the Web of Data for Earth and Environmental Sciences
Exploring the Web of Data for Earth and Environmental SciencesExploring the Web of Data for Earth and Environmental Sciences
Exploring the Web of Data for Earth and Environmental SciencesXiaogang (Marshall) Ma
 
Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Adoption of RDA DTR and PID in Deep Carbon Observatory Data PortalAdoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Adoption of RDA DTR and PID in Deep Carbon Observatory Data PortalXiaogang (Marshall) Ma
 
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...Xiaogang (Marshall) Ma
 
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureWhy Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureXiaogang (Marshall) Ma
 
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...Xiaogang (Marshall) Ma
 
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...Xiaogang (Marshall) Ma
 
Ontology Development for Provenance Tracing in National Climate Assessment o...
Ontology Development for Provenance Tracing in National Climate Assessment o...Ontology Development for Provenance Tracing in National Climate Assessment o...
Ontology Development for Provenance Tracing in National Climate Assessment o...Xiaogang (Marshall) Ma
 
A short review of Connected China: A visualization of elite social networks i...
A short review of Connected China: A visualization of elite social networks i...A short review of Connected China: A visualization of elite social networks i...
A short review of Connected China: A visualization of elite social networks i...Xiaogang (Marshall) Ma
 
Ontology spectrum for geological data interoperability (PhD defense nov 2011)
Ontology spectrum for geological data interoperability (PhD defense nov 2011)Ontology spectrum for geological data interoperability (PhD defense nov 2011)
Ontology spectrum for geological data interoperability (PhD defense nov 2011)Xiaogang (Marshall) Ma
 
A use case-driven iterative method for building a provenance-aware GCIS onto...
A use case-driven iterative method for building a provenance-aware GCIS onto...A use case-driven iterative method for building a provenance-aware GCIS onto...
A use case-driven iterative method for building a provenance-aware GCIS onto...Xiaogang (Marshall) Ma
 
A short story of geologic time ontologies and vocabularies
A short story of geologic time ontologies and vocabulariesA short story of geologic time ontologies and vocabularies
A short story of geologic time ontologies and vocabulariesXiaogang (Marshall) Ma
 
Exploratory visualization of earth science data in a Semantic Web context
Exploratory visualization of earth science data in a Semantic Web contextExploratory visualization of earth science data in a Semantic Web context
Exploratory visualization of earth science data in a Semantic Web contextXiaogang (Marshall) Ma
 

Plus de Xiaogang (Marshall) Ma (14)

From data portal to knowledge portal: Leveraging semantic technologies to sup...
From data portal to knowledge portal: Leveraging semantic technologies to sup...From data portal to knowledge portal: Leveraging semantic technologies to sup...
From data portal to knowledge portal: Leveraging semantic technologies to sup...
 
Exploring the Web of Data for Earth and Environmental Sciences
Exploring the Web of Data for Earth and Environmental SciencesExploring the Web of Data for Earth and Environmental Sciences
Exploring the Web of Data for Earth and Environmental Sciences
 
Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Adoption of RDA DTR and PID in Deep Carbon Observatory Data PortalAdoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
 
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
 
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureWhy Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
 
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
 
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
 
Ontology Development for Provenance Tracing in National Climate Assessment o...
Ontology Development for Provenance Tracing in National Climate Assessment o...Ontology Development for Provenance Tracing in National Climate Assessment o...
Ontology Development for Provenance Tracing in National Climate Assessment o...
 
A short review of Connected China: A visualization of elite social networks i...
A short review of Connected China: A visualization of elite social networks i...A short review of Connected China: A visualization of elite social networks i...
A short review of Connected China: A visualization of elite social networks i...
 
Ontology spectrum for geological data interoperability (PhD defense nov 2011)
Ontology spectrum for geological data interoperability (PhD defense nov 2011)Ontology spectrum for geological data interoperability (PhD defense nov 2011)
Ontology spectrum for geological data interoperability (PhD defense nov 2011)
 
A use case-driven iterative method for building a provenance-aware GCIS onto...
A use case-driven iterative method for building a provenance-aware GCIS onto...A use case-driven iterative method for building a provenance-aware GCIS onto...
A use case-driven iterative method for building a provenance-aware GCIS onto...
 
A short introduction to GIS
A short introduction to GISA short introduction to GIS
A short introduction to GIS
 
A short story of geologic time ontologies and vocabularies
A short story of geologic time ontologies and vocabulariesA short story of geologic time ontologies and vocabularies
A short story of geologic time ontologies and vocabularies
 
Exploratory visualization of earth science data in a Semantic Web context
Exploratory visualization of earth science data in a Semantic Web contextExploratory visualization of earth science data in a Semantic Web context
Exploratory visualization of earth science data in a Semantic Web context
 

Dernier

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 

Dernier (20)

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 

Why data science matters and what we can do with it

  • 1. deepcarbon.net Xiaogang (Marshall) Ma and DCO-Data Science Team Tetherless World Constellation Rensselaer Polytechnic Institute Why Data Science Matters? and what can we do with it
  • 2. Outline • Data Management and Publication • Interoperability of Data • Provenance of Research • Era of Science 2.0 2
  • 3. Data Management and Publication 3
  • 4. • Meet grant requirements – Many funding agencies now require researchers formally state how they will manage and preserve datasets generated from a research project. 4 … … Why Manage and Publish Data
  • 5. • Increase your research efficiency – Have you ever had a hard time understanding the data that you or your colleagues have collected? 5 Data work
  • 6. 6 Image courtesy of British Geological Survey Nice, now I have my DATA well managed, and next…
  • 7. • Increase the visibility of your research – Making your data available to other researchers through widely-searched repositories can increase your prominence and demonstrate continued use of the data and relevance of your research. • Facilitate new discoveries – Enabling other researchers to use your data reinforces open scientific inquiry and can lead to new and unanticipated discoveries. And doing so prevents duplication of effort by enabling others to use your data rather than trying to gather the data themselves. 7
  • 8. Data Management Plan: What and How • What is a Data Management Plan? – A data management plan is a formal document that outlines what you will do with your data during and after you complete your research. • What is involved in developing one? – Developing a data management plan can be time-consuming, tedious, and daunting, but it's a very important step in ensuring that your research data is safe and sound for the present and future. – With the right process and framework it does not take too long time and can pay-off enormously in the long-run. 8
  • 9. • Topics in a data management plan include – Introduction and context – Data types, formats, standards and capture methods – Short-term storage and data management – Deposit and long-term preservation – Data sharing, access and re-use – Resourcing – Adherence and review
  • 10. • Resources/Tools help create DMPs: – DCC Data Management Plans: http://www.dcc.ac.uk/resources/data-management-plans – MIT Data Management and Publishing: http://libraries.mit.edu/data-management/ – NSF Data Management Plan Requirements: http://www.nsf.gov/eng/general/dmp.jsp – DMPTool: https://dmptool.org – IEDA Data Management Plan Tool: http://www.iedadata.org/compliance/plan – DCC DMPOnline: https://dmponline.dcc.ac.uk 10
  • 12. Data Publication & Citation • Data as first class products of research – NSF bio-sketches can include data publications 13 Image from j4h.net
  • 13. • Ways of data publication – Data as supplemental material of a paper – Standalone data – Data paper: data + descriptive ‘data paper’ 14 (Strasser, 2014) Examples: • Standalone data journals: Nature Scientific Data, Geoscience Data Journal, Ecological Archives • Journals that publish data papers: GigaScience, F1000 Research, Internet Archaeology
  • 14. 15 What does a DCO data publication look like?
  • 15. • Data Citation Index – Indexes the world's leading data repositories – Records for the datasets are connected to related peer- reviewed literature indexed in the Web of Science™ – Allow researchers to efficiently access to data across subjects and regions 16
  • 17. A good example • OneGeology 18 • Web-accessible geologic map data worldwide (scale ~1:1 million) • Stimulate a rapid increase in interoperability (i.e. disseminate GeoSciML and vocabularies further and faster) • 120 participating countries (July 2014)
  • 19. 20 Wyoming Colorado More challenges are still to be addressed http://mrdata.usgs.gov/
  • 20. 21 Earth Resource Form Environmental Impact Value Exploration Activity Type Exploration Result UNFC Value Earth Resource Expression Earth Resource Shape Enduse Potential Mineral Occurrence Type Mining Activity Type Processing Activity Type Mining Waste Type Value Commodity Code Mineral Deposit Group Mineral Deposit Type Product Value A list of recently finished vocabularies CGI Geoscience Terminology Workgroup • Construct a collection of vocabularies for populating information interchange documents and enabling interoperability • Provide labels for concepts, scope to various communities defined by language, science domain, or application domain
  • 21. 22 Another major effort... And there is a vocabulary created by the CGI Geoscience Terminology Workgroup!
  • 22. Golden Spike information portal http://geotime.tw.rpi.edu/ 23 Golden spike - Global Boundary Stratotype Section and Point (GSSP)
  • 24. 25 (Ma et al., 2011) Interoperable: “Data should be discoverable, accessible, decodable, understandable and usable, and data sharing should be legal and ethical for all participants.”
  • 25. • Interoperability does not mean that all data should be mediated or standardized. • However, it is important that data archives are accompanied by detailed documentation, clarifying data provenance, data model, vocabularies used, etc. 26 (Ma et al., 2011)
  • 27. Provenance capture • Documenting provenance – Linking a range of observations and model outputs, research activities, people and organizations involved in the production of scientific findings with the supporting data sets and methods used to generate them. 28 Well-curated provenance information makes scientific workflows transparent and improves the credibility and trustworthiness of their outputs. It also facilitates informed and rational policy and decision-making. Image from nature.com (Ma et al., 2014)
  • 28. “Figure 1.2: Sea Level Rise: Past, Present, and Future” in the Third National Climate Assessment report draft of USA (NCA3) 29 What is the provenance of this figure?
  • 29. • Detailed caption of that figure: – Estimated, observed and possible amounts of global sea level rise from 1800 to 2100. Proxy estimates (Kemp et al. 2012) (for example, based on sediment records) are shown in red (pink band shows uncertainty), tide gauge data in blue (Church and White 2011a), and satellite observations are shown in green (Nerem et al. 2010). The future scenarios range from 0.66 feet to 6.6 feet in 2100 (Parris et al. 2012). Higher or lower amounts of sea level rise are considered implausible, as represented by the gray shading. The orange line at right shows the currently projected range of sea level rise of 1 to 4 feet by 2100, which falls within the larger risk- based scenario range. The large projected range reflects uncertainty about how glaciers and ice sheets will react to the warming ocean, the warming atmosphere, and changing winds and currents. As seen in the observations, there are year-to- year variations in the trend. (Figure source: Josh Willis, NASA Jet Propulsion Laboratory) 30 As a case study, let’s trace the provenance of this paper.
  • 30. Provenance tracing of NASA contributions to Figure 1.2 in draft NCA3 Here only the details of Topex-Poseidon mission are shown Here only the details of one paper (i.e., “paper/103”) cited by that figure are shown (a) Instances of calibration, model and software underpinning “paper/103” (b) Instances of sensor, instrument and platform underpinning that paper 31
  • 31.
  • 32.
  • 33. 34
  • 35. Era of Science 2.0 36 Practice
  • 36. • Science 2.0 – New practices of scientists who post raw experimental results, nascent theories, claims of discovery and draft papers on the Web for others to see and comment on. – Proponents say these “open access” practices make scientific progress more collaborative and therefore more productive. – Critics say scientists who put preliminary findings online risk having others copy or exploit the work to gain credit or even patents. 37 (Waldrop, 2008)
  • 37. 38
  • 38. • Social scholarship: Reconsidering scholarly practices in the age of social media – Polled 1,600 US and Canadian faculty members – Found that 15% use Twitter, 28% use YouTube and 39% use Facebook for scholarly activity 39 (Greenhow and Gleason, 2014) Using social media more often would help scientists to disseminate their results, debate findings and engage a wider audience Researchers must learn to create a robust online presence Social-media metrics to be added to the tenure process
  • 39. • Altmetrics – A very broad group of metrics, capturing various parts of impact a paper or work can have. 40 (Lin and Fenner, 2013) The ImpactStory Altermetrics Classifications
  • 40. • altmetric.com – already a product used by NPG, Springer, etc. 41 This Altmetric score means that the article is: • in the 99 percentile (ranked 181st) of the 81,582 tracked articles of a similar age in all journals • in the 93 percentile (ranked 69th) of the 992 tracked articles of a similar age in Nature http://www.nature.com/nature/journal/v497/n7449/nature12127/metrics

Notes de l'éditeur

  1. We saw this figure at the beginning of this presentation. So, now what we can do with the provenance tracing?
  2. (a) Instances of calibration, model and software underpinning “paper/103” and (b) Instances of sensor, instrument and platform underpinning that paper.
  3. (a) Instances of calibration, model and software underpinning “paper/103” and (b) Instances of sensor, instrument and platform underpinning that paper.