SlideShare une entreprise Scribd logo
1  sur  58
Télécharger pour lire hors ligne
INDEX
1.  Develop a data management plan
2.  Publish & archive biodiversity data
3.  Use persistent data identifiers
4.  Use data standards
5.  Ecological Metadata Language (EML)
6.  Data paper & data citation
EXPONENTIAL GROWTH OF DIGITAL DATA
Source: EMC/IDC Digital Universe Study, 2012
The	digital	universe	will	double	every	two	years	between	now	and	2020.	The	growth	is	mostly	unstructured	data.	
A	major	factor	behind	the	expansion	is	the	growth	of	machine	generated	data	(from	11%	in	2005	to	over	40%	in	2020).
DATA EXPLOSION
Source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
•  More and more data is being
created
•  Issue is not creating data, but
being able to navigate and
use it
•  Data management is critical
to make sure data are well-
organised, understandable
and reusable
Source: OpenAIRE, 2013
DATA LOSS
Digital data are fragile and susceptible
to loss for a wide variety of reasons:
•  Natural disaster
•  Facilities infrastructure failure
•  Storage failure
•  Server hardware/software failure
•  Application software failure
•  Format obsolescence
•  Legal encumbrance
•  Human error
•  Malicious attack
•  Loss of staffing competencies
•  Loss of institutional commitment
•  Loss of financial stability
•  Changes in user expectations
Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013 Image CC BY-NC-SA 2.0 by Dave Hill https://www.flickr.com/photos/dmh650/4031607067
WHERE TO STORE DATA?
Your own drive (PC, server, flash drive, etc.)
–  And if you lose it? Or it breaks?
Somebody else’s drive / departmental drive
“Cloud” drive
–  Do they care as much about your data as you do?
Large scale infrastructure services like Data
One or EUDAT
3... 2... 1... backup!
–  at least 3 copies of a file
–  on at least 2 different media
–  with at least 1 offsite
Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
BACKUP AND PRESERVATION
– NOT THE SAME THING!
Backups
–  Periodic snapshots of data in case the current
version is destroyed or lost
–  Backups are copies of files stored for short or near-
long-term
–  Often performed on a somewhat frequent schedule
Archiving
–  Preserve data for historical reference
–  Usually the final version, stored for long-term, and
generally not copied over
–  Often performed at the end of a project or during
major milestones
Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
FAIR DATA
Findable
–  assign persistent IDs, provide rich metadata, register
in a searchable resource... (such as GBIF)
Accessible
–  Retrievable by their ID using a standard protocol,
metadata remain accessible even if data aren’t...
Interoperable
–  Use formal, broadly applicable languages, use
standard vocabularies, qualified references... (e.g.
Darwin Core, …)
Reusable
–  Rich, accurate metadata, clear licences, provenance,
use of community standards... (e.g. Dublin Core,
EML, …)
www.force11.org/group/fairgroup/fairprinciples
Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
Develop a data
management plan
DATA MANAGEMENT PLAN
•  Making your data available to others ensures
that your research is truly reproducible.
•  Managing your research data saves you time
because it ensures that you and others in your
collaboration will be able to find, understand, and
use the data.
•  Sharing your research data enables wider
dissemination of your work.
•  Enabling others to use your data reinforces open
scientific inquiry and can lead to new and
unanticipated discoveries.
Source: Georgia Tech Library
Graphics	by	Jørgen	Stamp	CC-BY
DATA MANAGEMENT PLAN
•  Description of data to be produced or collected, including
data standards or formats. [e.g. Darwin Core & Darwin Core archive]
•  Identification of protocols or workflows to help manage
data throughout the project.
•  Description of documentation and metadata standards to
describe the data. [e.g. Ecological Metadata Language, EML]
•  Plan for short-term data storage & backup, including
necessary security measures.
•  Plan for sharing data, including legal and ethical issues,
intellectual property issues, or access policies and provisions.
•  Plan for data preservation, archiving and long-term access.
•  Plan for allocating responsibility for data management
within your research project.
Source: Georgia Tech Library
DATA MANAGEMENT PLAN
Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
A DMP is a brief plan to define:
•  how the data will be created?
•  how it will be documented?
•  who will access it?
•  where it will be stored?
•  who will back it up?
•  whether (and how) it will be shared & preserved?
DMPs are often submitted as part of grant applications, but are useful whenever
researchers are creating data.
Publish & archive
your research data!
Graphic	by	Jørgen	Stamp	CC-BY
Many	biodiversity	research	data	are	
“hidden”	in	reports	and	documents	
produced	by	universi9es,	research	
ins9tutes,	public	agencies	and	the	
university	museums.	
	
Publish	your	biodiversity	data!	
Photo	by:	Niklas	Bildhauer
PUBLISH	AND	ARCHIVE	YOUR	
OWN	BIODIVERSITY	DATA	
•  The	GBIF	helpdesk	will	assist		with	
publishing	your	species	occurrence	data:	
send	an	email	to	helpdesk@gbif.org		
•  You	can	also	install	a	data	publishing	
soLware	such	as	the	GBIF	Integrated	
Publishing	Toolkit	(IPT).	
•  Ci9zen	Science	portals	such	as	iNaturalist,	…	
•  We	recommend	using	a	long-term	data	
archiving	plaRorm	such	as	Data	ONE,	Dryad,	
FigShare,	etc…
ONLINE DATA ARCHIVING REPOSITORY
Rather than leaving your research data on a local server
or in cloud storage, archive your data with a trusted
digital repository. Many repositories create metadata
and documentation to ensure that the data will be
discoverable in the future.
DATA ONE
Source: GBIF News story, September 2014, DataONE: http://www.gbif.org/page/8199
GBIF and the Data Observation Network for Earth (DataONE)
collaborate to support long-term persistence of the biodiversity
data shared through the GBIF network. The collaboration is part
of a commitment in the GBIF Work Programme to explore data
archival services offering redundancy to handle scenarios such
as technical failure and the disappearance of projects or
institutions that share data through the network.
Source: GBIF News story, September 2014, DataONE: http://www.gbif.org/page/8199
ZENODO.ORG
For all content types!
With GitHub integration!
Upload	 Describe	 Publish	
Create communities!
Slide	source:		
OpenAIRE	&	EUDAT,		
CC-BY-4.0,	2013
Use data
standards
METADATA STANDARDS
Use relevant standards for interoperability
Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
http://rd-alliance.github.io/metadata-directory
STRUCTURED DATA & METADATA
Slide modified from: IDigBio workshop 2015
Unstructured	 Structured
DATA STANDARDS
Slide source: GB23 Nodes Madagascar October 2015 & iDigBio Florida January 2015 - http://www.tdwg.org/standards/
ABCD Access to Biological Collection
Data (2005)
DwC Darwin Core (2009)
AC Audubon Core Multimedia Resources
Metadata Schema (2013)
NCD Natural Collection Descriptions
(Draft 2008)
EML Ecological Metadata Language
(Ecological Society of America)
ECOLOGICAL METADATA LANGUAGE - EML
•  Developed for ecologists by ecologists
•  Lead by NCEAS, DataONE
•  XML schema for the structural expression of metadata
•  Basic subset of EML used by GBIF, « eml-gbif-profile »
http://www.gbif.org/resources/2559
http://rs.gbif.org/schema/eml-gbif-profile/
Slide modified from iDigBio workshop 2015
Persistent identifiers
The	purpose	of	iden9fiers	
	
						…is	to	name	things,		
						making	it	possible	to	refer	to	them.	
	
	
“Each	iden^fier	refers	to	one	and	only	one	thing”	(Coyle	2006).	
“A	assosia1on	between	a	string	and	a	thing”	(Kunze	2003).	
“A	stated	associa1on	between	a	symbol	and	a	thing;	that	the	symbol	may	be	used	
to	unambiguously	refer	to	the	thing	within	a	given	context”	(Campbell	2007).
Many	things	(in	GBIF)	are	named	123	
Catalog	number:	123	
GBIF	ID:	543392241	
urn:catalog:CAS:BOT:123	
Bigelowia	juncea	
Catalog	number:	123	
GBIF	ID:	1030591721	
UAMb:Herb:123	
Sphagnum	girgensohnii	
Catalog	number:	123	
GBIF	ID:	893477175	
Parides	erithalion	
Catalog	number:	123	
GBIF	ID:	1050327334	
Cinchona	ledgeriana	 Catalog	number:	123	
GBIF	ID:	931031820	
Bromus	kalmii	
Catalog	number:	123	
GBIF	ID:	283363	
urn:occurrence:Arctos:MVZ:Egg:123:164	
Mercurialis	ovata	
Catalog	number:	123	
GBIF	ID:	231564351	
Umbrina	canariensis	
Catalog	number:	123	
GBIF	ID:	896547722	
urn:occurrence:Arctos:MVZ:Egg:123:164	
Contopus	sordidulus	veliei	
NAME AMBIGUITY:
HTTP – PURL – UUID
http://purl.org/nhmuio/id/41d9cbb4-4590-4265-8079-ca44d46d27c3
Including	machine	
readable	formats
Dataset metadata
WHAT IS
METADATA?
Image CC-BY ‘Metadata is a love note to the future’ by
Cea+ www.flickr.com/photos/ centralasian/8071729256
Commonly defined as ‘data about data’, metadata
helps to make data findable and understandable
Metadata can be:
Descriptive: information about the content and context
of the data
Structural: information about the structure of the data
Administrative: information about the file type, rights
management and preservation processes
WHAT IS METADATA?
Slide	source:	CC-BY	EUDAT,	2015
METADATA CATALOG
Image CC-BY ‘University of Michigan Library Card Catalog’ by David Fulmer www.flickr.com/photos/annarbor/4350629792
Comprehensive metadata will:
Facilitate data discovery
Help users determine the applicability of the data
Enable interpretation and reuse
Allow any limitations to be understood
Clarify ownership and restrictions on reuse
Offer permanence as it transcends people and time
Provide interoperability
WHY USE METADATA?
Slide	source:	CC-BY	EUDAT,	2015
METADATA AND DOCUMENTATION
Think about what will be needed in order to find, evaluate,
understand, and reuse the data.
  Have you documented what you did and how?
  Did you develop code to run analyses? If so, this should be
kept and shared too.
  Is it clear what each bit of your dataset means? Make sure
the units are labelled and abbreviations explained.
  Record all the information needed for you and others to
understand the data in the future
Slide	source:	CC-BY	EUDAT,	2015
INFORMATION ENTROPY
The	Loss	of	Informa^on	about	Data	(Metadata)	Over	Time,		Michener	et	al,	1997	
Slide	source:	CC-BY	EUDAT,	2015
Create metadata at the time of data creation
Information will be forgotten and there won’t be time
or effort left to capture it later.
Metadata benefits from quality control at an early
stage too.
TIME MATTERS!
Image CC-BY-SA ‘egg timer – hour glass running out’ by
OpenDemocracy www.flickr.com/photos/opendemocracy/523438942
Slide	source:	CC-BY	EUDAT,	2015
DATASET TITLES
Titles are critical in helping readers find your data
–  While individuals are searching for the most appropriate data
sets, they are most likely going to use the title as the first
criteria to determine if a dataset meets their needs.
–  Treat the title as the opportunity to sell your dataset.
A complete title includes: What, Where, When, Who, and
Scale
An informative title includes: topic, timeliness of the data,
specific information about place and geography
Slide	source:	CC-BY	EUDAT,	2015
WHICH IS THE BETTER TITLE?
Rivers
OR
Greater Yellowstone Rivers from 1:126,700 U.S. Forest
Service Visitor Maps (1961-1983)
Greater Yellowstone (where) Rivers (what) from
1:126,700 (scale) U.S. Forest Service (who) Visitor
Maps (1961-1983) (when)
Slide	source:	CC-BY	EUDAT,	2015
WRITE FOR MACHINES, NOT JUST HUMANS
Remember: a computer will read your metadata
Do not use symbols that could be misinterpreted:
Examples: ! @ # % { } | /  < > ~
Don’t use tabs, indents, or line feeds/carriage
returns
When copying and pasting from other sources,
use a text editor (e.g., Notepad) to eliminate
hidden characters
Slide	source:	CC-BY	EUDAT,	2015
Data paper
METADATA CATEGORIES
Dataset description
Project description
People and Organizations (including roles)
Coverage
•  Taxonomic coverage
•  Geographic coverage
•  Temporal coverage
Methods
Intellectual property rights, licensing
Keywords
Peer	review	op^on	for	biodiversity	datasets.	
Authors	get	scien^fic	credit	for	data	publica^on.	
Mee^ng	concerns	over	data	quality.	
Mee^ng	concerns	over	data	cita9on	mechanism.	
	
hlp://www.gbif.org/publishingdata/datapapers
Slide source: Alberto González-Talavá, 2015
RATIONALE FOR DATA PUBLISHING
IndFauna, electronic catalogue of known Indian fauna
1
IndFauna, electronic catalogue of known Indian fauna
Jitendra Gaikwad1
, Rebecca James2
, Monica Peterson3
, David Robertson4
,Tom Griswold5
, S. Krishnan1
1 National Chemical Laboratory, 411007, Pune, India 2 Bulgarian Academy of Sciences, 2300, Sofia, Bulga-
ria 3 National Natural History Museum, 1722, Leiden, The Netherlands 4 1988 ½ South Shenandoah Street,
3041, Los Angeles, USA 5 California Academy of Sciences, 1111, San Francisco, USA
Corresponding author: Jitendra Gaikwad (jgaikwad@ncl.res.in), Monica Peterson (mpeterson@nnhm.nl)
Academic editor: ............................  |  Received 6 June 2010  |  Accepted 15 July 2010  |  Published 29 July 2010Citation: Gaikwad J, James R, Peterson M, Robertson D, Griswold T, Krishnan S (2010) IndFauna, electronic catalogue
of known Indian fauna. ZooKeys xx: xx-xx. doi: 10.3897/zookeys.xx.xxx
Abstract
This article describes the development and features of IndFauna, electronic catalogue of known Indian
fauna. Accessible at http://www.ncbi.org.in, this catalogue raises several issues concerned with taxonomy
or systematics and information technology in biodiversity information management. Baseline informa-
tion on more than 93% of the 90,000 known faunal species in India has been documented in IndFauna,
which demonstrates a model of collaboration between domain experts and IT managers. It is our belief
that such ECATs would be effective in overcoming taxonomic impediments as well as better sustainable
use and conservation of our biotic resources.
Keywords
Biodiversity informatics, IndFauna, data publishing, electronic catalogue
Taxonomic coverage
General taxonomic coverage description: The coverage of this database spans wholeof Kingdom Animalia. Database collates occurrences of over 90000 species belongingto 2222 genus.
Taxonomic ranks: Kingdom: Animalia, Phylum: Acanthocephala, Annelida,Arthropoda, Mollusca, Chordata, Rotifera, Class: Amphibia, Aves, Chondrichthyes,Mammalia, Reptalia, Order: Monotremata, Anura, Caudata, Gymnophiona, Family:
ZooKeys xx: x-xx (2010)
doi: 10.3897/zookeys.xx.xxx
www.pensoftonline.net/zookeys
Copyright Jitendra Gaikwad et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which
permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Launched to accelerate biodiversity research
A peer-reviewed open-access journal
DATA PAPER
•  Promote and publicize the
existence of the data
•  Provide scholarly credit to data
publishers through citable journal
publications
•  Describe the data in a structured
human-readable form
Data	Paper	
A	scholarly	publica^on	of	searchable	
metadata	document	describing	a	dataset,	
or	a	group	of	datasets.
Data citation
RATIONALE FOR DATA PUBLISHING: CITATION &
USAGE
Slide source: Alberto González-Talaván, 2015
“We believe that the lack of incentive similar to the impact
factor for scholarly publication remains a major impediment
to the provision of free and open access to biodiversity data”	
GBIF Data Publishing Framework Task Group
“Data citation standards can form the basis for increased
incentives, recognition, and rewards for scientific data
activities. Unfortunately, such standards and good practices
are lacking”
CODATA Data Citation Task Group
A REPRODUCIBILITY CRISIS
Slide	source:	OpenAIRE	&	EUDAT,	CC-BY-4.0,	2013
WHY MANAGE DATA?
Make your research easier
Stop yourself drowning in irrelevant stuff
Save data for later
Avoid accusations of fraud or bad science
Share your data for re-use
Get credit for it
Meet funder/institution requirements
Because well-managed data opens up
opportunities for re-use, sharing and makes for
better science!
Slide	source:	OpenAIRE	&	EUDAT,	CC-BY-4.0,	2013
H2020 - OPEN DATA BY DEFAULT FROM 2017
Slide	source:	OpenAIRE	&	EUDAT,	CC-BY-4.0,	2013
CONCERNS ABOUT DATA SHARING
Concern Solution
inappropriate use due to
misunderstanding of research
purpose or parameters
security and confidentiality of
sensitive data
lack of acknowledgement / credit
loss of advantage when
competing for research funding
	 	 	metadata	
	 	 	metadata	
	 	 	metadata	
	 	 	metadata	
Slide	source:	OpenAIRE	&	EUDAT,	CC-BY-4.0,	2013
CONCERNS ABOUT DATA SHARING
Concern Solution
inappropriate use due to
misunderstanding of research
purpose or parameters
provide rich Abstract, Purpose,
Use Constraints and Supplemental
Information where needed
security and confidentiality of
sensitive data
•  the metadata does NOT
contain the data
•  Use Constraints specify who
may access the data and how
lack of acknowledgement / credit
specify a required data citation
within the Use Constraints
loss of data insight and
competitive advantage when
vying for research funding
create second, public version with
generalised Data Processing
Description
Slide	source:	OpenAIRE	&	EUDAT,	CC-BY-4.0,	2013
MAKE DATA SHAREABLE
Create robust metadata that has been checked
Include reference information in metadata e.g. unique IDs & properly
formatted data citations
Publish your metadata so it’s discoverable. Use portals, clearing houses,
online resources…
Package up the data and associated metadata to deposit in repositories
License the data clearly
Slide	source:	OpenAIRE	&	EUDAT,	CC-BY-4.0,	2013
www.dcc.ac.uk/resources/how-guides/license-research-data
LICENSING RESEARCH DATA
This DCC guide outlines the pros and
cons of each approach and gives
practical advice on how to
implement your licence
CREATIVE COMMONS LIMITATIONS
NC Non-Commercial
What counts as commercial?
ND No Derivatives
Severely restricts use
These clauses are not open licenses
Horizon 2020 Open Access
guidelines point to:
or
Slide	source:	OpenAIRE	&	EUDAT,	CC-BY-4.0,	2013
LICENSING FOR DATA PUBLISHED THROUGH GBIF
GBIF Governing Board established in 2014 GBIF support for three licenses:
•  CC0, under which data are made available for any use
without restriction or particular requirements on the part of
users CC0 is strongly recommended whenever possible
•  CC-BY, under which data are made available for any use
provided that attribution is appropriately given for the sources
of data used, in the manner specified by the owner
•  CC-BY-NC, under which data are made available for any use
provided that attribution is appropriately given and provided
the use is not for commercial purposes
http://www.gbif.org/terms/licences
WHAT TO PRESERVE & SHARE
It’s not possible to keep everything.
Select based on:
•  What has to be kept e.g. data underlying publications
•  What can’t be recreated e.g. environmental recordings
•  What is potentially useful to others
•  What has scientific, cultural or historical value
•  What legally must be destroyed
How to select and appraise research data:
www.dcc.ac.uk/resources/how-guides/appraise-select-research-data
Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
SLIDE CREDIT – GB23, EUDAT & OPENAIRE
GB23 Nodes Madagascar, October 2015
•  http://community.gbif.org/pg/pages/view/47903/
Norwegian GBIF data publishing workshop in Trondheim, October 2015,
•  http://www.gbif.no/events/2015/data-publishing-workshop-october-2015.html
Slide credits: EUDAT/OpenAire, December 2015 & May 2016, CC-BY-4.0
•  http://www.slideshare.net/EUDAT/eudat-research-data-management
•  http://www.slideshare.net/EUDAT/research-data-management-introduction-eudatopen-aire-webinar?ref=https://
eudat.eu/events/webinar/research-data-management-an-introductory-webinar-from-openaire-and-eudat
•  https://eudat.eu/events/webinar/research-data-management-an-introductory-webinar-from-openaire-and-eudat
•  http://www.instantpresenter.com/WebConference/RecordingDefault.aspx?c_psrid=EB57D6888147
Slide credits: EUDAT & OpenAire
GBIF BIFA mentoring, Day 5a Data management, July 2016

Contenu connexe

Tendances

Global Biodiversity Information Facility (GBIF) - 2012
Global Biodiversity Information Facility (GBIF) - 2012Global Biodiversity Information Facility (GBIF) - 2012
Global Biodiversity Information Facility (GBIF) - 2012Dag Endresen
 
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)Dag Endresen
 
GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016Dag Endresen
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)Dag Endresen
 
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...Dag Endresen
 
Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Dag Endresen
 
The Global Biodiversity Information Facility and Africa Rising
The Global Biodiversity Information Facility and Africa RisingThe Global Biodiversity Information Facility and Africa Rising
The Global Biodiversity Information Facility and Africa RisingFatima Parker-Allie
 
Darwin Core extension for germplasm (11th December 2013)
Darwin Core extension for germplasm (11th December 2013)Darwin Core extension for germplasm (11th December 2013)
Darwin Core extension for germplasm (11th December 2013)Dag Endresen
 
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14Dag Endresen
 
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14Dag Endresen
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18Dag Endresen
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Dag Endresen
 
Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)
Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)
Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)Dag Endresen
 
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016Dag Endresen
 
Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Dag Endresen
 
GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)Dag Endresen
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeVince Smith
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...Dag Endresen
 
GBIF and Open Science
GBIF and Open ScienceGBIF and Open Science
GBIF and Open ScienceDag Endresen
 

Tendances (20)

Global Biodiversity Information Facility (GBIF) - 2012
Global Biodiversity Information Facility (GBIF) - 2012Global Biodiversity Information Facility (GBIF) - 2012
Global Biodiversity Information Facility (GBIF) - 2012
 
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
 
GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
 
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
 
Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)
 
The Global Biodiversity Information Facility and Africa Rising
The Global Biodiversity Information Facility and Africa RisingThe Global Biodiversity Information Facility and Africa Rising
The Global Biodiversity Information Facility and Africa Rising
 
Darwin Core extension for germplasm (11th December 2013)
Darwin Core extension for germplasm (11th December 2013)Darwin Core extension for germplasm (11th December 2013)
Darwin Core extension for germplasm (11th December 2013)
 
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
Introduction to GBIF. GBIF seminar in Bergen. 2016-12-14
 
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)
 
Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)
Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)
Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)
 
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
 
Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)
 
GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
 
GBIF and Open Science
GBIF and Open ScienceGBIF and Open Science
GBIF and Open Science
 

En vedette

Informatica tics saula katy trabajo n1
Informatica tics saula katy trabajo n1Informatica tics saula katy trabajo n1
Informatica tics saula katy trabajo n1Katy Saula
 
Cbn responsable administratif-2
Cbn responsable administratif-2Cbn responsable administratif-2
Cbn responsable administratif-2reefreef
 
Grantwriting presentation for Puppetmongers
Grantwriting presentation for Puppetmongers Grantwriting presentation for Puppetmongers
Grantwriting presentation for Puppetmongers Arts Cubed
 
Communication Colloque scientifique PGV : Prendre le virage compétences... Ba...
Communication Colloque scientifique PGV : Prendre le virage compétences... Ba...Communication Colloque scientifique PGV : Prendre le virage compétences... Ba...
Communication Colloque scientifique PGV : Prendre le virage compétences... Ba...Eric Giraudin
 
Office 365 Dev PnP & PowerShell : exploitez enfin le potentiel de votre écosy...
Office 365 Dev PnP & PowerShell : exploitez enfin le potentiel de votre écosy...Office 365 Dev PnP & PowerShell : exploitez enfin le potentiel de votre écosy...
Office 365 Dev PnP & PowerShell : exploitez enfin le potentiel de votre écosy...Microsoft
 
Pourquoi le collaboratif bouleverse le management ? 
Pourquoi le collaboratif bouleverse le management ? Pourquoi le collaboratif bouleverse le management ? 
Pourquoi le collaboratif bouleverse le management ? Microsoft Ideas
 
7 habits that kill dialogue
7 habits that kill dialogue7 habits that kill dialogue
7 habits that kill dialogueCraig Daniels
 
Fondamentaux de la gestion de projet (cours 2)
Fondamentaux de la gestion de projet (cours 2)Fondamentaux de la gestion de projet (cours 2)
Fondamentaux de la gestion de projet (cours 2)Françoise Gouzi
 
How l spent my summer
How l spent my summerHow l spent my summer
How l spent my summerVadim Belyaev
 

En vedette (12)

Informatica tics saula katy trabajo n1
Informatica tics saula katy trabajo n1Informatica tics saula katy trabajo n1
Informatica tics saula katy trabajo n1
 
Cbn responsable administratif-2
Cbn responsable administratif-2Cbn responsable administratif-2
Cbn responsable administratif-2
 
Notice this moment
Notice this momentNotice this moment
Notice this moment
 
Grantwriting presentation for Puppetmongers
Grantwriting presentation for Puppetmongers Grantwriting presentation for Puppetmongers
Grantwriting presentation for Puppetmongers
 
Communication Colloque scientifique PGV : Prendre le virage compétences... Ba...
Communication Colloque scientifique PGV : Prendre le virage compétences... Ba...Communication Colloque scientifique PGV : Prendre le virage compétences... Ba...
Communication Colloque scientifique PGV : Prendre le virage compétences... Ba...
 
Office 365 Dev PnP & PowerShell : exploitez enfin le potentiel de votre écosy...
Office 365 Dev PnP & PowerShell : exploitez enfin le potentiel de votre écosy...Office 365 Dev PnP & PowerShell : exploitez enfin le potentiel de votre écosy...
Office 365 Dev PnP & PowerShell : exploitez enfin le potentiel de votre écosy...
 
Pourquoi le collaboratif bouleverse le management ? 
Pourquoi le collaboratif bouleverse le management ? Pourquoi le collaboratif bouleverse le management ? 
Pourquoi le collaboratif bouleverse le management ? 
 
Trustee Conference PM3: Fundraising regulation and data protection: How can c...
Trustee Conference PM3: Fundraising regulation and data protection: How can c...Trustee Conference PM3: Fundraising regulation and data protection: How can c...
Trustee Conference PM3: Fundraising regulation and data protection: How can c...
 
CV Juhani Polvi
CV Juhani PolviCV Juhani Polvi
CV Juhani Polvi
 
7 habits that kill dialogue
7 habits that kill dialogue7 habits that kill dialogue
7 habits that kill dialogue
 
Fondamentaux de la gestion de projet (cours 2)
Fondamentaux de la gestion de projet (cours 2)Fondamentaux de la gestion de projet (cours 2)
Fondamentaux de la gestion de projet (cours 2)
 
How l spent my summer
How l spent my summerHow l spent my summer
How l spent my summer
 

Similaire à GBIF BIFA mentoring, Day 5a Data management, July 2016

e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and LibariesRob Grim
 
Using e-Infrastructures for Biodiversity Conservation
Using e-Infrastructures for Biodiversity ConservationUsing e-Infrastructures for Biodiversity Conservation
Using e-Infrastructures for Biodiversity ConservationBlue BRIDGE
 
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLAAcademicandResea
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATOpenAIRE
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | EUDAT
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
Data hosting infrastructure for primary biodiversity data
Data hosting infrastructure for primary biodiversity dataData hosting infrastructure for primary biodiversity data
Data hosting infrastructure for primary biodiversity dataPhil Cryer
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationMANENDRASINGH30
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012IUPUI
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
 
I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17Tom Nyongesa
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012IUPUI
 

Similaire à GBIF BIFA mentoring, Day 5a Data management, July 2016 (20)

e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
Using e-Infrastructures for Biodiversity Conservation
Using e-Infrastructures for Biodiversity ConservationUsing e-Infrastructures for Biodiversity Conservation
Using e-Infrastructures for Biodiversity Conservation
 
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Data hosting infrastructure for primary biodiversity data
Data hosting infrastructure for primary biodiversity dataData hosting infrastructure for primary biodiversity data
Data hosting infrastructure for primary biodiversity data
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and Education
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...
 
EDI Training Module 2: EDI Project
EDI Training Module 2:  EDI ProjectEDI Training Module 2:  EDI Project
EDI Training Module 2: EDI Project
 
I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012
 
Getaneh Alemu
Getaneh AlemuGetaneh Alemu
Getaneh Alemu
 
What is a DMP
What is a DMPWhat is a DMP
What is a DMP
 
Data management
Data management Data management
Data management
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 

Plus de Dag Endresen

Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Dag Endresen
 
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Dag Endresen
 
Ontologies for biodiversity informatics, UiO DSC June 2023
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023Dag Endresen
 
Evacuation of the Kherson herbarium
Evacuation of the Kherson herbariumEvacuation of the Kherson herbarium
Evacuation of the Kherson herbariumDag Endresen
 
2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC RomeDag Endresen
 
BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24Dag Endresen
 
Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Dag Endresen
 
BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022Dag Endresen
 
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20Dag Endresen
 
GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03Dag Endresen
 
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...Dag Endresen
 
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Dag Endresen
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021Dag Endresen
 
2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portalsDag Endresen
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)Dag Endresen
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementDag Endresen
 
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...Dag Endresen
 
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019Dag Endresen
 
Museum collections as research data - October 2019
Museum collections as research data - October 2019Museum collections as research data - October 2019
Museum collections as research data - October 2019Dag Endresen
 
Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Dag Endresen
 

Plus de Dag Endresen (20)

Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
 
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
 
Ontologies for biodiversity informatics, UiO DSC June 2023
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023
 
Evacuation of the Kherson herbarium
Evacuation of the Kherson herbariumEvacuation of the Kherson herbarium
Evacuation of the Kherson herbarium
 
2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome
 
BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24
 
Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26
 
BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022
 
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
 
GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03
 
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
 
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021
 
2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data management
 
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
 
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
 
Museum collections as research data - October 2019
Museum collections as research data - October 2019Museum collections as research data - October 2019
Museum collections as research data - October 2019
 
Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019
 

Dernier

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 

Dernier (20)

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 

GBIF BIFA mentoring, Day 5a Data management, July 2016

  • 1.
  • 2. INDEX 1.  Develop a data management plan 2.  Publish & archive biodiversity data 3.  Use persistent data identifiers 4.  Use data standards 5.  Ecological Metadata Language (EML) 6.  Data paper & data citation
  • 3. EXPONENTIAL GROWTH OF DIGITAL DATA Source: EMC/IDC Digital Universe Study, 2012 The digital universe will double every two years between now and 2020. The growth is mostly unstructured data. A major factor behind the expansion is the growth of machine generated data (from 11% in 2005 to over 40% in 2020).
  • 4. DATA EXPLOSION Source: OpenAIRE & EUDAT, CC-BY-4.0, 2013 •  More and more data is being created •  Issue is not creating data, but being able to navigate and use it •  Data management is critical to make sure data are well- organised, understandable and reusable Source: OpenAIRE, 2013
  • 5. DATA LOSS Digital data are fragile and susceptible to loss for a wide variety of reasons: •  Natural disaster •  Facilities infrastructure failure •  Storage failure •  Server hardware/software failure •  Application software failure •  Format obsolescence •  Legal encumbrance •  Human error •  Malicious attack •  Loss of staffing competencies •  Loss of institutional commitment •  Loss of financial stability •  Changes in user expectations Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013 Image CC BY-NC-SA 2.0 by Dave Hill https://www.flickr.com/photos/dmh650/4031607067
  • 6. WHERE TO STORE DATA? Your own drive (PC, server, flash drive, etc.) –  And if you lose it? Or it breaks? Somebody else’s drive / departmental drive “Cloud” drive –  Do they care as much about your data as you do? Large scale infrastructure services like Data One or EUDAT 3... 2... 1... backup! –  at least 3 copies of a file –  on at least 2 different media –  with at least 1 offsite Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 7. BACKUP AND PRESERVATION – NOT THE SAME THING! Backups –  Periodic snapshots of data in case the current version is destroyed or lost –  Backups are copies of files stored for short or near- long-term –  Often performed on a somewhat frequent schedule Archiving –  Preserve data for historical reference –  Usually the final version, stored for long-term, and generally not copied over –  Often performed at the end of a project or during major milestones Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 8. FAIR DATA Findable –  assign persistent IDs, provide rich metadata, register in a searchable resource... (such as GBIF) Accessible –  Retrievable by their ID using a standard protocol, metadata remain accessible even if data aren’t... Interoperable –  Use formal, broadly applicable languages, use standard vocabularies, qualified references... (e.g. Darwin Core, …) Reusable –  Rich, accurate metadata, clear licences, provenance, use of community standards... (e.g. Dublin Core, EML, …) www.force11.org/group/fairgroup/fairprinciples Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 10. DATA MANAGEMENT PLAN •  Making your data available to others ensures that your research is truly reproducible. •  Managing your research data saves you time because it ensures that you and others in your collaboration will be able to find, understand, and use the data. •  Sharing your research data enables wider dissemination of your work. •  Enabling others to use your data reinforces open scientific inquiry and can lead to new and unanticipated discoveries. Source: Georgia Tech Library Graphics by Jørgen Stamp CC-BY
  • 11. DATA MANAGEMENT PLAN •  Description of data to be produced or collected, including data standards or formats. [e.g. Darwin Core & Darwin Core archive] •  Identification of protocols or workflows to help manage data throughout the project. •  Description of documentation and metadata standards to describe the data. [e.g. Ecological Metadata Language, EML] •  Plan for short-term data storage & backup, including necessary security measures. •  Plan for sharing data, including legal and ethical issues, intellectual property issues, or access policies and provisions. •  Plan for data preservation, archiving and long-term access. •  Plan for allocating responsibility for data management within your research project. Source: Georgia Tech Library
  • 12. DATA MANAGEMENT PLAN Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013 A DMP is a brief plan to define: •  how the data will be created? •  how it will be documented? •  who will access it? •  where it will be stored? •  who will back it up? •  whether (and how) it will be shared & preserved? DMPs are often submitted as part of grant applications, but are useful whenever researchers are creating data.
  • 13. Publish & archive your research data! Graphic by Jørgen Stamp CC-BY
  • 16. ONLINE DATA ARCHIVING REPOSITORY Rather than leaving your research data on a local server or in cloud storage, archive your data with a trusted digital repository. Many repositories create metadata and documentation to ensure that the data will be discoverable in the future.
  • 17. DATA ONE Source: GBIF News story, September 2014, DataONE: http://www.gbif.org/page/8199
  • 18. GBIF and the Data Observation Network for Earth (DataONE) collaborate to support long-term persistence of the biodiversity data shared through the GBIF network. The collaboration is part of a commitment in the GBIF Work Programme to explore data archival services offering redundancy to handle scenarios such as technical failure and the disappearance of projects or institutions that share data through the network. Source: GBIF News story, September 2014, DataONE: http://www.gbif.org/page/8199
  • 19. ZENODO.ORG For all content types! With GitHub integration! Upload Describe Publish Create communities! Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 20.
  • 22. METADATA STANDARDS Use relevant standards for interoperability Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013 http://rd-alliance.github.io/metadata-directory
  • 23. STRUCTURED DATA & METADATA Slide modified from: IDigBio workshop 2015 Unstructured Structured
  • 24. DATA STANDARDS Slide source: GB23 Nodes Madagascar October 2015 & iDigBio Florida January 2015 - http://www.tdwg.org/standards/ ABCD Access to Biological Collection Data (2005) DwC Darwin Core (2009) AC Audubon Core Multimedia Resources Metadata Schema (2013) NCD Natural Collection Descriptions (Draft 2008) EML Ecological Metadata Language (Ecological Society of America)
  • 25. ECOLOGICAL METADATA LANGUAGE - EML •  Developed for ecologists by ecologists •  Lead by NCEAS, DataONE •  XML schema for the structural expression of metadata •  Basic subset of EML used by GBIF, « eml-gbif-profile » http://www.gbif.org/resources/2559 http://rs.gbif.org/schema/eml-gbif-profile/ Slide modified from iDigBio workshop 2015
  • 29. HTTP – PURL – UUID http://purl.org/nhmuio/id/41d9cbb4-4590-4265-8079-ca44d46d27c3
  • 32. WHAT IS METADATA? Image CC-BY ‘Metadata is a love note to the future’ by Cea+ www.flickr.com/photos/ centralasian/8071729256
  • 33. Commonly defined as ‘data about data’, metadata helps to make data findable and understandable Metadata can be: Descriptive: information about the content and context of the data Structural: information about the structure of the data Administrative: information about the file type, rights management and preservation processes WHAT IS METADATA? Slide source: CC-BY EUDAT, 2015
  • 34. METADATA CATALOG Image CC-BY ‘University of Michigan Library Card Catalog’ by David Fulmer www.flickr.com/photos/annarbor/4350629792
  • 35. Comprehensive metadata will: Facilitate data discovery Help users determine the applicability of the data Enable interpretation and reuse Allow any limitations to be understood Clarify ownership and restrictions on reuse Offer permanence as it transcends people and time Provide interoperability WHY USE METADATA? Slide source: CC-BY EUDAT, 2015
  • 36. METADATA AND DOCUMENTATION Think about what will be needed in order to find, evaluate, understand, and reuse the data.   Have you documented what you did and how?   Did you develop code to run analyses? If so, this should be kept and shared too.   Is it clear what each bit of your dataset means? Make sure the units are labelled and abbreviations explained.   Record all the information needed for you and others to understand the data in the future Slide source: CC-BY EUDAT, 2015
  • 38. Create metadata at the time of data creation Information will be forgotten and there won’t be time or effort left to capture it later. Metadata benefits from quality control at an early stage too. TIME MATTERS! Image CC-BY-SA ‘egg timer – hour glass running out’ by OpenDemocracy www.flickr.com/photos/opendemocracy/523438942 Slide source: CC-BY EUDAT, 2015
  • 39. DATASET TITLES Titles are critical in helping readers find your data –  While individuals are searching for the most appropriate data sets, they are most likely going to use the title as the first criteria to determine if a dataset meets their needs. –  Treat the title as the opportunity to sell your dataset. A complete title includes: What, Where, When, Who, and Scale An informative title includes: topic, timeliness of the data, specific information about place and geography Slide source: CC-BY EUDAT, 2015
  • 40. WHICH IS THE BETTER TITLE? Rivers OR Greater Yellowstone Rivers from 1:126,700 U.S. Forest Service Visitor Maps (1961-1983) Greater Yellowstone (where) Rivers (what) from 1:126,700 (scale) U.S. Forest Service (who) Visitor Maps (1961-1983) (when) Slide source: CC-BY EUDAT, 2015
  • 41. WRITE FOR MACHINES, NOT JUST HUMANS Remember: a computer will read your metadata Do not use symbols that could be misinterpreted: Examples: ! @ # % { } | / < > ~ Don’t use tabs, indents, or line feeds/carriage returns When copying and pasting from other sources, use a text editor (e.g., Notepad) to eliminate hidden characters Slide source: CC-BY EUDAT, 2015
  • 43. METADATA CATEGORIES Dataset description Project description People and Organizations (including roles) Coverage •  Taxonomic coverage •  Geographic coverage •  Temporal coverage Methods Intellectual property rights, licensing Keywords
  • 45. Slide source: Alberto González-Talavá, 2015 RATIONALE FOR DATA PUBLISHING IndFauna, electronic catalogue of known Indian fauna 1 IndFauna, electronic catalogue of known Indian fauna Jitendra Gaikwad1 , Rebecca James2 , Monica Peterson3 , David Robertson4 ,Tom Griswold5 , S. Krishnan1 1 National Chemical Laboratory, 411007, Pune, India 2 Bulgarian Academy of Sciences, 2300, Sofia, Bulga- ria 3 National Natural History Museum, 1722, Leiden, The Netherlands 4 1988 ½ South Shenandoah Street, 3041, Los Angeles, USA 5 California Academy of Sciences, 1111, San Francisco, USA Corresponding author: Jitendra Gaikwad (jgaikwad@ncl.res.in), Monica Peterson (mpeterson@nnhm.nl) Academic editor: ............................  |  Received 6 June 2010  |  Accepted 15 July 2010  |  Published 29 July 2010Citation: Gaikwad J, James R, Peterson M, Robertson D, Griswold T, Krishnan S (2010) IndFauna, electronic catalogue of known Indian fauna. ZooKeys xx: xx-xx. doi: 10.3897/zookeys.xx.xxx Abstract This article describes the development and features of IndFauna, electronic catalogue of known Indian fauna. Accessible at http://www.ncbi.org.in, this catalogue raises several issues concerned with taxonomy or systematics and information technology in biodiversity information management. Baseline informa- tion on more than 93% of the 90,000 known faunal species in India has been documented in IndFauna, which demonstrates a model of collaboration between domain experts and IT managers. It is our belief that such ECATs would be effective in overcoming taxonomic impediments as well as better sustainable use and conservation of our biotic resources. Keywords Biodiversity informatics, IndFauna, data publishing, electronic catalogue Taxonomic coverage General taxonomic coverage description: The coverage of this database spans wholeof Kingdom Animalia. Database collates occurrences of over 90000 species belongingto 2222 genus. Taxonomic ranks: Kingdom: Animalia, Phylum: Acanthocephala, Annelida,Arthropoda, Mollusca, Chordata, Rotifera, Class: Amphibia, Aves, Chondrichthyes,Mammalia, Reptalia, Order: Monotremata, Anura, Caudata, Gymnophiona, Family: ZooKeys xx: x-xx (2010) doi: 10.3897/zookeys.xx.xxx www.pensoftonline.net/zookeys Copyright Jitendra Gaikwad et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Launched to accelerate biodiversity research A peer-reviewed open-access journal DATA PAPER •  Promote and publicize the existence of the data •  Provide scholarly credit to data publishers through citable journal publications •  Describe the data in a structured human-readable form Data Paper A scholarly publica^on of searchable metadata document describing a dataset, or a group of datasets.
  • 47. RATIONALE FOR DATA PUBLISHING: CITATION & USAGE Slide source: Alberto González-Talaván, 2015 “We believe that the lack of incentive similar to the impact factor for scholarly publication remains a major impediment to the provision of free and open access to biodiversity data” GBIF Data Publishing Framework Task Group “Data citation standards can form the basis for increased incentives, recognition, and rewards for scientific data activities. Unfortunately, such standards and good practices are lacking” CODATA Data Citation Task Group
  • 49. WHY MANAGE DATA? Make your research easier Stop yourself drowning in irrelevant stuff Save data for later Avoid accusations of fraud or bad science Share your data for re-use Get credit for it Meet funder/institution requirements Because well-managed data opens up opportunities for re-use, sharing and makes for better science! Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 50. H2020 - OPEN DATA BY DEFAULT FROM 2017 Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 51. CONCERNS ABOUT DATA SHARING Concern Solution inappropriate use due to misunderstanding of research purpose or parameters security and confidentiality of sensitive data lack of acknowledgement / credit loss of advantage when competing for research funding metadata metadata metadata metadata Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 52. CONCERNS ABOUT DATA SHARING Concern Solution inappropriate use due to misunderstanding of research purpose or parameters provide rich Abstract, Purpose, Use Constraints and Supplemental Information where needed security and confidentiality of sensitive data •  the metadata does NOT contain the data •  Use Constraints specify who may access the data and how lack of acknowledgement / credit specify a required data citation within the Use Constraints loss of data insight and competitive advantage when vying for research funding create second, public version with generalised Data Processing Description Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 53. MAKE DATA SHAREABLE Create robust metadata that has been checked Include reference information in metadata e.g. unique IDs & properly formatted data citations Publish your metadata so it’s discoverable. Use portals, clearing houses, online resources… Package up the data and associated metadata to deposit in repositories License the data clearly Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 54. www.dcc.ac.uk/resources/how-guides/license-research-data LICENSING RESEARCH DATA This DCC guide outlines the pros and cons of each approach and gives practical advice on how to implement your licence CREATIVE COMMONS LIMITATIONS NC Non-Commercial What counts as commercial? ND No Derivatives Severely restricts use These clauses are not open licenses Horizon 2020 Open Access guidelines point to: or Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 55. LICENSING FOR DATA PUBLISHED THROUGH GBIF GBIF Governing Board established in 2014 GBIF support for three licenses: •  CC0, under which data are made available for any use without restriction or particular requirements on the part of users CC0 is strongly recommended whenever possible •  CC-BY, under which data are made available for any use provided that attribution is appropriately given for the sources of data used, in the manner specified by the owner •  CC-BY-NC, under which data are made available for any use provided that attribution is appropriately given and provided the use is not for commercial purposes http://www.gbif.org/terms/licences
  • 56. WHAT TO PRESERVE & SHARE It’s not possible to keep everything. Select based on: •  What has to be kept e.g. data underlying publications •  What can’t be recreated e.g. environmental recordings •  What is potentially useful to others •  What has scientific, cultural or historical value •  What legally must be destroyed How to select and appraise research data: www.dcc.ac.uk/resources/how-guides/appraise-select-research-data Slide source: OpenAIRE & EUDAT, CC-BY-4.0, 2013
  • 57. SLIDE CREDIT – GB23, EUDAT & OPENAIRE GB23 Nodes Madagascar, October 2015 •  http://community.gbif.org/pg/pages/view/47903/ Norwegian GBIF data publishing workshop in Trondheim, October 2015, •  http://www.gbif.no/events/2015/data-publishing-workshop-october-2015.html Slide credits: EUDAT/OpenAire, December 2015 & May 2016, CC-BY-4.0 •  http://www.slideshare.net/EUDAT/eudat-research-data-management •  http://www.slideshare.net/EUDAT/research-data-management-introduction-eudatopen-aire-webinar?ref=https:// eudat.eu/events/webinar/research-data-management-an-introductory-webinar-from-openaire-and-eudat •  https://eudat.eu/events/webinar/research-data-management-an-introductory-webinar-from-openaire-and-eudat •  http://www.instantpresenter.com/WebConference/RecordingDefault.aspx?c_psrid=EB57D6888147 Slide credits: EUDAT & OpenAire