SlideShare une entreprise Scribd logo
1  sur  32
www.iita.orgA member of CGIAR consortium
Workshop on Management and
Analyses of ISFM Data
Monday, May 25, 2015
1
www.iita.orgA member of CGIAR consortium
Data management
"Data management is the development,
execution and supervision of plans, policies,
programs and practices that control, protect,
deliver and enhance the value of data and
information assets.“
(DAMA Data Management Association International )
2
www.iita.orgA member of CGIAR consortium
Data management
Objective:
• to maximize the potential of data while
integrating them into business processes
Topics:
• Data quality
• Data security
• Data organization
3
www.iita.orgA member of CGIAR consortium
Data management principles
• Data are correct
• Data are consistent
(uniform in content, content structure, notation, units,
methods used, meaning, language)
• Data are complete
• Data are up to date
• Data are relevant
• Data are precise enough
• Datasets are free of redundancies
• Data are reliable and comprehensible
• Data are understandable by all involved
users and processible by machines
• Data are unambiguous/explicit
4
Data quality
www.iita.orgA member of CGIAR consortium
Data management principles
• Every data needs a
frequent backup
• no data without
access permission
control
• Treatment of data of
different ownership
(private) is clarified
5
Data security
www.iita.orgA member of CGIAR consortium
Data management principles
• There is no data
without a person
responsible for it (clear
roles & responsibilities)
• There is no data
without one, clearly
defined, easy to find
and communicated
location for it
6
Data
organization
www.iita.orgA member of CGIAR consortium
Main roles in data management
• Data Editor: The person that validates, creates
and edits the data
• Data Steward: The person that holds the data,
usually they will take care of the data, ensuring
the data consumers obtain exactly the data
approved by the data owner
• Data Owner: The person that approves data
before it is published for the eventual audience
• Data Consumer: A person that uses the data
without editing, correcting or modifying it
7
www.iita.orgA member of CGIAR consortium
Operational levels
• Individual
(Execution of data activities, self-organizing)
• Project/working group
(Plans&deliveries, rules&responsibilities, workflow&steering,
communication, access/permission control, data organizing (content
mgt./file order, file naming strategies, templates, Project data…) )
• Organization
(Policies, Infrastructure&repositories, Ressources, …)
• Global
(Metadata standards, data exchange protocols, vocabularies/
ontologies, legal issues, Open Access, …)
8
www.iita.orgA member of CGIAR consortium
Data
lifecycle
9
interpret data
derive data (apply statistical
and analytical methods)
produce research outputs
author publications
create metadata
and documentation
Identify (tracking)
Categorize
migrate data to
suitable medium
back-up and store
data
archive data
collect data (experiment, observe, measure, simulate)
design research
plan data management (formats, storage etc)
plan consent for sharing
locate existing data
enter data, digitize, transcribe, translate
check, validate, clean data
anonymize data where necessary
describe data
migrate data to best
format
Locate, explore and understand data
scrutinize findings
distribute data
share data
control access
establish copyright
promote data
establish copyright
promote data
follow-up research
undertake research reviews
teach and learn
Exposing metadata through a searchable
interface
Source: Boston University Libraries
www.iita.orgA member of CGIAR consortium
Data intervention areas
Data capturing and preprocessing
Data transfer
Data flow/content mgt.
Data storage
Data analytics
Data delivery
10
www.iita.orgA member of CGIAR consortium
From capture to delivery
11
Find answers to
• ensure all data mgt. principles are respected
• in and across all intervention areas
• at all operational levels
Start planning from the desired outcomes!
www.iita.orgA member of CGIAR consortium
Plan data management
12
Datamanagementprinciples
Data lifecycle / intervention areas
www.iita.orgA member of CGIAR consortium
Data presentation/publication
• Who are the end users of which data?
• Mode of presentation per information product
• Ease of extraction of the right data in the right
format for the right (authorized) people
• Automized? real-time data? Personalized data?
• Consumers conditions (file formats? Com. tools?)
• ability to search&browse (metadata, tags)
• Presentation mode and conditions (inclusive
visualization)
• licensing
13
www.iita.orgA member of CGIAR consortium
Data transfer
• Transfer format and requirements (Data
Transformation needed?)
• Transfer initiative (receiver or sender?)
• Transfer mode and instructions
• Transfer compression needs (zip, tar…),
limited internet availability?
• Transfer channels (email, phone, skype, RSS
etc.)
• Transfer check (i.e. email)
14
www.iita.orgA member of CGIAR consortium
Data transfer
• Transfer security
• Platform Openness
• Authorization Controls (user credentials)
• Encryption Standards (SSL, S/MIME etc.)
• Transfer scheduling
• Use of API’s?
15
www.iita.orgA member of CGIAR consortium
Data storage
• Suitable end repository
(server folder, Sharepoint, MySQL database, cloud based solution, PC, external repository)
• Suitable data infrastructure hardware
(servers, network(s), bandwidth, databases, security facilities, PCs, external hard drive, USB stick,
Smartphones/tablets, scanners, field or laboratory sensors with digital data capturing, etc.)
• Data categorization, file order, filing order criteria
• Data deleting policy and archiving for
evidence/documentation purposes
• Data disposal/sharing/access control +
administration
16
www.iita.orgA member of CGIAR consortium
Data analytics and data search
• Goal and mode of analysis
• Frequency of a data analysis
• Participating units and data integration
(Business intelligence)
• Storage and backup of analysis results
• Speed of search
• eventual transition or termination of the
data?
17
www.iita.orgA member of CGIAR consortium
Data backup
• Risk assessment:
loss/theft/damage/overload/hacker attack…
• Backup mode and regulations
• Backup frequency/scheduling and discipline
• Suitable backup repository (server folder,
Sharepoint, MySQL database, cloud, PC, external
repository, external hard drive, USB stick etc.)
• Backup tool/software/opportunities to automize
18
www.iita.orgA member of CGIAR consortium
Data capturing and preprocessing
• Capturing location and its conditions
• Capturing mode (manual typing, crowd sourcing, data
mining, etc.)
• Capturing tools/hardware (PC, Smartphones, tablets, GPS,
mobile phone, scanners etc.)
• Capturing software and requirements (field data capturing
tools, scanning & OCR read software, etc.)
• Capturing instructions (metadata, data protocols, add. data
descriptions, methodological correctness)
• Data validation rules + data checks: Ensuring Data quality
• Referencing captured data in time & space
• Data structure at capturing
• Capturing data intermediate storage
19
www.iita.orgA member of CGIAR consortium
Platforms
• MS SharePoint
• CKAN
• aWhere
• Collaboration tools
• File sharing services (google drive,
dropbox, FTP server, etc.)
20
www.iita.orgA member of CGIAR consortium
Data mgt. platforms (1)
MS SharePoint
• Fits to existing Microsoft environment
(MS Office (especially Outlook, Excel, Access, Visio, Project), MS Server
databases, Exchange server, skype)
• With proper permission settings, allows to
create as much pages, apps or subsites as
necessary
• Useful features for data mgt.
(Metadata tagging, version control, templates (MS office only), validation
rules, linking data lists, workflows (approvals etc.), many predefined apps
come with customizable metadata sets)
• Weak: issues linking open repositories
21
www.iita.orgA member of CGIAR consortium
Data mgt. platforms (2)
CKAN – “Meta-repository”
• functional emphasis: defacto standard software
for publishing open data
(started as a catalogue for harvesting published data spread of knowledge)
• Python based (DKAN in PHP)
• Strength: customizable, data organization,
harvesting multiple repositories
• Weak: no workflow or bulk operations:
processing need to be done before cataloguing;
no collaboration tools; no upload of multiple
ressources at a time and batch edit the metadata
• Example: http://data.ilri.org/portal/
22
www.iita.orgA member of CGIAR consortium
Data mgt. platforms (3a)
ILRI
dataset
portal
based
on
CKAN
23
www.iita.orgA member of CGIAR consortium
Data mgt. platforms (3b)
ILRI
dataset
portal
based
on
CKAN
24
www.iita.orgA member of CGIAR consortium
Data mgt. platforms (4)
aWhere
• Functional emphasis: (geo)data exploration
• Strength: easy to use platform to explore data
from xls or ODK as tables, diagram or maps and
in connection with data from other users, the
library and the weather module
• Weak: xls only; collaboration functionality
• More by Hannah and Courtney
25
www.iita.orgA member of CGIAR consortium
Data mgt. platforms (5)
Collaboration tools - basecamp
• Functional emphasis: collaboration with many
different partners in projects
• Strength: easy to use platform with typical
collab. tools (file sharing+tagging, calendar, wiki,
task tracking)
• Weak: not customizable, no data linkage to
databases
26
www.iita.orgA member of CGIAR consortium
Data mgt. platforms (6)
File sharing services – Google drive
• Functional emphasis: synchronized working on
office apps in the cloud
• Strength: data sharing and synchronizing, widely
known, easy to use
• Weak: not customizable, no data linkage to
databases, google account necessary; adverts
27
www.iita.orgA member of CGIAR consortium
28
www.iita.orgA member of CGIAR consortium
File naming strategies
29
 Order by date:
2013-04-12_interview-recording_THD.mp3
2013-04-12_interview-transcript_THD.docx
2012-12-15_interview-recording_MBD.mp3
2012-12-15_interview-transcript_MBD.docx
 Order by subject:
MBD_interview-recording_2012-12-15.mp3
MBD_interview-transcript_2012-12-15.docx
THD_interview-recording_2013-04-12.mp3
THD_interview-transcript_2013-04-12.docx
 Order by type:
Interview-recording_MBD_2012-12-15.mp3
Interview-recording_THD_2013-04-12.mp3
Interview-transcript_MBD_2012-12-15.docx
Interview-transcript_THD_2013-04-12.docx
 Forced order with numbering:
01_THD_interview-recording_2013-04-12.mp3
02_THD_interview-transcript_2013-04-12.docx
03_MBD_interview-recording_2012-12-15.mp3
04_MBD_interview-transcript_2012-12-15.docx
www.iita.orgA member of CGIAR consortium
Supporting documentation(1)
30
Supporting documentation is information in
separate files that accompanies data in order to
provide
• context,
• explanation, or
• instructions on
• confidentiality and
• data use or
• reuse
Source: Dublin UCD Library
www.iita.orgA member of CGIAR consortium
Supporting documentation(1)
31
Examples of supporting documentation include:
Source: Dublin UCD Library
Information about the project and data creators;
Working papers or laboratory notebooks
Questionnaires or interview guides
Codebooks
Details on how the data were created, analysed,
anonymised etc;
Final project reports and publications
www.iita.orgA member of CGIAR consortium
Metadata
32
There are three broad categories of metadata:
Source: Dublin UCD Library
 Descriptive - common fields such as title, author,
abstract, keywords which help users to discover
online sources through searching and browsing.
 Administrative - preservation, rights
management, and technical metadata about
formats.
 Structural - how different components of a set of
associated data relate to one another, such as a
schema describing relations between tables in a
database.

Contenu connexe

Tendances

Data quality overview
Data quality overviewData quality overview
Data quality overview
Alex Meadows
 
Data and database administration(database)
Data and database administration(database)Data and database administration(database)
Data and database administration(database)
welcometofacebook
 

Tendances (20)

Data Quality Presentation.ppt
Data Quality Presentation.pptData Quality Presentation.ppt
Data Quality Presentation.ppt
 
Chapter 5: Data Development
Chapter 5: Data Development Chapter 5: Data Development
Chapter 5: Data Development
 
Database Administration
Database AdministrationDatabase Administration
Database Administration
 
Reference Data Management
Reference Data ManagementReference Data Management
Reference Data Management
 
Chapter1 introduction
Chapter1 introductionChapter1 introduction
Chapter1 introduction
 
structuration des métadonnées de pérennisation
structuration des métadonnées de pérennisationstructuration des métadonnées de pérennisation
structuration des métadonnées de pérennisation
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management
‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management
‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management
 
Digital preservation: an introduction
Digital preservation: an introductionDigital preservation: an introduction
Digital preservation: an introduction
 
Data quality overview
Data quality overviewData quality overview
Data quality overview
 
planning & project management for DWH
planning & project management for DWHplanning & project management for DWH
planning & project management for DWH
 
Data warehouse presentaion
Data warehouse presentaionData warehouse presentaion
Data warehouse presentaion
 
Data Quality & Data Governance
Data Quality & Data GovernanceData Quality & Data Governance
Data Quality & Data Governance
 
Business intelligence and data warehouses
Business intelligence and data warehousesBusiness intelligence and data warehouses
Business intelligence and data warehouses
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management Basic
 
Data and database administration(database)
Data and database administration(database)Data and database administration(database)
Data and database administration(database)
 
‏‏‏‏‏‏‏‏‏‏Chapter 12: Data Quality Management
‏‏‏‏‏‏‏‏‏‏Chapter 12: Data Quality Management‏‏‏‏‏‏‏‏‏‏Chapter 12: Data Quality Management
‏‏‏‏‏‏‏‏‏‏Chapter 12: Data Quality Management
 
System analysis and design
System analysis and designSystem analysis and design
System analysis and design
 
Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality Strategies
 
Gathering Business Requirements for Data Warehouses
Gathering Business Requirements for Data WarehousesGathering Business Requirements for Data Warehouses
Gathering Business Requirements for Data Warehouses
 

Similaire à Introduction to data management, terminologies and use of data management platforms

Data Privacy at Scale
Data Privacy at ScaleData Privacy at Scale
Data Privacy at Scale
DataWorks Summit
 
PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013
Frauke Ziedorn
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
SEAD
 

Similaire à Introduction to data management, terminologies and use of data management platforms (20)

Data management for proposal writing
Data management for proposal writingData management for proposal writing
Data management for proposal writing
 
Building blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositoriesBuilding blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositories
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
 
Building blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositoriesBuilding blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositories
 
SMART Seminar Series: SMART Data Management
SMART Seminar Series: SMART Data ManagementSMART Seminar Series: SMART Data Management
SMART Seminar Series: SMART Data Management
 
Criteria for a trusted institutional repository
Criteria for a trusted institutional repositoryCriteria for a trusted institutional repository
Criteria for a trusted institutional repository
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Data Privacy at Scale
Data Privacy at ScaleData Privacy at Scale
Data Privacy at Scale
 
PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013
 
Building Cyber-infrastructure at UNC-CH
Building Cyber-infrastructure at UNC-CHBuilding Cyber-infrastructure at UNC-CH
Building Cyber-infrastructure at UNC-CH
 
Intelligent Cloud Enablement
Intelligent Cloud EnablementIntelligent Cloud Enablement
Intelligent Cloud Enablement
 
Strata San Jose 2017 - Ben Sharma Presentation
Strata San Jose 2017 - Ben Sharma PresentationStrata San Jose 2017 - Ben Sharma Presentation
Strata San Jose 2017 - Ben Sharma Presentation
 
Data Domain-Driven Design
Data Domain-Driven DesignData Domain-Driven Design
Data Domain-Driven Design
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in Manufacturing
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
Harness the power of Data in a Big Data Lake
Harness the power of Data in a Big Data LakeHarness the power of Data in a Big Data Lake
Harness the power of Data in a Big Data Lake
 
Rocana Deep Dive OC Big Data Meetup #19 Sept 21st 2016
Rocana Deep Dive OC Big Data Meetup #19 Sept 21st 2016Rocana Deep Dive OC Big Data Meetup #19 Sept 21st 2016
Rocana Deep Dive OC Big Data Meetup #19 Sept 21st 2016
 
Data Lakes - The Key to a Scalable Data Architecture
Data Lakes - The Key to a Scalable Data ArchitectureData Lakes - The Key to a Scalable Data Architecture
Data Lakes - The Key to a Scalable Data Architecture
 
Bertenthal
BertenthalBertenthal
Bertenthal
 
Metadata Strategies
Metadata StrategiesMetadata Strategies
Metadata Strategies
 

Plus de International Institute of Tropical Agriculture

Plus de International Institute of Tropical Agriculture (20)

Make your research visible and create more impact using DataCite DOIs
Make your research visible and  create more impact using  DataCite DOIsMake your research visible and  create more impact using  DataCite DOIs
Make your research visible and create more impact using DataCite DOIs
 
Induction of early flowering in cassava through light supplementation and CM...
Induction of early flowering in cassava  through light supplementation and CM...Induction of early flowering in cassava  through light supplementation and CM...
Induction of early flowering in cassava through light supplementation and CM...
 
Producing yam mother plants to collect vines for propagation
Producing yam mother plants to collect  vines for propagationProducing yam mother plants to collect  vines for propagation
Producing yam mother plants to collect vines for propagation
 
Effects of moult and breeding on the body condition of some forest birds in s...
Effects of moult and breeding on the body condition of some forest birds in s...Effects of moult and breeding on the body condition of some forest birds in s...
Effects of moult and breeding on the body condition of some forest birds in s...
 
Conserving Nigeria’s rarest endemic bird: Ibadan Malimbe, Malimbusibadanensis
Conserving Nigeria’s rarest endemic bird: Ibadan Malimbe, MalimbusibadanensisConserving Nigeria’s rarest endemic bird: Ibadan Malimbe, Malimbusibadanensis
Conserving Nigeria’s rarest endemic bird: Ibadan Malimbe, Malimbusibadanensis
 
Cassava brown streak epidemiology in Eastern Democratic Republic of the Congo
Cassava brown streak epidemiology in Eastern Democratic  Republic of the CongoCassava brown streak epidemiology in Eastern Democratic  Republic of the Congo
Cassava brown streak epidemiology in Eastern Democratic Republic of the Congo
 
Assessment of genetic diversity among Rwandan cassava (Manihot esculenta) ger...
Assessment of genetic diversity among Rwandan cassava (Manihot esculenta) ger...Assessment of genetic diversity among Rwandan cassava (Manihot esculenta) ger...
Assessment of genetic diversity among Rwandan cassava (Manihot esculenta) ger...
 
9 osunbade identification of end users preferences of a cassava product
9 osunbade identification of end users preferences of a cassava product9 osunbade identification of end users preferences of a cassava product
9 osunbade identification of end users preferences of a cassava product
 
7 helen ufondu perception of yam landraces quality among value chain actors i...
7 helen ufondu perception of yam landraces quality among value chain actors i...7 helen ufondu perception of yam landraces quality among value chain actors i...
7 helen ufondu perception of yam landraces quality among value chain actors i...
 
8 kazeem quality attributes and consumer acceptability of cookies flavoured
8 kazeem quality attributes and consumer acceptability of cookies flavoured8 kazeem quality attributes and consumer acceptability of cookies flavoured
8 kazeem quality attributes and consumer acceptability of cookies flavoured
 
6 anajekwu ekpereka chemical, functional and pasting properties of flours pro...
6 anajekwu ekpereka chemical, functional and pasting properties of flours pro...6 anajekwu ekpereka chemical, functional and pasting properties of flours pro...
6 anajekwu ekpereka chemical, functional and pasting properties of flours pro...
 
5 seun olowote effect of drying method on caroteniod content of yellow maize
5 seun olowote effect of drying method on caroteniod content of yellow maize5 seun olowote effect of drying method on caroteniod content of yellow maize
5 seun olowote effect of drying method on caroteniod content of yellow maize
 
4 ayodele adenitan survey of dried plantain (musa paradisiaca) chips processo...
4 ayodele adenitan survey of dried plantain (musa paradisiaca) chips processo...4 ayodele adenitan survey of dried plantain (musa paradisiaca) chips processo...
4 ayodele adenitan survey of dried plantain (musa paradisiaca) chips processo...
 
2 akin olagunju does crop diversification influenc e food and nutrition secur...
2 akin olagunju does crop diversification influenc e food and nutrition secur...2 akin olagunju does crop diversification influenc e food and nutrition secur...
2 akin olagunju does crop diversification influenc e food and nutrition secur...
 
3 akinsola carotenoid apparent retention in ogi flour made from different pro...
3 akinsola carotenoid apparent retention in ogi flour made from different pro...3 akinsola carotenoid apparent retention in ogi flour made from different pro...
3 akinsola carotenoid apparent retention in ogi flour made from different pro...
 
1 pearl amadi assessing the level of consumption of pro vitamin a cassava pr...
1 pearl amadi assessing the level of consumption of pro  vitamin a cassava pr...1 pearl amadi assessing the level of consumption of pro  vitamin a cassava pr...
1 pearl amadi assessing the level of consumption of pro vitamin a cassava pr...
 
Prof janice olawoye
Prof janice olawoyeProf janice olawoye
Prof janice olawoye
 
Inqaba biotech presentation
Inqaba biotech presentationInqaba biotech presentation
Inqaba biotech presentation
 
Iarsaf symposium adaptation to climate change
Iarsaf symposium adaptation to climate changeIarsaf symposium adaptation to climate change
Iarsaf symposium adaptation to climate change
 
Bimaf iita iarsaf presentation-ibadan 21.05.19
Bimaf  iita iarsaf presentation-ibadan 21.05.19Bimaf  iita iarsaf presentation-ibadan 21.05.19
Bimaf iita iarsaf presentation-ibadan 21.05.19
 

Dernier

Russian🍌Dazzling Hottie Get☎️ 9053900678 ☎️call girl In Chandigarh By Chandig...
Russian🍌Dazzling Hottie Get☎️ 9053900678 ☎️call girl In Chandigarh By Chandig...Russian🍌Dazzling Hottie Get☎️ 9053900678 ☎️call girl In Chandigarh By Chandig...
Russian🍌Dazzling Hottie Get☎️ 9053900678 ☎️call girl In Chandigarh By Chandig...
Chandigarh Call girls 9053900678 Call girls in Chandigarh
 

Dernier (20)

Call Girls Chakan Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Chakan Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Chakan Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Chakan Call Me 7737669865 Budget Friendly No Advance Booking
 
Night 7k to 12k Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...
Night 7k to 12k  Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...Night 7k to 12k  Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...
Night 7k to 12k Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...
 
Call On 6297143586 Viman Nagar Call Girls In All Pune 24/7 Provide Call With...
Call On 6297143586  Viman Nagar Call Girls In All Pune 24/7 Provide Call With...Call On 6297143586  Viman Nagar Call Girls In All Pune 24/7 Provide Call With...
Call On 6297143586 Viman Nagar Call Girls In All Pune 24/7 Provide Call With...
 
The NAP process & South-South peer learning
The NAP process & South-South peer learningThe NAP process & South-South peer learning
The NAP process & South-South peer learning
 
2024: The FAR, Federal Acquisition Regulations, Part 30
2024: The FAR, Federal Acquisition Regulations, Part 302024: The FAR, Federal Acquisition Regulations, Part 30
2024: The FAR, Federal Acquisition Regulations, Part 30
 
WORLD DEVELOPMENT REPORT 2024 - Economic Growth in Middle-Income Countries.
WORLD DEVELOPMENT REPORT 2024 - Economic Growth in Middle-Income Countries.WORLD DEVELOPMENT REPORT 2024 - Economic Growth in Middle-Income Countries.
WORLD DEVELOPMENT REPORT 2024 - Economic Growth in Middle-Income Countries.
 
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
 
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxx
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxxIncident Command System xxxxxxxxxxxxxxxxxxxxxxxxx
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxx
 
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
 
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation -  Humble BeginningsZechariah Boodey Farmstead Collaborative presentation -  Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginnings
 
The Economic and Organised Crime Office (EOCO) has been advised by the Office...
The Economic and Organised Crime Office (EOCO) has been advised by the Office...The Economic and Organised Crime Office (EOCO) has been advised by the Office...
The Economic and Organised Crime Office (EOCO) has been advised by the Office...
 
↑VVIP celebrity ( Pune ) Serampore Call Girls 8250192130 unlimited shot and a...
↑VVIP celebrity ( Pune ) Serampore Call Girls 8250192130 unlimited shot and a...↑VVIP celebrity ( Pune ) Serampore Call Girls 8250192130 unlimited shot and a...
↑VVIP celebrity ( Pune ) Serampore Call Girls 8250192130 unlimited shot and a...
 
Expressive clarity oral presentation.pptx
Expressive clarity oral presentation.pptxExpressive clarity oral presentation.pptx
Expressive clarity oral presentation.pptx
 
PPT BIJNOR COUNTING Counting of Votes on ETPBs (FOR SERVICE ELECTORS
PPT BIJNOR COUNTING Counting of Votes on ETPBs (FOR SERVICE ELECTORSPPT BIJNOR COUNTING Counting of Votes on ETPBs (FOR SERVICE ELECTORS
PPT BIJNOR COUNTING Counting of Votes on ETPBs (FOR SERVICE ELECTORS
 
Junnar ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Junnar ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Junnar ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Junnar ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
 
Top Rated Pune Call Girls Dapodi ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Dapodi ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Dapodi ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Dapodi ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
 
The U.S. Budget and Economic Outlook (Presentation)
The U.S. Budget and Economic Outlook (Presentation)The U.S. Budget and Economic Outlook (Presentation)
The U.S. Budget and Economic Outlook (Presentation)
 
Just Call Vip call girls Wardha Escorts ☎️8617370543 Starting From 5K to 25K ...
Just Call Vip call girls Wardha Escorts ☎️8617370543 Starting From 5K to 25K ...Just Call Vip call girls Wardha Escorts ☎️8617370543 Starting From 5K to 25K ...
Just Call Vip call girls Wardha Escorts ☎️8617370543 Starting From 5K to 25K ...
 
PPT Item # 4 - 231 Encino Ave (Significance Only)
PPT Item # 4 - 231 Encino Ave (Significance Only)PPT Item # 4 - 231 Encino Ave (Significance Only)
PPT Item # 4 - 231 Encino Ave (Significance Only)
 
Russian🍌Dazzling Hottie Get☎️ 9053900678 ☎️call girl In Chandigarh By Chandig...
Russian🍌Dazzling Hottie Get☎️ 9053900678 ☎️call girl In Chandigarh By Chandig...Russian🍌Dazzling Hottie Get☎️ 9053900678 ☎️call girl In Chandigarh By Chandig...
Russian🍌Dazzling Hottie Get☎️ 9053900678 ☎️call girl In Chandigarh By Chandig...
 

Introduction to data management, terminologies and use of data management platforms

  • 1. www.iita.orgA member of CGIAR consortium Workshop on Management and Analyses of ISFM Data Monday, May 25, 2015 1
  • 2. www.iita.orgA member of CGIAR consortium Data management "Data management is the development, execution and supervision of plans, policies, programs and practices that control, protect, deliver and enhance the value of data and information assets.“ (DAMA Data Management Association International ) 2
  • 3. www.iita.orgA member of CGIAR consortium Data management Objective: • to maximize the potential of data while integrating them into business processes Topics: • Data quality • Data security • Data organization 3
  • 4. www.iita.orgA member of CGIAR consortium Data management principles • Data are correct • Data are consistent (uniform in content, content structure, notation, units, methods used, meaning, language) • Data are complete • Data are up to date • Data are relevant • Data are precise enough • Datasets are free of redundancies • Data are reliable and comprehensible • Data are understandable by all involved users and processible by machines • Data are unambiguous/explicit 4 Data quality
  • 5. www.iita.orgA member of CGIAR consortium Data management principles • Every data needs a frequent backup • no data without access permission control • Treatment of data of different ownership (private) is clarified 5 Data security
  • 6. www.iita.orgA member of CGIAR consortium Data management principles • There is no data without a person responsible for it (clear roles & responsibilities) • There is no data without one, clearly defined, easy to find and communicated location for it 6 Data organization
  • 7. www.iita.orgA member of CGIAR consortium Main roles in data management • Data Editor: The person that validates, creates and edits the data • Data Steward: The person that holds the data, usually they will take care of the data, ensuring the data consumers obtain exactly the data approved by the data owner • Data Owner: The person that approves data before it is published for the eventual audience • Data Consumer: A person that uses the data without editing, correcting or modifying it 7
  • 8. www.iita.orgA member of CGIAR consortium Operational levels • Individual (Execution of data activities, self-organizing) • Project/working group (Plans&deliveries, rules&responsibilities, workflow&steering, communication, access/permission control, data organizing (content mgt./file order, file naming strategies, templates, Project data…) ) • Organization (Policies, Infrastructure&repositories, Ressources, …) • Global (Metadata standards, data exchange protocols, vocabularies/ ontologies, legal issues, Open Access, …) 8
  • 9. www.iita.orgA member of CGIAR consortium Data lifecycle 9 interpret data derive data (apply statistical and analytical methods) produce research outputs author publications create metadata and documentation Identify (tracking) Categorize migrate data to suitable medium back-up and store data archive data collect data (experiment, observe, measure, simulate) design research plan data management (formats, storage etc) plan consent for sharing locate existing data enter data, digitize, transcribe, translate check, validate, clean data anonymize data where necessary describe data migrate data to best format Locate, explore and understand data scrutinize findings distribute data share data control access establish copyright promote data establish copyright promote data follow-up research undertake research reviews teach and learn Exposing metadata through a searchable interface Source: Boston University Libraries
  • 10. www.iita.orgA member of CGIAR consortium Data intervention areas Data capturing and preprocessing Data transfer Data flow/content mgt. Data storage Data analytics Data delivery 10
  • 11. www.iita.orgA member of CGIAR consortium From capture to delivery 11
  • 12. Find answers to • ensure all data mgt. principles are respected • in and across all intervention areas • at all operational levels Start planning from the desired outcomes! www.iita.orgA member of CGIAR consortium Plan data management 12 Datamanagementprinciples Data lifecycle / intervention areas
  • 13. www.iita.orgA member of CGIAR consortium Data presentation/publication • Who are the end users of which data? • Mode of presentation per information product • Ease of extraction of the right data in the right format for the right (authorized) people • Automized? real-time data? Personalized data? • Consumers conditions (file formats? Com. tools?) • ability to search&browse (metadata, tags) • Presentation mode and conditions (inclusive visualization) • licensing 13
  • 14. www.iita.orgA member of CGIAR consortium Data transfer • Transfer format and requirements (Data Transformation needed?) • Transfer initiative (receiver or sender?) • Transfer mode and instructions • Transfer compression needs (zip, tar…), limited internet availability? • Transfer channels (email, phone, skype, RSS etc.) • Transfer check (i.e. email) 14
  • 15. www.iita.orgA member of CGIAR consortium Data transfer • Transfer security • Platform Openness • Authorization Controls (user credentials) • Encryption Standards (SSL, S/MIME etc.) • Transfer scheduling • Use of API’s? 15
  • 16. www.iita.orgA member of CGIAR consortium Data storage • Suitable end repository (server folder, Sharepoint, MySQL database, cloud based solution, PC, external repository) • Suitable data infrastructure hardware (servers, network(s), bandwidth, databases, security facilities, PCs, external hard drive, USB stick, Smartphones/tablets, scanners, field or laboratory sensors with digital data capturing, etc.) • Data categorization, file order, filing order criteria • Data deleting policy and archiving for evidence/documentation purposes • Data disposal/sharing/access control + administration 16
  • 17. www.iita.orgA member of CGIAR consortium Data analytics and data search • Goal and mode of analysis • Frequency of a data analysis • Participating units and data integration (Business intelligence) • Storage and backup of analysis results • Speed of search • eventual transition or termination of the data? 17
  • 18. www.iita.orgA member of CGIAR consortium Data backup • Risk assessment: loss/theft/damage/overload/hacker attack… • Backup mode and regulations • Backup frequency/scheduling and discipline • Suitable backup repository (server folder, Sharepoint, MySQL database, cloud, PC, external repository, external hard drive, USB stick etc.) • Backup tool/software/opportunities to automize 18
  • 19. www.iita.orgA member of CGIAR consortium Data capturing and preprocessing • Capturing location and its conditions • Capturing mode (manual typing, crowd sourcing, data mining, etc.) • Capturing tools/hardware (PC, Smartphones, tablets, GPS, mobile phone, scanners etc.) • Capturing software and requirements (field data capturing tools, scanning & OCR read software, etc.) • Capturing instructions (metadata, data protocols, add. data descriptions, methodological correctness) • Data validation rules + data checks: Ensuring Data quality • Referencing captured data in time & space • Data structure at capturing • Capturing data intermediate storage 19
  • 20. www.iita.orgA member of CGIAR consortium Platforms • MS SharePoint • CKAN • aWhere • Collaboration tools • File sharing services (google drive, dropbox, FTP server, etc.) 20
  • 21. www.iita.orgA member of CGIAR consortium Data mgt. platforms (1) MS SharePoint • Fits to existing Microsoft environment (MS Office (especially Outlook, Excel, Access, Visio, Project), MS Server databases, Exchange server, skype) • With proper permission settings, allows to create as much pages, apps or subsites as necessary • Useful features for data mgt. (Metadata tagging, version control, templates (MS office only), validation rules, linking data lists, workflows (approvals etc.), many predefined apps come with customizable metadata sets) • Weak: issues linking open repositories 21
  • 22. www.iita.orgA member of CGIAR consortium Data mgt. platforms (2) CKAN – “Meta-repository” • functional emphasis: defacto standard software for publishing open data (started as a catalogue for harvesting published data spread of knowledge) • Python based (DKAN in PHP) • Strength: customizable, data organization, harvesting multiple repositories • Weak: no workflow or bulk operations: processing need to be done before cataloguing; no collaboration tools; no upload of multiple ressources at a time and batch edit the metadata • Example: http://data.ilri.org/portal/ 22
  • 23. www.iita.orgA member of CGIAR consortium Data mgt. platforms (3a) ILRI dataset portal based on CKAN 23
  • 24. www.iita.orgA member of CGIAR consortium Data mgt. platforms (3b) ILRI dataset portal based on CKAN 24
  • 25. www.iita.orgA member of CGIAR consortium Data mgt. platforms (4) aWhere • Functional emphasis: (geo)data exploration • Strength: easy to use platform to explore data from xls or ODK as tables, diagram or maps and in connection with data from other users, the library and the weather module • Weak: xls only; collaboration functionality • More by Hannah and Courtney 25
  • 26. www.iita.orgA member of CGIAR consortium Data mgt. platforms (5) Collaboration tools - basecamp • Functional emphasis: collaboration with many different partners in projects • Strength: easy to use platform with typical collab. tools (file sharing+tagging, calendar, wiki, task tracking) • Weak: not customizable, no data linkage to databases 26
  • 27. www.iita.orgA member of CGIAR consortium Data mgt. platforms (6) File sharing services – Google drive • Functional emphasis: synchronized working on office apps in the cloud • Strength: data sharing and synchronizing, widely known, easy to use • Weak: not customizable, no data linkage to databases, google account necessary; adverts 27
  • 28. www.iita.orgA member of CGIAR consortium 28
  • 29. www.iita.orgA member of CGIAR consortium File naming strategies 29  Order by date: 2013-04-12_interview-recording_THD.mp3 2013-04-12_interview-transcript_THD.docx 2012-12-15_interview-recording_MBD.mp3 2012-12-15_interview-transcript_MBD.docx  Order by subject: MBD_interview-recording_2012-12-15.mp3 MBD_interview-transcript_2012-12-15.docx THD_interview-recording_2013-04-12.mp3 THD_interview-transcript_2013-04-12.docx  Order by type: Interview-recording_MBD_2012-12-15.mp3 Interview-recording_THD_2013-04-12.mp3 Interview-transcript_MBD_2012-12-15.docx Interview-transcript_THD_2013-04-12.docx  Forced order with numbering: 01_THD_interview-recording_2013-04-12.mp3 02_THD_interview-transcript_2013-04-12.docx 03_MBD_interview-recording_2012-12-15.mp3 04_MBD_interview-transcript_2012-12-15.docx
  • 30. www.iita.orgA member of CGIAR consortium Supporting documentation(1) 30 Supporting documentation is information in separate files that accompanies data in order to provide • context, • explanation, or • instructions on • confidentiality and • data use or • reuse Source: Dublin UCD Library
  • 31. www.iita.orgA member of CGIAR consortium Supporting documentation(1) 31 Examples of supporting documentation include: Source: Dublin UCD Library Information about the project and data creators; Working papers or laboratory notebooks Questionnaires or interview guides Codebooks Details on how the data were created, analysed, anonymised etc; Final project reports and publications
  • 32. www.iita.orgA member of CGIAR consortium Metadata 32 There are three broad categories of metadata: Source: Dublin UCD Library  Descriptive - common fields such as title, author, abstract, keywords which help users to discover online sources through searching and browsing.  Administrative - preservation, rights management, and technical metadata about formats.  Structural - how different components of a set of associated data relate to one another, such as a schema describing relations between tables in a database.

Notes de l'éditeur

  1. identifying, classifying, prioritizing, storing, securing, archiving, preserving, retrieving, tracking and destroying of records
  2. identifying, classifying, prioritizing, storing, securing, archiving, preserving, retrieving, tracking and destroying of records
  3. identifying, classifying, prioritizing, storing, securing, archiving, preserving, retrieving, tracking and destroying of records