DataOne - Suzie Allard - RDAP12

•Télécharger en tant que PPTX, PDF•

1 j'aime•448 vues

DataOne Suzie Allard, Ph.D. University of Tennessee Presentation at Research Data Access & Preservation Summit 21 March 2012

Technologie Formation

DataONE

Research Data Access & Preservation
21 March 2012

Suzie Allard, Ph.D.
University of Tennessee

DataONE vision and approach
Enable new science and knowledge creation through
universal access to data about life on earth and the
environment that sustains it.
1. Build on existing
cyberinfrastructure 2. Create new
cyberinfrastructure 3. Support communities
of practice

2
2

DataONE Cyberinfrastructure
Three major components for a Member Nodes
flexible, scalable, sustainable • diverse institutions
Coordinating Nodes
network • serve local community
• retain complete metadata
Investigator Toolkit
• provide resources for
catalog
managing their data
• indexing for search
• retain copies of data
• network-wide services
• ensure content
availability (preservation)
• replication services

3

Training in all elements of the data life cycle

Plan

Analyze Collect
Kepler

Integrate Assure

Discover Describe

Preserve
4

DataONE Education and Training

Summer Internships
Training at Conferences and Workshops
• Supercomputing 2011
• DataONE Implementation Workshop: Publishing data as a
Member Node
• Ecological Society of America (ESA)
• American Geophysical Union (AGU)
Educational Modules
Graduate-level course
• Summer Institute for Environmental Informatics

5

Environmental Information Management (EIM) Institute
Graduate students biology, geology, ecology, or other
environmental sciences, environmental engineering, geography
or science librarianship
Conceptual and practical hands-on
training to effectively
design, manage, analyze, visualize, and
preserve data and information:
• Managing data files
• Creating databases and web portals
• Data analysis and visualization
• Techniques for
managing, analyzing, and visualizing
geospatial data

7

DataONE Team and Sponsors
• Amber Budden, Roger Dahl, Rebecca • Ewa Deelman
Koskela, Bill Michener, Robert Nahf, Mark
• Servilla
Dave Vieglais • Peter Honeyman

• Suzie Allard, Carol Tenopir, Maribeth • Jeff Horsburgh
Manoff, Kimberley Douglass, Robert
• Waltz, Bruce Wilson Giri
John Cobb, Bob Cook, • Robert Sandusky
Palanismy, Line Pouchard
• Patricia Cruse, John Kunze • Bertram Ludaescher

• Sky Bristol, Mike Frame, Richard Huffine, Viv • Peter Buneman
Hutchison, Jeff Morisette, Jake Weltzin, Lisa Zolly
• Chris Jones, Stephanie Hampton, Matt • Cliff Duke
Jones
• Paul Allen, Rick Bonney, Steve Kelling • Carole Goble

• Ryan Scherle, Todd Vision • Donald Hobern

• Randy Butler • David DeRoure

LEON LEVY
FOUNDATION 8

A Science Use Case

Diverse bird observations and Model results
environmental data from
300,00 locations in the US Occurrence of Indigo Bunting (2008)
integrated and analyzed using
High Performance Computing
Resources

Land Cover

Jan Ap Jun Sep Dec
r
Meteorology
• Examine patterns of
migration
MODIS – Spatio-Temporal Exploratory • Infer how climate
Remote Model identifies factors change may affect
sensing data affecting patterns of bird migration
migration

11

Recommandé

EDI Training Module 2: EDI ProjectEnvironmental Data Initiative

Introduction to the Environmental Data Initiative (EDI)Corinna Gries

EDI Training Module 12: An Introduction to Metadata and Data RepositoriesEnvironmental Data Initiative

Certifying CISER! A Data Seal of Approval Case StudyHistoric Environment Scotland

Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012SEAD

Data Repositories: Recommendation, Certification and Models for Cost RecoveryAnita de Waard

PEER End of Project ReportEDINA, University of Edinburgh

Rots RDAP11 Data Archives in Federal AgenciesASIS&T

Recommandé

EDI Training Module 2: EDI ProjectEnvironmental Data Initiative

Introduction to the Environmental Data Initiative (EDI)Corinna Gries

EDI Training Module 12: An Introduction to Metadata and Data RepositoriesEnvironmental Data Initiative

Certifying CISER! A Data Seal of Approval Case StudyHistoric Environment Scotland

Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012SEAD

Data Repositories: Recommendation, Certification and Models for Cost RecoveryAnita de Waard

PEER End of Project ReportEDINA, University of Edinburgh

Rots RDAP11 Data Archives in Federal AgenciesASIS&T

Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard

DataONE Education Module 01: Why Data Management?DataONE

Data management: international challenges, national infrastructure, and insti...Andrew Treloar

EDI Training Module 4: Organizing Data Into Publishable UnitsEnvironmental Data Initiative

ANDS Applications Program: Building Tools to Facilitate Data ReuseAndrew Treloar

Provenance in Support of the ANDS Four TransformationsAndrew Treloar

Natasha intro to rdm c3 dis may 2018.pptxARDC

John morrissey c3 dis fair working data.pptxARDC

Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareHistoric Environment Scotland

Birgit Schmidt: RDA for Libraries from an International Perspectivedri_ireland

Sue cook c3 dis dm-ps 1.pptxARDC

Comeaux RDAP11 Data Archives in Federal AgenciesASIS&T

End of COBWEB Co-Design Projects Celebration EDINA, University of Edinburgh

Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...SEAD

Global registries initiative frumkin omodeiASIS&T

Geospatial metadata and spatial data workshop: 19 June 2014EDINA, University of Edinburgh

Smith RDAP11 NSF Data Management Plan Case StudiesASIS&T

A National Approach to Open Data in Ireland: Publishers and Research Data Man...Rebecca Grant

Altman RDAP11 Policy-based Data ManagementASIS&T

Ignite@AGU14SEAD

Hands-On Data Management Planning for Life SciencesAndrew Sallans

DataONE User's Group Lifecycle Management: PlanningAndrew Sallans

Contenu connexe

Tendances

Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard

DataONE Education Module 01: Why Data Management?DataONE

Data management: international challenges, national infrastructure, and insti...Andrew Treloar

EDI Training Module 4: Organizing Data Into Publishable UnitsEnvironmental Data Initiative

ANDS Applications Program: Building Tools to Facilitate Data ReuseAndrew Treloar

Provenance in Support of the ANDS Four TransformationsAndrew Treloar

Natasha intro to rdm c3 dis may 2018.pptxARDC

John morrissey c3 dis fair working data.pptxARDC

Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareHistoric Environment Scotland

Birgit Schmidt: RDA for Libraries from an International Perspectivedri_ireland

Sue cook c3 dis dm-ps 1.pptxARDC

Comeaux RDAP11 Data Archives in Federal AgenciesASIS&T

End of COBWEB Co-Design Projects Celebration EDINA, University of Edinburgh

Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...SEAD

Global registries initiative frumkin omodeiASIS&T

Geospatial metadata and spatial data workshop: 19 June 2014EDINA, University of Edinburgh

Smith RDAP11 NSF Data Management Plan Case StudiesASIS&T

A National Approach to Open Data in Ireland: Publishers and Research Data Man...Rebecca Grant

Altman RDAP11 Policy-based Data ManagementASIS&T

Ignite@AGU14SEAD

Tendances (20)

Real-World Data Challenges: Moving Towards Richer Data Ecosystems

DataONE Education Module 01: Why Data Management?

Data management: international challenges, national infrastructure, and insti...

EDI Training Module 4: Organizing Data Into Publishable Units

ANDS Applications Program: Building Tools to Facilitate Data Reuse

Provenance in Support of the ANDS Four Transformations

Natasha intro to rdm c3 dis may 2018.pptx

John morrissey c3 dis fair working data.pptx

Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare

Birgit Schmidt: RDA for Libraries from an International Perspective

Sue cook c3 dis dm-ps 1.pptx

Comeaux RDAP11 Data Archives in Federal Agencies

End of COBWEB Co-Design Projects Celebration

Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...

Global registries initiative frumkin omodei

Geospatial metadata and spatial data workshop: 19 June 2014

Smith RDAP11 NSF Data Management Plan Case Studies

A National Approach to Open Data in Ireland: Publishers and Research Data Man...

Altman RDAP11 Policy-based Data Management

Ignite@AGU14

En vedette

Hands-On Data Management Planning for Life SciencesAndrew Sallans

DataONE User's Group Lifecycle Management: PlanningAndrew Sallans

NSF Data Management Plan Case Study: UVa’s Response.Andrew Sallans

NSF Data Management Plan - Implications for LibrariansAndrew Sallans

Marketing With LinkedInVikram Rajan

Improving Integrity, Transparency, and Reproducibility Through Connection of ...Andrew Sallans

Badges to Acknowledge Open PracticesAndrew Sallans

En vedette (7)

Hands-On Data Management Planning for Life Sciences

DataONE User's Group Lifecycle Management: Planning

NSF Data Management Plan Case Study: UVa’s Response.

NSF Data Management Plan - Implications for Librarians

Marketing With LinkedIn

Improving Integrity, Transparency, and Reproducibility Through Connection of ...

Badges to Acknowledge Open Practices

Similaire à DataOne - Suzie Allard - RDAP12

Michener Plenary PPSR2012CitizenScience.org

NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...National Information Standards Organization (NISO)

Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...TERN Australia

ESI Supplemental Webinar 2 - DataONE presentation slides DuraSpace

RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satis...ASIS&T

DataONE_cobb_hubbub2012_20120924_v05John Cobb

Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...TERN Australia

Knowledge Exchange, Nov 2011, BonnTodd Vision

Ausplots Training - Session 1bensparrowau

Research Data Sharing LERU LIBER Europe

An Oz Mammals Bioinformatics and Data ResourcePhilippa Griffin

Introduction to Research Data Management for postgraduate studentsMarieke Guy

Walker odi -uksg_2013-jenny_walkerUKSG: connecting the knowledge community

Engaging the Researcher in RDMEDINA, University of Edinburgh

Sharing & Sustaining Ecosystem DataTERN Australia

e-Science, Research Data and LibariesRob Grim

Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...GigaScience, BGI Hong Kong

Data Facilties Workshop - Panel on Global Data Sharing ExemplarsEarthCube

Research data lifecycle diagramSteven Cracknell

Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...ProgCity

Similaire à DataOne - Suzie Allard - RDAP12 (20)

Michener Plenary PPSR2012

NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...

Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...

ESI Supplemental Webinar 2 - DataONE presentation slides

RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satis...

DataONE_cobb_hubbub2012_20120924_v05

Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...

Knowledge Exchange, Nov 2011, Bonn

Ausplots Training - Session 1

Research Data Sharing LERU

An Oz Mammals Bioinformatics and Data Resource

Introduction to Research Data Management for postgraduate students

Walker odi -uksg_2013-jenny_walker

Engaging the Researcher in RDM

Sharing & Sustaining Ecosystem Data

e-Science, Research Data and Libaries

Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...

Data Facilties Workshop - Panel on Global Data Sharing Exemplars

Research data lifecycle diagram

Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...

Plus de ASIS&T

RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)ASIS&T

RDAP 16: Sustainability of data infrastructure: The history of science scienc...ASIS&T

RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesASIS&T

RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...ASIS&T

RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...ASIS&T

RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...ASIS&T

RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)ASIS&T

RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...ASIS&T

RDAP 16 Poster: Interpreting Local Data Policies in PracticeASIS&T

RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...ASIS&T

RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...ASIS&T

RDAP 16 Lightning: Spreading the love: Bringing data management training to s...ASIS&T

RDAP 16 Lightning: RDM Discussion Group: How'd that go?ASIS&T

RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...ASIS&T

RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge BrokerASIS&T

RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...ASIS&T

RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...ASIS&T

RDAP 16 Lightning: Personas as a Policy Development Tool for Research DataASIS&T

RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide CollaborationASIS&T

RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...ASIS&T

Plus de ASIS&T (20)

RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)

RDAP 16: Sustainability of data infrastructure: The history of science scienc...

RDAP 16: DMPs and Public Access: Agency and Data Service Experiences

RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...

RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...

RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...

RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)

RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...

RDAP 16 Poster: Interpreting Local Data Policies in Practice

RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...

RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...

RDAP 16 Lightning: Spreading the love: Bringing data management training to s...

RDAP 16 Lightning: RDM Discussion Group: How'd that go?

RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...

RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker

RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...

RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...

RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data

RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration

RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...

Dernier

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Scaling API-first – The story of a global engineering organizationRadu Cotescu

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

How to convert PDF to text with Nanonetsnaman860154

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

A Call to Action for Generative AI in 2024Results

Google AI Hackathon: LLM based Evaluator for RAGSujit Pal

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

GenCyber Cyber Security Day PresentationMichael W. Hawkins

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Scaling API-first – The story of a global engineering organization

My Hashitalk Indonesia April 2024 Presentation

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Unblocking The Main Thread Solving ANRs and Frozen Frames

[2024]Digital Global Overview Report 2024 Meltwater.pdf

How to convert PDF to text with Nanonets

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Injustice - Developers Among Us (SciFiDevCon 2024)

Salesforce Community Group Quito, Salesforce 101

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

A Call to Action for Generative AI in 2024

Google AI Hackathon: LLM based Evaluator for RAG

Handwritten Text Recognition for manuscripts and early printed texts

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

GenCyber Cyber Security Day Presentation

🐬 The future of MySQL is Postgres 🐘

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...

DataOne - Suzie Allard - RDAP12

1. DataONE Research Data Access & Preservation 21 March 2012 Suzie Allard, Ph.D. University of Tennessee

2. DataONE vision and approach Enable new science and knowledge creation through universal access to data about life on earth and the environment that sustains it. 1. Build on existing cyberinfrastructure 2. Create new cyberinfrastructure 3. Support communities of practice 2 2

3. DataONE Cyberinfrastructure Three major components for a Member Nodes flexible, scalable, sustainable • diverse institutions Coordinating Nodes network • serve local community • retain complete metadata Investigator Toolkit • provide resources for catalog managing their data • indexing for search • retain copies of data • network-wide services • ensure content availability (preservation) • replication services 3

4. Training in all elements of the data life cycle Plan Analyze Collect Kepler Integrate Assure Discover Describe Preserve 4

5. DataONE Education and Training Summer Internships Training at Conferences and Workshops • Supercomputing 2011 • DataONE Implementation Workshop: Publishing data as a Member Node • Ecological Society of America (ESA) • American Geophysical Union (AGU) Educational Modules Graduate-level course • Summer Institute for Environmental Informatics 5

6. On-line Education Modules 6

7. Environmental Information Management (EIM) Institute Graduate students biology, geology, ecology, or other environmental sciences, environmental engineering, geography or science librarianship Conceptual and practical hands-on training to effectively design, manage, analyze, visualize, and preserve data and information: • Managing data files • Creating databases and web portals • Data analysis and visualization • Techniques for managing, analyzing, and visualizing geospatial data 7

8. DataONE Team and Sponsors • Amber Budden, Roger Dahl, Rebecca • Ewa Deelman Koskela, Bill Michener, Robert Nahf, Mark • Servilla Dave Vieglais • Peter Honeyman • Suzie Allard, Carol Tenopir, Maribeth • Jeff Horsburgh Manoff, Kimberley Douglass, Robert • Waltz, Bruce Wilson Giri John Cobb, Bob Cook, • Robert Sandusky Palanismy, Line Pouchard • Patricia Cruse, John Kunze • Bertram Ludaescher • Sky Bristol, Mike Frame, Richard Huffine, Viv • Peter Buneman Hutchison, Jeff Morisette, Jake Weltzin, Lisa Zolly • Chris Jones, Stephanie Hampton, Matt • Cliff Duke Jones • Paul Allen, Rick Bonney, Steve Kelling • Carole Goble • Ryan Scherle, Todd Vision • Donald Hobern • Randy Butler • David DeRoure LEON LEVY FOUNDATION 8

9. DataONE Team Year 1 Year 2 Year 3 9

10. Questions 10

11. A Science Use Case Diverse bird observations and Model results environmental data from 300,00 locations in the US Occurrence of Indigo Bunting (2008) integrated and analyzed using High Performance Computing Resources Land Cover Jan Ap Jun Sep Dec r Meteorology • Examine patterns of migration MODIS – Spatio-Temporal Exploratory • Infer how climate Remote Model identifies factors change may affect sensing data affecting patterns of bird migration migration 11

Notes de l'éditeur

The DataONE mission/vision is to “enable new science and knowledge creation through universal access to data about life on earth and the environment that sustains it.” DataONE is based on three precepts. 1. We are leveraging existing infrastructure such as the hundreds of existing data centers and repositories, and the myriad of software tools. 2. We are focusing our efforts on developing new infrastructure that better enables interoperability across data centers and between scientific tools and data resources. [The new cyberinfrastructure being created by DataONE is illustrated on a future slide.] 3. We recognize that the largest challenges are sociocultural in nature, and thus we focus significant attention on engaging and supporting the broader community of stakeholders (e.g. scientists, students, librarians).
DataONE is a federated data network built to improve access to Earth science data, and to support science by: engaging the relevant science, data, and policy communities; facilitating easy, secure, and persistent storage of data; and disseminating integrated and user-friendly tools for data discovery, analysis, visualization, and decision-making. There are three principal components:Member Nodes that include a diverse array of data centers and repositories that are associated with national and international agencies and research networks, universities, libraries, etc.Coordinating Nodes that support data replication across Member Nodes (i.e., data centers) as well as network wide services like 24/7 access to metadata at the CNs, indexing and rapid search and discovery, etc.An Investigator Toolkit that includes tools that are widely used by scientists, The tools are coupled with the DataONE resources so that it is, for example, possible to seamlessly and transparently access data at Member Nodes through the tool of your choice.
Other development activities during years 3-5 will focus on expanding the suite of tools that are available through the Investigator Toolkit. New tool additions will be identified and prioritized by the DataONE Users Group.
Other development activities during years 2-5 will focus on expanding the suite of tools that are available through the Investigator Toolkit. New tool additions will be identified and prioritized by the DataONE Users Group.
This final slide illustrates the initial DataONE partners that have now been involved for over 3 years, since the proposal was conceived. The DataONE Users Group now includes significantly more partners and we expect to grow exponentially over the next five years.
The DataONE team is growing!
The Scientific Exploration, Visualization and Analysis Working Group is an example of a scientific use case. By running through a comprehensive case study, this working group was able to provide specific guidance on the challenges faced when conducting data intensive science. Challenges that were communicated to, and met by, the DataONE core CI team and developers.Science requires: Multiple cooperating extreme scale CI components (EVA/eBird pilot lesson learned)EVA pilot collaborated with TeraGrid (now XSEDE) to use HPC and “schlep” data as part of the workflow50K cpu-core hours (SU’s) last year(supporting SOTB 2011)3M hours allocated this year (Cornell CLO team has optimized code for 3-10X speedup, loosened data transfer bottleneck, so we will under run)Plan for 500 species (3 yr data) runs. Currently: 70/wk for 2011 campaignHPC use 10X 2 years in a row. Data increases as well.Conclusion: success breeds scale