SlideShare une entreprise Scribd logo
1  sur  16
Towards a global open scientific
   notebook infrastructure
      Jeremy Frey, Andrew Milsted,
         Simon Coles, Colin Bird,
   Cerys Willoughby, Cameron Neylon &
              Matthew Todd
Science is
      Science is
   increasingly
    increasingly
interdisciplinary
 interdisciplinary
Infrastructures - Architecture




Collaboration
 Collaboration

  Sharing
   Sharing                                        Curation
                                                   Curation

   Reuse
    Reuse
Comparison with
 Comparison with
traditional paper
 traditional paper
   notebooks
    notebooks
                                   •• Higher Quality Record
                                       Higher Quality Record
                                   •• Natural linking to data and external
                                       Natural linking to data and external
                                      resources
                     Electronic
                      Electronic       resources
                                   •• Easier Collaboration
                                       Easier Collaboration
                     Laboratory
                      Laboratory   •• Improved planning
                                       Improved planning
                     Notebooks
                      Notebooks    •• Improved discussions
                                       Improved discussions
                                   •• Efficiency gain in production of
                                       Efficiency gain in production of
                                      presentations/reports
                                       presentations/reports
                       ELNs
                       ELNs        •• Change the nature of
                                       Change the nature of
Communication
 Communication                        Professor/Student interactions
                                       Professor/Student interactions
 Collaboration
  Collaboration
    Sharing
     Sharing
    Linking
     Linking
   Curating
    Curating
Commercial offerings
                          Commercial offerings

                        Web 2.0
                        Web 2.0

  Developments in                   LabTrove
ELN implementation
                        Smart Tea

 and characteristics                        Semantics


              PNNL                         User focus



                                        Collaboration
 RS/1

                                      Trust in ELNs for
                                       IP compliance

1980          1990     2000                2010
The LabTrove story




  http://www.labtrove.org
How do we
                                 If you can't describe what
communicate?                     you are doing as a process,
                                 you don't know what
• Surprisingly difficult to      you're doing.
                                 W. Edwards Deming
  explain what a process
  involves
• Much of the detail is
  assumed to be understood
  and not explicitly discussed     Growing need for the
                                   global (virtual)
• This is where the mis-
                                   equivalent of the
  understandings usually           “Tea Room”
  arise.
LabTrove: Easy Communication
AutoTrove from Matlab




              Computational processes also blog
BlogMyData Project - Godiva
LabTrove Open Notebooks Mat Todd’s PZQ Project
Open Notebooks
• Troves can be open Read/Comment/Write
  – Can control this access so it is your choice
• All contributions attributable (login needed)
  – Anonymous contributions not usually enabled
• Open contribution does worry the IT services
  – Provides potential pathway for abuse of systems
  – Not just our systems
Global open scientific notebook
           infrastructure
• Global collaboration:
  – International
  – Interdisciplinary
• Open science

• To ascend the knowledge pyramid, we need
  open collaboration and sharing of results
We must speed up the knowledge discovery process




   All I am saying is that now is the time to
   develop the technology to deflect an asteroid

Contenu connexe

Similaire à RDAP13 Cerys Willoughby: Towards a global open scientific notebook infrastructure

Break out: Project Communication and Dissemination - Koen De Vos
Break out: Project Communication and Dissemination - Koen De VosBreak out: Project Communication and Dissemination - Koen De Vos
Break out: Project Communication and Dissemination - Koen De Vos
imec.archive
 
Katherine Kott Slides for DLF PM Group 2011
Katherine Kott Slides for DLF PM Group 2011Katherine Kott Slides for DLF PM Group 2011
Katherine Kott Slides for DLF PM Group 2011
DLFCLIR
 
Web Scale Discovery Reality Check
Web Scale Discovery Reality CheckWeb Scale Discovery Reality Check
Web Scale Discovery Reality Check
Jeff Wisniewski
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
guru122
 
SharePoint 2010 for Document Compliance
SharePoint 2010 for Document ComplianceSharePoint 2010 for Document Compliance
SharePoint 2010 for Document Compliance
ntenany
 
Exploring perspectives in digital library evaluation
Exploring perspectives in digital library evaluationExploring perspectives in digital library evaluation
Exploring perspectives in digital library evaluation
Giannis Tsakonas
 

Similaire à RDAP13 Cerys Willoughby: Towards a global open scientific notebook infrastructure (20)

KopFournierCanadianInstituteDistanceEducationResearchPLE
KopFournierCanadianInstituteDistanceEducationResearchPLEKopFournierCanadianInstituteDistanceEducationResearchPLE
KopFournierCanadianInstituteDistanceEducationResearchPLE
 
Global im blueprinting security tcw 2012 02-22
Global im blueprinting security tcw 2012 02-22Global im blueprinting security tcw 2012 02-22
Global im blueprinting security tcw 2012 02-22
 
eReaders and ePublishing: developing a model for flexible and open distance l...
eReaders and ePublishing: developing a model for flexible and open distance l...eReaders and ePublishing: developing a model for flexible and open distance l...
eReaders and ePublishing: developing a model for flexible and open distance l...
 
Break out: Project Communication and Dissemination - Koen De Vos
Break out: Project Communication and Dissemination - Koen De VosBreak out: Project Communication and Dissemination - Koen De Vos
Break out: Project Communication and Dissemination - Koen De Vos
 
Katherine Kott Slides for DLF PM Group 2011
Katherine Kott Slides for DLF PM Group 2011Katherine Kott Slides for DLF PM Group 2011
Katherine Kott Slides for DLF PM Group 2011
 
Dicole Events - Product Sheet
Dicole Events - Product SheetDicole Events - Product Sheet
Dicole Events - Product Sheet
 
Data Management for Librarians: An Introduction
Data Management for Librarians: An IntroductionData Management for Librarians: An Introduction
Data Management for Librarians: An Introduction
 
Gender and ePortfolio practice
Gender and ePortfolio practiceGender and ePortfolio practice
Gender and ePortfolio practice
 
Ausspc 2012 case study ozone oakton
Ausspc 2012 case study   ozone oaktonAusspc 2012 case study   ozone oakton
Ausspc 2012 case study ozone oakton
 
Web Scale Discovery Reality Check
Web Scale Discovery Reality CheckWeb Scale Discovery Reality Check
Web Scale Discovery Reality Check
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Ub session 4
Ub session 4Ub session 4
Ub session 4
 
SharePoint 2010 for Document Compliance
SharePoint 2010 for Document ComplianceSharePoint 2010 for Document Compliance
SharePoint 2010 for Document Compliance
 
Hitchhikersguide
HitchhikersguideHitchhikersguide
Hitchhikersguide
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objects
 
Scientific data management from the lab to the web
Scientific data management   from the lab to the webScientific data management   from the lab to the web
Scientific data management from the lab to the web
 
Ub e assessment
Ub e assessmentUb e assessment
Ub e assessment
 
Exploring perspectives in digital library evaluation
Exploring perspectives in digital library evaluationExploring perspectives in digital library evaluation
Exploring perspectives in digital library evaluation
 
Making Your Classes, Sing, Dance, Talk, and Talk Back!
Making Your Classes, Sing, Dance, Talk, and Talk Back!Making Your Classes, Sing, Dance, Talk, and Talk Back!
Making Your Classes, Sing, Dance, Talk, and Talk Back!
 

Plus de ASIS&T

Plus de ASIS&T (20)

RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
 
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesRDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
 
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
 
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
 
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
 
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
 
RDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in PracticeRDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in Practice
 
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
 
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
 
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
 
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
 
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
 
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge BrokerRDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
 
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
 
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
 
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research DataRDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
 
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide CollaborationRDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
 
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
 
RDAP 16: How do we know where to grow? Assessing Research Data Services at th...
RDAP 16: How do we know where to grow? Assessing Research Data Services at th...RDAP 16: How do we know where to grow? Assessing Research Data Services at th...
RDAP 16: How do we know where to grow? Assessing Research Data Services at th...
 

RDAP13 Cerys Willoughby: Towards a global open scientific notebook infrastructure

  • 1. Towards a global open scientific notebook infrastructure Jeremy Frey, Andrew Milsted, Simon Coles, Colin Bird, Cerys Willoughby, Cameron Neylon & Matthew Todd
  • 2. Science is Science is increasingly increasingly interdisciplinary interdisciplinary
  • 3. Infrastructures - Architecture Collaboration Collaboration Sharing Sharing Curation Curation Reuse Reuse
  • 4. Comparison with Comparison with traditional paper traditional paper notebooks notebooks •• Higher Quality Record Higher Quality Record •• Natural linking to data and external Natural linking to data and external resources Electronic Electronic resources •• Easier Collaboration Easier Collaboration Laboratory Laboratory •• Improved planning Improved planning Notebooks Notebooks •• Improved discussions Improved discussions •• Efficiency gain in production of Efficiency gain in production of presentations/reports presentations/reports ELNs ELNs •• Change the nature of Change the nature of Communication Communication Professor/Student interactions Professor/Student interactions Collaboration Collaboration Sharing Sharing Linking Linking Curating Curating
  • 5. Commercial offerings Commercial offerings Web 2.0 Web 2.0 Developments in LabTrove ELN implementation Smart Tea and characteristics Semantics PNNL User focus Collaboration RS/1 Trust in ELNs for IP compliance 1980 1990 2000 2010
  • 6. The LabTrove story http://www.labtrove.org
  • 7. How do we If you can't describe what communicate? you are doing as a process, you don't know what • Surprisingly difficult to you're doing. W. Edwards Deming explain what a process involves • Much of the detail is assumed to be understood and not explicitly discussed Growing need for the global (virtual) • This is where the mis- equivalent of the understandings usually “Tea Room” arise.
  • 9. AutoTrove from Matlab Computational processes also blog
  • 11.
  • 12. LabTrove Open Notebooks Mat Todd’s PZQ Project
  • 13.
  • 14. Open Notebooks • Troves can be open Read/Comment/Write – Can control this access so it is your choice • All contributions attributable (login needed) – Anonymous contributions not usually enabled • Open contribution does worry the IT services – Provides potential pathway for abuse of systems – Not just our systems
  • 15. Global open scientific notebook infrastructure • Global collaboration: – International – Interdisciplinary • Open science • To ascend the knowledge pyramid, we need open collaboration and sharing of results
  • 16. We must speed up the knowledge discovery process All I am saying is that now is the time to develop the technology to deflect an asteroid

Notes de l'éditeur

  1. Talk will discuss applications of work originated in Southampton on development of electronic laboratory notebooks to support collaborative investigations and illustrated by work undertaken at Southampton, the ISIS neutron facility (Neylon) and University of Sydney (Todd). Work comes out of the e-Science funding (CombeChem Project) from the UK RCUK (Research Councils UK) [e-Science maps to Cyber-Infrastructure in the USA] further developed by funding from the Universities Modernization Fund, collaborative R&D between chemistry, computer science and library.
  2. Open Access debate has been high profile, but primarily and economic argument, from our perspective the question would be open access to what and we are interested in the access to the data! Thus the role of data management plans. The Royal Society report is key as it stresses that access to the data is essential for the whole basis of science to enable other researchers to build on the published work which is must harder and can be impossible if the data is not available (and easier if freely available) but only if the data is comprehensible so intelligent access is highlighted as necessary (i.e. importance of metadata).
  3. Infrastructure needs to support the collection and curation of data for high quality dissemination with context and provenance. Infrastructure parallels the DIKW Data, Information, Knowledge, Wisdom hierarchy.
  4. Having the ELN leads to changes in behaviour.
  5. Development of the ELNs trade off in effort devoted to Semantics, Usability and IP building these up over time, showing our Smart Tea and LabTrove projects
  6. The LabTrove system – designed to be quite easy to use for open and closed projects, allow & encourage use of metadata but not require or enforce – approach needed for adoption. Open Source software, with hosting and advice services.
  7. Skip this slide – LabTrove was further developed under the SRF project
  8. Process is important! As important as the Data. Need to describe as we can’t all “visit” – global tea room [Chemists are big on tea rooms]
  9. Images important, able to sketch comment as well as text comment, highly linked notes. For example a record (post) about a substrate, can then trace what processes used this substrate and what results were then produced, so if it transpires there was an issue with the material then the consequences can be readily traced.
  10. Computational processes can “blog” as well. A Matlab script can be run from a publish script so that all aspects data, code, figures output are all added to a Trove to give full provenance of a figure/result so a clear reord is kept of what material generated what outputs. Very useful once students have left and figures need modifying for a paper
  11. Comments on computational models – in this case GODIVA is a way to show ocean models over the web (University of Reading) and with LabTrove added people can comment on geo-coded regions of the models results and have the video in the post – metadata taken from the models and put in the Trove.
  12. Just shows the use in the x-ray project… computationally intensive image reconstruction in a complex, multi-disciplinary project, use of timelines, I have this to show that my work is grounded in physical science as well as computer science. You may want to stress your background in usability which is as we know so important to actually making this all work
  13. Examples from USyd of the Open Notebook science use in malaria drugs. Enables global collaboration, link back to notebook from the publications, has industrial participation, links with other platforms (wiki etc). Pictures of the research are really useful.
  14. Social media to disseminate open research, links to Twitter, and perhaps Facebook etc, make sure metadata is good enough for search engines to find, perhaps need some specialist metadata for research findings, researcher and funder ids are certainly useful!
  15. Attribution requires similar infrastructure to security, so switching between Open Notebook Science, Open on Publication, Closed (i.e. industrially funded private research) is not hard:- in industry the work my not be public but often does need to be shared within the company, so similar issues to Open Science apply.
  16. Well more rapidly and more efficiently, but is viewed by many as a problem when it comes to establishing reputation and advancement in career or potential financial gain, but open does not mean free, perhaps free at the point of use, but someone has paid for the work and is paying to maintain the access. Could comment on the collective action of the long tail of laboratory science needs the global collaboration that semantics + the web (not necessarily the formal semantic web) provides.
  17. Attitudes to undertaking research need to change so that when data is collected the assumption is that it will be shared (at some point) and that collaboration is essential for rapid progress – don ’ t wait until it is right before you share at least with your collaborators, something students seem to resist not understanding that share and discuss is the best way to find out what is right.