SlideShare une entreprise Scribd logo
1  sur  43
From Data to Data: One Version
   of a History of Scholarly
        Communication


    PRDLA 2008 Closing Keynote


Dr Andrew Treloar – andrew.treloar.net
  Australian National Data Service –
             ands.org.au
Data led to early writing




http://www.utexas.edu/features/archive/2003/vase.html
But early preservation technologies
     were a bit problematic…




      http://www.earth-history.com/_images/ms2340.jpg
Time passes…
Doomed data
                                                                     http://www.learnin
                                                                     gcurve.gov.uk/foc
                                                                     uson/domesday/ta
                                                                     ke-a-closer-look/




In the vill in which St. Peter’s Church is situated [Westminster] the abbot of the same
place holds 13½ hides. There is land for 11 ploughs. To the demesne belongs 9 hides
and 1 virgate, and there are 4 ploughs. The villeins have 6 ploughs, and there could be 1
plough more. There are 9 villeins each on 1 virgate and 1 villein on 1 hide, and 9 villeins
on each half a virgate and 1 cottar on 5 acres, and 41 cottars who pay 40 shillings a
year for their gardens. [There is] Meadow for 11 ploughs, pasture for the livestock of the
vill, woodland for 100 pigs, and 25 houses of the abbot’s knights and other men who pay
8 shillings a year. In all it is worth £10; when received, the same; TRE £12. This manor
More time passes…
Scholarly communication for the
         last 350 years
(a data-centric view, that is)
“A Correct Tide-
Table, Shewing the True
Times of the High-Waters
at London-Bridge, to Every
Day in the Year 1683. By
Mr. Flamstead”
Philosophical
Transactions, Vol.
13, (1683), pp. 10-15
Eclipse tables

                 “An Observation of
                 the Beginning of the
                 Lunar Eclipse which
                 Hapned Aug. 19.
                 1681. in the Morning,
                 Made on the Island of
                 St. Lawrence or
                 Madagascar, by Mr.
                 Tho. Heathcot, and
                 Communicated by Mr.
                 Flamstead”
                 Philosophical
                 Transactions, Vol. 13,
                 (1683), p. 15
Data problems in published
         literature
Inconvenient data




   DOI: 10.1098/rsta.2005.1569
Imprisoned data




  DOI 10.1098/rsta.2006.1793
Invisible data




DOI 10.1098/rsta.2006.1793
Inaccessible data
Missing negative data

• Need title capture for negative results
“Selective Publication of Antidepressant Trials and Its
Influence on Apparent Efficacy”

Turner, Erick, Matthews, Annette, Linardatos, Eftihia,
Tell, Robert, Rosenthal, Robert.

New England Journal of Medicine. 358(3):252-260,
January 17, 2008.

From the Abstract:
“Evidence-based medicine is valuable to the extent that
the evidence base is complete and unbiased. Selective
publication of clinical trials - and the outcomes within
those trials - can lead to unrealistic estimates of drug
effectiveness and alter the apparent risk-benefit ratio”
Why is data now so important?
• We are in an era of increasing data-intensive
  research
• Almost all data is now born digital
• Increasing amount of data generated
  (semi-)automatically
• “Consequently, increasing effort and
  therefore funding will necessarily be diverted
  to data and data management over time”
  – Towards the Australian Data Commons, p. 4
    (http://www.pfc.org.au/bin/view/Main/Data)
                                                 19
Need for standardisation
• Software and silicon-based hardware keep getting
  cheaper, carbon-based wetware keeps getting
  more expensive
• Fixing data management problems is enormously
  labour intensive and costly
• “Consequently, standardisation within forms of
  data and simplification in the frameworks around
  retention, storage, access and use of data, and
  the elimination of differences whose resolution
  requires labour, must be made, if the on-going
  keeping and reuse of data is to remain affordable”
  – Towards the Australian Data Commons, p. 5

                                                  20
Role of data federations
• With more data online, more can be done
• Possible now to answer questions unrelated
  to reasons why data was collected originally
• Increasing focus on cross-disciplinary
  science
• “Consequently greater clarity is needed over
  control and access to community-funded
  data, and the means of
  aggregating, federating and accessing such
  data are increasingly important”
  – Towards the Australian Data Commons, p. 5
                                                21
Changing Data, Changing
            Research
• New scientific instruments
  – Large Hadron Collider at CERN: 1.5 GB/sec
  – Square Kilometre Array telescope: 1 EB/day!
     • Exabyte = a thousand million gigabytes (1018 bytes)
• New scientific Models
  – The mapping of the Human Genome: A billion DNA
    letters in a human sequence
  – Global climate models: ever finer time/space
    resolution
• New knowledge from unlocked data
  – Hubble data has to be shared six months after
    collection
  – Majority of published research from Hubble telescope
    data was not “first use”
                                                             22
Data desiderata

• Easy deposit for researchers
• Greater (preferably open) access for all
• Easier (or any!) citability
• Easier discoverability, particularly outside
  generating discipline
• More context for those outside the
  generating discipline
A partial solution:
data in institutional repositories
ARROW
ARROW
ARROW Discovery Service
ARROW Discovery Service
Another partial solution:
researcher workflow integration
Repository domains




Treloar, A. and Harboe-Ree, C. (2008). "Data management and the curation continuum:
how the Monash experience is informing repository relationships". Proceedings of VALA
2008, Melbourne, February.
Service Provider

ARCHER’s Data-centric Model                                 Shib Protected



                        Federation
                                                 IdP
         IdP           Web Access
                                            Automated Instrument
                                               Data Deposition
Content Management   Private/Shared
      System         Research Repository     Analysis Workflow
                                      PKI
                                                Automation

          IdP
                     Desktop Access

                                                                   31
                             IdP
ARCHER portal screenshot




                           32
Another partial solution:
discipline self-organisation
TARDIS overview
TARDIS partners
A national solution:
       ANDS
Australian National Data
              Service
• Funded by Australian Government at
  A$21M from mid-2008 through mid-2011
• Goal: to deliver greater access to
  Australia’s research data assets in forms
  that support easier and more effective
  data use and reuse
• Approach: building the Australian
  Research Data Commons
ARDC diagram
ANDS Delivery Structure
• ANDS has been structured as four inter-
  related and co-ordinated service delivery
  programs:
  – Developing Frameworks (policy, planning)
  – Providing Utilities (discovery, persistent ID)
  – Seeding the Commons (more data, better
    managed)
  – Building Capabilities (researcher and support)
• Plus candidate service development
  activities funded through a discipline-driven  40
41
Conclusion

• Data is becoming steadily more important
  for research
• Research results need to be
  communicated
• Data is the next great challenge for
  scholarly communication
• And so, it should be the next great
  challenge for libraries
• Over to you!
Questions?

• andrew.treloar@its.monash.edu.au
• http://andrew.treloar.net/




• http://arrow.edu.au/
• http://archer.edu.au/
• http://ands.org.au/

Contenu connexe

Tendances

Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...EDINA, University of Edinburgh
 
Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessdatacite
 
DataCite - services and support for opening up research data
DataCite - services and support for opening up research dataDataCite - services and support for opening up research data
DataCite - services and support for opening up research dataHerbert Gruttemeier
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEDINA, University of Edinburgh
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...DuraSpace
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataHamilton Public Library
 
DataCite at APE 2011
DataCite at APE 2011DataCite at APE 2011
DataCite at APE 2011datacite
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107皓仁 柯
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataStefan Dietze
 
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)EUDAT
 
Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Robin Rice
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghEDINA, University of Edinburgh
 
CLOCKSS archive - Preserving our digital heritage - The community taking control
CLOCKSS archive - Preserving our digital heritage - The community taking controlCLOCKSS archive - Preserving our digital heritage - The community taking control
CLOCKSS archive - Preserving our digital heritage - The community taking controlEDINA, University of Edinburgh
 
Data are the new black : Susan Robbins
Data are the new black : Susan RobbinsData are the new black : Susan Robbins
Data are the new black : Susan Robbinstherese nolan-brown
 
Linked Data and Semantic Web - EUDAT Summer School (Yann Le Franc, e-Science ...
Linked Data and Semantic Web - EUDAT Summer School (Yann Le Franc, e-Science ...Linked Data and Semantic Web - EUDAT Summer School (Yann Le Franc, e-Science ...
Linked Data and Semantic Web - EUDAT Summer School (Yann Le Franc, e-Science ...EUDAT
 

Tendances (20)

Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...
 
Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information access
 
Smith - Developing Campus Stakeholders' Collaborations - Sept 8
Smith - Developing Campus Stakeholders' Collaborations - Sept 8Smith - Developing Campus Stakeholders' Collaborations - Sept 8
Smith - Developing Campus Stakeholders' Collaborations - Sept 8
 
British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011
 
DataCite - services and support for opening up research data
DataCite - services and support for opening up research dataDataCite - services and support for opening up research data
DataCite - services and support for opening up research data
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly Resources
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with Data
 
Preserving the Integrity of the Scholarly Record
Preserving the Integrity of the Scholarly RecordPreserving the Integrity of the Scholarly Record
Preserving the Integrity of the Scholarly Record
 
DataCite at APE 2011
DataCite at APE 2011DataCite at APE 2011
DataCite at APE 2011
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open Data
 
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)
 
Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of Edinburgh
 
Implementation of semantic network dictionary system
Implementation of semantic network dictionary system Implementation of semantic network dictionary system
Implementation of semantic network dictionary system
 
CLOCKSS archive - Preserving our digital heritage - The community taking control
CLOCKSS archive - Preserving our digital heritage - The community taking controlCLOCKSS archive - Preserving our digital heritage - The community taking control
CLOCKSS archive - Preserving our digital heritage - The community taking control
 
Derk Haank: Open Access publishing at Springer
Derk Haank: Open Access publishing at SpringerDerk Haank: Open Access publishing at Springer
Derk Haank: Open Access publishing at Springer
 
Data are the new black : Susan Robbins
Data are the new black : Susan RobbinsData are the new black : Susan Robbins
Data are the new black : Susan Robbins
 
Linked Data and Semantic Web - EUDAT Summer School (Yann Le Franc, e-Science ...
Linked Data and Semantic Web - EUDAT Summer School (Yann Le Franc, e-Science ...Linked Data and Semantic Web - EUDAT Summer School (Yann Le Franc, e-Science ...
Linked Data and Semantic Web - EUDAT Summer School (Yann Le Franc, e-Science ...
 

Similaire à From Data to Data: History of Scholarly Communication

Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
ELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesRafael C. Jimenez
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010ALISS
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless OpportunityRachel Frick
 
The Role of Librarians in transforming the world through Open Data and Open S...
The Role of Librarians in transforming the world through Open Data and Open S...The Role of Librarians in transforming the world through Open Data and Open S...
The Role of Librarians in transforming the world through Open Data and Open S...African Open Science Platform
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional RepositoriesRobin Rice
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourKNOWeSCAPE2014
 
Integrating repositories and eLab notebooks through an open science framework
Integrating repositories and eLab notebooks through an open science frameworkIntegrating repositories and eLab notebooks through an open science framework
Integrating repositories and eLab notebooks through an open science frameworkrmacneil88
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeEdward Baker
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceAndrew Sallans
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...hsuleslie
 
Ausplots Training - Session 1
Ausplots Training - Session 1Ausplots Training - Session 1
Ausplots Training - Session 1bensparrowau
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
 

Similaire à From Data to Data: History of Scholarly Communication (20)

Data Publishing in Archaeozoology
Data Publishing in ArchaeozoologyData Publishing in Archaeozoology
Data Publishing in Archaeozoology
 
Coleman: Latest trends in Data Analysis for the Scholarly and Academic Publis...
Coleman: Latest trends in Data Analysis for the Scholarly and Academic Publis...Coleman: Latest trends in Data Analysis for the Scholarly and Academic Publis...
Coleman: Latest trends in Data Analysis for the Scholarly and Academic Publis...
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Researh data management
Researh data managementResearh data management
Researh data management
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
ELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciences
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
 
The Role of Librarians in transforming the world through Open Data and Open S...
The Role of Librarians in transforming the world through Open Data and Open S...The Role of Librarians in transforming the world through Open Data and Open S...
The Role of Librarians in transforming the world through Open Data and Open S...
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
Integrating repositories and eLab notebooks through an open science framework
Integrating repositories and eLab notebooks through an open science frameworkIntegrating repositories and eLab notebooks through an open science framework
Integrating repositories and eLab notebooks through an open science framework
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-Science
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
 
Ausplots Training - Session 1
Ausplots Training - Session 1Ausplots Training - Session 1
Ausplots Training - Session 1
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 

Plus de Andrew Treloar

Building a National Research Data Commons – Transforming Scholarship Through ...
Building a National Research Data Commons – Transforming Scholarship Through ...Building a National Research Data Commons – Transforming Scholarship Through ...
Building a National Research Data Commons – Transforming Scholarship Through ...Andrew Treloar
 
Provenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four TransformationsProvenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four TransformationsAndrew Treloar
 
ANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data ReuseANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data ReuseAndrew Treloar
 
Instutional repositories and data
Instutional repositories and dataInstutional repositories and data
Instutional repositories and dataAndrew Treloar
 
Closing comments at #iPres 2014 conference
Closing comments at #iPres 2014 conferenceClosing comments at #iPres 2014 conference
Closing comments at #iPres 2014 conferenceAndrew Treloar
 
The universe of identifiers and how ANDS is using them
The universe of identifiers and how ANDS is using themThe universe of identifiers and how ANDS is using them
The universe of identifiers and how ANDS is using themAndrew Treloar
 
Adding value to researchers' data
Adding value to researchers' dataAdding value to researchers' data
Adding value to researchers' dataAndrew Treloar
 
The life-sciences as a pathfinder in data-intensive research practice
The life-sciences as a pathfinder in data-intensive research practiceThe life-sciences as a pathfinder in data-intensive research practice
The life-sciences as a pathfinder in data-intensive research practiceAndrew Treloar
 
Past, present, and future of scholarly technology and practices
Past, present, and future of scholarly technology and practicesPast, present, and future of scholarly technology and practices
Past, present, and future of scholarly technology and practicesAndrew Treloar
 
Scholarly archive-of-the-future
Scholarly archive-of-the-futureScholarly archive-of-the-future
Scholarly archive-of-the-futureAndrew Treloar
 
Data Infrastructure and the Scholarly Ecosystem of the Future
Data Infrastructure and the Scholarly Ecosystem of the FutureData Infrastructure and the Scholarly Ecosystem of the Future
Data Infrastructure and the Scholarly Ecosystem of the FutureAndrew Treloar
 
Research data and the ANDS agenda in Australia
Research data and the ANDS agenda in AustraliaResearch data and the ANDS agenda in Australia
Research data and the ANDS agenda in AustraliaAndrew Treloar
 
Building on the Atlas (of Living Australia)
Building on the Atlas (of Living Australia)Building on the Atlas (of Living Australia)
Building on the Atlas (of Living Australia)Andrew Treloar
 
Journal literature size in the context of the LHC data
Journal literature size in the context of the LHC dataJournal literature size in the context of the LHC data
Journal literature size in the context of the LHC dataAndrew Treloar
 
Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...Andrew Treloar
 
The Past, Present and Future of data
The Past, Present and Future of dataThe Past, Present and Future of data
The Past, Present and Future of dataAndrew Treloar
 

Plus de Andrew Treloar (20)

Building a National Research Data Commons – Transforming Scholarship Through ...
Building a National Research Data Commons – Transforming Scholarship Through ...Building a National Research Data Commons – Transforming Scholarship Through ...
Building a National Research Data Commons – Transforming Scholarship Through ...
 
Provenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four TransformationsProvenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four Transformations
 
ANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data ReuseANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data Reuse
 
Instutional repositories and data
Instutional repositories and dataInstutional repositories and data
Instutional repositories and data
 
Closing comments at #iPres 2014 conference
Closing comments at #iPres 2014 conferenceClosing comments at #iPres 2014 conference
Closing comments at #iPres 2014 conference
 
I pres 2014 slides
I pres 2014 slidesI pres 2014 slides
I pres 2014 slides
 
The universe of identifiers and how ANDS is using them
The universe of identifiers and how ANDS is using themThe universe of identifiers and how ANDS is using them
The universe of identifiers and how ANDS is using them
 
Adding value to researchers' data
Adding value to researchers' dataAdding value to researchers' data
Adding value to researchers' data
 
The life-sciences as a pathfinder in data-intensive research practice
The life-sciences as a pathfinder in data-intensive research practiceThe life-sciences as a pathfinder in data-intensive research practice
The life-sciences as a pathfinder in data-intensive research practice
 
Past, present, and future of scholarly technology and practices
Past, present, and future of scholarly technology and practicesPast, present, and future of scholarly technology and practices
Past, present, and future of scholarly technology and practices
 
Scholarly archive-of-the-future
Scholarly archive-of-the-futureScholarly archive-of-the-future
Scholarly archive-of-the-future
 
Data Infrastructure and the Scholarly Ecosystem of the Future
Data Infrastructure and the Scholarly Ecosystem of the FutureData Infrastructure and the Scholarly Ecosystem of the Future
Data Infrastructure and the Scholarly Ecosystem of the Future
 
Research data and the ANDS agenda in Australia
Research data and the ANDS agenda in AustraliaResearch data and the ANDS agenda in Australia
Research data and the ANDS agenda in Australia
 
Data drives decisions
Data drives decisionsData drives decisions
Data drives decisions
 
Building on the Atlas (of Living Australia)
Building on the Atlas (of Living Australia)Building on the Atlas (of Living Australia)
Building on the Atlas (of Living Australia)
 
Journal literature size in the context of the LHC data
Journal literature size in the context of the LHC dataJournal literature size in the context of the LHC data
Journal literature size in the context of the LHC data
 
Seeking serendipity
Seeking serendipitySeeking serendipity
Seeking serendipity
 
Research data ecology
Research data ecologyResearch data ecology
Research data ecology
 
Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...
 
The Past, Present and Future of data
The Past, Present and Future of dataThe Past, Present and Future of data
The Past, Present and Future of data
 

Dernier

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Dernier (20)

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

From Data to Data: History of Scholarly Communication

  • 1. From Data to Data: One Version of a History of Scholarly Communication PRDLA 2008 Closing Keynote Dr Andrew Treloar – andrew.treloar.net Australian National Data Service – ands.org.au
  • 2.
  • 3. Data led to early writing http://www.utexas.edu/features/archive/2003/vase.html
  • 4. But early preservation technologies were a bit problematic… http://www.earth-history.com/_images/ms2340.jpg
  • 6. Doomed data http://www.learnin gcurve.gov.uk/foc uson/domesday/ta ke-a-closer-look/ In the vill in which St. Peter’s Church is situated [Westminster] the abbot of the same place holds 13½ hides. There is land for 11 ploughs. To the demesne belongs 9 hides and 1 virgate, and there are 4 ploughs. The villeins have 6 ploughs, and there could be 1 plough more. There are 9 villeins each on 1 virgate and 1 villein on 1 hide, and 9 villeins on each half a virgate and 1 cottar on 5 acres, and 41 cottars who pay 40 shillings a year for their gardens. [There is] Meadow for 11 ploughs, pasture for the livestock of the vill, woodland for 100 pigs, and 25 houses of the abbot’s knights and other men who pay 8 shillings a year. In all it is worth £10; when received, the same; TRE £12. This manor
  • 8. Scholarly communication for the last 350 years
  • 10. “A Correct Tide- Table, Shewing the True Times of the High-Waters at London-Bridge, to Every Day in the Year 1683. By Mr. Flamstead” Philosophical Transactions, Vol. 13, (1683), pp. 10-15
  • 11. Eclipse tables “An Observation of the Beginning of the Lunar Eclipse which Hapned Aug. 19. 1681. in the Morning, Made on the Island of St. Lawrence or Madagascar, by Mr. Tho. Heathcot, and Communicated by Mr. Flamstead” Philosophical Transactions, Vol. 13, (1683), p. 15
  • 12. Data problems in published literature
  • 13. Inconvenient data DOI: 10.1098/rsta.2005.1569
  • 14. Imprisoned data DOI 10.1098/rsta.2006.1793
  • 17. Missing negative data • Need title capture for negative results
  • 18. “Selective Publication of Antidepressant Trials and Its Influence on Apparent Efficacy” Turner, Erick, Matthews, Annette, Linardatos, Eftihia, Tell, Robert, Rosenthal, Robert. New England Journal of Medicine. 358(3):252-260, January 17, 2008. From the Abstract: “Evidence-based medicine is valuable to the extent that the evidence base is complete and unbiased. Selective publication of clinical trials - and the outcomes within those trials - can lead to unrealistic estimates of drug effectiveness and alter the apparent risk-benefit ratio”
  • 19. Why is data now so important? • We are in an era of increasing data-intensive research • Almost all data is now born digital • Increasing amount of data generated (semi-)automatically • “Consequently, increasing effort and therefore funding will necessarily be diverted to data and data management over time” – Towards the Australian Data Commons, p. 4 (http://www.pfc.org.au/bin/view/Main/Data) 19
  • 20. Need for standardisation • Software and silicon-based hardware keep getting cheaper, carbon-based wetware keeps getting more expensive • Fixing data management problems is enormously labour intensive and costly • “Consequently, standardisation within forms of data and simplification in the frameworks around retention, storage, access and use of data, and the elimination of differences whose resolution requires labour, must be made, if the on-going keeping and reuse of data is to remain affordable” – Towards the Australian Data Commons, p. 5 20
  • 21. Role of data federations • With more data online, more can be done • Possible now to answer questions unrelated to reasons why data was collected originally • Increasing focus on cross-disciplinary science • “Consequently greater clarity is needed over control and access to community-funded data, and the means of aggregating, federating and accessing such data are increasingly important” – Towards the Australian Data Commons, p. 5 21
  • 22. Changing Data, Changing Research • New scientific instruments – Large Hadron Collider at CERN: 1.5 GB/sec – Square Kilometre Array telescope: 1 EB/day! • Exabyte = a thousand million gigabytes (1018 bytes) • New scientific Models – The mapping of the Human Genome: A billion DNA letters in a human sequence – Global climate models: ever finer time/space resolution • New knowledge from unlocked data – Hubble data has to be shared six months after collection – Majority of published research from Hubble telescope data was not “first use” 22
  • 23. Data desiderata • Easy deposit for researchers • Greater (preferably open) access for all • Easier (or any!) citability • Easier discoverability, particularly outside generating discipline • More context for those outside the generating discipline
  • 24. A partial solution: data in institutional repositories
  • 25. ARROW
  • 26. ARROW
  • 29. Another partial solution: researcher workflow integration
  • 30. Repository domains Treloar, A. and Harboe-Ree, C. (2008). "Data management and the curation continuum: how the Monash experience is informing repository relationships". Proceedings of VALA 2008, Melbourne, February.
  • 31. Service Provider ARCHER’s Data-centric Model Shib Protected Federation IdP IdP Web Access Automated Instrument Data Deposition Content Management Private/Shared System Research Repository Analysis Workflow PKI Automation IdP Desktop Access 31 IdP
  • 33.
  • 38. Australian National Data Service • Funded by Australian Government at A$21M from mid-2008 through mid-2011 • Goal: to deliver greater access to Australia’s research data assets in forms that support easier and more effective data use and reuse • Approach: building the Australian Research Data Commons
  • 40. ANDS Delivery Structure • ANDS has been structured as four inter- related and co-ordinated service delivery programs: – Developing Frameworks (policy, planning) – Providing Utilities (discovery, persistent ID) – Seeding the Commons (more data, better managed) – Building Capabilities (researcher and support) • Plus candidate service development activities funded through a discipline-driven 40
  • 41. 41
  • 42. Conclusion • Data is becoming steadily more important for research • Research results need to be communicated • Data is the next great challenge for scholarly communication • And so, it should be the next great challenge for libraries • Over to you!
  • 43. Questions? • andrew.treloar@its.monash.edu.au • http://andrew.treloar.net/ • http://arrow.edu.au/ • http://archer.edu.au/ • http://ands.org.au/

Notes de l'éditeur

  1. A extended reflection on the last 12 years of my career
  2. NOTE: This is going to be a Western perspective. My apologies, but I haven’t yet had time to research the background of the Chinese writing system and publishing in sufficient detailTokens in envelope -> tokens plus number signs -> table plus number signs
  3. Burned library
  4. Note: sign like 7 is a full stop. Numbers are in roman numerals
  5. 1665 +350 = 2015
  6. 18 years after journal founded
  7. Actual scientific observations
  8. Illustrated from the journaI I showed the cover of: Philosophical Transactions of the Royal Society A
  9. Need to retype
  10. Near impossible to liberate. Talk about ChemXSeer example if time
  11. Too transformed
  12. Scientist may know how to get these data but I don’t
  13. Only journal like this I know. Anecdotal evidence that it is hard to get negative papers published
  14. Institutional repositories