Talk given by @atreloar and @hvdsomp at workshop sponsored by http://dans.knaw.nl/ with title "Riding the Wave and the Scholarly Archive of the Future". NOTE: This reflects thinking in progress which may well change in the future.
Injustice - Developers Among Us (SciFiDevCon 2024)
Scholarly archive-of-the-future
1. Data Archiving and Networked Services
Riding the Wave and the Scholarly
Archive of the Future
Thinking in Progress by:
Andrew Treloar
DANS Visiting Fellow
ANDS Director of Technology
Herbert van de Sompel
DANS Visiting Fellow
LANL Scientist
#rtwsaf
DANS is an institute of KNAW and NWO
2. Structure presentation
• Where we are today
• Pointers to the future
• Characterising that future
– Fundamental concepts
– Observations about archiving
– Diagramming the infrastructure
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
3. Let’s go on a journey
• Republic of Letters
• System of Journals
• Web of Objects
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
4. Functions of Research Communication
Rosendaal and Geurts (1997)
• Registration: Allows claims of precedence for a
scholarly finding
• Certification: Establishes validity of claim
• Awareness: Allows actors in the system to remain
aware of new claims
• Archiving: Preserves the scholarly record
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
5. System of Journals
• Registration
– submission of manuscript
• Certification
– peer-review (pre-publication)
– commentary (post-publication)
• Awareness
– discovery services
• Archiving
– libraries (print)
– publishers (electronic)
– special purpose organisations (e.g. Portico)
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
6. Pointers to the future
“the future is already here – it’s
just not very evenly distributed”
William Gibson, NPR interview
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
25. Awareness: Observations
•
•
•
•
Awareness for various types of objects
Real time awareness
Awareness support targeted at machines
Awareness through social media
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
30. Archiving: EU Trusted Digital Repositories
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
31. Archiving: Observations
•
•
•
•
Archiving for various types of objects
Distributed archives
Archival consortia
Audit for trustworthiness
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
32. Characterising the future
Hidden
Research Process
Visible
Fixed
Nature of object
Varying
Atomic
Atomicity of object
Compound
Discrete
Process of making public
Continuous
Delayed
S
peed of communication
Publication
+data proxies
Communicated object
Formal
Nature of process
Instant
Publication +
linked data +
linked models
Informal
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
33. Fundamental changes
• The research process (objects, social dimension)
is becoming more exposed
• Articles, books are no longer the only relevant
objects for research communication
• Objects are no longer static
• Machines are joining humans as (co-)creators
and consumers of research objects
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
34. Web of Objects
• Registration
– Recording of a wide variety of objects, versions of objects
• Certification
– Content/Form
– Human/Machine
• Awareness
– Real-time
– Social
– Variety of objects
• Archiving
– Archiving a wide variety of objects
– Trusted archives
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
43. Web platforms for scholarship
• Common web platforms are increasingly used for
scholarship
– Wikis, GitHub, Twitter, Wordpress, etc.
• Many of these have desirable characteristics:
– Versioning
– Timestamping
– Social embedding
• Still, they record rather than archive
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
44. Recording not Archiving
“GitHub reserves the right at any time and from time to
time to modify or discontinue, temporarily or
permanently, the Service (or any part thereof) with or
without notice.”
“GitHub does not warrant that (i) the service will meet
your specific requirements, (ii) the service will be
uninterrupted, timely, secure, or error-free, (iii) the
results that may be obtained from the use of the service
will be accurate or reliable, (iv) the quality of any
products, services, information, or other material
purchased or obtained by you through the service will
meet your expectations, and (v) any errors in the
Service will be corrected.”
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
46. Infrastructure implications
• This infrastructure needs to include
– use of common platforms to support recording
– availability of specialist platforms to support archiving
• We need an archiving infrastructure that
underpins research activity that is
–
–
–
–
–
trusted
sustainable
distributed
interoperable
standards-based
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
48. Implications
• Need organizational, technical,
curational interfaces between recording
and archiving platforms
• Need organizational, technical
interfaces across archiving platforms
January 20, 2014
CC-BY-SA, @atreloar and @hvdsomp
Notes de l'éditeur
Content: Multiple sources checking the validity/classification of data
Content: Multiple sources checking the validity/classification of data