SlideShare une entreprise Scribd logo
1  sur  38
LOTS OF COPIES KEEP STUFF SAFE
Lots of LOCKSS Keeping
Stuff Safe: The Future of
the LOCKSS Program
Nicholas Taylor (@nullhandle)
Program Manager for LOCKSS and Web Archiving
Stanford University Libraries
CNI Fall 2016 Membership Meeting
12 December 2016
why more LOCKSS?
• mature, community-
validated technology
• research-based + built to
a specific threat model
• web-centric preservation
for web-centric
scholarship
• community-centric
preservation for
collective challenges +
opportunities
• robust, distributed digital
preservation “Cologne Love Padlocks” by orkomedix under CC BY-NC-SA 2.0
Program History
“Grant Park” by xelipe under CC BY-NC-SA 2.0
inception
• a serials librarian + a
computer scientist
• print journals → Web
• conserve library’s role as
preserver
• collect from publishers’
websites
• preserve w/ cheap,
distributed, library-
managed hardware
• disseminate when
unavailable from
publisher
Chris Dobson: “From Bright Idea to Beta Test”
philosophy + focus
• lots of copies keep stuff
safe
• preservation is an active
community effort
• lots of communities keep
stuff safe
• enable communities to
preserve + access their
scholarly record
“Le Penseur” by Ian Abbott under CC BY-NC-SA 2.0
present day
• financially self-
sustaining
• tens of networks
• hundreds of institutions
• all types of content
“LOCKSS | Lots of Copies Keep Stuff Safe”
looking forward
• organizational changes
• software evolution
• LOCKSS networks
• distributed digital
preservation
“Looking for a brighter Future?” by Vincent Brassinne under CC BY-NC-ND 2.0
“Olympic Relay Handoff” by Dr. Mark Kubert under CC BY-NC-ND 2.0
Organizational Changes
David + Vicky
American Library Association: “Victoria Reich and David S.H. Rosenthal”
personal introduction
• 10 years in research libraries:
• Stanford University Libraries (2013 – present)
• Library of Congress (2010 – 2013)
• U.S. Supreme Court (2007 – 2010)
• professional background:
• web archives
• digital library services
• library technology
• what I care about:
• scalability + sustainability of PLNs, CLOCKSS
• mainstreaming LOCKSS for digital
preservation
• building collaborative technical communities
SUL Web Archiving
• end-to-end service:
• collect
• preserve
• make accessible
• make discoverable
• integrate w/ collection
development
• use cases:
• scholarly inputs/outputs
• institutional
legacy/compliance
• government information
Internet Archive: “Stanford University Homepage”
LOCKSS + DLSS administrativa
• LOCKSS integrating w/
SUL Digital Library
Systems & Services
(DLSS)
• led by Tom Cramer,
Director & Associate
University Librarian
• LOCKSS + SUL Web
Archiving, under
Nicholas Taylor
“SPO.101514.SLIDERlathrop.jpg” by Michael Hong
LOCKSS + DLSS synergies
• realize operational
efficiencies
• adopt, drive shared
engineering best
practices
• promote API-oriented
architectures
• streamline repository →
PLN data hand-offs
• contribute upstream to
shared tools
• broaden, diversify
community outreach
Software Evolution
“Why we love our macs” by Jason Corneveaux under CC BY-NC-ND 2.0
new functionality
• supported by Mellon
Foundation grant
• ingest/harvest
• form-filling
• AJAX
• dissemination
• Memento
• Shibboleth
• preservation
• polling performance
“1.13.09: versatility” by Team Dalog under CC BY 2.0
new architecture
• existing functionality
• discrete components as
web services
• incorporate external
software
“San Francisco Oakland Bay Bridge, East Spans New and Old” by Shanan under CC BY-NC 2.0
web services imperative
1. “All teams will henceforth expose their data and
functionality through service interfaces.”
2. “Teams must communicate with each other through
these interfaces.”
3. “There will be no other form of interprocess
communication allowed: no direct linking, no direct reads
of another team's data store, no shared-memory model,
no back-doors whatsoever.”
4. “All service interfaces, without exception, must be
designed from the ground up to be externalizable. That
is to say, the team must plan and design to be able to
expose the interface to developers in the outside world.”
5. “Anyone who doesn't do this will be fired.”
Steve Yegge: “Stevey's Google Platforms Rant”
risk of large projects
small projects (< $1 million)
4%
20%
76%
large projects (> $10 million)
38%
52%
10%
Standish Group: “Chaos Manifesto 2013: Think Big, Act Small”
successful
(on time,
on budget)
challenged
(late, over budget,
lacking functionality)
failed (cancelled,
or delivered
and never used)
Based on an 8-year survey of 50,000 software projects by the Standish Group.
why re-architect LOCKSS?
• reduce support + operations costs
• leverage web-scale open-source software
• align w/ web archiving mainstream
• de-silo components + enable external integration
• metadata extraction
• archive access via DOI + OpenURL
• polling + repair protocol
• prepare to evolve w/ the Web
• web services architecture as flexible foundation
integration opportunities
• polling + repair
• repository replication
layer
• other distributed digital
preservation systems
• access
• Dockerized full-text
search for web archives
• DOI + OpenURL access
to web archives
• metadata extraction “A Different Kind of Weave” by Barbara Courouble under CC BY-NC 2.0
aligning with web archiving
Web ARChive (WARC) format compatible technologies
• Heritrix
• OpenWayback
• WarcBase
• Web Archiving Proxy
21
web archiving system APIs (WASAPI)
leveraging community components
development progress
• access WARC-stored
content via:
• DOI
• OpenURL
• URL
• Solr full-text search
• web services:
• metadata extraction
• metadata database
“Milestones” by Dheeraj Nagwani under CC BY-NC-ND 2.0
product roadmap
• 2017
• Docker-ize components
• web harvest framework
• polling + repair web
service
• release to PLNs
• 2018
• IP address + Shibboleth
access via OpenWayback
• OpenWayback format
negotiation framework
• full-text search web
service
• release to GLN
“Printemps, work in progress” by Eric Gjerde under CC BY-NC 2.0
LOCKSS Networks
“Railroad Wye Switch” by Noel Hankamer under CC BY-NC-SA 2.0
Controlled LOCKSS (CLOCKSS)
• what is it?
• library/publisher partnership
• preserve the scholarly record
• 12 globally-distributed nodes
• dark until no longer accessible
• triggered content world-
accessible
• looking forward
• expand capacity
• increase pursuit of long tail
• champion standards to simplify
archiving (e.g., Signposting)
Private LOCKSS Networks (PLNs)
• what are they?
• community of interest
• jointly designate content
• run distributed nodes
• establish governance
• preservation via diverse
technologies, institutions,
networks
• looking forward
• create documentation
• enable self-setup
• support community
collaboration
• preserve web archives
national networks
• what are they?
• in-country preservation
• local stewardship
• perpetual access
• non-consumptive use
• looking forward
• more networks
• preserving national
long-tail content
“1951 World Map” by peonyandthistle under CC BY-NC-ND 2.0
Distributed Preservation
“Catho longtime [explored]” by Bill Collison under CC BY-NC 2.0
distributed preservation landscape
• better understanding of
role of distributed dark
archives
• next logical step
beyond mature local
preservation
• appealing option for
those w/o mature local
preservation
a greater role for LOCKSS?
• bolster existing efforts
• undergird PLN service
providers
• mainstream distributed
digital preservation
“DSCN7867” by tyalis_2 under CC BY-NC-ND 2.0
LOCKSS for web archiving
• growth in web archiving
• centralization in web
archiving
• native WARC support
• logical complement for
web archive
preservation
NDSA: “Web Archiving in the United States”
reliance on service provider
25.40%
60.32%
14.29%
19.51%
63.41%
15.85%
4.81%
63.29%
30.38%
0.00%
10.00%
20.00%
30.00%
40.00%
50.00%
60.00%
70.00%
Local External Both
2011 2013 2015
NDSA: “2016 NDSA Web Archiving Survey”
flat data transfer trend
19.15%
80.85%
20.29%
79.71%
20.27%
79.73%
0.00%
10.00%
20.00%
30.00%
40.00%
50.00%
60.00%
70.00%
80.00%
90.00%
Transfer data Do not transfer data
2011 2013 2015
NDSA: “2016 NDSA Web Archiving Survey”
Recap
“Rearview” by jenkinson2455 under CC BY-NC 2.0
vision
• better ensure the preservation of web archives
• LOCKSS team more actively engaged in community-
supported development efforts
• communities enabled to more easily contribute to
LOCKSS software, or run it w/o our help
• a longer tail of institutions able to capitalize on
distributed digital preservation
• LOCKSS components applied in contexts other than
LOCKSS networks
Questions?
“stanford dish at sunset” by Dan under CC BY-NC-SA 2.0

Contenu connexe

Tendances

From digital to social collections. A short story of collections online.
From digital to social collections. A short story of collections online.From digital to social collections. A short story of collections online.
From digital to social collections. A short story of collections online.
Elena Lagoudi
 
National Digital Library
National Digital LibraryNational Digital Library
National Digital Library
guesta45bc80
 
Please, do not decentralize the Internet (with permissionless) blockchains
Please, do not decentralize the Internet (with permissionless) blockchainsPlease, do not decentralize the Internet (with permissionless) blockchains
Please, do not decentralize the Internet (with permissionless) blockchains
pgarcial
 

Tendances (20)

Digital Library Project Proposal
Digital Library Project ProposalDigital Library Project Proposal
Digital Library Project Proposal
 
Project management report-on Digital Libraries
Project management report-on Digital LibrariesProject management report-on Digital Libraries
Project management report-on Digital Libraries
 
The public library and wikipedia
The public library and wikipediaThe public library and wikipedia
The public library and wikipedia
 
Clicklaw wikibooks for CBA dial-a-law
Clicklaw wikibooks for CBA dial-a-lawClicklaw wikibooks for CBA dial-a-law
Clicklaw wikibooks for CBA dial-a-law
 
Open repositories 2016 floss panel slides
Open repositories 2016 floss panel slidesOpen repositories 2016 floss panel slides
Open repositories 2016 floss panel slides
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
SDX - The Software Defined Exchange
SDX - The Software Defined ExchangeSDX - The Software Defined Exchange
SDX - The Software Defined Exchange
 
Digital Content Management
Digital Content ManagementDigital Content Management
Digital Content Management
 
Evolution of Digital Libraries
Evolution of Digital LibrariesEvolution of Digital Libraries
Evolution of Digital Libraries
 
SDX: Software Defined Exchange
SDX: Software Defined ExchangeSDX: Software Defined Exchange
SDX: Software Defined Exchange
 
From digital to social collections. A short story of collections online.
From digital to social collections. A short story of collections online.From digital to social collections. A short story of collections online.
From digital to social collections. A short story of collections online.
 
Internet Archive and Open Library
Internet Archive and Open LibraryInternet Archive and Open Library
Internet Archive and Open Library
 
Cyberlaw presentation
Cyberlaw presentationCyberlaw presentation
Cyberlaw presentation
 
Corrado -- Establishing the Landscape
Corrado -- Establishing the LandscapeCorrado -- Establishing the Landscape
Corrado -- Establishing the Landscape
 
Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...
 
What's Welsh for Crowdsourcing?: Citizen Science at the National Library of W...
What's Welsh for Crowdsourcing?: Citizen Science at the National Library of W...What's Welsh for Crowdsourcing?: Citizen Science at the National Library of W...
What's Welsh for Crowdsourcing?: Citizen Science at the National Library of W...
 
Calisphere: Broadening Access through DPLA
Calisphere: Broadening Access through DPLACalisphere: Broadening Access through DPLA
Calisphere: Broadening Access through DPLA
 
National Digital Library
National Digital LibraryNational Digital Library
National Digital Library
 
08 chapter 03
08 chapter 0308 chapter 03
08 chapter 03
 
Please, do not decentralize the Internet (with permissionless) blockchains
Please, do not decentralize the Internet (with permissionless) blockchainsPlease, do not decentralize the Internet (with permissionless) blockchains
Please, do not decentralize the Internet (with permissionless) blockchains
 

Similaire à Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program

Institutional repositories, digital asset management, and digitization
Institutional repositories, digital asset management, and digitizationInstitutional repositories, digital asset management, and digitization
Institutional repositories, digital asset management, and digitization
kgerber
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-redux
LIS EPI Meeting
 
Sakai09 Repo Case Study
Sakai09 Repo Case StudySakai09 Repo Case Study
Sakai09 Repo Case Study
jrmdkc
 
Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012
Roxanne Missingham
 
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
University of Missouri
 

Similaire à Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program (20)

Unlocking LOCKSS with APIs
Unlocking LOCKSS with APIsUnlocking LOCKSS with APIs
Unlocking LOCKSS with APIs
 
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...
 
การประยุกต์ใช้ DSpace Open Source ในการจัดการความรู้ขององค์กร
การประยุกต์ใช้ DSpace Open Source ในการจัดการความรู้ขององค์กรการประยุกต์ใช้ DSpace Open Source ในการจัดการความรู้ขององค์กร
การประยุกต์ใช้ DSpace Open Source ในการจัดการความรู้ขององค์กร
 
Web and Twitter Archiving at the Library of Congress
Web and Twitter Archiving at the Library of CongressWeb and Twitter Archiving at the Library of Congress
Web and Twitter Archiving at the Library of Congress
 
Why Not Lots of Copies Keep(ing) Software Safe?
Why Not Lots of Copies Keep(ing) Software Safe?Why Not Lots of Copies Keep(ing) Software Safe?
Why Not Lots of Copies Keep(ing) Software Safe?
 
Institutional repositories, digital asset management, and digitization
Institutional repositories, digital asset management, and digitizationInstitutional repositories, digital asset management, and digitization
Institutional repositories, digital asset management, and digitization
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
 
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolExposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
 
Social Networking Extensions for EPrints
Social Networking Extensions for EPrintsSocial Networking Extensions for EPrints
Social Networking Extensions for EPrints
 
The Tale of Two Deployments: Greenfield and Monolith Apps with Docker Enterpr...
The Tale of Two Deployments: Greenfield and Monolith Apps with Docker Enterpr...The Tale of Two Deployments: Greenfield and Monolith Apps with Docker Enterpr...
The Tale of Two Deployments: Greenfield and Monolith Apps with Docker Enterpr...
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-redux
 
Building Web Archiving Technology, Together
Building Web Archiving Technology, TogetherBuilding Web Archiving Technology, Together
Building Web Archiving Technology, Together
 
Sakai09 Repo Case Study
Sakai09 Repo Case StudySakai09 Repo Case Study
Sakai09 Repo Case Study
 
Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012
 
OpenGLAM in museums: Linked Open Data and Wikipedia
OpenGLAM in museums: Linked Open Data and WikipediaOpenGLAM in museums: Linked Open Data and Wikipedia
OpenGLAM in museums: Linked Open Data and Wikipedia
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and Education
 
suresh oclc (3).pptx
suresh oclc (3).pptxsuresh oclc (3).pptx
suresh oclc (3).pptx
 
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
 
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
 

Plus de nullhandle

Plus de nullhandle (20)

Understanding Legal Use Cases for Web Archives
Understanding Legal Use Cases for Web ArchivesUnderstanding Legal Use Cases for Web Archives
Understanding Legal Use Cases for Web Archives
 
Interoperability and Technical Collaboration for Web and Social Media Archiving
Interoperability and Technical Collaboration for Web and Social Media ArchivingInteroperability and Technical Collaboration for Web and Social Media Archiving
Interoperability and Technical Collaboration for Web and Social Media Archiving
 
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...
 
2015 NDSA Web Archiving Survey Report Highlights
2015 NDSA Web Archiving Survey Report Highlights2015 NDSA Web Archiving Survey Report Highlights
2015 NDSA Web Archiving Survey Report Highlights
 
Collection Development for Selective Web Archiving
Collection Development for Selective Web ArchivingCollection Development for Selective Web Archiving
Collection Development for Selective Web Archiving
 
WASAPI Web Archive Data Transfer APIs
WASAPI Web Archive Data Transfer APIsWASAPI Web Archive Data Transfer APIs
WASAPI Web Archive Data Transfer APIs
 
Outreach to Campus Webmasters for a Better Web, and Better Web Archiving
Outreach to Campus Webmasters for a Better Web, and Better Web ArchivingOutreach to Campus Webmasters for a Better Web, and Better Web Archiving
Outreach to Campus Webmasters for a Better Web, and Better Web Archiving
 
Measure All the (Web Archiving) Things!
Measure All the (Web Archiving) Things!Measure All the (Web Archiving) Things!
Measure All the (Web Archiving) Things!
 
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...
 
Campaign Web Archives to Support Multi-Institutional Research
Campaign Web Archives to Support Multi-Institutional ResearchCampaign Web Archives to Support Multi-Institutional Research
Campaign Web Archives to Support Multi-Institutional Research
 
2013 NDSA Web Archiving Survey Report Highlights
2013 NDSA Web Archiving Survey Report Highlights2013 NDSA Web Archiving Survey Report Highlights
2013 NDSA Web Archiving Survey Report Highlights
 
Considerations for Strategic Web Archive Collection Development
Considerations for Strategic Web Archive Collection DevelopmentConsiderations for Strategic Web Archive Collection Development
Considerations for Strategic Web Archive Collection Development
 
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
 
Advocating for Web Archivability
Advocating for Web ArchivabilityAdvocating for Web Archivability
Advocating for Web Archivability
 
Building Archivable Websites
Building Archivable WebsitesBuilding Archivable Websites
Building Archivable Websites
 
Link Persistence, Website Persistence
Link Persistence, Website PersistenceLink Persistence, Website Persistence
Link Persistence, Website Persistence
 
From Seed to Harvest: Web Archiving Program Considerations for SUL
From Seed to Harvest: Web Archiving Program Considerations for SULFrom Seed to Harvest: Web Archiving Program Considerations for SUL
From Seed to Harvest: Web Archiving Program Considerations for SUL
 
A Survey of Research Prospects for more Manageable Personal Digital Photo Col...
A Survey of Research Prospects for more Manageable Personal Digital Photo Col...A Survey of Research Prospects for more Manageable Personal Digital Photo Col...
A Survey of Research Prospects for more Manageable Personal Digital Photo Col...
 
Tool Academy: Web Archiving
Tool Academy: Web ArchivingTool Academy: Web Archiving
Tool Academy: Web Archiving
 
Using Wayback Machine for Research
Using Wayback Machine for ResearchUsing Wayback Machine for Research
Using Wayback Machine for Research
 

Dernier

Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Sheetaleventcompany
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
Diya Sharma
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Dernier (20)

Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
Al Barsha Night Partner +0567686026 Call Girls Dubai
Al Barsha Night Partner +0567686026 Call Girls  DubaiAl Barsha Night Partner +0567686026 Call Girls  Dubai
Al Barsha Night Partner +0567686026 Call Girls Dubai
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
 
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
 
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
 
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
 
𓀤Call On 7877925207 𓀤 Ahmedguda Call Girls Hot Model With Sexy Bhabi Ready Fo...
𓀤Call On 7877925207 𓀤 Ahmedguda Call Girls Hot Model With Sexy Bhabi Ready Fo...𓀤Call On 7877925207 𓀤 Ahmedguda Call Girls Hot Model With Sexy Bhabi Ready Fo...
𓀤Call On 7877925207 𓀤 Ahmedguda Call Girls Hot Model With Sexy Bhabi Ready Fo...
 
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 

Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program

  • 1. LOTS OF COPIES KEEP STUFF SAFE Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program Nicholas Taylor (@nullhandle) Program Manager for LOCKSS and Web Archiving Stanford University Libraries CNI Fall 2016 Membership Meeting 12 December 2016
  • 2. why more LOCKSS? • mature, community- validated technology • research-based + built to a specific threat model • web-centric preservation for web-centric scholarship • community-centric preservation for collective challenges + opportunities • robust, distributed digital preservation “Cologne Love Padlocks” by orkomedix under CC BY-NC-SA 2.0
  • 3. Program History “Grant Park” by xelipe under CC BY-NC-SA 2.0
  • 4. inception • a serials librarian + a computer scientist • print journals → Web • conserve library’s role as preserver • collect from publishers’ websites • preserve w/ cheap, distributed, library- managed hardware • disseminate when unavailable from publisher Chris Dobson: “From Bright Idea to Beta Test”
  • 5. philosophy + focus • lots of copies keep stuff safe • preservation is an active community effort • lots of communities keep stuff safe • enable communities to preserve + access their scholarly record “Le Penseur” by Ian Abbott under CC BY-NC-SA 2.0
  • 6. present day • financially self- sustaining • tens of networks • hundreds of institutions • all types of content “LOCKSS | Lots of Copies Keep Stuff Safe”
  • 7. looking forward • organizational changes • software evolution • LOCKSS networks • distributed digital preservation “Looking for a brighter Future?” by Vincent Brassinne under CC BY-NC-ND 2.0
  • 8. “Olympic Relay Handoff” by Dr. Mark Kubert under CC BY-NC-ND 2.0 Organizational Changes
  • 9. David + Vicky American Library Association: “Victoria Reich and David S.H. Rosenthal”
  • 10. personal introduction • 10 years in research libraries: • Stanford University Libraries (2013 – present) • Library of Congress (2010 – 2013) • U.S. Supreme Court (2007 – 2010) • professional background: • web archives • digital library services • library technology • what I care about: • scalability + sustainability of PLNs, CLOCKSS • mainstreaming LOCKSS for digital preservation • building collaborative technical communities
  • 11. SUL Web Archiving • end-to-end service: • collect • preserve • make accessible • make discoverable • integrate w/ collection development • use cases: • scholarly inputs/outputs • institutional legacy/compliance • government information Internet Archive: “Stanford University Homepage”
  • 12. LOCKSS + DLSS administrativa • LOCKSS integrating w/ SUL Digital Library Systems & Services (DLSS) • led by Tom Cramer, Director & Associate University Librarian • LOCKSS + SUL Web Archiving, under Nicholas Taylor “SPO.101514.SLIDERlathrop.jpg” by Michael Hong
  • 13. LOCKSS + DLSS synergies • realize operational efficiencies • adopt, drive shared engineering best practices • promote API-oriented architectures • streamline repository → PLN data hand-offs • contribute upstream to shared tools • broaden, diversify community outreach
  • 14. Software Evolution “Why we love our macs” by Jason Corneveaux under CC BY-NC-ND 2.0
  • 15. new functionality • supported by Mellon Foundation grant • ingest/harvest • form-filling • AJAX • dissemination • Memento • Shibboleth • preservation • polling performance “1.13.09: versatility” by Team Dalog under CC BY 2.0
  • 16. new architecture • existing functionality • discrete components as web services • incorporate external software “San Francisco Oakland Bay Bridge, East Spans New and Old” by Shanan under CC BY-NC 2.0
  • 17. web services imperative 1. “All teams will henceforth expose their data and functionality through service interfaces.” 2. “Teams must communicate with each other through these interfaces.” 3. “There will be no other form of interprocess communication allowed: no direct linking, no direct reads of another team's data store, no shared-memory model, no back-doors whatsoever.” 4. “All service interfaces, without exception, must be designed from the ground up to be externalizable. That is to say, the team must plan and design to be able to expose the interface to developers in the outside world.” 5. “Anyone who doesn't do this will be fired.” Steve Yegge: “Stevey's Google Platforms Rant”
  • 18. risk of large projects small projects (< $1 million) 4% 20% 76% large projects (> $10 million) 38% 52% 10% Standish Group: “Chaos Manifesto 2013: Think Big, Act Small” successful (on time, on budget) challenged (late, over budget, lacking functionality) failed (cancelled, or delivered and never used) Based on an 8-year survey of 50,000 software projects by the Standish Group.
  • 19. why re-architect LOCKSS? • reduce support + operations costs • leverage web-scale open-source software • align w/ web archiving mainstream • de-silo components + enable external integration • metadata extraction • archive access via DOI + OpenURL • polling + repair protocol • prepare to evolve w/ the Web • web services architecture as flexible foundation
  • 20. integration opportunities • polling + repair • repository replication layer • other distributed digital preservation systems • access • Dockerized full-text search for web archives • DOI + OpenURL access to web archives • metadata extraction “A Different Kind of Weave” by Barbara Courouble under CC BY-NC 2.0
  • 21. aligning with web archiving Web ARChive (WARC) format compatible technologies • Heritrix • OpenWayback • WarcBase • Web Archiving Proxy 21
  • 22. web archiving system APIs (WASAPI)
  • 24. development progress • access WARC-stored content via: • DOI • OpenURL • URL • Solr full-text search • web services: • metadata extraction • metadata database “Milestones” by Dheeraj Nagwani under CC BY-NC-ND 2.0
  • 25. product roadmap • 2017 • Docker-ize components • web harvest framework • polling + repair web service • release to PLNs • 2018 • IP address + Shibboleth access via OpenWayback • OpenWayback format negotiation framework • full-text search web service • release to GLN “Printemps, work in progress” by Eric Gjerde under CC BY-NC 2.0
  • 26. LOCKSS Networks “Railroad Wye Switch” by Noel Hankamer under CC BY-NC-SA 2.0
  • 27. Controlled LOCKSS (CLOCKSS) • what is it? • library/publisher partnership • preserve the scholarly record • 12 globally-distributed nodes • dark until no longer accessible • triggered content world- accessible • looking forward • expand capacity • increase pursuit of long tail • champion standards to simplify archiving (e.g., Signposting)
  • 28. Private LOCKSS Networks (PLNs) • what are they? • community of interest • jointly designate content • run distributed nodes • establish governance • preservation via diverse technologies, institutions, networks • looking forward • create documentation • enable self-setup • support community collaboration • preserve web archives
  • 29. national networks • what are they? • in-country preservation • local stewardship • perpetual access • non-consumptive use • looking forward • more networks • preserving national long-tail content “1951 World Map” by peonyandthistle under CC BY-NC-ND 2.0
  • 30. Distributed Preservation “Catho longtime [explored]” by Bill Collison under CC BY-NC 2.0
  • 31. distributed preservation landscape • better understanding of role of distributed dark archives • next logical step beyond mature local preservation • appealing option for those w/o mature local preservation
  • 32. a greater role for LOCKSS? • bolster existing efforts • undergird PLN service providers • mainstream distributed digital preservation “DSCN7867” by tyalis_2 under CC BY-NC-ND 2.0
  • 33. LOCKSS for web archiving • growth in web archiving • centralization in web archiving • native WARC support • logical complement for web archive preservation NDSA: “Web Archiving in the United States”
  • 34. reliance on service provider 25.40% 60.32% 14.29% 19.51% 63.41% 15.85% 4.81% 63.29% 30.38% 0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% Local External Both 2011 2013 2015 NDSA: “2016 NDSA Web Archiving Survey”
  • 35. flat data transfer trend 19.15% 80.85% 20.29% 79.71% 20.27% 79.73% 0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% 80.00% 90.00% Transfer data Do not transfer data 2011 2013 2015 NDSA: “2016 NDSA Web Archiving Survey”
  • 37. vision • better ensure the preservation of web archives • LOCKSS team more actively engaged in community- supported development efforts • communities enabled to more easily contribute to LOCKSS software, or run it w/o our help • a longer tail of institutions able to capitalize on distributed digital preservation • LOCKSS components applied in contexts other than LOCKSS networks
  • 38. Questions? “stanford dish at sunset” by Dan under CC BY-NC-SA 2.0