SlideShare une entreprise Scribd logo
1  sur  99
Télécharger pour lire hors ligne
Some store to remember, some store to forget
About me

Søren Schaffstein
CEO of dkd Internet Service GmbH
Frankfurt, Germany
The problem

What this is all about
Storage capacity is ever increasing
Prices for storage are falling
How large is large?
Size references

A simple text: an average Wikipedia article ≈ 3.78 kB (no markup)
Lots of text: complete Wikipedia ≈ 13.5 GB (text only, no markup)
An average image (12MP) ≈ 1.3 MB (JPG 90% quality; 24bit/pixel)
An average movie stored on Blu-ray Disc ≈ 25.48 GB
1955 – The IBM 355

Capacity: 12 MB
Cost: 6,233.33 USD/MB

✘
3,250

0

✘
9

0

0.16 kB
1970 – The IBM 3330

Capacity: 100 MB
Cost: 259.70 USD/MB

✘
27,089

0

✘
76

0

3.94 kB
1988 – Seagate ST-238

Capacity: 30 MB
Cost: 9.97 USD/MB

✘
8,126

0

✘
23

0

102.71 kB
2000 – Western Digital WD600AB

Capacity: 60 GB
Cost: 0.00275 USD/MB

16,644,063

4

47,261

2

363.64 MB
2010 – Seagate ST32000542AS

Capacity: 2 TB
Cost: 0.0000450 USD/MB
≈ 5 cent/GB

541,798,941

148

1,538,461

76

21.7 GB
2013 – NSA

Capacity: ∞
Cost: free

✘
∞

∞

∞

∞

it’s free :)
Cool!

Let’s store everything, then!
Or, maybe not...

There’s a lot more costs
Retrieval
Maintenance
Indexing
Updates
We need to keep our information

Accessible
Usable
Useful
Let’s start to forget!

The concept of Memory Buoyancy
Memory Buoyancy

time

memory
Memory Buoyancy
Memory Buoyancy
The ForgetIT Project

A short overview
ForgetIT project overview

Consortium of 11 partners
Project start was in February 2013
3 years of research & development
http://www.forgetit-project.eu
The ForgetIT project is funded by the EC within the 7th Framework
Programme under the objective "Digital Preservation"
(GA 600826).
Project Partners 1/2

Centre for Research and Technology Hellas
dkd Internet Service GmbH
Deutsches Forschungszentrum für Künstliche Intelligenz GmbH
Eurix Srl
Gottfried Wilhelm Leibniz Universität Hannover
Project Partners 2/2

IBM Israel - Science and Technology Ltd
Luleå Tekniska Universitet
The Chancellor, Masters and Scholars of the University of Oxford
The University of Edinburgh
The University of Sheffield
Turk Telekomunikasyon AS
Inspiring people to share!

TYPO3 is the CMS used for the organisational use cases
TYPO3 was chosen because it’s Open Source
We want to raise awareness on the matter of preservation
We will publish our modules under open source licenses
ForgetIT core concepts
Managed Forgetting
Synergetic Preservation
Contextualised Remembering
Do you preserve?
What is preservation?

“Preservation — The protection of cultural
property through activities that minimize
chemical and physical deterioration and
damage and that prevent loss of informational
content. The primary goal of preservation is to
prolong the existence of cultural property.”
Preservation 101
Problems are caused by

storage medium (disks, tapes, DVD, etc.)
Problems are caused by

storage medium (disks, tapes, DVD, etc.)
format of the data
Problems are caused by

storage medium (disks, tapes, DVD, etc.)
format of the data
availability of the software or operating system
possible encryption
Digital Dark Age
“The digital dark age is a possible future
situation where it will be difficult or impossible
to read historical electronic documents and
multimedia, because they have been stored in
an obsolete and obscure file format.”

Wikipedia
Preserving a website is not trivial

What do want you preserve?
Content only?
Content and Design?
How often? Stock prices vs. Company History page
How do you deal with browser differences?
How do you preserve functionality? E.g. insurance fee calculator
Preservation Value
Preservation Value

~ 5,000 €
Preservation Value

~ 200,000 €
The ForgetIT Use Cases

Private
Organisational
Personal Preservation

A personal use case:
How to organise an ever growing picture collection
Organisational Preservation

Typical use cases in the daily work with TYPO3-driven company
websites.
Organisational Use Cases

Digital Asset Management
Versioning
Archiving a complete Website
Individual genres and their specific requirements
Example: Press Release
Press Release Example

An organisational use case
Elements of a Press Release

text
image
links
documents
Meta information

Presseinformationen Spielwarenmesse
Global Toy Conference Now on Saturday at the
Spielwarenmesse
* Customised programme for retailers: “How to get
your customer into the shop”
* Conference will take place for the 5th time in
Nuremberg on 1 February 2014
All around the world, retailers are wondering how
they can still get their customers in their shops
in the age of the Internet – because competition
for the sale of consumer goods online is growing
dramatically. With the topic “How to Get
Customers into Your Shop – Successful Pricing,
Presentation and Selling” the Global Toy
Conference of the Spielwarenmesse demonstrates
what parameters business owners can adjust for
the future. The conference will take place for
the first time in the St Petersburg hall in the
NCC East on Saturday. The new earlier date means
that more international retailers can take
advantage of the knowledge on offer at the toy
industry's leading trade fair – from 9 a.m. to 4
p.m. on 1 February 2014.
...
Translations

German

English
…
Delete

Keep

Archive

Levels of significance

legal value
Action: keep for legal time

present value
Action: Keep for x days

archive value
Action: keep forever

trigger value
Action: Check significance
meta info

media
meta info

media
meta info
move

media

copy
refer

Content Management System
meta info

media

meta info

media
asset

meta info

meta info

external
Digital Asset (DAM)

etc.

editable
content
meta info

media
asset

structure
(code, users,
plugins,
extensions,
etc.)
meta info

internal
Info Level 4, etc.
dynamic

Info Level 3

(semi)automatic

Info Level 2

static

Info Level 1

meta info

media

meta info

meta info

Output

meta info

Archive 1

media
asset

etc.

Archive 2

editable
content
meta info

media
asset

structure
(code, users,
plugins,
extensions,
etc.
meta info

Delete
Info Level 4, etc.
dynamic

Info Level 3

(semi)automatic

Info Level 2

static

Info Level 1

meta info

media

meta info

meta info

Output

meta info

Archive 1

media
asset

etc.

Archive 2

editable
content
meta info

media
asset

structure
(code, users,
plugins,
extensions,
etc.
meta info

Delete
Info Level 4, etc.
dynamic

Info Level 3

(semi)automatic

Info Level 2

static

Info Level 1

meta info

media

meta info

meta info

Output

meta info

Archive 1

media
asset

etc.

Archive 2

editable
content
meta info

media
asset

structure
(code, users,
plugins,
extensions,
etc.
meta info

Delete
Info Level 4, etc.
dynamic

Info Level 3

(semi)automatic

Info Level 2

static

Info Level 1

meta info

media

meta info

meta info

Output

meta info

Archive 1

media
asset

etc.

Archive 2

editable
content
meta info

media
asset

structure
(code, users,
plugins,
extensions,
etc.
meta info

Delete
Info Level 4, etc.
dynamic

Info Level 3

(semi)automatic

Info Level 2

static

Info Level 1

meta info

media

meta info

meta info

Output

meta info

Archive 1

media
asset

etc.

Archive 2

editable
content
meta info

media
asset

structure
(code, users,
plugins,
extensions,
etc.
meta info

Delete
T-CM (Todays Content Management)

F-CM (Future Content Management)

L4

L4

L3

L3

L2

L2

L1

L1

Retrieve Service

Archive 1

Archive 2

Delete
The Information Lifecycle

of a press release
Information Lifecycle

Collect

Create

Process

Publish

Analyse

Archive
Collect
Create
Process
Publish
Analyse
Archive
Information Lifecycle

Annotations

Collect

Create

Process

Publish

Analyse

Archive
Example Press Release

Annotation (text)

Annotation (image)

global toy
conference,
conference,
podium, speaker,
lights
Do you remember?

A game about forgetting.
Do you remember the details?

Which ocean was the ForgetIT Team examining?
Do you remember the details?

Which ocean was the ForgetIT Team examining?
Mediterranean Sea
Do you remember the details?

Which ocean was the ForgetIT Team examining?
Mediterranean Sea
How many people of the ForgetIT Team were carrying a bag?
Do you remember the details?

Which ocean was the ForgetIT Team examining?
Mediterranean Sea
How many people of the ForgetIT Team were carrying a bag?
Do you remember the details?

Which ocean was the ForgetIT Team examining?
Mediterranean Sea
How many people of the ForgetIT Team were carrying a bag?
How many barcodes are on the Western Digital WD600AB?
Do you remember the details?

Which ocean was the ForgetIT Team examining?
Mediterranean Sea
How many people of the ForgetIT Team were carrying a bag?
How many barcodes are on the Western Digital WD600AB?
Do you remember the details?

Which ocean was the ForgetIT Team examining?
Mediterranean Sea
How many people of the ForgetIT Team were carrying a bag?
How many barcodes are on the Western Digital WD600AB?
How many pictures in the shoebox image are mostly blue?
Do you remember the details?

Which ocean was the ForgetIT Team examining?
Mediterranean Sea
How many people of the ForgetIT Team were carrying a bag?
How many barcodes are on the Western Digital WD600AB?
How many pictures in the shoebox image are mostly blue?
Next steps

or how you can participate
We’d love to see you participate!

Reflect your thoughts with us
Take our short survey: http://tinyurl.com/forgetit-webarchiving
Tell us your use cases
Join the development of TYPO3 features
Don’t forget them!
We’d love to discuss them with
you ... and a beer or two...
Thank you for your attention!
References

Sources, Books, Images
References (Sources) 1/2

Size of Wikipedia (as of 2013-10-04): https://en.wikipedia.org/wiki/
Wikipedia:Size_comparisons
Average JPG size: http://web.forret.com/tools/megapixel.asp?
title=12+Megapixel+camera&width=4000&height=3000
Average movie size: http://answers.yahoo.com/question/index?
qid=20110807095141AABGQm8
Storage Prices: http://www.jcmit.com/diskprice.htm
References (Sources) 2/2

Forget IT Website: http://www.forgetit-project.eu
Preservation: http://unfacilitated.preservation101.org/session1/
expl_whatis-definitions.asp
Digital Dark Age: https://en.wikipedia.org/wiki/Digital_dark_age
References (Books)

Delete: The Virtue of Forgetting in the Digital Age, Viktor MayerSchönberger
References (Images) 1/8

“About me”: all images by Søren Schaffstein
“ForgetIT Team” by Søren Schaffstein
“The Problem/Knot”: http://www.istockphoto.com/stockphoto-8933647-rope-with-knot.php
“1 Dollar”: http://www.istockphoto.com/stock-photo-17830696-fandollars-isolated-on-white.php
Starbucks Cups: http://5feetonagoodday.files.wordpress.com/
2012/01/starbucks-coffee-cups-sizes-tall-grande-venti-trenta.jpg
References (Images) 2/8

IBM 355: http://www-03.ibm.com/ibm/history/exhibits/storage/
storage_355.html
IBM 3330: http://www-03.ibm.com/ibm/history/exhibits/storage/
storage_3330.html
Seagate ST-238: http://www.redlop.de/bilder/produkte/gross/
Seagate-WREN-5-ST4702N-702-MB-.png
Western Digital WD600AB: http://www.junek.de/thomas/bilder/
WD600AB.jpg
References (Images) 3/8

Seagate ST32000542AS: http://bilder.afterbuy.de/images/ZZNLZ/
seagatesata.jpg
Finger “Forget”: http://www.istockphoto.com/stock-photo-7252836string-finger-reminder-on-white.php
Memory Buoyancy: http://www.istockphoto.com/stockphoto-16244755-fishing-hook-underwater.php?st=0320b45
Fish: http://www.istockphoto.com/stock-photo-14623368-gold-fishand-piranha.php
References (Images) 4/8

Game pieces by Søren Schaffstein
Managed Forgetting: http://www.istockphoto.com/stockphoto-3533508-colorful-memos.php?st=0320b45
Synergetic Preservation: http://www.istockphoto.com/stockphoto-13301920-goldfish-jump.php
Contextualised Remembering: http://www.istockphoto.com/stockphoto-14370511-shoebox-of-old-photos-too.php
References (Images) 5/8

Cans: http://www.istockphoto.com/stock-photo-16948268-threemetallic-goods-can-with-key.php
5 1/4” Disk: https://secure.flickr.com/photos/twicepix/4330813840/
sizes/z/in/photostream/
5 1/4” Disk Drawing: https://secure.flickr.com/photos/
flattop341/2094771560/sizes/z/in/photostream/
Ami Pro: http://www.os2museum.com/wp/?attachment_id=99
Digital Dark Age by Søren Schaffstein
References (Images) 6/8

Gauges: http://www.istockphoto.com/stock-photo-9059088-oldgauges.php
Golf Car: http://www.netzeitung.de/default/337276.html#
Golf Car Papers: http://www.motor-talk.de/news/das-heilige-blechwieder-unterm-hammer-t4421282.html
Create: http://hdwallsize.com/wp-content/uploads/2013/04/
Abstract-Art-Wallpaper-Dekstop.jpg
References (Images) 7/8

Process by Søren Schaffstein
Publish: http://www.istockphoto.com/stock-photo-25712828-britishdog-reading.php?st=e5bf164
Analyse: http://www.istockphoto.com/stock-photo-28297160laboratory-experimental-testing.php?st=239c76e
Archive: http://www.istockphoto.com/stock-photo-18865341-oldwooden-card-catalogue-with-one-opened-drawer.php
References (Images) 8/8

Shoes: http://www.istockphoto.com/stock-photo-2457744-what-syour-walking-style.php?st=e12d3d2
Questions: http://www.istockphoto.com/stock-photo-17686236decision-making.php

Contenu connexe

Similaire à ForgetIT – Some store to remember, some store to forget

ForgetIT Project TYPO3Camp Milano 2014
ForgetIT Project TYPO3Camp Milano 2014ForgetIT Project TYPO3Camp Milano 2014
ForgetIT Project TYPO3Camp Milano 2014Olivier Dobberkau
 
Personal Digital Archiving Initiatives at the Library of Congress
Personal Digital Archiving Initiatives at the Library of CongressPersonal Digital Archiving Initiatives at the Library of Congress
Personal Digital Archiving Initiatives at the Library of Congresslljohnston
 
Puglia marac-file formats-20111020
Puglia marac-file formats-20111020Puglia marac-file formats-20111020
Puglia marac-file formats-20111020MARAC Bethlehem PC
 
Navigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
Navigating the Analog Waves: Digitizing Audio Cassettes for Your CollectionNavigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
Navigating the Analog Waves: Digitizing Audio Cassettes for Your CollectionKay Gregg
 
Personal Digital Preservation
Personal Digital PreservationPersonal Digital Preservation
Personal Digital Preservationsouslapoussiere
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projectsac2182
 
Managing large and complex data sets
Managing large and complex data setsManaging large and complex data sets
Managing large and complex data setsdata_management
 
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the PondDigital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the PondBenoit Pauwels
 
Digital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the PondDigital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the PondULB - Bibliothèques
 
Death of Disk Panel Session - HEC-FSIO Workshop
Death of Disk Panel Session - HEC-FSIO WorkshopDeath of Disk Panel Session - HEC-FSIO Workshop
Death of Disk Panel Session - HEC-FSIO WorkshopErik Riedel
 
Cost, Risk, Loss and other fun things
Cost, Risk, Loss and other fun things Cost, Risk, Loss and other fun things
Cost, Risk, Loss and other fun things PrestoCentre
 
2015 05-27-congrés archivoscatalunya
2015 05-27-congrés archivoscatalunya2015 05-27-congrés archivoscatalunya
2015 05-27-congrés archivoscatalunyaJosé Carlos Ramalho
 
Webinar: Designing Storage and Apps to Enable Data Monetization
Webinar: Designing Storage and Apps to Enable Data MonetizationWebinar: Designing Storage and Apps to Enable Data Monetization
Webinar: Designing Storage and Apps to Enable Data MonetizationStorage Switzerland
 

Similaire à ForgetIT – Some store to remember, some store to forget (20)

ForgetIT Project TYPO3Camp Milano 2014
ForgetIT Project TYPO3Camp Milano 2014ForgetIT Project TYPO3Camp Milano 2014
ForgetIT Project TYPO3Camp Milano 2014
 
Personal Digital Archiving Initiatives at the Library of Congress
Personal Digital Archiving Initiatives at the Library of CongressPersonal Digital Archiving Initiatives at the Library of Congress
Personal Digital Archiving Initiatives at the Library of Congress
 
Puglia marac-file formats-20111020
Puglia marac-file formats-20111020Puglia marac-file formats-20111020
Puglia marac-file formats-20111020
 
Navigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
Navigating the Analog Waves: Digitizing Audio Cassettes for Your CollectionNavigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
Navigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
 
Personal Digital Preservation
Personal Digital PreservationPersonal Digital Preservation
Personal Digital Preservation
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
 
Managing large and complex data sets
Managing large and complex data setsManaging large and complex data sets
Managing large and complex data sets
 
MyLifeBits van Microsoft
MyLifeBits van MicrosoftMyLifeBits van Microsoft
MyLifeBits van Microsoft
 
Trm Introduction
Trm IntroductionTrm Introduction
Trm Introduction
 
Preserve or preserve not
Preserve or preserve notPreserve or preserve not
Preserve or preserve not
 
Ex chapter7 questions
Ex chapter7   questionsEx chapter7   questions
Ex chapter7 questions
 
Speaker: Gustavo Murillo Lopez, Mexico
Speaker: Gustavo Murillo Lopez, MexicoSpeaker: Gustavo Murillo Lopez, Mexico
Speaker: Gustavo Murillo Lopez, Mexico
 
diskmfr
diskmfrdiskmfr
diskmfr
 
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the PondDigital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the Pond
 
Digital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the PondDigital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the Pond
 
Death of Disk Panel Session - HEC-FSIO Workshop
Death of Disk Panel Session - HEC-FSIO WorkshopDeath of Disk Panel Session - HEC-FSIO Workshop
Death of Disk Panel Session - HEC-FSIO Workshop
 
Chap72&73
Chap72&73Chap72&73
Chap72&73
 
Cost, Risk, Loss and other fun things
Cost, Risk, Loss and other fun things Cost, Risk, Loss and other fun things
Cost, Risk, Loss and other fun things
 
2015 05-27-congrés archivoscatalunya
2015 05-27-congrés archivoscatalunya2015 05-27-congrés archivoscatalunya
2015 05-27-congrés archivoscatalunya
 
Webinar: Designing Storage and Apps to Enable Data Monetization
Webinar: Designing Storage and Apps to Enable Data MonetizationWebinar: Designing Storage and Apps to Enable Data Monetization
Webinar: Designing Storage and Apps to Enable Data Monetization
 

Plus de Søren Schaffstein

Data Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldData Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldSøren Schaffstein
 
Indoctrinatr – Open Source PDF generation service
Indoctrinatr – Open Source PDF generation serviceIndoctrinatr – Open Source PDF generation service
Indoctrinatr – Open Source PDF generation serviceSøren Schaffstein
 
Let's play Work – wie Sie mit Gamification Ihre Nutzer involvieren und so Ihr...
Let's play Work – wie Sie mit Gamification Ihre Nutzer involvieren und so Ihr...Let's play Work – wie Sie mit Gamification Ihre Nutzer involvieren und so Ihr...
Let's play Work – wie Sie mit Gamification Ihre Nutzer involvieren und so Ihr...Søren Schaffstein
 

Plus de Søren Schaffstein (6)

Data Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldData Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into Gold
 
Indoctrinatr – Open Source PDF generation service
Indoctrinatr – Open Source PDF generation serviceIndoctrinatr – Open Source PDF generation service
Indoctrinatr – Open Source PDF generation service
 
Let's play Work – wie Sie mit Gamification Ihre Nutzer involvieren und so Ihr...
Let's play Work – wie Sie mit Gamification Ihre Nutzer involvieren und so Ihr...Let's play Work – wie Sie mit Gamification Ihre Nutzer involvieren und so Ihr...
Let's play Work – wie Sie mit Gamification Ihre Nutzer involvieren und so Ihr...
 
Lets play TYPO3
Lets play TYPO3Lets play TYPO3
Lets play TYPO3
 
About the TYPO3 Association
About the TYPO3 AssociationAbout the TYPO3 Association
About the TYPO3 Association
 
Scheduling tasks in TYPO3
Scheduling tasks in TYPO3Scheduling tasks in TYPO3
Scheduling tasks in TYPO3
 

Dernier

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

ForgetIT – Some store to remember, some store to forget

  • 1. Some store to remember, some store to forget
  • 2. About me Søren Schaffstein CEO of dkd Internet Service GmbH Frankfurt, Germany
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8. The problem What this is all about
  • 9. Storage capacity is ever increasing Prices for storage are falling
  • 10. How large is large?
  • 11. Size references A simple text: an average Wikipedia article ≈ 3.78 kB (no markup) Lots of text: complete Wikipedia ≈ 13.5 GB (text only, no markup) An average image (12MP) ≈ 1.3 MB (JPG 90% quality; 24bit/pixel) An average movie stored on Blu-ray Disc ≈ 25.48 GB
  • 12. 1955 – The IBM 355 Capacity: 12 MB Cost: 6,233.33 USD/MB ✘ 3,250 0 ✘ 9 0 0.16 kB
  • 13. 1970 – The IBM 3330 Capacity: 100 MB Cost: 259.70 USD/MB ✘ 27,089 0 ✘ 76 0 3.94 kB
  • 14. 1988 – Seagate ST-238 Capacity: 30 MB Cost: 9.97 USD/MB ✘ 8,126 0 ✘ 23 0 102.71 kB
  • 15. 2000 – Western Digital WD600AB Capacity: 60 GB Cost: 0.00275 USD/MB 16,644,063 4 47,261 2 363.64 MB
  • 16. 2010 – Seagate ST32000542AS Capacity: 2 TB Cost: 0.0000450 USD/MB ≈ 5 cent/GB 541,798,941 148 1,538,461 76 21.7 GB
  • 17. 2013 – NSA Capacity: ∞ Cost: free ✘ ∞ ∞ ∞ ∞ it’s free :)
  • 19. Or, maybe not... There’s a lot more costs Retrieval Maintenance Indexing Updates
  • 20. We need to keep our information Accessible Usable Useful
  • 21. Let’s start to forget! The concept of Memory Buoyancy
  • 25. The ForgetIT Project A short overview
  • 26. ForgetIT project overview Consortium of 11 partners Project start was in February 2013 3 years of research & development http://www.forgetit-project.eu The ForgetIT project is funded by the EC within the 7th Framework Programme under the objective "Digital Preservation" (GA 600826).
  • 27. Project Partners 1/2 Centre for Research and Technology Hellas dkd Internet Service GmbH Deutsches Forschungszentrum für Künstliche Intelligenz GmbH Eurix Srl Gottfried Wilhelm Leibniz Universität Hannover
  • 28. Project Partners 2/2 IBM Israel - Science and Technology Ltd Luleå Tekniska Universitet The Chancellor, Masters and Scholars of the University of Oxford The University of Edinburgh The University of Sheffield Turk Telekomunikasyon AS
  • 29. Inspiring people to share! TYPO3 is the CMS used for the organisational use cases TYPO3 was chosen because it’s Open Source We want to raise awareness on the matter of preservation We will publish our modules under open source licenses
  • 35. What is preservation? “Preservation — The protection of cultural property through activities that minimize chemical and physical deterioration and damage and that prevent loss of informational content. The primary goal of preservation is to prolong the existence of cultural property.” Preservation 101
  • 36. Problems are caused by storage medium (disks, tapes, DVD, etc.)
  • 37. Problems are caused by storage medium (disks, tapes, DVD, etc.) format of the data
  • 38. Problems are caused by storage medium (disks, tapes, DVD, etc.) format of the data availability of the software or operating system possible encryption
  • 40. “The digital dark age is a possible future situation where it will be difficult or impossible to read historical electronic documents and multimedia, because they have been stored in an obsolete and obscure file format.” Wikipedia
  • 41. Preserving a website is not trivial What do want you preserve? Content only? Content and Design? How often? Stock prices vs. Company History page How do you deal with browser differences? How do you preserve functionality? E.g. insurance fee calculator
  • 45. The ForgetIT Use Cases Private Organisational
  • 46. Personal Preservation A personal use case: How to organise an ever growing picture collection
  • 47. Organisational Preservation Typical use cases in the daily work with TYPO3-driven company websites.
  • 48. Organisational Use Cases Digital Asset Management Versioning Archiving a complete Website Individual genres and their specific requirements Example: Press Release
  • 49. Press Release Example An organisational use case
  • 50. Elements of a Press Release text image links documents
  • 51. Meta information Presseinformationen Spielwarenmesse Global Toy Conference Now on Saturday at the Spielwarenmesse * Customised programme for retailers: “How to get your customer into the shop” * Conference will take place for the 5th time in Nuremberg on 1 February 2014 All around the world, retailers are wondering how they can still get their customers in their shops in the age of the Internet – because competition for the sale of consumer goods online is growing dramatically. With the topic “How to Get Customers into Your Shop – Successful Pricing, Presentation and Selling” the Global Toy Conference of the Spielwarenmesse demonstrates what parameters business owners can adjust for the future. The conference will take place for the first time in the St Petersburg hall in the NCC East on Saturday. The new earlier date means that more international retailers can take advantage of the knowledge on offer at the toy industry's leading trade fair – from 9 a.m. to 4 p.m. on 1 February 2014. ...
  • 53.
  • 54. Delete Keep Archive Levels of significance legal value Action: keep for legal time present value Action: Keep for x days archive value Action: keep forever trigger value Action: Check significance
  • 58. meta info media meta info media asset meta info meta info external Digital Asset (DAM) etc. editable content meta info media asset structure (code, users, plugins, extensions, etc.) meta info internal
  • 59. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • 60. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • 61. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • 62. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • 63. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • 64. T-CM (Todays Content Management) F-CM (Future Content Management) L4 L4 L3 L3 L2 L2 L1 L1 Retrieve Service Archive 1 Archive 2 Delete
  • 65. The Information Lifecycle of a press release
  • 74. Example Press Release Annotation (text) Annotation (image) global toy conference, conference, podium, speaker, lights
  • 75. Do you remember? A game about forgetting.
  • 76. Do you remember the details? Which ocean was the ForgetIT Team examining?
  • 77. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea
  • 78. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag?
  • 79. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag?
  • 80. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB?
  • 81. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB?
  • 82. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB? How many pictures in the shoebox image are mostly blue?
  • 83. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB? How many pictures in the shoebox image are mostly blue?
  • 84. Next steps or how you can participate
  • 85. We’d love to see you participate! Reflect your thoughts with us Take our short survey: http://tinyurl.com/forgetit-webarchiving Tell us your use cases Join the development of TYPO3 features
  • 86. Don’t forget them! We’d love to discuss them with you ... and a beer or two...
  • 87. Thank you for your attention!
  • 89. References (Sources) 1/2 Size of Wikipedia (as of 2013-10-04): https://en.wikipedia.org/wiki/ Wikipedia:Size_comparisons Average JPG size: http://web.forret.com/tools/megapixel.asp? title=12+Megapixel+camera&width=4000&height=3000 Average movie size: http://answers.yahoo.com/question/index? qid=20110807095141AABGQm8 Storage Prices: http://www.jcmit.com/diskprice.htm
  • 90. References (Sources) 2/2 Forget IT Website: http://www.forgetit-project.eu Preservation: http://unfacilitated.preservation101.org/session1/ expl_whatis-definitions.asp Digital Dark Age: https://en.wikipedia.org/wiki/Digital_dark_age
  • 91. References (Books) Delete: The Virtue of Forgetting in the Digital Age, Viktor MayerSchönberger
  • 92. References (Images) 1/8 “About me”: all images by Søren Schaffstein “ForgetIT Team” by Søren Schaffstein “The Problem/Knot”: http://www.istockphoto.com/stockphoto-8933647-rope-with-knot.php “1 Dollar”: http://www.istockphoto.com/stock-photo-17830696-fandollars-isolated-on-white.php Starbucks Cups: http://5feetonagoodday.files.wordpress.com/ 2012/01/starbucks-coffee-cups-sizes-tall-grande-venti-trenta.jpg
  • 93. References (Images) 2/8 IBM 355: http://www-03.ibm.com/ibm/history/exhibits/storage/ storage_355.html IBM 3330: http://www-03.ibm.com/ibm/history/exhibits/storage/ storage_3330.html Seagate ST-238: http://www.redlop.de/bilder/produkte/gross/ Seagate-WREN-5-ST4702N-702-MB-.png Western Digital WD600AB: http://www.junek.de/thomas/bilder/ WD600AB.jpg
  • 94. References (Images) 3/8 Seagate ST32000542AS: http://bilder.afterbuy.de/images/ZZNLZ/ seagatesata.jpg Finger “Forget”: http://www.istockphoto.com/stock-photo-7252836string-finger-reminder-on-white.php Memory Buoyancy: http://www.istockphoto.com/stockphoto-16244755-fishing-hook-underwater.php?st=0320b45 Fish: http://www.istockphoto.com/stock-photo-14623368-gold-fishand-piranha.php
  • 95. References (Images) 4/8 Game pieces by Søren Schaffstein Managed Forgetting: http://www.istockphoto.com/stockphoto-3533508-colorful-memos.php?st=0320b45 Synergetic Preservation: http://www.istockphoto.com/stockphoto-13301920-goldfish-jump.php Contextualised Remembering: http://www.istockphoto.com/stockphoto-14370511-shoebox-of-old-photos-too.php
  • 96. References (Images) 5/8 Cans: http://www.istockphoto.com/stock-photo-16948268-threemetallic-goods-can-with-key.php 5 1/4” Disk: https://secure.flickr.com/photos/twicepix/4330813840/ sizes/z/in/photostream/ 5 1/4” Disk Drawing: https://secure.flickr.com/photos/ flattop341/2094771560/sizes/z/in/photostream/ Ami Pro: http://www.os2museum.com/wp/?attachment_id=99 Digital Dark Age by Søren Schaffstein
  • 97. References (Images) 6/8 Gauges: http://www.istockphoto.com/stock-photo-9059088-oldgauges.php Golf Car: http://www.netzeitung.de/default/337276.html# Golf Car Papers: http://www.motor-talk.de/news/das-heilige-blechwieder-unterm-hammer-t4421282.html Create: http://hdwallsize.com/wp-content/uploads/2013/04/ Abstract-Art-Wallpaper-Dekstop.jpg
  • 98. References (Images) 7/8 Process by Søren Schaffstein Publish: http://www.istockphoto.com/stock-photo-25712828-britishdog-reading.php?st=e5bf164 Analyse: http://www.istockphoto.com/stock-photo-28297160laboratory-experimental-testing.php?st=239c76e Archive: http://www.istockphoto.com/stock-photo-18865341-oldwooden-card-catalogue-with-one-opened-drawer.php
  • 99. References (Images) 8/8 Shoes: http://www.istockphoto.com/stock-photo-2457744-what-syour-walking-style.php?st=e12d3d2 Questions: http://www.istockphoto.com/stock-photo-17686236decision-making.php