SlideShare une entreprise Scribd logo
1  sur  88
File formats and registries Manfred Thaller, University at Cologne October 2 nd , 2007
[object Object]
An image
An image 6 rows 5 columns
5 rows 6 columns
An image 1 == yellow 0 == red 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
An image 1 == violett  0 == green 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
An image Store: 1,1,1,1,1,1,0,0,0,1,1,1,0,1,1,1,1,0,1,1,1,1,0,1,1,1,1,1,1,1 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
An image Store: 6,1,3,0,3,11,0,4,1,1,0,4,1,1,0,7,1 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
An image Store: 1,1,1,1,1,1,0,0,0,1,1,1,0,1,1,1,1,0,1,1,1,1,0,1,1,1,1,1,1,1 Uncompressed 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
An image Store: 6,1,3,0,3,1,1,0,4,1,1,0,4,1,1,0,7,1 (Compressed)Run Length Encoded 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
An image Store: SetSize: 5 by 6 SetBackgroundColor: Blue SetForegroundColor: Red SetLetterHeight: 4 MoveTo: 3,5 DrawLetter: T 1,1 2,1 3,1 4,1 5,1 1,2 2,2 3,2 4,2 5,2 1,3 2,3 3,3 4,3 5,3 1,4 2,4 3,4 4,4 5,4 1,5 2,5 3,5 4,5 5,5 1,6 2,6 3,6 4,6 5,6
An image 6 rows 5 columns 1 == yellow 0 == red Uncompressed
An image dimensions 1 == yellow 0 == red Uncompressed
An image dimensions photogrammetric interpretation Uncompressed
An image dimensions photogrammetric interpretation compression
An image <basic information> <rendering information> <storage information>
An image <basic information> (implicit / explicit) <rendering information> (implicit / explicit) <storage information> (implicit / explicit) …  and the data?
An image <basic information> (implicit / explicit) <rendering information> (implicit / explicit) <storage information> (implicit / explicit) …  and the data?
An image Data either as data stream  1,1,1,1,1,1, 0,0,0,1,1,1, 0,1,1,1,1,0, 1,1,1,1,0,1, 1,1,1,1,1,1 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
An image Data either as data stream  or as processing instructions SetSize: 5 by 6 SetBackgroundColor: Yellow SetForegroundColor: Red SetLetterHeight: 4 MoveTo: 3,5 DrawLetter: T 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
File format <basic information> <rendering information> <storage information> <data>
File format <basic information> What to do? <rendering information> <storage information> <data>
File format <basic information> What to do? <rendering information> How to do it? <storage information> <data>
File format <basic information> What to do? <rendering information> How to do it? <storage information> How to move it from persistent to deployed form? <data>
File format <basic information> What to do? <rendering information> How to do it? <storage information> How to move it from persistent to deployed form? <data> What to deploy?
File format <basic information> What to do? <rendering information> How to do it? <storage information> How to move it from persistent to deployed form? <data> What to deploy?
File format <basic information> Mandatory <rendering information> Useful <storage information> Historical <data> Mandatory
File format A deterministic specification how the properties of a digital object can reversibly be converted into a linear bytestream (bitstream).
File format: TIFF
File format: PDF 1 0 obj <<  /Type /Page  /Parent 281 0 R  /Resources 2 0 R  /Contents 3 0 R  /StructParents 2  /MediaBox [ 0 0 612 792 ]  /CropBox [ 0 0 612 792 ]  /Rotate 0  >>  endobj
File format: PDF 2 0 obj <<  /ProcSet [ /PDF /Text ]  /Font << /TT2 292 0 R /TT4 288 0 R >>  /ExtGState << /GS1 300 0 R >>  /ColorSpace << /Cs6 289 0 R >>  >>  endobj
File format: PDF 3 0 obj << /Length 4605 /Filter /FlateDecode >>  stream H‰„WÛŽÛÈ}×Wô#Œ4jR”¨`±Àø ™Í&quot; ¶(²5j›&quot;¹lräý‘|oêÖ-j —‹ udTÙÂ…fPnˆ¿ìþ>Ó›Ež²ÝÕ˽âä”uª2i*<<v ú[Óžk9Q‰¼‡x»XTP{ ‹ ±/[i²½Ö)}ÔÏö&ªÙH;<Cµ …  and about 4000 bytes more ŠøL&quot;È÷ےƐ¬JYØÂm]j¥Ýqõ¥ÏººÕ™·²ôÒ·Ûº¤–÷.u-kP0 4“øTxM<é識9uôøˆòLi¦ØoTÖ m–;ǯ÷¤ÿlÕºvéU—Ë ±¤Lm°gŸˆu1Åëu5l3¯’¢O %òËTîü7?ìNdh endstream endobj
File format: XML (here: SVG) <?xml version=&quot;1.0&quot; encoding=&quot;UTF-16&quot;?> <svg:svg width=&quot;800&quot; height=&quot;1000&quot; xmlns:svg=&quot;http://www.w3.org ... <svg:rect x=&quot;0&quot; y=&quot;0&quot; width=&quot;800&quot; height=&quot;1000&quot; fill=&quot;white&quot; /> <svg:g transform=&quot;translate(-140,0)&quot;> <svg:line x1=&quot;600&quot; y1=&quot;20&quot; x2=&quot;500&quot; y2=&quot;20&quot; stroke=&quot;black&quot; … <svg:text x=&quot;600&quot; y=&quot;28.8&quot; font-size=&quot;6&quot; fill=&quot;black&quot; … </svg:g> <svg:g transform=&quot;translate(-140,0)&quot;> <svg:text x=&quot;500&quot; y=&quot;24.4&quot;> <svg:tspan font-size=&quot;4&quot; fill=&quot;black&quot;>Leiste</svg:tspan> </svg:text> </svg:g> <svg:defs> <svg:g id=&quot;halbeSaeuleLeiste0&quot;>
File format: XML (here SVG)
File format: XML (ETH: “column XML”) <?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?> <Autor name=&quot;Vitruv&quot;> <Ordnung name=&quot;Ionisch&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;&quot; TBz=&quot;&quot; TBn=&quot;&quot; … <Element name=&quot;Gebaelk&quot; original=&quot;&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;&quot; … <Element name=&quot;Gesims&quot; original=&quot;corona&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;&quot; … <Element name=&quot;Leiste&quot; original=&quot;&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;0.03&quot; … <Element name=&quot;Kyma&quot; original=&quot;sima&quot; THz=&quot;&quot; THn=&quot;&quot; … <Element name=&quot;Leiste&quot; original=&quot;&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;0.017&quot; … <Element name=&quot;Kyma_reversa&quot; original=&quot;cymatium&quot; THz=&quot;&quot; … <Element name=&quot;Platte&quot; original=&quot;corona&quot; THz=&quot;&quot; THn=&quot;&quot; … <Element name=&quot;Leiste&quot; original=&quot;&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;0.017&quot; … <Element name=&quot;Kyma_reversa&quot; original=&quot;cymatium&quot; THz=&quot;&quot; … <hElement name=&quot;Band&quot; typ=&quot;1&quot; dx=&quot;0.048&quot; r=&quot;0.019&quot;/> <hElement name=&quot;Band&quot; typ=&quot;1&quot; dx=&quot;0.048&quot; r=&quot;0.019&quot;/> </Element>
Files and Preservation ,[object Object],[object Object]
Bit rot An Image file before ….
Bit rot ... and after one  byte is changed.
Bit rot ... and after one  byte is changed. Undetectable by software.
Bit rot Processing dictionary   Payload 002 004 234 123 234 156 127 178 221 221
Bit rot One byte is damaged, one byte cannot be displayed correctly. 002 004 234 123 234 156 127 xxx 221 221
Bit rot One byte is damaged, ten bytes cannot be displayed correctly. 002 xxx 234 123 234 156 127 178 221 221
Result:  http://www.cflr.beniculturali.it/Progetti/Fixit.php www.cflr.beniculturali.it Franco Liberati [email_address] Università di Roma “La Sapienza”  Dipartimento Informatica  Centro Fotoriproduzione  Legatoria e Restauro Paolo Buonora [email_address]
Paolo on JPEG JPEG2000 more robust against bit rot than TIFF.
Paolo on JPEG JPEG2000 more robust against bit rot than TIFF. So, to stinulate more empiricism …
Obsolescence ,[object Object],[object Object],[object Object],[object Object]
Recommended formats: text http://www.fcla.edu/digitalArchive/pdfs/recFormats.pdf High confidence Medium confidence Low confidence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Recommended formats: bitmap / raster image http://www.fcla.edu/digitalArchive/pdfs/recFormats.pdf High confidence Medium confidence Low confidence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Recommended formats: vector graphics http://www.fcla.edu/digitalArchive/pdfs/recFormats.pdf High confidence Medium confidence Low confidence ,[object Object],[object Object],[object Object],[object Object],[object Object]
Recommended formats: audio http://www.fcla.edu/digitalArchive/pdfs/recFormats.pdf High confidence Medium confidence Low confidence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Recommended formats: video http://www.fcla.edu/digitalArchive/pdfs/recFormats.pdf High confidence Medium confidence Low confidence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Recommended formats: “data base” http://www.fcla.edu/digitalArchive/pdfs/recFormats.pdf High confidence Medium confidence Low confidence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Recommended formats: 3D (“virtual reality”) http://www.fcla.edu/digitalArchive/pdfs/recFormats.pdf High confidence Medium confidence Low confidence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What kind of file is this?  ,[object Object],[object Object],[object Object]
What kind of file is this?  ,[object Object],[object Object],[object Object]
What kind of file is this?  Two ways to identify a file: (b) By internal characteristics („magic number“, „signature“). A TIFF file begins with … Bytes 0-1: The byte order used within the file. Legal values are: “ II” (4949.H) / “MM” (4D4D.H) Bytes 2-3 An arbitrary but carefully chosen number (42) that further identifies the file as a TIFF file.
What kind of file is this?  ,[object Object],[object Object],[object Object],[object Object]
What kind of file is this?  The following 4 transparencies are a quotation from http://hul.harvard.edu/gdfr (see below).
Why Do We Need a Registry? ,[object Object],[object Object],[object Object],Global Digital Format Registry DSpace User Group, March 2004
Potential Use Cases ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Global Digital Format Registry DSpace User Group, March 2004
Repository Format Dependencies  Using the OAIS Reference Model Global Digital Format Registry DSpace User Group, March 2004
What’s Wrong with MIME Types? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Global Digital Format Registry DSpace User Group, March 2004
File format registries - URLs PRONOM : http://www.nationalarchives.gov.uk/pronom/ (does not only rely on extensions) Global Digital Format Registry : http://hul.harvard.edu/gdfr (predominantly project description) FileExt : http://filext.com (predominantly links to software)
Exercise I: A few experiments Group 1 Aist ė  Abromaityt ė Tomasz Jablonski Aadi Kaljuvee Jurat ė  Kuprien ė Violeta Meiliūnait ė
Exercise I: A few experiments Group 2 Libor Coufal Edvardas Germanas Hamid Rofoogaran Laima Šiudikiene Egl ė  Žvinyt ė
Exercise I: A few experiments Group 3 Renata Balandien ė Thomas Guignard Edgars Jekabsons Elona Malaiškien ė Bjorn Ragnolf Ronning
Exercise I: A few experiments Group 4 Gražina Deveikyt ė Raimondas Malaiška Filip Kwiatek Marija Prokopčik Piret Randmae Jelena Saikovič
PART II – Formats in PLANETS: File characteristics
PART II – Formats in PLANETS: File characteristics ,[object Object],[object Object],[object Object]
[object Object],2017 0,99% Tooth of Time Extractor Format specified in XCEL Comparer  XCDL 2017 XCDL 2007
[object Object],png 0,93% Migrator Extractor tiff XCEL png XCEL ... XCEL ... XCEL Comparer  png XCDL tiff XCDL
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],<xcdl> <object id=&quot;o1&quot; > <normData id=&quot;nd1&quot; > ... </normData> <property id=&quot;p1&quot; source=&quot;raw&quot; cat=&quot;descr&quot; > <name> compression </name> <valueSet id=&quot;i_i1_s6&quot; > <rawValue>0 </rawValue> <labValue>...</labValue> <dataRef ind=&quot;normAll&quot; /> <propRel/> </valueSet> </property> <property id=&quot;p2&quot; source=&quot;raw&quot; cat=&quot;descr&quot; > <name> height </name> <valueSet id=&quot;i_i1_s3&quot; > <rawValue>0 0 1 ad </rawValue> <labValue> <val>429</val> <type>uint32</type> </labValue> <dataRef ind=&quot;normAll&quot; /> <propRel/> </valueSet> </property> <property id=&quot;p3&quot; source=&quot;raw&quot; cat=&quot;descr&quot; > <name> imageType </name> .....
 
Confession
Confession Computer science does not really know what information is.
Computer science does not really know what information is. It is pretty good at representing and processing it, though.
Representations &  migrations III == 3 ==  γ ‘ == ●●● Four representations of the idea / concept / model  three
Representations &  migrations I divided by III == 1 / 3 == 1.3333? I divided by III == 1 / 3 == 1.3 periodic Some ideas are  handled  more precisely by Some  thinkers  than others.
Representations &  migrations 48 bit images on 24 and on 48 bit graphics cards. Some data is  processed  more adequately by some  equipment  than others
Representations &  migrations A model for information before and after a migration must therefore potentially represent all information there, irrespective of the possibility to process it in a given environment.
XCEL / XCDL Languages are being processed … …  development focus currently: dynamic handling of format specific algorithms.
  XCEL / XCDL: image model (1) A pixel cube … Each pixel: MSB (channel 1), … LSB (channel 1), … MSB (channel n), … LSB (channel n), MSB (aux 1), … LSB (aux 1), … MSB (aux m), … LSB (aux m)
  XCEL / XCDL: image model (2) A pixel cube … Accompanied by  rendering info  plus deployment info.
XCEL / XCDL: image model - example <property id=&quot;p4&quot; source=&quot;raw&quot; cat=&quot;descr&quot; > <name>imageType</name> <valueSet id=&quot;i_i1_s5&quot; > <rawValue>2</rawValue> <labValue> <val>truecolour</val> <type>fixedLabel</type> </labValue> <dataRef ind=&quot;normAll&quot; /> <propRel/> </valueSet> </property>
  XCEL / XCDL: text model A text (= <object>) is  composed of - data (<normData>) plus - interpretations of data according to the underlying format specification (=properties; <property>).
  XCEL / XCDL: text model - example This  is a  text <refData id=&quot;1&quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData> … <property> <name>fontsize</name> <rawVal> <val>00 18</val> <type>unsignedInt8</type> </rawVal> <dataRef> <!-- property refers to discrete part of reference data- -> <ref id=&quot;1&quot; start=&quot;0&quot; end=&quot;3&quot;/> <ref id=&quot;1&quot; start=“10&quot; end=&quot;12&quot;/> </dataRef> </property>
Exercise II: Abstract modelling Group 1: maps Group 2: music Group 3: excel sheets Group 4: „books“ … ever heard of FRBR?

Contenu connexe

Similaire à Trm 02 10 07vilnius

TAUS USER CONFERENCE 2009, Normalization of translation memories
TAUS USER CONFERENCE 2009, Normalization of translation memoriesTAUS USER CONFERENCE 2009, Normalization of translation memories
TAUS USER CONFERENCE 2009, Normalization of translation memoriesTAUS - The Language Data Network
 
Digital + Container List
Digital + Container ListDigital + Container List
Digital + Container Listguest53eac8
 
Faster! Faster! Accelerate your business with blazing prototypes
Faster! Faster! Accelerate your business with blazing prototypesFaster! Faster! Accelerate your business with blazing prototypes
Faster! Faster! Accelerate your business with blazing prototypesOSCON Byrum
 
Data Portability - Fronteers Meetup
Data Portability - Fronteers MeetupData Portability - Fronteers Meetup
Data Portability - Fronteers MeetupAndreas - Creten
 
SIPOC Model PowerPoint Presentation Slides
SIPOC Model PowerPoint Presentation SlidesSIPOC Model PowerPoint Presentation Slides
SIPOC Model PowerPoint Presentation SlidesSlideTeam
 
Generic Setup De-Mystified
Generic Setup De-MystifiedGeneric Setup De-Mystified
Generic Setup De-MystifiedClayton Parker
 
Significant Characteristics In Planets Manfred Thaller
Significant Characteristics In Planets Manfred ThallerSignificant Characteristics In Planets Manfred Thaller
Significant Characteristics In Planets Manfred ThallerDigitalPreservationEurope
 
Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...
Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...
Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...Daniel Cukier
 
IPO Framework PowerPoint Presentation Slides
IPO Framework PowerPoint Presentation SlidesIPO Framework PowerPoint Presentation Slides
IPO Framework PowerPoint Presentation SlidesSlideTeam
 
“LATEST CONSTRUCTION EQUIPMENTS USED IN CIVIL ENGINEERING"
“LATEST CONSTRUCTION EQUIPMENTS USED IN CIVIL ENGINEERING"“LATEST CONSTRUCTION EQUIPMENTS USED IN CIVIL ENGINEERING"
“LATEST CONSTRUCTION EQUIPMENTS USED IN CIVIL ENGINEERING"ROHANVADURLE
 
Towards the Digital Archive – Challenges and Best Practice: A Look at Digitiz...
Towards the Digital Archive – Challenges and Best Practice: A Look at Digitiz...Towards the Digital Archive – Challenges and Best Practice: A Look at Digitiz...
Towards the Digital Archive – Challenges and Best Practice: A Look at Digitiz...Axiell ALM
 
AiCore Brochure 27-Mar-2023-205529.pdf
AiCore Brochure 27-Mar-2023-205529.pdfAiCore Brochure 27-Mar-2023-205529.pdf
AiCore Brochure 27-Mar-2023-205529.pdfAjayRawat829497
 
Api anti patterns
Api anti patternsApi anti patterns
Api anti patternsMike Pearce
 
OSCON 2004: XML and Apache
OSCON 2004: XML and ApacheOSCON 2004: XML and Apache
OSCON 2004: XML and ApacheTed Leung
 
basic knowledge abot html
basic knowledge abot htmlbasic knowledge abot html
basic knowledge abot htmlAnkit Dubey
 
Getting more out of Matplotlib with GR
Getting more out of Matplotlib with GRGetting more out of Matplotlib with GR
Getting more out of Matplotlib with GRJosef Heinen
 
Banner to Moodle Integration
Banner to Moodle IntegrationBanner to Moodle Integration
Banner to Moodle Integrationeamerril
 

Similaire à Trm 02 10 07vilnius (20)

TAUS USER CONFERENCE 2009, Normalization of translation memories
TAUS USER CONFERENCE 2009, Normalization of translation memoriesTAUS USER CONFERENCE 2009, Normalization of translation memories
TAUS USER CONFERENCE 2009, Normalization of translation memories
 
HTML5: 5 Quick Wins
HTML5:  5 Quick WinsHTML5:  5 Quick Wins
HTML5: 5 Quick Wins
 
Digital + Container List
Digital + Container ListDigital + Container List
Digital + Container List
 
Faster! Faster! Accelerate your business with blazing prototypes
Faster! Faster! Accelerate your business with blazing prototypesFaster! Faster! Accelerate your business with blazing prototypes
Faster! Faster! Accelerate your business with blazing prototypes
 
Data Portability - Fronteers Meetup
Data Portability - Fronteers MeetupData Portability - Fronteers Meetup
Data Portability - Fronteers Meetup
 
SIPOC Model PowerPoint Presentation Slides
SIPOC Model PowerPoint Presentation SlidesSIPOC Model PowerPoint Presentation Slides
SIPOC Model PowerPoint Presentation Slides
 
Stc2011
Stc2011Stc2011
Stc2011
 
Generic Setup De-Mystified
Generic Setup De-MystifiedGeneric Setup De-Mystified
Generic Setup De-Mystified
 
Significant Characteristics In Planets Manfred Thaller
Significant Characteristics In Planets Manfred ThallerSignificant Characteristics In Planets Manfred Thaller
Significant Characteristics In Planets Manfred Thaller
 
Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...
Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...
Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...
 
IPO Framework PowerPoint Presentation Slides
IPO Framework PowerPoint Presentation SlidesIPO Framework PowerPoint Presentation Slides
IPO Framework PowerPoint Presentation Slides
 
“LATEST CONSTRUCTION EQUIPMENTS USED IN CIVIL ENGINEERING"
“LATEST CONSTRUCTION EQUIPMENTS USED IN CIVIL ENGINEERING"“LATEST CONSTRUCTION EQUIPMENTS USED IN CIVIL ENGINEERING"
“LATEST CONSTRUCTION EQUIPMENTS USED IN CIVIL ENGINEERING"
 
Towards the Digital Archive – Challenges and Best Practice: A Look at Digitiz...
Towards the Digital Archive – Challenges and Best Practice: A Look at Digitiz...Towards the Digital Archive – Challenges and Best Practice: A Look at Digitiz...
Towards the Digital Archive – Challenges and Best Practice: A Look at Digitiz...
 
AiCore Brochure 27-Mar-2023-205529.pdf
AiCore Brochure 27-Mar-2023-205529.pdfAiCore Brochure 27-Mar-2023-205529.pdf
AiCore Brochure 27-Mar-2023-205529.pdf
 
Api anti patterns
Api anti patternsApi anti patterns
Api anti patterns
 
OSCON 2004: XML and Apache
OSCON 2004: XML and ApacheOSCON 2004: XML and Apache
OSCON 2004: XML and Apache
 
basic knowledge abot html
basic knowledge abot htmlbasic knowledge abot html
basic knowledge abot html
 
Html5 Overview
Html5 OverviewHtml5 Overview
Html5 Overview
 
Getting more out of Matplotlib with GR
Getting more out of Matplotlib with GRGetting more out of Matplotlib with GR
Getting more out of Matplotlib with GR
 
Banner to Moodle Integration
Banner to Moodle IntegrationBanner to Moodle Integration
Banner to Moodle Integration
 

Plus de DigitalPreservationEurope

Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigitalPreservationEurope
 
Scalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingScalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingDigitalPreservationEurope
 
Preservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallPreservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallDigitalPreservationEurope
 
Preservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MelePreservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MeleDigitalPreservationEurope
 

Plus de DigitalPreservationEurope (20)

Infrastructure Training Session
Infrastructure Training SessionInfrastructure Training Session
Infrastructure Training Session
 
Drm Training Session
Drm Training SessionDrm Training Session
Drm Training Session
 
2009 Barcelona Wepreserve Nestor
2009 Barcelona Wepreserve Nestor2009 Barcelona Wepreserve Nestor
2009 Barcelona Wepreserve Nestor
 
Trusted Repositories
Trusted RepositoriesTrusted Repositories
Trusted Repositories
 
Preservation Metadata
Preservation MetadataPreservation Metadata
Preservation Metadata
 
An Introduction to Digital Preservation
An Introduction to Digital PreservationAn Introduction to Digital Preservation
An Introduction to Digital Preservation
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and Requirements
 
The Planets Preservation Planning workflow
The Planets Preservation Planning workflowThe Planets Preservation Planning workflow
The Planets Preservation Planning workflow
 
Preservation Metadata, Michael Day, DCC
Preservation Metadata, Michael Day, DCCPreservation Metadata, Michael Day, DCC
Preservation Metadata, Michael Day, DCC
 
PLATTER - Jan Hutar
PLATTER - Jan HutarPLATTER - Jan Hutar
PLATTER - Jan Hutar
 
Sustainability Clive Billenness
Sustainability Clive  BillennessSustainability Clive  Billenness
Sustainability Clive Billenness
 
Shaman Project Hemmje
Shaman Project  HemmjeShaman Project  Hemmje
Shaman Project Hemmje
 
Scalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingScalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross King
 
Risks Benefits And Motivations Seamus Ross
Risks Benefits And Motivations Seamus RossRisks Benefits And Motivations Seamus Ross
Risks Benefits And Motivations Seamus Ross
 
Representation Information Steve Rankin
Representation Information Steve RankinRepresentation Information Steve Rankin
Representation Information Steve Rankin
 
Preservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallPreservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian Upshall
 
Preservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MelePreservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore Mele
 
Platter Colin Rosenthal
Platter Colin RosenthalPlatter Colin Rosenthal
Platter Colin Rosenthal
 
Planets Testbed Brian Aitken
Planets Testbed Brian AitkenPlanets Testbed Brian Aitken
Planets Testbed Brian Aitken
 
Oais Based Information Flow Esther Conway
Oais Based Information Flow Esther ConwayOais Based Information Flow Esther Conway
Oais Based Information Flow Esther Conway
 

Dernier

Cracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxCracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxWorkforce Group
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsP&CO
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Roland Driesen
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Centuryrwgiffor
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...amitlee9823
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Roland Driesen
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Serviceritikaroy0888
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...lizamodels9
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...anilsa9823
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Delhi Call girls
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...amitlee9823
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒anilsa9823
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfAdmir Softic
 

Dernier (20)

Cracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxCracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptx
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and pains
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Century
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Forklift Operations: Safety through Cartoons
Forklift Operations: Safety through CartoonsForklift Operations: Safety through Cartoons
Forklift Operations: Safety through Cartoons
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 

Trm 02 10 07vilnius

  • 1. File formats and registries Manfred Thaller, University at Cologne October 2 nd , 2007
  • 2.
  • 4. An image 6 rows 5 columns
  • 5. 5 rows 6 columns
  • 6. An image 1 == yellow 0 == red 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
  • 7. An image 1 == violett 0 == green 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
  • 8. An image Store: 1,1,1,1,1,1,0,0,0,1,1,1,0,1,1,1,1,0,1,1,1,1,0,1,1,1,1,1,1,1 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
  • 9. An image Store: 6,1,3,0,3,11,0,4,1,1,0,4,1,1,0,7,1 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
  • 10. An image Store: 1,1,1,1,1,1,0,0,0,1,1,1,0,1,1,1,1,0,1,1,1,1,0,1,1,1,1,1,1,1 Uncompressed 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
  • 11. An image Store: 6,1,3,0,3,1,1,0,4,1,1,0,4,1,1,0,7,1 (Compressed)Run Length Encoded 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
  • 12. An image Store: SetSize: 5 by 6 SetBackgroundColor: Blue SetForegroundColor: Red SetLetterHeight: 4 MoveTo: 3,5 DrawLetter: T 1,1 2,1 3,1 4,1 5,1 1,2 2,2 3,2 4,2 5,2 1,3 2,3 3,3 4,3 5,3 1,4 2,4 3,4 4,4 5,4 1,5 2,5 3,5 4,5 5,5 1,6 2,6 3,6 4,6 5,6
  • 13. An image 6 rows 5 columns 1 == yellow 0 == red Uncompressed
  • 14. An image dimensions 1 == yellow 0 == red Uncompressed
  • 15. An image dimensions photogrammetric interpretation Uncompressed
  • 16. An image dimensions photogrammetric interpretation compression
  • 17. An image <basic information> <rendering information> <storage information>
  • 18. An image <basic information> (implicit / explicit) <rendering information> (implicit / explicit) <storage information> (implicit / explicit) … and the data?
  • 19. An image <basic information> (implicit / explicit) <rendering information> (implicit / explicit) <storage information> (implicit / explicit) … and the data?
  • 20. An image Data either as data stream 1,1,1,1,1,1, 0,0,0,1,1,1, 0,1,1,1,1,0, 1,1,1,1,0,1, 1,1,1,1,1,1 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
  • 21. An image Data either as data stream or as processing instructions SetSize: 5 by 6 SetBackgroundColor: Yellow SetForegroundColor: Red SetLetterHeight: 4 MoveTo: 3,5 DrawLetter: T 1 1 1 1 1 1 0 0 0 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 1 1
  • 22. File format <basic information> <rendering information> <storage information> <data>
  • 23. File format <basic information> What to do? <rendering information> <storage information> <data>
  • 24. File format <basic information> What to do? <rendering information> How to do it? <storage information> <data>
  • 25. File format <basic information> What to do? <rendering information> How to do it? <storage information> How to move it from persistent to deployed form? <data>
  • 26. File format <basic information> What to do? <rendering information> How to do it? <storage information> How to move it from persistent to deployed form? <data> What to deploy?
  • 27. File format <basic information> What to do? <rendering information> How to do it? <storage information> How to move it from persistent to deployed form? <data> What to deploy?
  • 28. File format <basic information> Mandatory <rendering information> Useful <storage information> Historical <data> Mandatory
  • 29. File format A deterministic specification how the properties of a digital object can reversibly be converted into a linear bytestream (bitstream).
  • 31. File format: PDF 1 0 obj << /Type /Page /Parent 281 0 R /Resources 2 0 R /Contents 3 0 R /StructParents 2 /MediaBox [ 0 0 612 792 ] /CropBox [ 0 0 612 792 ] /Rotate 0 >> endobj
  • 32. File format: PDF 2 0 obj << /ProcSet [ /PDF /Text ] /Font << /TT2 292 0 R /TT4 288 0 R >> /ExtGState << /GS1 300 0 R >> /ColorSpace << /Cs6 289 0 R >> >> endobj
  • 33. File format: PDF 3 0 obj << /Length 4605 /Filter /FlateDecode >> stream H‰„WÛŽÛÈ}×Wô#Œ4jR”¨`±Àø ™Í&quot; ¶(²5j›&quot;¹lräý‘|oêÖ-j —‹ udTÙÂ…fPnˆ¿ìþ>Ó›Ež²ÝÕ˽âä”uª2i*<<v ú[Óžk9Q‰¼‡x»XTP{ ‹ ±/[i²½Ö)}ÔÏö&ªÙH;<Cµ … and about 4000 bytes more ŠøL&quot;È÷ےƐ¬JYØÂm]j¥Ýqõ¥ÏººÕ™·²ôÒ·Ûº¤–÷.u-kP0 4“øTxM<é識9uôøˆòLi¦ØoTÖ m–;ǯ÷¤ÿlÕºvéU—Ë ±¤Lm°gŸˆu1Åëu5l3¯’¢O %òËTîü7?ìNdh endstream endobj
  • 34. File format: XML (here: SVG) <?xml version=&quot;1.0&quot; encoding=&quot;UTF-16&quot;?> <svg:svg width=&quot;800&quot; height=&quot;1000&quot; xmlns:svg=&quot;http://www.w3.org ... <svg:rect x=&quot;0&quot; y=&quot;0&quot; width=&quot;800&quot; height=&quot;1000&quot; fill=&quot;white&quot; /> <svg:g transform=&quot;translate(-140,0)&quot;> <svg:line x1=&quot;600&quot; y1=&quot;20&quot; x2=&quot;500&quot; y2=&quot;20&quot; stroke=&quot;black&quot; … <svg:text x=&quot;600&quot; y=&quot;28.8&quot; font-size=&quot;6&quot; fill=&quot;black&quot; … </svg:g> <svg:g transform=&quot;translate(-140,0)&quot;> <svg:text x=&quot;500&quot; y=&quot;24.4&quot;> <svg:tspan font-size=&quot;4&quot; fill=&quot;black&quot;>Leiste</svg:tspan> </svg:text> </svg:g> <svg:defs> <svg:g id=&quot;halbeSaeuleLeiste0&quot;>
  • 35. File format: XML (here SVG)
  • 36. File format: XML (ETH: “column XML”) <?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?> <Autor name=&quot;Vitruv&quot;> <Ordnung name=&quot;Ionisch&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;&quot; TBz=&quot;&quot; TBn=&quot;&quot; … <Element name=&quot;Gebaelk&quot; original=&quot;&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;&quot; … <Element name=&quot;Gesims&quot; original=&quot;corona&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;&quot; … <Element name=&quot;Leiste&quot; original=&quot;&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;0.03&quot; … <Element name=&quot;Kyma&quot; original=&quot;sima&quot; THz=&quot;&quot; THn=&quot;&quot; … <Element name=&quot;Leiste&quot; original=&quot;&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;0.017&quot; … <Element name=&quot;Kyma_reversa&quot; original=&quot;cymatium&quot; THz=&quot;&quot; … <Element name=&quot;Platte&quot; original=&quot;corona&quot; THz=&quot;&quot; THn=&quot;&quot; … <Element name=&quot;Leiste&quot; original=&quot;&quot; THz=&quot;&quot; THn=&quot;&quot; MH=&quot;0.017&quot; … <Element name=&quot;Kyma_reversa&quot; original=&quot;cymatium&quot; THz=&quot;&quot; … <hElement name=&quot;Band&quot; typ=&quot;1&quot; dx=&quot;0.048&quot; r=&quot;0.019&quot;/> <hElement name=&quot;Band&quot; typ=&quot;1&quot; dx=&quot;0.048&quot; r=&quot;0.019&quot;/> </Element>
  • 37.
  • 38. Bit rot An Image file before ….
  • 39. Bit rot ... and after one byte is changed.
  • 40. Bit rot ... and after one byte is changed. Undetectable by software.
  • 41. Bit rot Processing dictionary Payload 002 004 234 123 234 156 127 178 221 221
  • 42. Bit rot One byte is damaged, one byte cannot be displayed correctly. 002 004 234 123 234 156 127 xxx 221 221
  • 43. Bit rot One byte is damaged, ten bytes cannot be displayed correctly. 002 xxx 234 123 234 156 127 178 221 221
  • 44. Result: http://www.cflr.beniculturali.it/Progetti/Fixit.php www.cflr.beniculturali.it Franco Liberati [email_address] Università di Roma “La Sapienza” Dipartimento Informatica Centro Fotoriproduzione Legatoria e Restauro Paolo Buonora [email_address]
  • 45. Paolo on JPEG JPEG2000 more robust against bit rot than TIFF.
  • 46. Paolo on JPEG JPEG2000 more robust against bit rot than TIFF. So, to stinulate more empiricism …
  • 47.
  • 48.
  • 49.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57. What kind of file is this? Two ways to identify a file: (b) By internal characteristics („magic number“, „signature“). A TIFF file begins with … Bytes 0-1: The byte order used within the file. Legal values are: “ II” (4949.H) / “MM” (4D4D.H) Bytes 2-3 An arbitrary but carefully chosen number (42) that further identifies the file as a TIFF file.
  • 58.
  • 59. What kind of file is this? The following 4 transparencies are a quotation from http://hul.harvard.edu/gdfr (see below).
  • 60.
  • 61.
  • 62. Repository Format Dependencies Using the OAIS Reference Model Global Digital Format Registry DSpace User Group, March 2004
  • 63.
  • 64. File format registries - URLs PRONOM : http://www.nationalarchives.gov.uk/pronom/ (does not only rely on extensions) Global Digital Format Registry : http://hul.harvard.edu/gdfr (predominantly project description) FileExt : http://filext.com (predominantly links to software)
  • 65. Exercise I: A few experiments Group 1 Aist ė Abromaityt ė Tomasz Jablonski Aadi Kaljuvee Jurat ė Kuprien ė Violeta Meiliūnait ė
  • 66. Exercise I: A few experiments Group 2 Libor Coufal Edvardas Germanas Hamid Rofoogaran Laima Šiudikiene Egl ė Žvinyt ė
  • 67. Exercise I: A few experiments Group 3 Renata Balandien ė Thomas Guignard Edgars Jekabsons Elona Malaiškien ė Bjorn Ragnolf Ronning
  • 68. Exercise I: A few experiments Group 4 Gražina Deveikyt ė Raimondas Malaiška Filip Kwiatek Marija Prokopčik Piret Randmae Jelena Saikovič
  • 69. PART II – Formats in PLANETS: File characteristics
  • 70.
  • 71.
  • 72.
  • 73.
  • 74.  
  • 76. Confession Computer science does not really know what information is.
  • 77. Computer science does not really know what information is. It is pretty good at representing and processing it, though.
  • 78. Representations & migrations III == 3 == γ ‘ == ●●● Four representations of the idea / concept / model three
  • 79. Representations & migrations I divided by III == 1 / 3 == 1.3333? I divided by III == 1 / 3 == 1.3 periodic Some ideas are handled more precisely by Some thinkers than others.
  • 80. Representations & migrations 48 bit images on 24 and on 48 bit graphics cards. Some data is processed more adequately by some equipment than others
  • 81. Representations & migrations A model for information before and after a migration must therefore potentially represent all information there, irrespective of the possibility to process it in a given environment.
  • 82. XCEL / XCDL Languages are being processed … … development focus currently: dynamic handling of format specific algorithms.
  • 83. XCEL / XCDL: image model (1) A pixel cube … Each pixel: MSB (channel 1), … LSB (channel 1), … MSB (channel n), … LSB (channel n), MSB (aux 1), … LSB (aux 1), … MSB (aux m), … LSB (aux m)
  • 84. XCEL / XCDL: image model (2) A pixel cube … Accompanied by rendering info plus deployment info.
  • 85. XCEL / XCDL: image model - example <property id=&quot;p4&quot; source=&quot;raw&quot; cat=&quot;descr&quot; > <name>imageType</name> <valueSet id=&quot;i_i1_s5&quot; > <rawValue>2</rawValue> <labValue> <val>truecolour</val> <type>fixedLabel</type> </labValue> <dataRef ind=&quot;normAll&quot; /> <propRel/> </valueSet> </property>
  • 86. XCEL / XCDL: text model A text (= <object>) is composed of - data (<normData>) plus - interpretations of data according to the underlying format specification (=properties; <property>).
  • 87. XCEL / XCDL: text model - example This is a text <refData id=&quot;1&quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData> … <property> <name>fontsize</name> <rawVal> <val>00 18</val> <type>unsignedInt8</type> </rawVal> <dataRef> <!-- property refers to discrete part of reference data- -> <ref id=&quot;1&quot; start=&quot;0&quot; end=&quot;3&quot;/> <ref id=&quot;1&quot; start=“10&quot; end=&quot;12&quot;/> </dataRef> </property>
  • 88. Exercise II: Abstract modelling Group 1: maps Group 2: music Group 3: excel sheets Group 4: „books“ … ever heard of FRBR?