SlideShare une entreprise Scribd logo
1  sur  29
Significant characteristics in Planets


            Manfred Thaller
           Universität zu* Köln

          *University at not of Cologne
What are “significant characteristics”?

Those properties of a digital file which have to
  be known to enable the processing of the file
  within a specific setup.
Why extract them by software?

To create technical metadata as required by
  organizational models for long term
  preservation. (NLNZ)
Within Planets …

… served by solutions to identify formats:
  formats registry / PRONOM / DROID.

… and a solution for extracting and processing
  such characteristics: XCL.
A Vision
            Extractor                   tiff XCDL
  tiff

                                  93%
Migrator                                Comparator



  png      tiff XCEL   png XCEL         png XCDL
A Vision
 Extractor




                    Comparator
Appropriate XCELs

                        C-Set
Why automate?

1 million objects: use one second for each.

== 16666.7 minutes == 277.8 hours

== 11.57 working days of a computer

== 34.7 8-hour days for a Human

== 7 working weeks
Why automate?

1 million objects: use five minutes for each.

== 416 666.7 hours

== 52 803.4 8-hour days for a Human
Why automate?

Assumption: Preservation is only feasible, if the
content of two digital objects can be compared
without human intervention, giving a numerical
estimate of their degree of similarity.
Demo
Abstract solution I
(1) Language to represent the complete content of a digital object.
    XCDL
(2) Language to describe any machine readable format in a formal
     language.
    XCEL
(3) Software to extract the content of a file based upon a description
     as under (2) and express it in the language as specified under (1).
    “extractor”
(4) Software to compare two such content descriptions.
     “comparator”
<XCELDocument...>           ...                       <xcdl>
<formatDescription>....                                 <object id=quot;o1quot; >
<symbol identifier=quot;ID01_I01_I01_S02quot;                       <normData id=quot;nd1quot; > ... </normData>
     originalName=quot;height“ interpretation=quot;uint32quot;>
                                                            <property id=quot;p1quot; source=quot;rawquot;
    <range><startposition xsi:type=quot;sequential“>         cat=quot;descrquot; >
     </startposition>
       <length xsi:type=quot;fixedquot;>4</length></range>               <name>   compression</name>
     <name>height</name>                                       <valueSet id=quot;i_i1_s6quot; >
                                                                   <rawValue>0 </rawValue>
</symbol>
                                                                   <labValue>...</labValue>
<symbol identifier=quot;ID01_I01_I01_S04quot;
     originalName=quot;colourTypequot;>                                    <dataRef ind=quot;normAllquot; />
   <range>                                                         <propRel/>
     <startposition xsi:type=quot;sequentialquot;>                     </valueSet>
     </startposition>                                      </property>
     <length xsi:type=quot;fixedquot;>1</length></range>           <property id=quot;p2quot; source=quot;rawquot;
   <valueInterpretation>                                cat=quot;descrquot; >
         <valueLabel>greyscale</valueLabel>                      <name>   height</name>
         <value>0</value></valueinterpretation>
                                                               <valueSet id=quot;i_i1_s3quot; >
    <name>imageType</name>                                         <rawValue>0 0 1 ad </rawValue>
</symbol>                                                          <labValue>
<symbol identifier=quot;ID01_I01_I01_S05quot;                                  <val>429</val>
     originalName=quot;compressionMethodquot;>                                 <type>uint32</type>
   <range>                                                         </labValue>
       <startposition xsi:type=quot;sequential“>                       <dataRef ind=quot;normAllquot; />
     </startposition>                                              <propRel/>
        <length                                                </valueSet>
     xsi:type=quot;fixedquot;>1</length></range>                   </property>
   <valueInterpretation>                                   <property id=quot;p3quot; source=quot;rawquot;
     <valueLabel>zlibDeflateInflate</valueLabel>        cat=quot;descrquot; >
     <value>0</value></valueInterpretation>
                                                                 <name>   imageType</name>
   <name>compression</name>
                                                         .....
</symbol>...
<request2>                                      <property id=quot;2quot;
    <measurementRequest>                           name=quot;imageHeightquot;
          <source name=quot;XCDL1.xmlquot;/>               unit=quot;pixelquot;
          <target name=quot;XCDL2.xmlquot;/>               compStatus=quot;completequot;>
        <property id=quot;45quot; name=quot;rgbPalettequot;>
                                                   <values type=quot;intquot;>
            <metric id=quot;10quot;
     name=quot;hammingDistancequot;/>                         <src>32</src>
     </property>
        <property id=quot;300quot; name=quot;normDataquot;>
                                                      <tar>32</tar>
            <metric id=quot;10quot;                        </values>
     name=quot;hammingDistancequot;/>
            <metric id=quot;50quot; name=quot;RMSEquot;/>          <metric id=quot;200quot;
        </property>                                name=quot;equalquot; result=quot;truequot;/>
         <property id=quot;2quot;                          <metric id=quot;201quot;
     name=quot;imageHeightquot; unit=quot;pixelquot;>              name=quot;intDiffquot; result=quot;0quot;/>
     <metric id=quot;200quot; name=quot;equalquot;/>               <metric id=quot;210quot;
     <metric id=quot;201quot; name=quot;intDiffquot;/>             name=quot;percDevquot;
     <metric id=quot;210quot; name=quot;percDevquot;/>             result=quot;0.000000quot;/>
          </property>                           </property>
        <property id=quot;30quot; name=quot;imageWidthquot;
     unit=quot;pixelquot;>
            <metric id=quot;200quot; name=quot;equalquot;/>
            <metric id=quot;201quot; name=quot;intDiffquot;/>
            <metric id=quot;210quot; name=quot;percDevquot;/>
        </property>
Abstract solution I
(1) Language to represent the complete content of a digital object.
    XCDL
(2) Language to describe any machine readable format in a formal
     language.
    XCEL
(3) Software to extract the content of a file based upon a description
     as under (2) and express it in the language as specified under (1).
    “extractor”
(4) Software to compare two such content descriptions.
     “comparator”
Are the following two items equal:




          VIII  8
eight   eight


VIII  8
otto

       eight   eight


otto   VIII  8
otto                   acht

       eight   eight


otto   VIII  8       acht
8.0
otto                         acht

       eight         eight


otto   VIII  8             acht
Information model: „an image“

otto                                   acht

           eight           eight


otto     VIII  8                     acht
information model: „an image“

 format ontology: „what terms are
used in formats to describe image
            properties“


   VIII  8
Information model: „what is an image“

  Format ontology: „what terms are
  used in formats to describe image
              properties“

Extraction language: “how to get the
terms describing an image out of a file”
Abstract solution II
(1) A theoretical model of information (not: data) types – “image”,
    “text”, “audio” ...

(2) Ontologies, which map existing file format terminologies onto
    these model.

(3) A language – XCDL – which allows to express the content of files
    in different formats using the vocabulary of the ontologies and the
    “grammar” of the information model.
XCDL

eXtensible Characterisation Definition
  Language

Purpose: Describe the contents of a file in
  terms of an abstract model.
XCDL: text model (1)

A text (= <object>) is composed of
data (= <normData>) plus
interpretations of data according to the
underlying format specification (=
<property>).
XCDL: text model (2)

Or, one level of abstraction higher, a text
is composed of content carrying tokens,
accompanied by rendering info plus
deployment info plus historical info.
This text   is a

<refData id=quot;1quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData>
…
<property>
<name>fontsize</name>
<rawVal>
<val>48</val>
<type>unsignedInt8</type>
</rawVal>
<dataRef> <!-- property refers to discrete part of reference data-->
<ref id=quot;1quot; start=quot;0quot; end=quot;3quot;/>
<ref id=quot;1quot; start=“10quot; end=quot;12quot;/>
</dataRef>
</property>
This text   is a

<refData id=quot;1quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData>
…
<property>
<name>fontsize</name>
<rawVal>
<val>48</val>
<type>unsignedInt8</type>
</rawVal>
<dataRef> <!-- property refers to discrete part of reference data-->
<ref id=quot;1quot; start=quot;0quot; end=quot;3quot;/>
<ref id=quot;1quot; start=“10quot; end=quot;12quot;/>
</dataRef>
</property>
Thank you!

     Questions?

Manfred.thaller@uni-koeln.de

Contenu connexe

En vedette

Moon Powerpoint
Moon PowerpointMoon Powerpoint
Moon Powerpointmbla3081
 
Moon Formation And Features
Moon Formation And FeaturesMoon Formation And Features
Moon Formation And FeaturesTeach5ch
 
the outer planets
the outer planetsthe outer planets
the outer planetsBob Smullen
 
The Outer Planets
The Outer PlanetsThe Outer Planets
The Outer PlanetsEric C
 
Pluto Example PowerPoint
Pluto Example PowerPointPluto Example PowerPoint
Pluto Example PowerPointjasmu1mn
 
Pluto The Dwarf Planet
Pluto The Dwarf PlanetPluto The Dwarf Planet
Pluto The Dwarf PlanetJon Dav
 
Best ppt on solar system
Best ppt on solar systemBest ppt on solar system
Best ppt on solar systemMake Megenius
 

En vedette (11)

Moon Powerpoint
Moon PowerpointMoon Powerpoint
Moon Powerpoint
 
My planets´activity
My planets´activityMy planets´activity
My planets´activity
 
Moon Formation And Features
Moon Formation And FeaturesMoon Formation And Features
Moon Formation And Features
 
the outer planets
the outer planetsthe outer planets
the outer planets
 
The Outer Planets
The Outer PlanetsThe Outer Planets
The Outer Planets
 
Planets Lesson
Planets LessonPlanets Lesson
Planets Lesson
 
Pluto Example PowerPoint
Pluto Example PowerPointPluto Example PowerPoint
Pluto Example PowerPoint
 
SunCyles_GT
SunCyles_GTSunCyles_GT
SunCyles_GT
 
Pluto The Dwarf Planet
Pluto The Dwarf PlanetPluto The Dwarf Planet
Pluto The Dwarf Planet
 
Best ppt on solar system
Best ppt on solar systemBest ppt on solar system
Best ppt on solar system
 
Our Moon
Our MoonOur Moon
Our Moon
 

Similaire à Significant Characteristics In Planets Manfred Thaller

Csphtp1 18
Csphtp1 18Csphtp1 18
Csphtp1 18HUST
 
Everything You Always Wanted To Know About XML But Were Afraid To Ask
Everything You Always Wanted To Know About XML But Were Afraid To AskEverything You Always Wanted To Know About XML But Were Afraid To Ask
Everything You Always Wanted To Know About XML But Were Afraid To AskRichard Davis
 
OpenSocial - GTUG Stockholm Meeting Oct 1 2009
OpenSocial - GTUG Stockholm Meeting Oct 1 2009OpenSocial - GTUG Stockholm Meeting Oct 1 2009
OpenSocial - GTUG Stockholm Meeting Oct 1 2009Jacob Gyllenstierna
 
07 Collada Overview
07 Collada Overview07 Collada Overview
07 Collada Overviewjohny2008
 
3 xml namespaces and xml schema
3   xml namespaces and xml schema3   xml namespaces and xml schema
3 xml namespaces and xml schemagauravashq
 
JWU Guest Talk: JavaScript and AJAX
JWU Guest Talk: JavaScript and AJAXJWU Guest Talk: JavaScript and AJAX
JWU Guest Talk: JavaScript and AJAXHilary Mason
 
CSIS 138 Javascript Class1
CSIS 138 Javascript Class1CSIS 138 Javascript Class1
CSIS 138 Javascript Class1Teresa Pelkie
 
Cleveland Silverlight Firestarter - XAML Basics
Cleveland Silverlight Firestarter - XAML BasicsCleveland Silverlight Firestarter - XAML Basics
Cleveland Silverlight Firestarter - XAML BasicsSarah Dutkiewicz
 
Ajax On S2 Odp
Ajax On S2 OdpAjax On S2 Odp
Ajax On S2 Odpghessler
 
IBM Lotus Notes Domino XPages and XPages for Mobile
IBM Lotus Notes Domino XPages and XPages for MobileIBM Lotus Notes Domino XPages and XPages for Mobile
IBM Lotus Notes Domino XPages and XPages for MobileChris Toohey
 
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010Sergey Ilinsky
 
Java Script
Java ScriptJava Script
Java Scriptsiddaram
 

Similaire à Significant Characteristics In Planets Manfred Thaller (20)

Csphtp1 18
Csphtp1 18Csphtp1 18
Csphtp1 18
 
Everything You Always Wanted To Know About XML But Were Afraid To Ask
Everything You Always Wanted To Know About XML But Were Afraid To AskEverything You Always Wanted To Know About XML But Were Afraid To Ask
Everything You Always Wanted To Know About XML But Were Afraid To Ask
 
Ajax ons2
Ajax ons2Ajax ons2
Ajax ons2
 
OpenSocial - GTUG Stockholm Meeting Oct 1 2009
OpenSocial - GTUG Stockholm Meeting Oct 1 2009OpenSocial - GTUG Stockholm Meeting Oct 1 2009
OpenSocial - GTUG Stockholm Meeting Oct 1 2009
 
JQuery 101
JQuery 101JQuery 101
JQuery 101
 
Xml encryption
Xml encryptionXml encryption
Xml encryption
 
07 Collada Overview
07 Collada Overview07 Collada Overview
07 Collada Overview
 
3 xml namespaces and xml schema
3   xml namespaces and xml schema3   xml namespaces and xml schema
3 xml namespaces and xml schema
 
Using Forms in Share
Using Forms in ShareUsing Forms in Share
Using Forms in Share
 
JWU Guest Talk: JavaScript and AJAX
JWU Guest Talk: JavaScript and AJAXJWU Guest Talk: JavaScript and AJAX
JWU Guest Talk: JavaScript and AJAX
 
Front End on Rails
Front End on RailsFront End on Rails
Front End on Rails
 
CSIS 138 Javascript Class1
CSIS 138 Javascript Class1CSIS 138 Javascript Class1
CSIS 138 Javascript Class1
 
Ridingapachecamel
RidingapachecamelRidingapachecamel
Ridingapachecamel
 
Cleveland Silverlight Firestarter - XAML Basics
Cleveland Silverlight Firestarter - XAML BasicsCleveland Silverlight Firestarter - XAML Basics
Cleveland Silverlight Firestarter - XAML Basics
 
Embedded Metadata working group
Embedded Metadata working groupEmbedded Metadata working group
Embedded Metadata working group
 
Ajax On S2 Odp
Ajax On S2 OdpAjax On S2 Odp
Ajax On S2 Odp
 
IBM Lotus Notes Domino XPages and XPages for Mobile
IBM Lotus Notes Domino XPages and XPages for MobileIBM Lotus Notes Domino XPages and XPages for Mobile
IBM Lotus Notes Domino XPages and XPages for Mobile
 
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
 
XML and DTD
XML and DTDXML and DTD
XML and DTD
 
Java Script
Java ScriptJava Script
Java Script
 

Plus de DigitalPreservationEurope

Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigitalPreservationEurope
 
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...DigitalPreservationEurope
 
Scalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingScalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingDigitalPreservationEurope
 
Preservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallPreservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallDigitalPreservationEurope
 
Preservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MelePreservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MeleDigitalPreservationEurope
 

Plus de DigitalPreservationEurope (20)

Infrastructure Training Session
Infrastructure Training SessionInfrastructure Training Session
Infrastructure Training Session
 
Drm Training Session
Drm Training SessionDrm Training Session
Drm Training Session
 
2009 Barcelona Wepreserve Nestor
2009 Barcelona Wepreserve Nestor2009 Barcelona Wepreserve Nestor
2009 Barcelona Wepreserve Nestor
 
Trusted Repositories
Trusted RepositoriesTrusted Repositories
Trusted Repositories
 
Preservation Metadata
Preservation MetadataPreservation Metadata
Preservation Metadata
 
An Introduction to Digital Preservation
An Introduction to Digital PreservationAn Introduction to Digital Preservation
An Introduction to Digital Preservation
 
Introduction to Planets
Introduction to PlanetsIntroduction to Planets
Introduction to Planets
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and Requirements
 
The Planets Preservation Planning workflow
The Planets Preservation Planning workflowThe Planets Preservation Planning workflow
The Planets Preservation Planning workflow
 
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
 
Preservation Metadata, Michael Day, DCC
Preservation Metadata, Michael Day, DCCPreservation Metadata, Michael Day, DCC
Preservation Metadata, Michael Day, DCC
 
PLATTER - Jan Hutar
PLATTER - Jan HutarPLATTER - Jan Hutar
PLATTER - Jan Hutar
 
Sustainability Clive Billenness
Sustainability Clive  BillennessSustainability Clive  Billenness
Sustainability Clive Billenness
 
Shaman Project Hemmje
Shaman Project  HemmjeShaman Project  Hemmje
Shaman Project Hemmje
 
Scalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingScalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross King
 
Risks Benefits And Motivations Seamus Ross
Risks Benefits And Motivations Seamus RossRisks Benefits And Motivations Seamus Ross
Risks Benefits And Motivations Seamus Ross
 
Representation Information Steve Rankin
Representation Information Steve RankinRepresentation Information Steve Rankin
Representation Information Steve Rankin
 
Preservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallPreservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian Upshall
 
Preservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MelePreservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore Mele
 
Platter Colin Rosenthal
Platter Colin RosenthalPlatter Colin Rosenthal
Platter Colin Rosenthal
 

Dernier

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Dernier (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Significant Characteristics In Planets Manfred Thaller

  • 1. Significant characteristics in Planets Manfred Thaller Universität zu* Köln *University at not of Cologne
  • 2. What are “significant characteristics”? Those properties of a digital file which have to be known to enable the processing of the file within a specific setup.
  • 3. Why extract them by software? To create technical metadata as required by organizational models for long term preservation. (NLNZ)
  • 4. Within Planets … … served by solutions to identify formats: formats registry / PRONOM / DROID. … and a solution for extracting and processing such characteristics: XCL.
  • 5. A Vision Extractor tiff XCDL tiff 93% Migrator Comparator png tiff XCEL png XCEL png XCDL
  • 6. A Vision Extractor Comparator Appropriate XCELs C-Set
  • 7. Why automate? 1 million objects: use one second for each. == 16666.7 minutes == 277.8 hours == 11.57 working days of a computer == 34.7 8-hour days for a Human == 7 working weeks
  • 8. Why automate? 1 million objects: use five minutes for each. == 416 666.7 hours == 52 803.4 8-hour days for a Human
  • 9. Why automate? Assumption: Preservation is only feasible, if the content of two digital objects can be compared without human intervention, giving a numerical estimate of their degree of similarity.
  • 10. Demo
  • 11. Abstract solution I (1) Language to represent the complete content of a digital object. XCDL (2) Language to describe any machine readable format in a formal language. XCEL (3) Software to extract the content of a file based upon a description as under (2) and express it in the language as specified under (1). “extractor” (4) Software to compare two such content descriptions. “comparator”
  • 12. <XCELDocument...> ... <xcdl> <formatDescription>.... <object id=quot;o1quot; > <symbol identifier=quot;ID01_I01_I01_S02quot; <normData id=quot;nd1quot; > ... </normData> originalName=quot;height“ interpretation=quot;uint32quot;> <property id=quot;p1quot; source=quot;rawquot; <range><startposition xsi:type=quot;sequential“> cat=quot;descrquot; > </startposition> <length xsi:type=quot;fixedquot;>4</length></range> <name> compression</name> <name>height</name> <valueSet id=quot;i_i1_s6quot; > <rawValue>0 </rawValue> </symbol> <labValue>...</labValue> <symbol identifier=quot;ID01_I01_I01_S04quot; originalName=quot;colourTypequot;> <dataRef ind=quot;normAllquot; /> <range> <propRel/> <startposition xsi:type=quot;sequentialquot;> </valueSet> </startposition> </property> <length xsi:type=quot;fixedquot;>1</length></range> <property id=quot;p2quot; source=quot;rawquot; <valueInterpretation> cat=quot;descrquot; > <valueLabel>greyscale</valueLabel> <name> height</name> <value>0</value></valueinterpretation> <valueSet id=quot;i_i1_s3quot; > <name>imageType</name> <rawValue>0 0 1 ad </rawValue> </symbol> <labValue> <symbol identifier=quot;ID01_I01_I01_S05quot; <val>429</val> originalName=quot;compressionMethodquot;> <type>uint32</type> <range> </labValue> <startposition xsi:type=quot;sequential“> <dataRef ind=quot;normAllquot; /> </startposition> <propRel/> <length </valueSet> xsi:type=quot;fixedquot;>1</length></range> </property> <valueInterpretation> <property id=quot;p3quot; source=quot;rawquot; <valueLabel>zlibDeflateInflate</valueLabel> cat=quot;descrquot; > <value>0</value></valueInterpretation> <name> imageType</name> <name>compression</name> ..... </symbol>...
  • 13. <request2> <property id=quot;2quot; <measurementRequest> name=quot;imageHeightquot; <source name=quot;XCDL1.xmlquot;/> unit=quot;pixelquot; <target name=quot;XCDL2.xmlquot;/> compStatus=quot;completequot;> <property id=quot;45quot; name=quot;rgbPalettequot;> <values type=quot;intquot;> <metric id=quot;10quot; name=quot;hammingDistancequot;/> <src>32</src> </property> <property id=quot;300quot; name=quot;normDataquot;> <tar>32</tar> <metric id=quot;10quot; </values> name=quot;hammingDistancequot;/> <metric id=quot;50quot; name=quot;RMSEquot;/> <metric id=quot;200quot; </property> name=quot;equalquot; result=quot;truequot;/> <property id=quot;2quot; <metric id=quot;201quot; name=quot;imageHeightquot; unit=quot;pixelquot;> name=quot;intDiffquot; result=quot;0quot;/> <metric id=quot;200quot; name=quot;equalquot;/> <metric id=quot;210quot; <metric id=quot;201quot; name=quot;intDiffquot;/> name=quot;percDevquot; <metric id=quot;210quot; name=quot;percDevquot;/> result=quot;0.000000quot;/> </property> </property> <property id=quot;30quot; name=quot;imageWidthquot; unit=quot;pixelquot;> <metric id=quot;200quot; name=quot;equalquot;/> <metric id=quot;201quot; name=quot;intDiffquot;/> <metric id=quot;210quot; name=quot;percDevquot;/> </property>
  • 14. Abstract solution I (1) Language to represent the complete content of a digital object. XCDL (2) Language to describe any machine readable format in a formal language. XCEL (3) Software to extract the content of a file based upon a description as under (2) and express it in the language as specified under (1). “extractor” (4) Software to compare two such content descriptions. “comparator”
  • 15. Are the following two items equal: VIII  8
  • 16. eight eight VIII  8
  • 17. otto eight eight otto VIII  8
  • 18. otto acht eight eight otto VIII  8 acht
  • 19. 8.0 otto acht eight eight otto VIII  8 acht
  • 20. Information model: „an image“ otto acht eight eight otto VIII  8 acht
  • 21. information model: „an image“ format ontology: „what terms are used in formats to describe image properties“ VIII  8
  • 22. Information model: „what is an image“ Format ontology: „what terms are used in formats to describe image properties“ Extraction language: “how to get the terms describing an image out of a file”
  • 23. Abstract solution II (1) A theoretical model of information (not: data) types – “image”, “text”, “audio” ... (2) Ontologies, which map existing file format terminologies onto these model. (3) A language – XCDL – which allows to express the content of files in different formats using the vocabulary of the ontologies and the “grammar” of the information model.
  • 24. XCDL eXtensible Characterisation Definition Language Purpose: Describe the contents of a file in terms of an abstract model.
  • 25. XCDL: text model (1) A text (= <object>) is composed of data (= <normData>) plus interpretations of data according to the underlying format specification (= <property>).
  • 26. XCDL: text model (2) Or, one level of abstraction higher, a text is composed of content carrying tokens, accompanied by rendering info plus deployment info plus historical info.
  • 27. This text is a <refData id=quot;1quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData> … <property> <name>fontsize</name> <rawVal> <val>48</val> <type>unsignedInt8</type> </rawVal> <dataRef> <!-- property refers to discrete part of reference data--> <ref id=quot;1quot; start=quot;0quot; end=quot;3quot;/> <ref id=quot;1quot; start=“10quot; end=quot;12quot;/> </dataRef> </property>
  • 28. This text is a <refData id=quot;1quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData> … <property> <name>fontsize</name> <rawVal> <val>48</val> <type>unsignedInt8</type> </rawVal> <dataRef> <!-- property refers to discrete part of reference data--> <ref id=quot;1quot; start=quot;0quot; end=quot;3quot;/> <ref id=quot;1quot; start=“10quot; end=quot;12quot;/> </dataRef> </property>
  • 29. Thank you! Questions? Manfred.thaller@uni-koeln.de