SlideShare une entreprise Scribd logo
1  sur  5
Télécharger pour lire hors ligne
PDF/A – A standard for
document archiving
Dipl. Inf. Reinhold Müller-Meernach
Röttenbach
                                                     006
                                                  2/2
Dr. Uwe Wächter                               No.

Roßdorf




                           SEAL Systems
                       info@sealsystems.com
                       www.sealsystems.com
DOCUMENT MANAGEMENT




PDF/A – A standard for
document archiving
Dipl. Inf. Reinhold Müller-Meernach
Röttenbach

Dr. Uwe Wächter
Roßdorf




             The »leader of the pack« TIFF/G4 has got competition.
             With PDF/A, a new standard for long-term archiving of electronic
             documents has now been defined. Checks on existing document
             archives show that a large amount of the PDF files archived there
             don’t even meet the minimum requirements of the new standard.
             But this is no longer a reason to panic.
DOCUMENT MANAGEMENT

   Paper archives have been and
are being replaced by digital stora-
ge. The number of electronically
created documents is growing con-
stantly. For long-term archiving of
these documents, standards are
beneficial if the well-defined repro-
ducibility and distribution is to be                                                   Fig. 1: Investigations show that
supported over a long period of                                                        almost no PDF files in existing
time. The monochrome grid format                                                       archives conform to PDF/A.
                                                                                       (Fig.: Seal Systems AG,
TIFF/G4 has been the de facto                                                          Röttenbach)
standard for more than ten years.
For text-laden documents (such as       References to external sources,       of this are the effects of trans-
those from Office applications), the    such as further files, images, web-   parency, colour mixing and back-
Portable Document Format, PDF           sites or external fonts contradict    ground stamping. These characte-
for short, from Adobe has become        the PDF/A norm.                       ristics can not be represented 1:1
established as an application-neu-                                            with many PDF generating proces-
                                           An especially important charac-
tral exchange format. With PDF/A,                                             ses. Therefore, with PDF/A, this
                                        teristic of PDF/A is the embedding
there is now a standard, that                                                 must be avoided.
                                        of fonts. Only this can ensure that
establishes a part of the PDF speci-
                                        a document can be printed in
fication to make PDF files parti-
                                        exactly the same way after many            Secure archiving in line
cularly suitable for archiving.
                                        years, without having to use font              with the norm
  The ISO Norm 19005-1 is based         definitions on a computer or prin-
on the »PDF Reference 1.4« from         ter. PDF can also demonstrate its       Secure archiving in line with the
Adobe. It makes PDF 1.4 more            advantage over TIFF G4 through        norm means that the saved files
precise and defines whether its         its colour displays. However, this    can then still be used if the admi-
properties are obligatory, re-          only conforms to standards if the     nistration system corrupts. There-

                                                                                          Fig. 2: With test and correction
                                                                                          procedures for PDF/A, data
                                                                                          stocks can be viewed and modified
                                                                                          as the case may need.
                                                                                          (Fig.: Seal Systems AG, Röttenbach)




commended, limited or prohibited.       PDF file can also be printed in-      fore, PDF/A-conforming files must
This makes it possible to differenti-   stantly on all colour printers.       operate a clause on metadata.
ate two levels of PDF/A: a (PDF/A-      To do this, colour definitions un-
                                                                                The Portable Document Format
1a) and b (PDF/A-1b).                   related to the equipment are saved
                                                                              makes it possible to save graphic
                                        in the file, which are only conver-
                                                                              displays in different representations
       Level B is important             ted when printing.
                                                                              simultaneously. This means an
          for archiving                   Simple and safe reproduction can    improved display on different
                                        be prevented through protective       screens (PC or Handheld or PDA)
   Level B deals mainly with the        mechanism, compressions and           or a user orientation (German or
preservation of the external appea-     encodings. Therefore, these tech-     English) is possible. However, as
rance over long periods of time.        niques are also prohibited for        reproduction is unclear with this
To do this, it is necessary that all
                                        PDF/A conforming files.               method, this function contradicts
the information needed for the
                                                                              ISO 19005-1.
reproduction is contained in the           Frequently, image overlapping in
file itself. For example, this con-     certain applications can be              When using level A, such charac-
cerns all texts, graphics, images,      specifically used to elicit certain   teristics are additionally standardised
fonts and colour information.           effects for the observer. Examples    using level B, which define the
DOCUMENT MANAGEMENT

                                                                                              When this question is answered
                                                                                           then the next steps can be taken.
                                                                                           It must be clarified which proce-
                                                                                           dures guarantee that these
                                                                                           minimum requirements are com-
                                                                                           plied with. In addition, it must be
                                                                                           decided how to proceed with any
                                                                                           old stock. And finally, it must be
                                                                                           specified who is responsible for
                                                                                           inspection and compliance of these
                                                        Fig. 3: Test logs provide users    processes.
                                                        and IT managers with information
                                                        about the quality of the data         In the meantime, there are now
                                                        stock. (Fig.: Seal Systems AG,     countless software tools for
                                                        Röttenbach)
                                                                                           creating PDF files. The most well-
                                                                                           known is Acrobat from Adobe.
                                                                                           As well as many converting
                                                                                           applications from third-party
                                                                                           suppliers, there are a number of
                                                                                           applications that make it possible
                                                                                           to directly export PDFs. In the
properties for content, structure             operational practice. Therefore, it
                                                                                           future, this should also be possible
and semantics. This means there is            must be checked whether company
                                                                                           for the Office products from
the opportunity to be able to                 standards can also be defined
                                                                                           Microsoft. However, investigations
re-extract parts or information               taking into account practicability
                                                                                           show that some PDFs created in
from the PDF documents at a later             and compatibility with existing
                                                                                           this way do not even meet the
point in time. Furthermore, this              procedures. This takes over defini-
                                                                                           standard specification, so definitely
level explains how a Unicode font             tions from the ISO norm, combi-
                                                                                           fall short of the stricter ISO 19005-1.
must be dealt with. Work is                   ning comprehensible instructions
already being carried out on the              for action which can be used by all             In a very small number of cases,
expansion of this norm, which is              company members.                             PDF files are created solely within
named 19005-2 and is based on                                                              the company with an inspected
»Adobe PDF Reference 1.6«.                        Define minimum standards                 tool. PDF is an exchange format –
                                                                                           meaning the probability that con-
                                                 The past has shown that even in-          siderable data stocks stem from
      PDF/A level A covers                                                                 other, uncheckable sources is high.
                                              dividual industries can agree on a
       the complete norm
                                              standardised comprehension and                 Business partners, the internet
  Every international norm is a               procedure.                                   and emails are examples of this.
compromise between the interest                  If a company decides on PDF               For these reasons, it makes more
groups concerned and their                    as a reliable document format for            sense for the standard to be
requirements, which can be                    long-term archiving, then this is            inspected by the responsible body
contradicting in parts. Existing pro-         the next logical question: is every          within the archiving organisation.
cedures and local regulations                 PDF allowed or must it satisfy                  Nowadays, there are test
should be taken into account. On              certain minimum requirements?                programs, with which PDF files
the other hand, new technical                 When answering this question                 can be inspected for configurable
possibilities also shouldn’t be ruled         and defining the minimum                     ISO and company standard compa-
out. Maximum specification of all             standards, the ISO norm for                  tibility. The result of an inspection
details can lead to unusability in             PDF/A can help.                             is always a confirmation of




                   Fig 4: PDF/A inspections can be
                    integrated into existing Document
                   Management Systems (DMS) and
                Product Data Management Systems.
                (Fig.: Seal Systems AG, Röttenbach)
DOCUMENT MANAGEMENT

conformity or a rejection. In the        steps can be derived from this. A          becoming more powerful and
latter case, a qualified analysis        part of the data can be corrected,         extensive with every new version.
should take place so that the            another part not.                          3D visualisations, form processing,
creator can be given targeted                                                       digital signatures, change mana-
instructions for use.                                                               gement and pre-print inspection
                                          PDF/A – an archiving format
                                                                                    are only parts of the PDF applica-
   However, an alternative to                    with a future                      tion spectrum. The use as an
rejection can also be the automated
                                                                                    extensively simple exchange format
correction of a PDF file to norm           If the sources are known, it can
                                                                                    suggests itself for use as an archi-
conformity. Frequently observed          be possible to make a new norm-
                                                                                    ving format. The technical
incompatibilities, such as missing       conforming version available. First        requirements here are less but the
font embedding, can be corrected         experiences in reference supplies          legal ones are higher.
as a result with the minimum of          from industrial customers have
                                                                                      With PDF/A, a norm has now
                                                              Fig 5: The diagram    been passed, with which risks and
                                                              shows the integra-    future expense for long-term
                                                              tion of the PDF/A
                                                              methods into the      archiving can be minimised.
                                                              SAP document          There are tools to generate, inspect
                                                              management system.    and adjust PDF/A files. As a result,
                                                              (Fig.: Seal Systems
                                                                                    the new standard will rapidly be
                                                              AG, Röttenbach)
                                                                                    established as practical alternative.




effort. To safeguard processes, the      shown that almost no PDF file met
question of the time of a con-           the PDF/A-1b definition.
formity inspection is decisive.
                                            The most frequent errors are
   The first and best time is defini-    (in this order) missing metadata,
tely the generation process. For         no font embedding, colour
unknown documents or non-                management and protection me-
secured generation processes, a          chanisms. However, any weaknes-
simple checking procedure is supp-       ses can be automatically corrected
lied on the desktop. Both methods        through suitable tools. The
make it easier for the parties           Portable Document Format is
concerned to carry out an
inspection but do not force
them to do so.
  Therefore, it is recommen-
ded that the manufacturer
and operator of Document
Management Systems (DMS),
*Enterprise Content Management
Systems (ECM) and archiving
solutions provide a suitable inter-
face, through which test methods
can be integrated.
  If this interface is then run by all
archiving and converting processes,
                                                                                                         Fig. 6: The data
the PDF/A inspection is obligatory.                                                                      format PDF/A is
Even for existing PDF archives, a                                                                        classed as a norm,
one-off or regular inspection is                                                                         with which both the
recommended. A first run provides                                                                        risks and predomi-
                                                                                                         nantly the expense of
information about the quality of                                                                         long-term archiving
the data stock. Then subsequent                                                                          can be minimised.

Contenu connexe

En vedette

Chapter 6: FINANCIAL OPERATIONS OF I NSURERS
Chapter 6: FINANCIAL OPERATIONS OF I NSURERSChapter 6: FINANCIAL OPERATIONS OF I NSURERS
Chapter 6: FINANCIAL OPERATIONS OF I NSURERS
Marya Sholevar
 
Chapter 1: Introduction to Insurance
Chapter 1: Introduction to InsuranceChapter 1: Introduction to Insurance
Chapter 1: Introduction to Insurance
Marya Sholevar
 
E-Banking 2009
E-Banking 2009E-Banking 2009
E-Banking 2009
keerthi123
 

En vedette (12)

Chapter 6: FINANCIAL OPERATIONS OF I NSURERS
Chapter 6: FINANCIAL OPERATIONS OF I NSURERSChapter 6: FINANCIAL OPERATIONS OF I NSURERS
Chapter 6: FINANCIAL OPERATIONS OF I NSURERS
 
Chapter 1: Introduction to Insurance
Chapter 1: Introduction to InsuranceChapter 1: Introduction to Insurance
Chapter 1: Introduction to Insurance
 
Chapter 2:Insurance Contract
Chapter 2:Insurance ContractChapter 2:Insurance Contract
Chapter 2:Insurance Contract
 
E-Banking 2009
E-Banking 2009E-Banking 2009
E-Banking 2009
 
SAP Archiving
SAP ArchivingSAP Archiving
SAP Archiving
 
Chapter 6:e marketing
Chapter 6:e marketingChapter 6:e marketing
Chapter 6:e marketing
 
Chapter 5 tech in e commerce
Chapter 5 tech in e commerceChapter 5 tech in e commerce
Chapter 5 tech in e commerce
 
Document Archieving System (DAS) by In2sol Riyadh
Document Archieving System (DAS) by In2sol RiyadhDocument Archieving System (DAS) by In2sol Riyadh
Document Archieving System (DAS) by In2sol Riyadh
 
E Banking
E BankingE Banking
E Banking
 
Chapter 7 e crm
Chapter 7 e crmChapter 7 e crm
Chapter 7 e crm
 
Marketing Financial Services
Marketing Financial ServicesMarketing Financial Services
Marketing Financial Services
 
E banking
E bankingE banking
E banking
 

Similaire à a standard for document archiving

October 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy HubbardOctober 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
John Wang
 
January 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJanuary 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies Presentation
John Wang
 
Genetic Engineering - Teamworking Infrastructure For Post And DI
Genetic Engineering - Teamworking Infrastructure For Post And DIGenetic Engineering - Teamworking Infrastructure For Post And DI
Genetic Engineering - Teamworking Infrastructure For Post And DI
Quantel
 
Apago Pdfx Nyc Seminar Fs Presentation
Apago Pdfx Nyc Seminar Fs PresentationApago Pdfx Nyc Seminar Fs Presentation
Apago Pdfx Nyc Seminar Fs Presentation
Dwight Kelly
 

Similaire à a standard for document archiving (20)

October 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy HubbardOctober 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
 
PDF/A: A Preservation Format
PDF/A: A Preservation Format PDF/A: A Preservation Format
PDF/A: A Preservation Format
 
Dpp bloodless revolution : Un guide sur les Workflow DEMAT
Dpp bloodless revolution : Un guide sur les Workflow DEMATDpp bloodless revolution : Un guide sur les Workflow DEMAT
Dpp bloodless revolution : Un guide sur les Workflow DEMAT
 
Key Considerations For Deduplication In The Enterprise
Key Considerations For Deduplication In The EnterpriseKey Considerations For Deduplication In The Enterprise
Key Considerations For Deduplication In The Enterprise
 
Demystifying pd fs
Demystifying pd fsDemystifying pd fs
Demystifying pd fs
 
January 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJanuary 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies Presentation
 
Pdfa 2 rome-fanning
Pdfa 2 rome-fanningPdfa 2 rome-fanning
Pdfa 2 rome-fanning
 
What is PDF/A?
What is PDF/A?What is PDF/A?
What is PDF/A?
 
Non-Specialized File Format Extension
Non-Specialized File Format ExtensionNon-Specialized File Format Extension
Non-Specialized File Format Extension
 
What is PDF/X?
What is PDF/X? What is PDF/X?
What is PDF/X?
 
PDF/Archive - Preserving Electronic Documents
PDF/Archive - Preserving Electronic DocumentsPDF/Archive - Preserving Electronic Documents
PDF/Archive - Preserving Electronic Documents
 
Genetic Engineering - Teamworking Infrastructure For Post And DI
Genetic Engineering - Teamworking Infrastructure For Post And DIGenetic Engineering - Teamworking Infrastructure For Post And DI
Genetic Engineering - Teamworking Infrastructure For Post And DI
 
Star 2013-pdfa-pdfa
Star 2013-pdfa-pdfaStar 2013-pdfa-pdfa
Star 2013-pdfa-pdfa
 
testing.pdf
testing.pdftesting.pdf
testing.pdf
 
Apago Pdfx Nyc Seminar Fs Presentation
Apago Pdfx Nyc Seminar Fs PresentationApago Pdfx Nyc Seminar Fs Presentation
Apago Pdfx Nyc Seminar Fs Presentation
 
Dsohowto
DsohowtoDsohowto
Dsohowto
 
Pdfa Keynote
Pdfa KeynotePdfa Keynote
Pdfa Keynote
 
Gfs论文
Gfs论文Gfs论文
Gfs论文
 
The google file system
The google file systemThe google file system
The google file system
 
Pdfsfly
PdfsflyPdfsfly
Pdfsfly
 

Plus de Olivia Clouse

Final Documentation Directly Off S...
Final Documentation Directly Off  S...Final Documentation Directly Off  S...
Final Documentation Directly Off S...
Olivia Clouse
 
Your Ticket To Increased Efficiency And Improved Security
Your Ticket To Increased Efficiency And Improved SecurityYour Ticket To Increased Efficiency And Improved Security
Your Ticket To Increased Efficiency And Improved Security
Olivia Clouse
 
P L O S S Y S Netdome En
P L O S S Y S Netdome EnP L O S S Y S Netdome En
P L O S S Y S Netdome En
Olivia Clouse
 
Seal systems a white paper
Seal systems a white paperSeal systems a white paper
Seal systems a white paper
Olivia Clouse
 
Maintenance Orders N[1]
Maintenance Orders N[1]Maintenance Orders N[1]
Maintenance Orders N[1]
Olivia Clouse
 
Mercedes pm printing[1]
Mercedes pm printing[1]Mercedes pm printing[1]
Mercedes pm printing[1]
Olivia Clouse
 

Plus de Olivia Clouse (8)

Final Documentation Directly Off S...
Final Documentation Directly Off  S...Final Documentation Directly Off  S...
Final Documentation Directly Off S...
 
Your Ticket To Increased Efficiency And Improved Security
Your Ticket To Increased Efficiency And Improved SecurityYour Ticket To Increased Efficiency And Improved Security
Your Ticket To Increased Efficiency And Improved Security
 
P L O S S Y S Netdome En
P L O S S Y S Netdome EnP L O S S Y S Netdome En
P L O S S Y S Netdome En
 
Seal systems a white paper
Seal systems a white paperSeal systems a white paper
Seal systems a white paper
 
Maintenance Orders N[1]
Maintenance Orders N[1]Maintenance Orders N[1]
Maintenance Orders N[1]
 
Mercedes pm printing[1]
Mercedes pm printing[1]Mercedes pm printing[1]
Mercedes pm printing[1]
 
Pm overview 2010
Pm overview 2010Pm overview 2010
Pm overview 2010
 
Plm 2009
Plm 2009Plm 2009
Plm 2009
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Dernier (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 

a standard for document archiving

  • 1. PDF/A – A standard for document archiving Dipl. Inf. Reinhold Müller-Meernach Röttenbach 006 2/2 Dr. Uwe Wächter No. Roßdorf SEAL Systems info@sealsystems.com www.sealsystems.com
  • 2. DOCUMENT MANAGEMENT PDF/A – A standard for document archiving Dipl. Inf. Reinhold Müller-Meernach Röttenbach Dr. Uwe Wächter Roßdorf The »leader of the pack« TIFF/G4 has got competition. With PDF/A, a new standard for long-term archiving of electronic documents has now been defined. Checks on existing document archives show that a large amount of the PDF files archived there don’t even meet the minimum requirements of the new standard. But this is no longer a reason to panic.
  • 3. DOCUMENT MANAGEMENT Paper archives have been and are being replaced by digital stora- ge. The number of electronically created documents is growing con- stantly. For long-term archiving of these documents, standards are beneficial if the well-defined repro- ducibility and distribution is to be Fig. 1: Investigations show that supported over a long period of almost no PDF files in existing time. The monochrome grid format archives conform to PDF/A. (Fig.: Seal Systems AG, TIFF/G4 has been the de facto Röttenbach) standard for more than ten years. For text-laden documents (such as References to external sources, of this are the effects of trans- those from Office applications), the such as further files, images, web- parency, colour mixing and back- Portable Document Format, PDF sites or external fonts contradict ground stamping. These characte- for short, from Adobe has become the PDF/A norm. ristics can not be represented 1:1 established as an application-neu- with many PDF generating proces- An especially important charac- tral exchange format. With PDF/A, ses. Therefore, with PDF/A, this teristic of PDF/A is the embedding there is now a standard, that must be avoided. of fonts. Only this can ensure that establishes a part of the PDF speci- a document can be printed in fication to make PDF files parti- exactly the same way after many Secure archiving in line cularly suitable for archiving. years, without having to use font with the norm The ISO Norm 19005-1 is based definitions on a computer or prin- on the »PDF Reference 1.4« from ter. PDF can also demonstrate its Secure archiving in line with the Adobe. It makes PDF 1.4 more advantage over TIFF G4 through norm means that the saved files precise and defines whether its its colour displays. However, this can then still be used if the admi- properties are obligatory, re- only conforms to standards if the nistration system corrupts. There- Fig. 2: With test and correction procedures for PDF/A, data stocks can be viewed and modified as the case may need. (Fig.: Seal Systems AG, Röttenbach) commended, limited or prohibited. PDF file can also be printed in- fore, PDF/A-conforming files must This makes it possible to differenti- stantly on all colour printers. operate a clause on metadata. ate two levels of PDF/A: a (PDF/A- To do this, colour definitions un- The Portable Document Format 1a) and b (PDF/A-1b). related to the equipment are saved makes it possible to save graphic in the file, which are only conver- displays in different representations Level B is important ted when printing. simultaneously. This means an for archiving Simple and safe reproduction can improved display on different be prevented through protective screens (PC or Handheld or PDA) Level B deals mainly with the mechanism, compressions and or a user orientation (German or preservation of the external appea- encodings. Therefore, these tech- English) is possible. However, as rance over long periods of time. niques are also prohibited for reproduction is unclear with this To do this, it is necessary that all PDF/A conforming files. method, this function contradicts the information needed for the ISO 19005-1. reproduction is contained in the Frequently, image overlapping in file itself. For example, this con- certain applications can be When using level A, such charac- cerns all texts, graphics, images, specifically used to elicit certain teristics are additionally standardised fonts and colour information. effects for the observer. Examples using level B, which define the
  • 4. DOCUMENT MANAGEMENT When this question is answered then the next steps can be taken. It must be clarified which proce- dures guarantee that these minimum requirements are com- plied with. In addition, it must be decided how to proceed with any old stock. And finally, it must be specified who is responsible for inspection and compliance of these Fig. 3: Test logs provide users processes. and IT managers with information about the quality of the data In the meantime, there are now stock. (Fig.: Seal Systems AG, countless software tools for Röttenbach) creating PDF files. The most well- known is Acrobat from Adobe. As well as many converting applications from third-party suppliers, there are a number of applications that make it possible to directly export PDFs. In the properties for content, structure operational practice. Therefore, it future, this should also be possible and semantics. This means there is must be checked whether company for the Office products from the opportunity to be able to standards can also be defined Microsoft. However, investigations re-extract parts or information taking into account practicability show that some PDFs created in from the PDF documents at a later and compatibility with existing this way do not even meet the point in time. Furthermore, this procedures. This takes over defini- standard specification, so definitely level explains how a Unicode font tions from the ISO norm, combi- fall short of the stricter ISO 19005-1. must be dealt with. Work is ning comprehensible instructions already being carried out on the for action which can be used by all In a very small number of cases, expansion of this norm, which is company members. PDF files are created solely within named 19005-2 and is based on the company with an inspected »Adobe PDF Reference 1.6«. Define minimum standards tool. PDF is an exchange format – meaning the probability that con- The past has shown that even in- siderable data stocks stem from PDF/A level A covers other, uncheckable sources is high. dividual industries can agree on a the complete norm standardised comprehension and Business partners, the internet Every international norm is a procedure. and emails are examples of this. compromise between the interest If a company decides on PDF For these reasons, it makes more groups concerned and their as a reliable document format for sense for the standard to be requirements, which can be long-term archiving, then this is inspected by the responsible body contradicting in parts. Existing pro- the next logical question: is every within the archiving organisation. cedures and local regulations PDF allowed or must it satisfy Nowadays, there are test should be taken into account. On certain minimum requirements? programs, with which PDF files the other hand, new technical When answering this question can be inspected for configurable possibilities also shouldn’t be ruled and defining the minimum ISO and company standard compa- out. Maximum specification of all standards, the ISO norm for tibility. The result of an inspection details can lead to unusability in PDF/A can help. is always a confirmation of Fig 4: PDF/A inspections can be integrated into existing Document Management Systems (DMS) and Product Data Management Systems. (Fig.: Seal Systems AG, Röttenbach)
  • 5. DOCUMENT MANAGEMENT conformity or a rejection. In the steps can be derived from this. A becoming more powerful and latter case, a qualified analysis part of the data can be corrected, extensive with every new version. should take place so that the another part not. 3D visualisations, form processing, creator can be given targeted digital signatures, change mana- instructions for use. gement and pre-print inspection PDF/A – an archiving format are only parts of the PDF applica- However, an alternative to with a future tion spectrum. The use as an rejection can also be the automated extensively simple exchange format correction of a PDF file to norm If the sources are known, it can suggests itself for use as an archi- conformity. Frequently observed be possible to make a new norm- ving format. The technical incompatibilities, such as missing conforming version available. First requirements here are less but the font embedding, can be corrected experiences in reference supplies legal ones are higher. as a result with the minimum of from industrial customers have With PDF/A, a norm has now Fig 5: The diagram been passed, with which risks and shows the integra- future expense for long-term tion of the PDF/A methods into the archiving can be minimised. SAP document There are tools to generate, inspect management system. and adjust PDF/A files. As a result, (Fig.: Seal Systems the new standard will rapidly be AG, Röttenbach) established as practical alternative. effort. To safeguard processes, the shown that almost no PDF file met question of the time of a con- the PDF/A-1b definition. formity inspection is decisive. The most frequent errors are The first and best time is defini- (in this order) missing metadata, tely the generation process. For no font embedding, colour unknown documents or non- management and protection me- secured generation processes, a chanisms. However, any weaknes- simple checking procedure is supp- ses can be automatically corrected lied on the desktop. Both methods through suitable tools. The make it easier for the parties Portable Document Format is concerned to carry out an inspection but do not force them to do so. Therefore, it is recommen- ded that the manufacturer and operator of Document Management Systems (DMS), *Enterprise Content Management Systems (ECM) and archiving solutions provide a suitable inter- face, through which test methods can be integrated. If this interface is then run by all archiving and converting processes, Fig. 6: The data the PDF/A inspection is obligatory. format PDF/A is Even for existing PDF archives, a classed as a norm, one-off or regular inspection is with which both the recommended. A first run provides risks and predomi- nantly the expense of information about the quality of long-term archiving the data stock. Then subsequent can be minimised.