SlideShare une entreprise Scribd logo
1  sur  18
ISO 19157 – A Unified Metadata Model for
Data Quality
Ted Habermann
Director of Earth Science
The HDF Group
thabermann@hdfgroup.org
1
The Global Change Master Directory
(GCMD) and the EOS Clearinghouse
(ECHO) both include data quality
information?
Can this information be mapped to
ISO 19157?
GCMD ECHO
ISO
?Presented May 21, 2014 at the Earth Science
Information Partnership Information Quality Cluster
Metadata in Multiple Dialects
Documentation
Repository
ISO 19115-1,
19115-2, 19119
19157 and
extensions
THREDDS
HDF, netCDF
(NcML)
FGDC,
Data.Gov
SensorML
WCS, WMS,
WFS, SOS
Open
Provenance
Model, PROV
DIF, ECS,
ECHO
KML
GCMD Data Quality
3
The <Quality> field allows the author to
provide information about the quality of
the data or any quality assurance
procedures followed in producing the
data described in the metadata.
GCMD Quality Examples
4
<Quality>
Due to the lack of high resolution data available over the region for
1993-94, it has been hard to validate the product. However the maps of
burnt areas correspond well with active fire maps for the
region. Where large [>3km] scars are found, the detection is more
reliable. In areas of small scars more problems are involved. It is
hoped that the 1994-95 data set will cover the whole of the study area
and be calibrated by high resolution data.
</Quality> <Quality>
QA performed by CDIAC One of the roles of the Carbon Dioxide
Information Analysis Center (CDIAC) is quality assurance (QA) of
data. The QA process is an important component of the value-added
concept of assuring accurate, usable information for researchers,
because data received by CDIAC are rarely in condition for immediate
distribution, regardless of source.
</Quality> <Quality>
The fire training-set may also have been biased against savanna and
savanna woodland fires since their detection is more difficult than in
humid, forst environments with cool background temperatures
[Malingreau, 1990]. There may, therefore, be an under-sampling of
fires in these warmer background environments.
</Quality> <Quality>
Note that Data File 12, Report #2, TASK 2 (Auclair et al., 1994a) is a
Quality Assurance and Quality Control chapter for the areas of Canada,
Alaska, United States (48 states), with range estimates of validation
and error, a listing of discussions with experts in the field and a
review of the draft of data files.
</Quality>
ISO Data Quality Results
DQ_ConformanceResult
+ specification : CI_Citation
+ explanation : CharacterString
+ pass : Boolean
DQ_CoverageResultDQ_DescriptiveResult
+ statement : CharacterString
<<Abstract>>
DQ_Result
+ dateTime [0..1] : DateTime
+ resultScope [0..1] : DQ_Scope
DQ_QuantitativeResult
+ valueType [0..1] : RecordType
+ valueUnit : UnitOfMeasure
+ value [1..*] : Record
ISO 19157 includes four kinds of quality results.
It includes a new kind of report (DQ_DescriptiveResult) that includes a simple text
description of the result of the quality test.
ISO Data Quality Results
DQ_ConformanceResult
+ specification : CI_Citation
+ explanation : CharacterString
+ pass : Boolean
DQ_CoverageResultDQ_DescriptiveResult
+ statement : CharacterString
<<Abstract>>
DQ_Result
+ dateTime [0..1] : DateTime
+ resultScope [0..1] : DQ_Scope
DQ_QuantitativeResult
+ valueType [0..1] : RecordType
+ valueUnit : UnitOfMeasure
+ value [1..*] : Record
<Quality>
QA performed by CDIAC One of the roles of the Carbon Dioxide
Information Analysis Center (CDIAC) is quality assurance (QA) of
data. The QA process is an important component of the value-added
concept of assuring accurate, usable information for researchers,
because data received by CDIAC are rarely in condition for immediate
distribution, regardless of source.
</Quality>
The DQ_DescriptiveResult
matches the GCMD quality
element exactly.
GCMD Quality Examples
7
<Quality>
Due to the lack of high resolution data available over the region for
1993-94, it has been hard to validate the product. However the maps of
burnt areas cor- respond well with active fire maps for the
region. Where large [>3km] scars are found, the detection is more
reliable. In areas of small scars more problems are involved. It is
hoped that the 1994-95 data set will cover the whole of the study area
and be calibrated by high resolution data.
</Quality> <Quality>
QA performed by CDIAC One of the roles of the Carbon Dioxide
Information Analysis Center (CDIAC) is quality assurance (QA) of
data. The QA process is an important component of the value-added
concept of assuring accurate, usable information for researchers,
because data received by CDIAC are rarely in condition for immediate
distribution, regardless of source.
</Quality> <Quality>
The fire training-set may also have been biased against savanna and
savanna woodland fires since their detection is more difficult than in
humid, forst environments with cool background temperatures
[Malingreau, 1990]. There may, therefore, be an under-sampling of
fires in these warmer background environments.
</Quality> <Quality>
Note that Data File 12, Report #2, TASK 2 (Auclair et al., 1994a) is a
Quality Assurance and Quality Control chapter for the areas of Canada,
Alaska, United States (48 states), with range estimates of validation
and error, a listing of discussions with experts in the field and a
review of the draft of data files.
</Quality>
Standalone Quality Reports
DQ_Element
+ standaloneQualityReportDetails [0..1] : CharacterString
DQ_DataQuality
+ scope : DQ_Scope
+ report 1..*
DQ_StandaloneQualityReportInformation
+ reportReference: CI_Citation
+ abstract : CharacterString
+ standaloneQualityReport 0..1
ISO 19157 acknowledges that important quality information can exist in standalone
reports that may not fit easily into the ISO conceptual model. These standalone
reports are cited with abstracts that summarize the results for the user without
neccessitating access to the cited report.
ECHO Quality Information
ECHO includes two kinds of quality information
ECHO Quality Information
ECHO includes two kinds of quality information
QAStats – Standard measures for all products
QAPercentMissingData - Granule level % missing data. This attribute can be
repeated for individual parameters within a granule.
QAPercentOutOfBoundsData – Granule level % out of bounds data. This attribute
can be repeated for individual parameters within a granule.
QAPercentInterpolatedData – Granule level % interpolated data. This attribute
can be repeated for individual parameters within a granule.
QAPercentCloudCover – This attribute is used to characterize the cloud cover
amount of a granule. This attribute may be repeated for individual parameters
within a granule. (Note - there may be more than one way to define a cloud or it's
effects within a product containing several parameters; i.e. this attribute may be
parameter specific)
ECHO Quality Information
ECHO includes two kinds of quality information
QAFlags – Classes of quality measures with product specific implementations
AutomaticQualityFlag – The granule level flag applying generally to the granule and specifically to
parameters the granule level. When applied to parameter, the flag refers to the quality of that parameter
for the granule (as applicable). The parameters determining whether the flag is set are defined by the
developer and documented in the Quality Flag Explanation.
AutomaticQualityFlagExplanation – A text explanation of the criteria used to set automatic quality flag,
including thresholds or other criteria.
OperationalQualityFlag – The granule level flag applying both generally to a granule and specifically to
parameters at the granule level. When applied to parameter, the flag refers to the quality of that
parameter for the granule (as applicable). The parameters determining whether the flag is set are defined
by the developers and documented in the Operational Quality Flag Explanation.
OperationalQualityFlagExplanation – A text explanation of the criteria used to set operational quality
flag; including thresholds or other criteria.
ScienceQualityFlag – Granule level flag applying to a granule, and specifically to parameters. When
applied to parameter, the flag refers to the quality of that parameter for the granule (as applicable). The
parameters determining whether the flag is set are defined by the developers and documented in the
Science Quality Flag Explanation.
ScienceQualityFlagExplanation – A text explanation of the criteria used to set science quality flag;
including thresholds or other criteria.
ECHO Data Quality Measures
DQ_MeasureReference
+ measureIdentification [0..1] : MD_Identifier
+ nameOfMeasure [0..*] : CharacterString
+ measureDescription [0..1] : CharacterString
if measureIdentification is not provided,
then nameOfMeasure shall be provided
+ measure 0..1
ISO 19157 data quality measure references identify measures in several ways and
provides a brief description of the measure.
<<Abstract>>
DQ_Element
+ measure [0..*] : DQ_MeasureReference
+ evaluationMethod [0..1] : DQ_EvaluationMethod
+ result [0..1] : DQ_Result
QAPercentMissingData
QAPercentOutOfBoundsData
QAPercentInterpolatedData
QAPercentCloudCover
AutomaticQualityFlag
OperationalQualityFlag
ScienceQualityFlag
AutomaticQualityFlagExplanation
OperationalQualityFlagExplanation
ScienceQualityFlagExplanation
Data Quality Measures
DQM_Measure
+ measureIdentifier : MD_Identifier
+ name : CharacterString
+ alias [0..*] : CharacterString
+ sourceReference [0..*] : CI_Citation
+ elementName [1..*] : TypeName
+ definition : CharacterString
+ description [0..1] : DQM_Description
+ valueType : TypeName
+ valueStructure [0..1] : DQM_ValueStructure
+ example [0..*] : DQM_Description
DQM_BasicMeasure
+ name : CharacterString
+ definition : CharacterString
+ example : DQM_Description [0..1]
+ valueType : TypeName
<<CodeList>>
DQ_ValueStructure
+ bag
+ set
+ sequence
+ table
+ matrix
+ coverage
DQM_Parameter
+ name : CharacterString
+ definition : CharacterString
+ description: DQM_Description [0..1]
+ valueType : TypeName
+ valueStructure [0..1] : DQM_ValueStructure
DQM_Description
+ textDescription: CharacterString
+ extendedDescription [0..1] : MD_BrowseGraphic
DQ_MeasureReference
+ measureIdentification [0..1] : MD_Identifier
+ nameOfMeasure [0..*] : CharacterString
+ measureDescription [0..1] : CharacterString
if measureIdentification is not provided,
then nameOfMeasure shall be provided
+ measure 0..1
ISO 19157 data quality measure properties are in a
separate object and provide a much more complete
description of the measures.
<<Abstract>>
DQ_Element
+ measure [0..*] : DQ_MeasureReference
+ evaluationMethod [0..1] : DQ_EvaluationMethod
+ result [0..1] : DQ_Result
ECHO Quality Measure Descriptions
14
AutomaticQualityFlagExplanation
(2289)
66% parameter is produced correctly
7% No automatic quality assessment is performed in the PGE
5%
Based on percentage of product that is good. Suspect used where true
quality is not known.
4% Automatic quality determination software not yet implemented
2% QA flag explanation
ScienceQualityFlagExplanation
(346)
OperationalQualityFlagExplanation
(238)
35% Passed
14%
Passed,parameter passed the specified operational test. Inferred Pass,parameter
terminated with warnings. Failed parameter terminated with fatal errors.
11% Not Investigated
10%
Passed,parameter passed the specified science test. Inferred Pass,parameter
terminated with warnings for specified science test. Failed parameter terminated
with fatal errors for specified science test.
10%
See http://landweb.nascom.nasa.gov/cgi-
bin/QA_WWW/qaFlagPage.cgi?sat=aqua for the product Science Quality status.
8% Validated, see http://disc.gsfc.nasa.gov/Aura/MLS/ for quality document
6%
See http://landweb.nascom.nasa.gov/cgi-
bin/QA_WWW/qaFlagPage.cgi?sat=terra for the product Science Quality status.
5%
An updated science quality flag and explanation is put in the product .met file
when a granule has been evaluated. The flag value in this file, Not Investigated, is
an automatic default that is put into every granule during production.
ISO Data Quality Results
DQ_ConformanceResult
+ specification : CI_Citation
+ explanation : CharacterString
+ pass : Boolean
DQ_CoverageResultDQ_DescriptiveResult
+ statement : CharacterString
<<Abstract>>
DQ_Result
+ dateTime [0..1] : DateTime
+ resultScope [0..1] : DQ_Scope
DQ_QuantitativeResult
+ valueType [0..1] : RecordType
+ valueUnit : UnitOfMeasure
+ value [1..*] : Record
Passed
Suspect
Failed
Not Investigated
Inferred Passed
Being Investigated
16
GCMD ECHO
ISO
YES!
Questions?
17
Questions?
tedhabermann@hdfgroup.org
Acknowledgements
This work was partially supported by contract number NNG10HP02C from NASA.
Any opinions, findings, conclusions, or recommendations expressed in this material are
those of the author and do not necessarily reflect the views of NASA or The HDF Group.

Contenu connexe

En vedette

This Year's Model: Metadata Models and Representations - Tech Forum 2014 - Gr...
This Year's Model: Metadata Models and Representations - Tech Forum 2014 - Gr...This Year's Model: Metadata Models and Representations - Tech Forum 2014 - Gr...
This Year's Model: Metadata Models and Representations - Tech Forum 2014 - Gr...BookNet Canada
 
How to Create a Metadata Schema that Doesn't suck in 5 Easy Steps
How to Create a Metadata Schema that Doesn't suck in 5 Easy Steps How to Create a Metadata Schema that Doesn't suck in 5 Easy Steps
How to Create a Metadata Schema that Doesn't suck in 5 Easy Steps Emily Kolvitz
 
Model metadata sheet - WAMSI
Model metadata sheet - WAMSIModel metadata sheet - WAMSI
Model metadata sheet - WAMSIegovindia
 
What the future holds: Metadata and Document Repositories Stanislav Nazarenk...
What the future holds: Metadata and Document Repositories  Stanislav Nazarenk...What the future holds: Metadata and Document Repositories  Stanislav Nazarenk...
What the future holds: Metadata and Document Repositories Stanislav Nazarenk...Institute of Public Health in Ireland
 
Alfresco Day BeNelux: The success of Alfresco
Alfresco Day BeNelux: The success of AlfrescoAlfresco Day BeNelux: The success of Alfresco
Alfresco Day BeNelux: The success of AlfrescoAlfresco Software
 
Customizing the Document Library
Customizing the Document LibraryCustomizing the Document Library
Customizing the Document LibraryAlfresco Software
 
PLAT-13 Metadata Extraction and Transformation
PLAT-13 Metadata Extraction and TransformationPLAT-13 Metadata Extraction and Transformation
PLAT-13 Metadata Extraction and TransformationAlfresco Software
 
Alfresco Day Benelux Hogeschool Inholland Records Management application
Alfresco Day Benelux Hogeschool Inholland Records Management applicationAlfresco Day Benelux Hogeschool Inholland Records Management application
Alfresco Day Benelux Hogeschool Inholland Records Management applicationAlfresco Software
 
Alfresco Day BeNelux: Customer Success Showcase - Saxion Hogescholen
Alfresco Day BeNelux: Customer Success Showcase - Saxion HogescholenAlfresco Day BeNelux: Customer Success Showcase - Saxion Hogescholen
Alfresco Day BeNelux: Customer Success Showcase - Saxion HogescholenAlfresco Software
 
Enterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge - Taxonomy Design Best Practices and MethodologyEnterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge - Taxonomy Design Best Practices and MethodologyEnterprise Knowledge
 
Taxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information ArchitectureTaxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information ArchitectureAccess Innovations, Inc.
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata managementOpen Data Support
 

En vedette (17)

Ariyalur
AriyalurAriyalur
Ariyalur
 
This Year's Model: Metadata Models and Representations - Tech Forum 2014 - Gr...
This Year's Model: Metadata Models and Representations - Tech Forum 2014 - Gr...This Year's Model: Metadata Models and Representations - Tech Forum 2014 - Gr...
This Year's Model: Metadata Models and Representations - Tech Forum 2014 - Gr...
 
How to Create a Metadata Schema that Doesn't suck in 5 Easy Steps
How to Create a Metadata Schema that Doesn't suck in 5 Easy Steps How to Create a Metadata Schema that Doesn't suck in 5 Easy Steps
How to Create a Metadata Schema that Doesn't suck in 5 Easy Steps
 
Model metadata sheet - WAMSI
Model metadata sheet - WAMSIModel metadata sheet - WAMSI
Model metadata sheet - WAMSI
 
What the future holds: Metadata and Document Repositories Stanislav Nazarenk...
What the future holds: Metadata and Document Repositories  Stanislav Nazarenk...What the future holds: Metadata and Document Repositories  Stanislav Nazarenk...
What the future holds: Metadata and Document Repositories Stanislav Nazarenk...
 
Document repositories-and-metadata
Document repositories-and-metadataDocument repositories-and-metadata
Document repositories-and-metadata
 
Alfresco Day BeNelux: The success of Alfresco
Alfresco Day BeNelux: The success of AlfrescoAlfresco Day BeNelux: The success of Alfresco
Alfresco Day BeNelux: The success of Alfresco
 
Customizing the Document Library
Customizing the Document LibraryCustomizing the Document Library
Customizing the Document Library
 
PLAT-13 Metadata Extraction and Transformation
PLAT-13 Metadata Extraction and TransformationPLAT-13 Metadata Extraction and Transformation
PLAT-13 Metadata Extraction and Transformation
 
Alfresco Day Benelux Hogeschool Inholland Records Management application
Alfresco Day Benelux Hogeschool Inholland Records Management applicationAlfresco Day Benelux Hogeschool Inholland Records Management application
Alfresco Day Benelux Hogeschool Inholland Records Management application
 
Alfresco Day BeNelux: Customer Success Showcase - Saxion Hogescholen
Alfresco Day BeNelux: Customer Success Showcase - Saxion HogescholenAlfresco Day BeNelux: Customer Success Showcase - Saxion Hogescholen
Alfresco Day BeNelux: Customer Success Showcase - Saxion Hogescholen
 
CBAP® Preparation Course
CBAP® Preparation CourseCBAP® Preparation Course
CBAP® Preparation Course
 
Taxonomy And Metadata
Taxonomy And MetadataTaxonomy And Metadata
Taxonomy And Metadata
 
Enterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge - Taxonomy Design Best Practices and MethodologyEnterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge - Taxonomy Design Best Practices and Methodology
 
Taxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information ArchitectureTaxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information Architecture
 
Metadata Workshop
Metadata WorkshopMetadata Workshop
Metadata Workshop
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata management
 

Similaire à Can ISO 19157 support current NASA data quality metadata?

20050314 specification based regression test selection with risk analysis
20050314 specification based regression test selection with risk analysis20050314 specification based regression test selection with risk analysis
20050314 specification based regression test selection with risk analysisWill Shen
 
Foucher2002
Foucher2002Foucher2002
Foucher2002adakua
 
Qualification of analytical instrument of FTIR
Qualification of analytical instrument of FTIR Qualification of analytical instrument of FTIR
Qualification of analytical instrument of FTIR KUNDLAJAYALAKSHMI
 
(ATS6-APP08) ADQM Solution Deployment
(ATS6-APP08) ADQM Solution Deployment(ATS6-APP08) ADQM Solution Deployment
(ATS6-APP08) ADQM Solution DeploymentBIOVIA
 
Deployment of Debug and Trace for features in RISC-V Core
Deployment of Debug and Trace for features in RISC-V CoreDeployment of Debug and Trace for features in RISC-V Core
Deployment of Debug and Trace for features in RISC-V CoreIRJET Journal
 
Statistical Process Monitoring using R Software
Statistical Process Monitoring using R SoftwareStatistical Process Monitoring using R Software
Statistical Process Monitoring using R SoftwareUniversity of Sargodha
 
Automatic Target Detection using Maximum Average Correlation Height Filter an...
Automatic Target Detection using Maximum Average Correlation Height Filter an...Automatic Target Detection using Maximum Average Correlation Height Filter an...
Automatic Target Detection using Maximum Average Correlation Height Filter an...IRJET Journal
 
Analysis of quality metadata in the GEOSS Clearinghouse
Analysis of quality metadata in the GEOSS ClearinghouseAnalysis of quality metadata in the GEOSS Clearinghouse
Analysis of quality metadata in the GEOSS ClearinghousePaula Díaz
 
Huong dan cu the svm
Huong dan cu the svmHuong dan cu the svm
Huong dan cu the svmtaikhoan262
 
Assessment ArchitectureSecurity Assessment Plan (SAP) System Name.docx
Assessment ArchitectureSecurity Assessment Plan (SAP) System Name.docxAssessment ArchitectureSecurity Assessment Plan (SAP) System Name.docx
Assessment ArchitectureSecurity Assessment Plan (SAP) System Name.docxfestockton
 
Soft quality & standards
Soft quality & standardsSoft quality & standards
Soft quality & standardsPrince Bhanwra
 

Similaire à Can ISO 19157 support current NASA data quality metadata? (20)

Ops a la_carte_rdt_broch
Ops a la_carte_rdt_brochOps a la_carte_rdt_broch
Ops a la_carte_rdt_broch
 
Module Owb Tuning
Module Owb TuningModule Owb Tuning
Module Owb Tuning
 
Software metrics
Software metricsSoftware metrics
Software metrics
 
20050314 specification based regression test selection with risk analysis
20050314 specification based regression test selection with risk analysis20050314 specification based regression test selection with risk analysis
20050314 specification based regression test selection with risk analysis
 
Foucher2002
Foucher2002Foucher2002
Foucher2002
 
IoT and M2M Safety and Security
IoT and M2M Safety and Security 	IoT and M2M Safety and Security
IoT and M2M Safety and Security
 
Maestro_Abstract
Maestro_AbstractMaestro_Abstract
Maestro_Abstract
 
Qualification of analytical instrument of FTIR
Qualification of analytical instrument of FTIR Qualification of analytical instrument of FTIR
Qualification of analytical instrument of FTIR
 
(ATS6-APP08) ADQM Solution Deployment
(ATS6-APP08) ADQM Solution Deployment(ATS6-APP08) ADQM Solution Deployment
(ATS6-APP08) ADQM Solution Deployment
 
Deployment of Debug and Trace for features in RISC-V Core
Deployment of Debug and Trace for features in RISC-V CoreDeployment of Debug and Trace for features in RISC-V Core
Deployment of Debug and Trace for features in RISC-V Core
 
Statistical Process Monitoring using R Software
Statistical Process Monitoring using R SoftwareStatistical Process Monitoring using R Software
Statistical Process Monitoring using R Software
 
.doc
.doc.doc
.doc
 
.doc
.doc.doc
.doc
 
.doc
.doc.doc
.doc
 
Automatic Target Detection using Maximum Average Correlation Height Filter an...
Automatic Target Detection using Maximum Average Correlation Height Filter an...Automatic Target Detection using Maximum Average Correlation Height Filter an...
Automatic Target Detection using Maximum Average Correlation Height Filter an...
 
Analysis of quality metadata in the GEOSS Clearinghouse
Analysis of quality metadata in the GEOSS ClearinghouseAnalysis of quality metadata in the GEOSS Clearinghouse
Analysis of quality metadata in the GEOSS Clearinghouse
 
Guide
GuideGuide
Guide
 
Huong dan cu the svm
Huong dan cu the svmHuong dan cu the svm
Huong dan cu the svm
 
Assessment ArchitectureSecurity Assessment Plan (SAP) System Name.docx
Assessment ArchitectureSecurity Assessment Plan (SAP) System Name.docxAssessment ArchitectureSecurity Assessment Plan (SAP) System Name.docx
Assessment ArchitectureSecurity Assessment Plan (SAP) System Name.docx
 
Soft quality & standards
Soft quality & standardsSoft quality & standards
Soft quality & standards
 

Plus de Ted Habermann

Translation proofing
Translation proofingTranslation proofing
Translation proofingTed Habermann
 
New ISO Geospatial Metadata Capabilities
New ISO Geospatial Metadata CapabilitiesNew ISO Geospatial Metadata Capabilities
New ISO Geospatial Metadata CapabilitiesTed Habermann
 
The HDF Product Designer – Interoperability in the First Mile
The HDF Product Designer – Interoperability in the First MileThe HDF Product Designer – Interoperability in the First Mile
The HDF Product Designer – Interoperability in the First MileTed Habermann
 
Hdf Augmentation: Interoperability in the Last Mile
Hdf Augmentation: Interoperability in the Last MileHdf Augmentation: Interoperability in the Last Mile
Hdf Augmentation: Interoperability in the Last MileTed Habermann
 
Citations in ISO Metadata
Citations in ISO MetadataCitations in ISO Metadata
Citations in ISO MetadataTed Habermann
 
Translation proofing
Translation proofingTranslation proofing
Translation proofingTed Habermann
 
ESDIS and International Standards
ESDIS and International StandardsESDIS and International Standards
ESDIS and International StandardsTed Habermann
 
Wikis, Rubrics and Views: An Integrated Approach to Improving Documentation
Wikis, Rubrics and Views: An Integrated Approach to Improving DocumentationWikis, Rubrics and Views: An Integrated Approach to Improving Documentation
Wikis, Rubrics and Views: An Integrated Approach to Improving DocumentationTed Habermann
 
Metadata Evaluation and Improvement
Metadata Evaluation and ImprovementMetadata Evaluation and Improvement
Metadata Evaluation and ImprovementTed Habermann
 
Metadata For Humans and Machines
Metadata For Humans and MachinesMetadata For Humans and Machines
Metadata For Humans and MachinesTed Habermann
 
ESDIS Metadata Archive
ESDIS Metadata ArchiveESDIS Metadata Archive
ESDIS Metadata ArchiveTed Habermann
 
Earth Science Platform
Earth Science PlatformEarth Science Platform
Earth Science PlatformTed Habermann
 

Plus de Ted Habermann (13)

Translation proofing
Translation proofingTranslation proofing
Translation proofing
 
New ISO Geospatial Metadata Capabilities
New ISO Geospatial Metadata CapabilitiesNew ISO Geospatial Metadata Capabilities
New ISO Geospatial Metadata Capabilities
 
The HDF Product Designer – Interoperability in the First Mile
The HDF Product Designer – Interoperability in the First MileThe HDF Product Designer – Interoperability in the First Mile
The HDF Product Designer – Interoperability in the First Mile
 
Hdf Augmentation: Interoperability in the Last Mile
Hdf Augmentation: Interoperability in the Last MileHdf Augmentation: Interoperability in the Last Mile
Hdf Augmentation: Interoperability in the Last Mile
 
Citations in ISO Metadata
Citations in ISO MetadataCitations in ISO Metadata
Citations in ISO Metadata
 
Translation proofing
Translation proofingTranslation proofing
Translation proofing
 
ESDIS and International Standards
ESDIS and International StandardsESDIS and International Standards
ESDIS and International Standards
 
Wikis, Rubrics and Views: An Integrated Approach to Improving Documentation
Wikis, Rubrics and Views: An Integrated Approach to Improving DocumentationWikis, Rubrics and Views: An Integrated Approach to Improving Documentation
Wikis, Rubrics and Views: An Integrated Approach to Improving Documentation
 
Hdf Inside
Hdf InsideHdf Inside
Hdf Inside
 
Metadata Evaluation and Improvement
Metadata Evaluation and ImprovementMetadata Evaluation and Improvement
Metadata Evaluation and Improvement
 
Metadata For Humans and Machines
Metadata For Humans and MachinesMetadata For Humans and Machines
Metadata For Humans and Machines
 
ESDIS Metadata Archive
ESDIS Metadata ArchiveESDIS Metadata Archive
ESDIS Metadata Archive
 
Earth Science Platform
Earth Science PlatformEarth Science Platform
Earth Science Platform
 

Dernier

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 

Dernier (20)

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 

Can ISO 19157 support current NASA data quality metadata?

  • 1. ISO 19157 – A Unified Metadata Model for Data Quality Ted Habermann Director of Earth Science The HDF Group thabermann@hdfgroup.org 1 The Global Change Master Directory (GCMD) and the EOS Clearinghouse (ECHO) both include data quality information? Can this information be mapped to ISO 19157? GCMD ECHO ISO ?Presented May 21, 2014 at the Earth Science Information Partnership Information Quality Cluster
  • 2. Metadata in Multiple Dialects Documentation Repository ISO 19115-1, 19115-2, 19119 19157 and extensions THREDDS HDF, netCDF (NcML) FGDC, Data.Gov SensorML WCS, WMS, WFS, SOS Open Provenance Model, PROV DIF, ECS, ECHO KML
  • 3. GCMD Data Quality 3 The <Quality> field allows the author to provide information about the quality of the data or any quality assurance procedures followed in producing the data described in the metadata.
  • 4. GCMD Quality Examples 4 <Quality> Due to the lack of high resolution data available over the region for 1993-94, it has been hard to validate the product. However the maps of burnt areas correspond well with active fire maps for the region. Where large [>3km] scars are found, the detection is more reliable. In areas of small scars more problems are involved. It is hoped that the 1994-95 data set will cover the whole of the study area and be calibrated by high resolution data. </Quality> <Quality> QA performed by CDIAC One of the roles of the Carbon Dioxide Information Analysis Center (CDIAC) is quality assurance (QA) of data. The QA process is an important component of the value-added concept of assuring accurate, usable information for researchers, because data received by CDIAC are rarely in condition for immediate distribution, regardless of source. </Quality> <Quality> The fire training-set may also have been biased against savanna and savanna woodland fires since their detection is more difficult than in humid, forst environments with cool background temperatures [Malingreau, 1990]. There may, therefore, be an under-sampling of fires in these warmer background environments. </Quality> <Quality> Note that Data File 12, Report #2, TASK 2 (Auclair et al., 1994a) is a Quality Assurance and Quality Control chapter for the areas of Canada, Alaska, United States (48 states), with range estimates of validation and error, a listing of discussions with experts in the field and a review of the draft of data files. </Quality>
  • 5. ISO Data Quality Results DQ_ConformanceResult + specification : CI_Citation + explanation : CharacterString + pass : Boolean DQ_CoverageResultDQ_DescriptiveResult + statement : CharacterString <<Abstract>> DQ_Result + dateTime [0..1] : DateTime + resultScope [0..1] : DQ_Scope DQ_QuantitativeResult + valueType [0..1] : RecordType + valueUnit : UnitOfMeasure + value [1..*] : Record ISO 19157 includes four kinds of quality results. It includes a new kind of report (DQ_DescriptiveResult) that includes a simple text description of the result of the quality test.
  • 6. ISO Data Quality Results DQ_ConformanceResult + specification : CI_Citation + explanation : CharacterString + pass : Boolean DQ_CoverageResultDQ_DescriptiveResult + statement : CharacterString <<Abstract>> DQ_Result + dateTime [0..1] : DateTime + resultScope [0..1] : DQ_Scope DQ_QuantitativeResult + valueType [0..1] : RecordType + valueUnit : UnitOfMeasure + value [1..*] : Record <Quality> QA performed by CDIAC One of the roles of the Carbon Dioxide Information Analysis Center (CDIAC) is quality assurance (QA) of data. The QA process is an important component of the value-added concept of assuring accurate, usable information for researchers, because data received by CDIAC are rarely in condition for immediate distribution, regardless of source. </Quality> The DQ_DescriptiveResult matches the GCMD quality element exactly.
  • 7. GCMD Quality Examples 7 <Quality> Due to the lack of high resolution data available over the region for 1993-94, it has been hard to validate the product. However the maps of burnt areas cor- respond well with active fire maps for the region. Where large [>3km] scars are found, the detection is more reliable. In areas of small scars more problems are involved. It is hoped that the 1994-95 data set will cover the whole of the study area and be calibrated by high resolution data. </Quality> <Quality> QA performed by CDIAC One of the roles of the Carbon Dioxide Information Analysis Center (CDIAC) is quality assurance (QA) of data. The QA process is an important component of the value-added concept of assuring accurate, usable information for researchers, because data received by CDIAC are rarely in condition for immediate distribution, regardless of source. </Quality> <Quality> The fire training-set may also have been biased against savanna and savanna woodland fires since their detection is more difficult than in humid, forst environments with cool background temperatures [Malingreau, 1990]. There may, therefore, be an under-sampling of fires in these warmer background environments. </Quality> <Quality> Note that Data File 12, Report #2, TASK 2 (Auclair et al., 1994a) is a Quality Assurance and Quality Control chapter for the areas of Canada, Alaska, United States (48 states), with range estimates of validation and error, a listing of discussions with experts in the field and a review of the draft of data files. </Quality>
  • 8. Standalone Quality Reports DQ_Element + standaloneQualityReportDetails [0..1] : CharacterString DQ_DataQuality + scope : DQ_Scope + report 1..* DQ_StandaloneQualityReportInformation + reportReference: CI_Citation + abstract : CharacterString + standaloneQualityReport 0..1 ISO 19157 acknowledges that important quality information can exist in standalone reports that may not fit easily into the ISO conceptual model. These standalone reports are cited with abstracts that summarize the results for the user without neccessitating access to the cited report.
  • 9. ECHO Quality Information ECHO includes two kinds of quality information
  • 10. ECHO Quality Information ECHO includes two kinds of quality information QAStats – Standard measures for all products QAPercentMissingData - Granule level % missing data. This attribute can be repeated for individual parameters within a granule. QAPercentOutOfBoundsData – Granule level % out of bounds data. This attribute can be repeated for individual parameters within a granule. QAPercentInterpolatedData – Granule level % interpolated data. This attribute can be repeated for individual parameters within a granule. QAPercentCloudCover – This attribute is used to characterize the cloud cover amount of a granule. This attribute may be repeated for individual parameters within a granule. (Note - there may be more than one way to define a cloud or it's effects within a product containing several parameters; i.e. this attribute may be parameter specific)
  • 11. ECHO Quality Information ECHO includes two kinds of quality information QAFlags – Classes of quality measures with product specific implementations AutomaticQualityFlag – The granule level flag applying generally to the granule and specifically to parameters the granule level. When applied to parameter, the flag refers to the quality of that parameter for the granule (as applicable). The parameters determining whether the flag is set are defined by the developer and documented in the Quality Flag Explanation. AutomaticQualityFlagExplanation – A text explanation of the criteria used to set automatic quality flag, including thresholds or other criteria. OperationalQualityFlag – The granule level flag applying both generally to a granule and specifically to parameters at the granule level. When applied to parameter, the flag refers to the quality of that parameter for the granule (as applicable). The parameters determining whether the flag is set are defined by the developers and documented in the Operational Quality Flag Explanation. OperationalQualityFlagExplanation – A text explanation of the criteria used to set operational quality flag; including thresholds or other criteria. ScienceQualityFlag – Granule level flag applying to a granule, and specifically to parameters. When applied to parameter, the flag refers to the quality of that parameter for the granule (as applicable). The parameters determining whether the flag is set are defined by the developers and documented in the Science Quality Flag Explanation. ScienceQualityFlagExplanation – A text explanation of the criteria used to set science quality flag; including thresholds or other criteria.
  • 12. ECHO Data Quality Measures DQ_MeasureReference + measureIdentification [0..1] : MD_Identifier + nameOfMeasure [0..*] : CharacterString + measureDescription [0..1] : CharacterString if measureIdentification is not provided, then nameOfMeasure shall be provided + measure 0..1 ISO 19157 data quality measure references identify measures in several ways and provides a brief description of the measure. <<Abstract>> DQ_Element + measure [0..*] : DQ_MeasureReference + evaluationMethod [0..1] : DQ_EvaluationMethod + result [0..1] : DQ_Result QAPercentMissingData QAPercentOutOfBoundsData QAPercentInterpolatedData QAPercentCloudCover AutomaticQualityFlag OperationalQualityFlag ScienceQualityFlag AutomaticQualityFlagExplanation OperationalQualityFlagExplanation ScienceQualityFlagExplanation
  • 13. Data Quality Measures DQM_Measure + measureIdentifier : MD_Identifier + name : CharacterString + alias [0..*] : CharacterString + sourceReference [0..*] : CI_Citation + elementName [1..*] : TypeName + definition : CharacterString + description [0..1] : DQM_Description + valueType : TypeName + valueStructure [0..1] : DQM_ValueStructure + example [0..*] : DQM_Description DQM_BasicMeasure + name : CharacterString + definition : CharacterString + example : DQM_Description [0..1] + valueType : TypeName <<CodeList>> DQ_ValueStructure + bag + set + sequence + table + matrix + coverage DQM_Parameter + name : CharacterString + definition : CharacterString + description: DQM_Description [0..1] + valueType : TypeName + valueStructure [0..1] : DQM_ValueStructure DQM_Description + textDescription: CharacterString + extendedDescription [0..1] : MD_BrowseGraphic DQ_MeasureReference + measureIdentification [0..1] : MD_Identifier + nameOfMeasure [0..*] : CharacterString + measureDescription [0..1] : CharacterString if measureIdentification is not provided, then nameOfMeasure shall be provided + measure 0..1 ISO 19157 data quality measure properties are in a separate object and provide a much more complete description of the measures. <<Abstract>> DQ_Element + measure [0..*] : DQ_MeasureReference + evaluationMethod [0..1] : DQ_EvaluationMethod + result [0..1] : DQ_Result
  • 14. ECHO Quality Measure Descriptions 14 AutomaticQualityFlagExplanation (2289) 66% parameter is produced correctly 7% No automatic quality assessment is performed in the PGE 5% Based on percentage of product that is good. Suspect used where true quality is not known. 4% Automatic quality determination software not yet implemented 2% QA flag explanation ScienceQualityFlagExplanation (346) OperationalQualityFlagExplanation (238) 35% Passed 14% Passed,parameter passed the specified operational test. Inferred Pass,parameter terminated with warnings. Failed parameter terminated with fatal errors. 11% Not Investigated 10% Passed,parameter passed the specified science test. Inferred Pass,parameter terminated with warnings for specified science test. Failed parameter terminated with fatal errors for specified science test. 10% See http://landweb.nascom.nasa.gov/cgi- bin/QA_WWW/qaFlagPage.cgi?sat=aqua for the product Science Quality status. 8% Validated, see http://disc.gsfc.nasa.gov/Aura/MLS/ for quality document 6% See http://landweb.nascom.nasa.gov/cgi- bin/QA_WWW/qaFlagPage.cgi?sat=terra for the product Science Quality status. 5% An updated science quality flag and explanation is put in the product .met file when a granule has been evaluated. The flag value in this file, Not Investigated, is an automatic default that is put into every granule during production.
  • 15. ISO Data Quality Results DQ_ConformanceResult + specification : CI_Citation + explanation : CharacterString + pass : Boolean DQ_CoverageResultDQ_DescriptiveResult + statement : CharacterString <<Abstract>> DQ_Result + dateTime [0..1] : DateTime + resultScope [0..1] : DQ_Scope DQ_QuantitativeResult + valueType [0..1] : RecordType + valueUnit : UnitOfMeasure + value [1..*] : Record Passed Suspect Failed Not Investigated Inferred Passed Being Investigated
  • 18. Acknowledgements This work was partially supported by contract number NNG10HP02C from NASA. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the author and do not necessarily reflect the views of NASA or The HDF Group.