2016 SDMX Experts meeting, National Accounts business case (validation, data cooperation) and its SDMX implementation through shared services, Daniel Suranyi, Alvaro Diez Soto
This document discusses Eurostat's plans to implement SDMX standards to improve data sharing and reuse across statistical domains. Currently, Eurostat statistical production is organized in "stovepipes" by domain, using different conventions and IT tools. Eurostat aims to establish shared statistical services and an interoperability architecture using common standards like SDMX to enable cross-domain data usage, increase transparency, and allow efficient sharing of IT resources. SDMX tools will be implemented for tasks like metadata management, data validation, loading, and dissemination to achieve these goals and benefits like reduced production time, greater transparency, and economies of scale.
Similaire à 2016 SDMX Experts meeting, National Accounts business case (validation, data cooperation) and its SDMX implementation through shared services, Daniel Suranyi, Alvaro Diez Soto
InstantAtlasfor Local Information Systems (Bristol Dec09)John Maslen
Similaire à 2016 SDMX Experts meeting, National Accounts business case (validation, data cooperation) and its SDMX implementation through shared services, Daniel Suranyi, Alvaro Diez Soto (20)
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
2016 SDMX Experts meeting, National Accounts business case (validation, data cooperation) and its SDMX implementation through shared services, Daniel Suranyi, Alvaro Diez Soto
1. Eurostat
National Accounts business case
(validation, data cooperation) and
its SDMX implementation through
shared services
October, 2016 1SDMX Experts meeting, Mexico
Daniel SURANYI
Eurostat, Directorate C: “National Accounts, prices & key indicators”
Alvaro DIEZ SOTO
Eurostat, Unit B3: “IT solutions for statistical production”
3. Eurostat
Official statistics: the challenge…
3
More timely policies
Commercial providers
GDP T+30
Official reference
Cross-domain usage
Shrinking resources
€ 🙎
5. Eurostat
Stovepipe production: the reality…
5
• Customised for a specific domain
• Conventions used within domains / surveys
• Hampering cross-domain usage
• Leading to low level of transparency
• Not possible to share IT tools efficiently
• Difficult to share data across domains / organisations
8. Eurostat
Target: flexible use of statistical services
11
• Customised for a specific domain
• Conventions used within domains / surveys
• Hampering cross-domain usage
• Leading to low level of transparency
• Not possible to share IT tools efficiently
• Difficult to share data across domains / organisations
9. Eurostat
Target: flexible use of statistical services
12
• Customised for a specific domain
• Conventions used within domains / surveys
• Hampering cross-domain usage
• Leading to low level of transparency
• Not possible to share IT tools efficiently
• Difficult to share data across domains / organisations
10. Eurostat
Target: flexible use of statistical services
13
• Architecture for cross-domain usage
• Standards used across domains / surveys
• Enabling cross-domain usage
• Leading to transparency
• Encouraged to share IT tools efficiently
• Facilitates sharing data across domains / organisations
11. Eurostat
The big picture: using standards
GSBPM
Process step
categories
GSIM
Reference
information
model
☑ Statistical
Production
CSPA
Service specification
SDMX
Data ModellingVTL
Validation
expressions
12. SDMX compliance
• Valid SDMX-ML file
• Coded according to the DSD
• Mandatory fields present
• Correct data types
• Dataflow definition
Basic logical checks
• Sender ID and REF_AREA
• Table ID is present
• Value "NaN" and OBS_STATUS
• EMBARGO_DATE and CONF_STATUS
• PRICES and REF_YEAR_PRICE
Basic content checks
• Missing or unexpected series
• Hole in series
• Zero values
• Negative values
General plausibility and
consistency (within file)
• Additivity of breakdowns
• Outliers
• Consistency between prices
• Unadjusted and adjusted series
Advanced plausability and
consistency (across files)
• Revisions
• Quarterly versus Annual
• Same series across tables
Cross-domain or source
checks
• Balance of Payments
• Trade statistics
• Labour market statistics
• Data pulished by NSI or IO
SDMX
Registry
Structural
Validation
Content
Validation
VTL
Repository
?
14. Eurostat
Eurostat SDMX tools
October, 2016 17SDMX Experts meeting, Mexico
Where is SDMX?
Metadata design and build
Data compliance
Collection
Validation
Data loading and storage
Disseminate
DSW; SDMX Registry
SDMX Converter; SDMX-RI
EDAMIS; SDMX-RI;
Census/Data Hub; ESS-MH
STRUVAL; CONVAL
DLBB
DSWS; SDMX-RI; Census Hub
SDMX tools Input Used in
More information here
15. Eurostat
Our target - Interoperability Architecture
Support and service improvements
EU Public Licence and others
Community and forum pages
Shared/collaborative development
October, 2016 18SDMX Experts meeting, Mexico
Security and availability
Shared services, housing, hosting
Auditing & Operations
Modular & interoperable
Reference architecture
Strategy
16. Eurostat
Implementation: use of SDMX tools
19
SDMX Converter
SDMX-RI
Web & Test Client
EDAMIS
SDMX-RI
Web Services
Struval, Conval
18. Eurostat
Standardisation incl. SDMX deliver…
22
Shorter throughput
from producer to user
Transparent
• production process
• validation rules
Economy of
scale & scope
€ 🙌
SDMX is used in different processes related to data reporting, dissemination and exchange. These processes are in compliance with the structure and phases defined under GSBPM model
Data Structure Wizard - Data modeling, GUI
SDMX Registry - Data modeling, Central repository, artefact exchange, WS
SDMX Converter - Create/Convert SDMX data from files, GUI, API, CLI, WS
SDMX RI - Create/Convert data from DB, artefact and data reporting, data dissemination, set of tools, GUI, API, WS
DSWS - Disseminate data, uses SDMX RI WS
ESS-MH - Reference Metadata reporting tool
CENSUS/Data HUB - Single Dissemination point for MS Census 2011/2021 data. Data Hub – SDMX query generator and SDMX compliant web service client, for data retrieval, display and dispatching. Both are clinet implementations on the SDMX RI WS
STRUVAL - Validate data, validation reports, WS and API
Each tool creates an input for the next step, while the product is used also all along the chain/process, ensuring reusability, single maintenance and storage, centralised management and the same input all along the chain
Follow the approach of building blocks; cross border framework; reusable solutions with low maintenance cost; enhancements easy to implement as plug-ins; similar to MS software and compatible with environment; follows common components and reference framework/architecture established in the standard.
SDMX is used in different processes related to data reporting, dissemination and exchange. These processes are in compliance with the structure and phases defined under GSBPM model
Data Structure Wizard - Data modeling, GUI
SDMX Registry - Data modeling, Central repository, artefact exchange, WS
SDMX Converter - Create/Convert SDMX data from files, GUI, API, CLI, WS
SDMX RI - Create/Convert data from DB, artefact and data reporting, data dissemination, set of tools, GUI, API, WS
DSWS - Disseminate data, uses SDMX RI WS
ESS-MH - Reference Metadata reporting tool
CENSUS/Data HUB - Single Dissemination point for MS Census 2011/2021 data. Data Hub – SDMX query generator and SDMX compliant web service client, for data retrieval, display and dispatching. Both are clinet implementations on the SDMX RI WS
STRUVAL - Validate data, validation reports, WS and API
Each tool creates an input for the next step, while the product is used also all along the chain/process, ensuring reusability, single maintenance and storage, centralised management and the same input all along the chain