SlideShare a Scribd company logo
1 of 19
Download to read offline
1
Copyright © 1991 ‐ 2016 R20/Consultancy B.V., The 
Hague, The Netherlands. All rights reserved. No 
part of this material may be reproduced, stored in 
a retrieval system, or transmitted in any form or by 
any means, electronic, mechanical, photographic, 
or otherwise, without the explicit written 
permission of the copyright owners.
Data Quality and 
Governance in a 
Data‐Obsessed World
by
Rick F. van der Lans
R20/Consultancy BV
Twitter @rick_vanderlans
www.r20.nl
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 2
Rick F. van der Lans
Rick F. van der Lans is an independent consultant, lecturer, and author. He
specializes in data warehousing, business intelligence, database technology,
and data virtualization. He is managing director of R20/Consultancy B.V.. Rick
has been involved in various projects in which data warehousing, and
integration technology was applied.
Rick van der Lans is an internationally acclaimed lecturer. He has lectured
professionally for the last twenty five years in many of the European and
Middle East countries, the USA, South America, and in Australia. He has been
invited by several major software vendors to present keynote speeches.
He is the author of several books on computing, including his new Data
Virtualization for Business Intelligence Systems. Some of these books are
available in different languages. Books such as the popular Introduction to
SQL is available in English, Dutch, Italian, Chinese, and German and is sold
world wide. He also authored The SQL Guide to Ingres and SQL for MySQL
Developers.
As author for TechTarget.com and BeyeNetwork.com, writer of whitepapers,
chairman for the annual European Enterprise Data and Business Intelligence
Conference, and as columnist for a few IT magazines, he has close contacts
with many vendors.
R20/Consultancy B.V. is located in The Hague, The Netherlands, www.r20.nl. You can get in touch with Rick via:
Email: rick@r20.nl
Twitter: @Rick_vanderlans
LinkedIn: http://www.linkedin.com/pub/rick-van-der-lans/9/207/223
2
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 3
Economic Resources
Economic resources = Factors of
production
Primary resources: land, labor, and
capital
• primary factors facilitate production but
neither become part of the product
Secondary resources: materials and
energy
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 4
The New Economic Resource: Data
3
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 5
Usage of Production Data is Changing
Data is used for reporting
Data is used for forecasting and predictions
Data is used for improving business
processes
Data is used for improving customer care
Data is used for product personalization
Data is used by customers and suppliers
Data is used …
Before
Now
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 6
The Importance of Data Quality
The quality of raw products determines
the quality of end products
The quality of labor determines the
quality of end products
Likewise …
The quality of data determines the
quality of an organization’s products and
efficiency
4
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 7
Data Quality is Key
Source: Experian Data Quality, 2015; see https://www.edq.com/uk/resources/papers/global-data-quality-research/
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 8
The Classic Data Warehouse Architecture
ETL ETLETL
Source
systems
Data martsData
warehouse
Staging
area
Analytics &
reporting
5
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 9
The Classic Data Warehouse Architecture
ETLETL
Source
systems
Data martsData
warehouse
Staging
area
Analytics &
reporting
Data Cleansing
ETL
Manual corrections
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 10
“Old” Requirements
No need for real-time data in reports
• There was time to spend on data cleansing
No need for high-quality data in
production systems
Only internally-produced data used
for reporting
Mostly internal users
All reports developed by IT
specialists
6
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 11
New Requirements
Reporting and analytics requires real-
time data
External users, such as customers and
suppliers
Mixing of internal with external data
Machine-generated data
Self-service development of reports
…
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 12
Operational Business Intelligence
Web analytics: Which ad or product to present now
Security: Face recognition real-time
Factories: Changing machine settings based on real-
time events
Call Centers: Predict the chance of churning and
predict which service or upgrade to offer
Incorrect data can lead to the wrong reaction
7
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 13
The Chain is Too Long for Real‐time Reporting
ETL ETLETL
Source
systems
Data martsData
warehouse
Staging
area
Operational
Analytics &
reporting
Too many steps and too much copying
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 14
The Chain is Too Long for Real‐time Reporting
ETL ETLETL
Source
systems
Data martsData
warehouse
Staging
area
Classic
Analytics &
reporting
Operational
BI reports
8
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 15
Customer‐Driven BI
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 16
Real‐Time Reporting for Customers
9
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 17
Real‐Time Analytics for Customers
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 18
High Data Quality
is Crucial for
Customer‐Driven BI
10
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 19
Streaming Data
Producers
of data
Storage of
streaming data
Consumers
of data
Listener
Listener
Listener
Listener
Stream
processor
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 20
Data Streaming for Operational BI
ETL ETLETL
Source
systems
Data martsStaging
area
Analytics &
reporting
Data
warehouse
Producers
of data Consumers
of data
Stream
processor
?
11
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 21
Self‐Service BI Continues
Self-Service Data
Visualization
Self-Service Analytics
Self-Service ETL
Self-Service Data
Preparation
Self-Service …
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 22
Self‐Service Data Preparation
Non-technical interface for
studying data files
Easy way of defining rules
Data is fixed by defining
filters, not by changing data
in source systems
Relationship with data
blending
Users are def ining t heir own
dat a qualit y rules
12
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 23
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 24
Open Data is Available in Abundance
13
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 25
External Data Integration by IT?
ETL ETLETL
Source
systems
Data martsData
warehouse
Staging
area
Analytics &
reporting
Social
media data
Open data
Spreadsheets
ETL
ETL ETL
?
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 26
External Data Integration by Users
ETL ETLETL
Source
systems Data marts
Data
warehouse
Staging
area Self‐Service
Analytics
Social
media data
Open data
Spreadsheets
?
14
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 27
Raising the Data Quality Bar
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 28
Option 1: Do Nothing
15
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 29
Option 2: 
Old Technology
For New Applications 
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 30
Option 3:
Adopt New Technology, 
but Stick to Old Ideas
16
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 31
Recommendations (1)
Data quality is not only relevant
for reporting and analytics
Data has become a primary
economic resource
Data quality improves reporting
results, but has operational
business impact as well
Poor data quality can be as
damaging to an organization as
other poor-quality resources
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 32
Recommendations (2)
Presenting poor data quality to
customers and suppliers will
reflect poorly on an organization
Poor data quality may lower trust
in the organization
17
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 33
Recommendations (3)
Move data quality checks
upstream
Develop new production systems
with data quality checks built-in
Use new architectures
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 34
ETL ETLETL
Source
systems
Data martsStaging
area
Analytics &
reporting
Data
warehouse
Shortening the Chain
ETLETL
ETL
18
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 35
Recommendations (4)
A dat a st rat egy is essential for
implementing an adequate data
quality program, not an option
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 36
What is Data Strategy?
A single, unified, organization-wide plan …
… for the use of corporate data …
… as a vital asset for strategic and
operational decision-making.
Investing in a formal data strategy lends
much needed intentionality around critical
data related issues, such as data quality,
metadata, performance, data distribution,
organization, ownership, security, privacy,
etc.
Source: Capstone Consulting, January 2009
19
Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 37
Data Quality

More Related Content

Similar to Data Quality and Governance in a Data Obsessed World

Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Big Data LDN 2016: Case Studies of Business Transformation through Big DataBig Data LDN 2016: Case Studies of Business Transformation through Big Data
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Matt Stubbs
 
UX STRAT Europe 2019: Rob van der Haar
UX STRAT Europe 2019: Rob van der HaarUX STRAT Europe 2019: Rob van der Haar
UX STRAT Europe 2019: Rob van der Haar
UX STRAT
 

Similar to Data Quality and Governance in a Data Obsessed World (20)

Big Data Expo 2015 - R20 Six Big Myths of Big Data
Big Data Expo 2015 - R20 Six Big Myths of Big DataBig Data Expo 2015 - R20 Six Big Myths of Big Data
Big Data Expo 2015 - R20 Six Big Myths of Big Data
 
Why Data Virtualization? By Rick van der Lans
Why Data Virtualization? By Rick van der LansWhy Data Virtualization? By Rick van der Lans
Why Data Virtualization? By Rick van der Lans
 
Become Agile with Data Modeling
Become Agile with Data ModelingBecome Agile with Data Modeling
Become Agile with Data Modeling
 
THE INDUSTRY'S FIRST VIRTUAL EVENT IN ROMANIA - Why Data Virtualization is a ...
THE INDUSTRY'S FIRST VIRTUAL EVENT IN ROMANIA - Why Data Virtualization is a ...THE INDUSTRY'S FIRST VIRTUAL EVENT IN ROMANIA - Why Data Virtualization is a ...
THE INDUSTRY'S FIRST VIRTUAL EVENT IN ROMANIA - Why Data Virtualization is a ...
 
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Big Data LDN 2016: Case Studies of Business Transformation through Big DataBig Data LDN 2016: Case Studies of Business Transformation through Big Data
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
 
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...
 
Hortonworks laurie maclachlan
Hortonworks laurie maclachlanHortonworks laurie maclachlan
Hortonworks laurie maclachlan
 
Capgemini’s Data WARP: Accelerate your Journey to Insights
Capgemini’s Data WARP: Accelerate your Journey to InsightsCapgemini’s Data WARP: Accelerate your Journey to Insights
Capgemini’s Data WARP: Accelerate your Journey to Insights
 
Are You Killing the Benefits of Your Data Lake?
Are You Killing the Benefits of Your Data Lake?Are You Killing the Benefits of Your Data Lake?
Are You Killing the Benefits of Your Data Lake?
 
business analytics 2016 - lean principles implementing your data platform
business analytics 2016 -  lean principles implementing your data platformbusiness analytics 2016 -  lean principles implementing your data platform
business analytics 2016 - lean principles implementing your data platform
 
Otto - Combinning recommendations and dynamic pricing: an offer you can't ref...
Otto - Combinning recommendations and dynamic pricing: an offer you can't ref...Otto - Combinning recommendations and dynamic pricing: an offer you can't ref...
Otto - Combinning recommendations and dynamic pricing: an offer you can't ref...
 
Big Data Refinery: Distilling Value for User-Driven Analytics
Big Data Refinery: Distilling Value for User-Driven AnalyticsBig Data Refinery: Distilling Value for User-Driven Analytics
Big Data Refinery: Distilling Value for User-Driven Analytics
 
How to Keep SAP Projects on Schedule with B2B Managed Services
How to Keep SAP Projects on Schedule with B2B Managed Services How to Keep SAP Projects on Schedule with B2B Managed Services
How to Keep SAP Projects on Schedule with B2B Managed Services
 
Big Data Enabled: How YARN Changes the Game
Big Data Enabled: How YARN Changes the GameBig Data Enabled: How YARN Changes the Game
Big Data Enabled: How YARN Changes the Game
 
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...
 
Trivadis TechEvent 2016 DWH Modernization – in the Age of Big Data by Gregor ...
Trivadis TechEvent 2016 DWH Modernization – in the Age of Big Data by Gregor ...Trivadis TechEvent 2016 DWH Modernization – in the Age of Big Data by Gregor ...
Trivadis TechEvent 2016 DWH Modernization – in the Age of Big Data by Gregor ...
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
 
Data-informed Experience Design
Data-informed Experience DesignData-informed Experience Design
Data-informed Experience Design
 
UX STRAT Europe 2019: Rob van der Haar
UX STRAT Europe 2019: Rob van der HaarUX STRAT Europe 2019: Rob van der Haar
UX STRAT Europe 2019: Rob van der Haar
 

More from ibi

More from ibi (20)

Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar
 
Data Monetization Expert Session Webinar
Data Monetization Expert Session WebinarData Monetization Expert Session Webinar
Data Monetization Expert Session Webinar
 
Embedded Analytics Expert Session Webinar
Embedded Analytics Expert Session Webinar Embedded Analytics Expert Session Webinar
Embedded Analytics Expert Session Webinar
 
Predictive and Prescriptive Analytics Expert Session Webinar
Predictive  and Prescriptive Analytics Expert Session Webinar Predictive  and Prescriptive Analytics Expert Session Webinar
Predictive and Prescriptive Analytics Expert Session Webinar
 
Internet of Things (IoT) Expert Session Webinar
Internet of Things (IoT) Expert Session WebinarInternet of Things (IoT) Expert Session Webinar
Internet of Things (IoT) Expert Session Webinar
 
Artificial Intelligence Expert Session Webinar
Artificial Intelligence Expert Session Webinar Artificial Intelligence Expert Session Webinar
Artificial Intelligence Expert Session Webinar
 
Celebrating Women Today and Everyday
Celebrating Women Today and EverydayCelebrating Women Today and Everyday
Celebrating Women Today and Everyday
 
The Value of Improved Clinical Information Management for Payers
The Value of Improved Clinical Information Management for PayersThe Value of Improved Clinical Information Management for Payers
The Value of Improved Clinical Information Management for Payers
 
Five Hot Trends for 2018
Five Hot Trends for 2018Five Hot Trends for 2018
Five Hot Trends for 2018
 
What Employees Think of Working at Information Builders
What Employees Think of Working at Information BuildersWhat Employees Think of Working at Information Builders
What Employees Think of Working at Information Builders
 
What Customers Are Saying About Information Builders
What Customers Are Saying About Information BuildersWhat Customers Are Saying About Information Builders
What Customers Are Saying About Information Builders
 
Accelerating Your Move to Value-Based Care
Accelerating Your Move to Value-Based CareAccelerating Your Move to Value-Based Care
Accelerating Your Move to Value-Based Care
 
Top 10 Reasons to Work at Information Builders
Top 10 Reasons to Work at Information BuildersTop 10 Reasons to Work at Information Builders
Top 10 Reasons to Work at Information Builders
 
Five Critical Success Factors for Embedded Analytics
Five Critical Success Factors for Embedded AnalyticsFive Critical Success Factors for Embedded Analytics
Five Critical Success Factors for Embedded Analytics
 
Why Attend Summit 2017?
Why Attend Summit 2017?Why Attend Summit 2017?
Why Attend Summit 2017?
 
Data Discovery and Governance
Data Discovery and GovernanceData Discovery and Governance
Data Discovery and Governance
 
Solving the BI Adoption Challenge With Report Consolidation
Solving the BI Adoption Challenge With Report ConsolidationSolving the BI Adoption Challenge With Report Consolidation
Solving the BI Adoption Challenge With Report Consolidation
 
What the Data Says...About Elections
What the Data Says...About ElectionsWhat the Data Says...About Elections
What the Data Says...About Elections
 
Transforming Healthcare: Improving Decision Support with Your Partners
Transforming Healthcare: Improving Decision Support with Your PartnersTransforming Healthcare: Improving Decision Support with Your Partners
Transforming Healthcare: Improving Decision Support with Your Partners
 
UX & Design Thinking for BI Applications
UX & Design Thinking for BI ApplicationsUX & Design Thinking for BI Applications
UX & Design Thinking for BI Applications
 

Recently uploaded

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 

Data Quality and Governance in a Data Obsessed World

  • 1. 1 Copyright © 1991 ‐ 2016 R20/Consultancy B.V., The  Hague, The Netherlands. All rights reserved. No  part of this material may be reproduced, stored in  a retrieval system, or transmitted in any form or by  any means, electronic, mechanical, photographic,  or otherwise, without the explicit written  permission of the copyright owners. Data Quality and  Governance in a  Data‐Obsessed World by Rick F. van der Lans R20/Consultancy BV Twitter @rick_vanderlans www.r20.nl Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 2 Rick F. van der Lans Rick F. van der Lans is an independent consultant, lecturer, and author. He specializes in data warehousing, business intelligence, database technology, and data virtualization. He is managing director of R20/Consultancy B.V.. Rick has been involved in various projects in which data warehousing, and integration technology was applied. Rick van der Lans is an internationally acclaimed lecturer. He has lectured professionally for the last twenty five years in many of the European and Middle East countries, the USA, South America, and in Australia. He has been invited by several major software vendors to present keynote speeches. He is the author of several books on computing, including his new Data Virtualization for Business Intelligence Systems. Some of these books are available in different languages. Books such as the popular Introduction to SQL is available in English, Dutch, Italian, Chinese, and German and is sold world wide. He also authored The SQL Guide to Ingres and SQL for MySQL Developers. As author for TechTarget.com and BeyeNetwork.com, writer of whitepapers, chairman for the annual European Enterprise Data and Business Intelligence Conference, and as columnist for a few IT magazines, he has close contacts with many vendors. R20/Consultancy B.V. is located in The Hague, The Netherlands, www.r20.nl. You can get in touch with Rick via: Email: rick@r20.nl Twitter: @Rick_vanderlans LinkedIn: http://www.linkedin.com/pub/rick-van-der-lans/9/207/223
  • 2. 2 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 3 Economic Resources Economic resources = Factors of production Primary resources: land, labor, and capital • primary factors facilitate production but neither become part of the product Secondary resources: materials and energy Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 4 The New Economic Resource: Data
  • 3. 3 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 5 Usage of Production Data is Changing Data is used for reporting Data is used for forecasting and predictions Data is used for improving business processes Data is used for improving customer care Data is used for product personalization Data is used by customers and suppliers Data is used … Before Now Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 6 The Importance of Data Quality The quality of raw products determines the quality of end products The quality of labor determines the quality of end products Likewise … The quality of data determines the quality of an organization’s products and efficiency
  • 4. 4 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 7 Data Quality is Key Source: Experian Data Quality, 2015; see https://www.edq.com/uk/resources/papers/global-data-quality-research/ Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 8 The Classic Data Warehouse Architecture ETL ETLETL Source systems Data martsData warehouse Staging area Analytics & reporting
  • 5. 5 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 9 The Classic Data Warehouse Architecture ETLETL Source systems Data martsData warehouse Staging area Analytics & reporting Data Cleansing ETL Manual corrections Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 10 “Old” Requirements No need for real-time data in reports • There was time to spend on data cleansing No need for high-quality data in production systems Only internally-produced data used for reporting Mostly internal users All reports developed by IT specialists
  • 6. 6 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 11 New Requirements Reporting and analytics requires real- time data External users, such as customers and suppliers Mixing of internal with external data Machine-generated data Self-service development of reports … Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 12 Operational Business Intelligence Web analytics: Which ad or product to present now Security: Face recognition real-time Factories: Changing machine settings based on real- time events Call Centers: Predict the chance of churning and predict which service or upgrade to offer Incorrect data can lead to the wrong reaction
  • 7. 7 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 13 The Chain is Too Long for Real‐time Reporting ETL ETLETL Source systems Data martsData warehouse Staging area Operational Analytics & reporting Too many steps and too much copying Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 14 The Chain is Too Long for Real‐time Reporting ETL ETLETL Source systems Data martsData warehouse Staging area Classic Analytics & reporting Operational BI reports
  • 8. 8 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 15 Customer‐Driven BI Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 16 Real‐Time Reporting for Customers
  • 9. 9 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 17 Real‐Time Analytics for Customers Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 18 High Data Quality is Crucial for Customer‐Driven BI
  • 10. 10 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 19 Streaming Data Producers of data Storage of streaming data Consumers of data Listener Listener Listener Listener Stream processor Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 20 Data Streaming for Operational BI ETL ETLETL Source systems Data martsStaging area Analytics & reporting Data warehouse Producers of data Consumers of data Stream processor ?
  • 11. 11 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 21 Self‐Service BI Continues Self-Service Data Visualization Self-Service Analytics Self-Service ETL Self-Service Data Preparation Self-Service … Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 22 Self‐Service Data Preparation Non-technical interface for studying data files Easy way of defining rules Data is fixed by defining filters, not by changing data in source systems Relationship with data blending Users are def ining t heir own dat a qualit y rules
  • 12. 12 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 23 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 24 Open Data is Available in Abundance
  • 13. 13 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 25 External Data Integration by IT? ETL ETLETL Source systems Data martsData warehouse Staging area Analytics & reporting Social media data Open data Spreadsheets ETL ETL ETL ? Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 26 External Data Integration by Users ETL ETLETL Source systems Data marts Data warehouse Staging area Self‐Service Analytics Social media data Open data Spreadsheets ?
  • 14. 14 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 27 Raising the Data Quality Bar Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 28 Option 1: Do Nothing
  • 15. 15 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 29 Option 2:  Old Technology For New Applications  Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 30 Option 3: Adopt New Technology,  but Stick to Old Ideas
  • 16. 16 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 31 Recommendations (1) Data quality is not only relevant for reporting and analytics Data has become a primary economic resource Data quality improves reporting results, but has operational business impact as well Poor data quality can be as damaging to an organization as other poor-quality resources Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 32 Recommendations (2) Presenting poor data quality to customers and suppliers will reflect poorly on an organization Poor data quality may lower trust in the organization
  • 17. 17 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 33 Recommendations (3) Move data quality checks upstream Develop new production systems with data quality checks built-in Use new architectures Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 34 ETL ETLETL Source systems Data martsStaging area Analytics & reporting Data warehouse Shortening the Chain ETLETL ETL
  • 18. 18 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 35 Recommendations (4) A dat a st rat egy is essential for implementing an adequate data quality program, not an option Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 36 What is Data Strategy? A single, unified, organization-wide plan … … for the use of corporate data … … as a vital asset for strategic and operational decision-making. Investing in a formal data strategy lends much needed intentionality around critical data related issues, such as data quality, metadata, performance, data distribution, organization, ownership, security, privacy, etc. Source: Capstone Consulting, January 2009
  • 19. 19 Copyright © 1991 - 2016 R20/Consultancy B.V., The Hague, The Netherlands 37 Data Quality