SlideShare une entreprise Scribd logo
1  sur  16
DWH-Ahsan AbdullahDWH-Ahsan Abdullah
11
Data WarehousingData Warehousing
Lecture-21Lecture-21
Introduction to Data Quality Management (DQM)Introduction to Data Quality Management (DQM)
Virtual University of PakistanVirtual University of Pakistan
Ahsan Abdullah
Assoc. Prof. & Head
Center for Agro-Informatics Research
www.nu.edu.pk/cairindex.asp
National University of Computers & Emerging Sciences, Islamabad
Email: ahsan101@yahoo.com
DWH-Ahsan Abdullah
2
Introduction to Data QualityIntroduction to Data Quality
Management (DQM)Management (DQM)
DWH-Ahsan Abdullah
3
What is Quality? InformallyWhat is Quality? Informally
Some things are better than others i.e. they are ofSome things are better than others i.e. they are of
higher quality. How much “better” is better?higher quality. How much “better” is better?
Is the right item the best item to purchase? HowIs the right item the best item to purchase? How
about after the purchase?about after the purchase?
What is quality of service? The bank exampleWhat is quality of service? The bank example
DWH-Ahsan Abdullah
4
What is Quality? FormallyWhat is Quality? Formally
“Quality is conformance to requirements”
P. Crosby, “Quality is Free” 1979
“Degree of excellence”
Webster’s Third New International Dictionary
DWH-Ahsan Abdullah
5
What is Quality? Examples from Auto IndustryWhat is Quality? Examples from Auto Industry
Quality means meeting customer’s needs,
not necessarily exceeding them.
Quality means improving things customers
care about, because that makes their lives
easier and more comfortable.
Why example from auto-industry?
DWH-Ahsan Abdullah
6
What is Data Quality?What is Data Quality?
Muhammad Khan
Height = 5’8”
Weight = 160 lbs
Gender = Male
Age = 35 yrs
Emp_ID = 440
All data is an abstraction of something real
What is Data?
Note Change
the picture
DWH-Ahsan Abdullah
7
What is Data Quality?What is Data Quality?
Intrinsic Data Quality
Electronic reproduction of reality.
Realistic Data Quality
Degree of utility or value of data to business.
DWH-Ahsan Abdullah
8
Data Quality & OrganizationsData Quality & Organizations
Intelligent Learning Organization:
High-quality data is an open, shared resource with value-
adding processes.
The dysfunctional learning
organization:
Low-quality data is a proprietary resource with cost-adding
processes.
{Comment: Put picture of person in water holding round tube with data written on it}
DWH-Ahsan Abdullah
9
Law #1 - “Data that is not used cannot be correct!”
Law #2 - “Data quality is a function of its use, not its
collection!”
Law #3 - “Data will be no better than its most stringent use!”
Law #4 - “Data quality problems increase with the age of the
system!”
Law #5 – “The less likely something is to occur, the more
traumatic it will be when it happens!”
Orr’s Laws of Data QualityOrr’s Laws of Data Quality
DWH-Ahsan Abdullah
10
Total Quality Control (TQM)Total Quality Control (TQM)
Philosophy of involving all forPhilosophy of involving all for systematicsystematic andand
continuouscontinuous improvement.improvement.
It is customer oriented. Why?It is customer oriented. Why?
TQM incorporates the concept of product quality,TQM incorporates the concept of product quality,
process control, quality assurance, and qualityprocess control, quality assurance, and quality
improvement.improvement.
Quality assurance isQuality assurance is NOTNOT Quality improvementQuality improvement
DWH-Ahsan Abdullah
11
Co$t of fixing data qualityCo$t of fixing data quality
Lowest Quality Highest quality
Costofachievingquality
 Defect minimization is economical.
 Defect elimination is very very expensive.
Exponential rise
in cost
DWH-Ahsan Abdullah
12
Co$t of Data Quality DefectsCo$t of Data Quality Defects
 Controllable CostsControllable Costs
 Recurring costs for analyzing, correcting, and preventingRecurring costs for analyzing, correcting, and preventing
data errorsdata errors
 Resultant CostsResultant Costs
 Internal and external failure costs of business opportunitiesInternal and external failure costs of business opportunities
missed.missed.
 Equipment & Training CostsEquipment & Training Costs
DWH-Ahsan Abdullah
13
Where data quality is critical?Where data quality is critical?
Almost everywhere, some examples:Almost everywhere, some examples:
Marketing communications.Marketing communications.
Customer matching.Customer matching.
Retail house-holding.Retail house-holding.
Combining MIS systems after acquisition.Combining MIS systems after acquisition.
DWH-Ahsan Abdullah
14
Characteristics or Dimensions of Data QualityCharacteristics or Dimensions of Data Quality
Data Quality
Characteristic
Definition
Accuracy Qualitatively assessing lack of error, high accuracy
corresponding to small error.
Completeness The degree to which values are present in the attributes that
require them.
DWH-Ahsan Abdullah
15
Completeness Vs AccuracyCompleteness Vs Accuracy
95% accurate and 100% complete
OR
100% accurate and 95% complete
Which is better?
Depends on data quality (i) tolerances,Depends on data quality (i) tolerances,
the (ii) corresponding application and the (iii) cost ofthe (ii) corresponding application and the (iii) cost of
achieving that data quality vs. the (iv) business value.achieving that data quality vs. the (iv) business value.
DWH-Ahsan Abdullah
16
Characteristics or Dimensions of Data QualityCharacteristics or Dimensions of Data Quality
Data Quality
Characteristic
Definition
Consistency A measure of the degree to which a set of data satisfies a set of
constraints.
Timeliness A measure of how current or up to date the data is.
Uniqueness The state of being only one of its kind or being without an equal
or parallel.
Interpretability The extent to which data is in appropriate languages, symbols,
and units, and the definitions are clear.
Accessibility The extent to which data is available, or easily and quickly
retrievable
Objectivity The extent to which data is unbiased, unprejudiced, and
impartial

Contenu connexe

Tendances

Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeStefan Kühn
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Edureka!
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Edureka!
 
Data science tutorial
Data science tutorialData science tutorial
Data science tutorialAakashdata
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Simplilearn
 
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Edureka!
 
Data mining financial services
Data mining financial servicesData mining financial services
Data mining financial servicesHprentice
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Edureka!
 
Have Data—Need Analysts. Lessons Learned From The Woodworking Industry
Have Data—Need Analysts. Lessons Learned From The Woodworking IndustryHave Data—Need Analysts. Lessons Learned From The Woodworking Industry
Have Data—Need Analysts. Lessons Learned From The Woodworking IndustryHealth Catalyst
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analyticsSSaudia
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4
 
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...The Hive
 
Aa proj assited-living_iot
Aa proj assited-living_iotAa proj assited-living_iot
Aa proj assited-living_iotIshanDhoble1
 

Tendances (20)

Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data Challenge
 
Data analytics
Data analyticsData analytics
Data analytics
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
 
Data science tutorial
Data science tutorialData science tutorial
Data science tutorial
 
Machine Learning in Healthcare: A Case Study
Machine Learning in Healthcare: A Case StudyMachine Learning in Healthcare: A Case Study
Machine Learning in Healthcare: A Case Study
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
 
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
Data mining financial services
Data mining financial servicesData mining financial services
Data mining financial services
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
 
Have Data—Need Analysts. Lessons Learned From The Woodworking Industry
Have Data—Need Analysts. Lessons Learned From The Woodworking IndustryHave Data—Need Analysts. Lessons Learned From The Woodworking Industry
Have Data—Need Analysts. Lessons Learned From The Woodworking Industry
 
Predictive analytics
Predictive analytics Predictive analytics
Predictive analytics
 
Life Science Analytics
Life Science AnalyticsLife Science Analytics
Life Science Analytics
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the table
 
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
 
Aa proj assited-living_iot
Aa proj assited-living_iotAa proj assited-living_iot
Aa proj assited-living_iot
 
Machine Learning and Multi Drug Resistant(MDR) Infections case study
Machine Learning and Multi Drug Resistant(MDR) Infections case studyMachine Learning and Multi Drug Resistant(MDR) Infections case study
Machine Learning and Multi Drug Resistant(MDR) Infections case study
 
Data Analytics Life Cycle
Data Analytics Life CycleData Analytics Life Cycle
Data Analytics Life Cycle
 

En vedette

En vedette (20)

Lecture 40
Lecture 40Lecture 40
Lecture 40
 
Lecture 17
Lecture 17Lecture 17
Lecture 17
 
Lecture 27
Lecture 27Lecture 27
Lecture 27
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Lecture 16
Lecture 16Lecture 16
Lecture 16
 
Lecture 31
Lecture 31Lecture 31
Lecture 31
 
Lecture 32
Lecture 32Lecture 32
Lecture 32
 
Lecture 20
Lecture 20Lecture 20
Lecture 20
 
Lecture 26
Lecture 26Lecture 26
Lecture 26
 
Lecture 30
Lecture 30Lecture 30
Lecture 30
 
Lecture 38
Lecture 38Lecture 38
Lecture 38
 
Lecture 18
Lecture 18Lecture 18
Lecture 18
 
Lecture 29
Lecture 29Lecture 29
Lecture 29
 
Lecture 5
Lecture 5Lecture 5
Lecture 5
 
Lecture 35
Lecture 35Lecture 35
Lecture 35
 
Lecture 33
Lecture 33Lecture 33
Lecture 33
 
Lecture 34
Lecture 34Lecture 34
Lecture 34
 
Lecture 37
Lecture 37Lecture 37
Lecture 37
 
Lecture 7
Lecture 7Lecture 7
Lecture 7
 

Similaire à Lecture 21

Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBigDataExpo
 
Reframing the Value Proposition and Proposed Value of Information Quality
Reframing the Value Proposition and Proposed Value of Information QualityReframing the Value Proposition and Proposed Value of Information Quality
Reframing the Value Proposition and Proposed Value of Information QualityIAIDQ Community
 
10 Steps for Taking Control of Your Organization's Digital Debris
10 Steps for Taking Control of Your Organization's Digital Debris 10 Steps for Taking Control of Your Organization's Digital Debris
10 Steps for Taking Control of Your Organization's Digital Debris Perficient, Inc.
 
John Mancini's Predictions for Information Management in 2015
John Mancini's Predictions for Information Management in 2015John Mancini's Predictions for Information Management in 2015
John Mancini's Predictions for Information Management in 2015AIIM International
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management BasicKhaled Mosharraf
 
Developing A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataDeveloping A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataFindWhitePapers
 
PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)Preeti Sirohi
 
A Hitchhiker's Guide to Data Quality_20150331
A Hitchhiker's Guide to Data Quality_20150331A Hitchhiker's Guide to Data Quality_20150331
A Hitchhiker's Guide to Data Quality_20150331Tatiana Stebakova
 
From Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernanceFrom Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernancePrecisely
 
CDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsCDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsJeffrey T. Pollock
 
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineQlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineSrikanth Sharma Boddupalli
 
AI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfAI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfarifulislam946965
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agilityMike ORourke
 
E outsource asia 2010
E outsource asia 2010E outsource asia 2010
E outsource asia 2010Azlan Zainal
 
Surviving the Change Agents - How Business Survive the Next Evolution
Surviving the Change Agents - How Business Survive the Next EvolutionSurviving the Change Agents - How Business Survive the Next Evolution
Surviving the Change Agents - How Business Survive the Next EvolutionKyle Lacy
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallTrillium Software
 
Damo Consulting INFORMS 2015 cloud based analytics for healthcare
Damo Consulting INFORMS 2015 cloud based analytics for healthcare Damo Consulting INFORMS 2015 cloud based analytics for healthcare
Damo Consulting INFORMS 2015 cloud based analytics for healthcare Damo Consulting Inc.
 
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...Elemica
 
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data QualityThe Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data QualityDATAVERSITY
 

Similaire à Lecture 21 (20)

Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
 
Reframing the Value Proposition and Proposed Value of Information Quality
Reframing the Value Proposition and Proposed Value of Information QualityReframing the Value Proposition and Proposed Value of Information Quality
Reframing the Value Proposition and Proposed Value of Information Quality
 
10 Steps for Taking Control of Your Organization's Digital Debris
10 Steps for Taking Control of Your Organization's Digital Debris 10 Steps for Taking Control of Your Organization's Digital Debris
10 Steps for Taking Control of Your Organization's Digital Debris
 
John Mancini's Predictions for Information Management in 2015
John Mancini's Predictions for Information Management in 2015John Mancini's Predictions for Information Management in 2015
John Mancini's Predictions for Information Management in 2015
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management Basic
 
Developing A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataDeveloping A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product Data
 
PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)
 
Quality in information_security
Quality in information_securityQuality in information_security
Quality in information_security
 
A Hitchhiker's Guide to Data Quality_20150331
A Hitchhiker's Guide to Data Quality_20150331A Hitchhiker's Guide to Data Quality_20150331
A Hitchhiker's Guide to Data Quality_20150331
 
From Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernanceFrom Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data Governance
 
CDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsCDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and Trends
 
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineQlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
 
AI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfAI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdf
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agility
 
E outsource asia 2010
E outsource asia 2010E outsource asia 2010
E outsource asia 2010
 
Surviving the Change Agents - How Business Survive the Next Evolution
Surviving the Change Agents - How Business Survive the Next EvolutionSurviving the Change Agents - How Business Survive the Next Evolution
Surviving the Change Agents - How Business Survive the Next Evolution
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They Fall
 
Damo Consulting INFORMS 2015 cloud based analytics for healthcare
Damo Consulting INFORMS 2015 cloud based analytics for healthcare Damo Consulting INFORMS 2015 cloud based analytics for healthcare
Damo Consulting INFORMS 2015 cloud based analytics for healthcare
 
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
 
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data QualityThe Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
 

Plus de Shani729

Python tutorialfeb152012
Python tutorialfeb152012Python tutorialfeb152012
Python tutorialfeb152012Shani729
 
Python tutorial
Python tutorialPython tutorial
Python tutorialShani729
 
Interaction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionInteraction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionShani729
 
Fm lecturer 13(final)
Fm lecturer 13(final)Fm lecturer 13(final)
Fm lecturer 13(final)Shani729
 
Lecture slides week14-15
Lecture slides week14-15Lecture slides week14-15
Lecture slides week14-15Shani729
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodShani729
 
Dwh lecture slides-week15
Dwh lecture slides-week15Dwh lecture slides-week15
Dwh lecture slides-week15Shani729
 
Dwh lecture slides-week10
Dwh lecture slides-week10Dwh lecture slides-week10
Dwh lecture slides-week10Shani729
 
Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Shani729
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Shani729
 
Dwh lecture slides-week3&4
Dwh lecture slides-week3&4Dwh lecture slides-week3&4
Dwh lecture slides-week3&4Shani729
 
Dwh lecture slides-week2
Dwh lecture slides-week2Dwh lecture slides-week2
Dwh lecture slides-week2Shani729
 
Dwh lecture slides-week1
Dwh lecture slides-week1Dwh lecture slides-week1
Dwh lecture slides-week1Shani729
 
Dwh lecture slides-week 13
Dwh lecture slides-week 13Dwh lecture slides-week 13
Dwh lecture slides-week 13Shani729
 
Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Shani729
 
Data warehousing and mining furc
Data warehousing and mining furcData warehousing and mining furc
Data warehousing and mining furcShani729
 
Lecture 39
Lecture 39Lecture 39
Lecture 39Shani729
 
Lecture 36
Lecture 36Lecture 36
Lecture 36Shani729
 
Lecture 28
Lecture 28Lecture 28
Lecture 28Shani729
 

Plus de Shani729 (19)

Python tutorialfeb152012
Python tutorialfeb152012Python tutorialfeb152012
Python tutorialfeb152012
 
Python tutorial
Python tutorialPython tutorial
Python tutorial
 
Interaction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionInteraction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interaction
 
Fm lecturer 13(final)
Fm lecturer 13(final)Fm lecturer 13(final)
Fm lecturer 13(final)
 
Lecture slides week14-15
Lecture slides week14-15Lecture slides week14-15
Lecture slides week14-15
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth method
 
Dwh lecture slides-week15
Dwh lecture slides-week15Dwh lecture slides-week15
Dwh lecture slides-week15
 
Dwh lecture slides-week10
Dwh lecture slides-week10Dwh lecture slides-week10
Dwh lecture slides-week10
 
Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6
 
Dwh lecture slides-week3&4
Dwh lecture slides-week3&4Dwh lecture slides-week3&4
Dwh lecture slides-week3&4
 
Dwh lecture slides-week2
Dwh lecture slides-week2Dwh lecture slides-week2
Dwh lecture slides-week2
 
Dwh lecture slides-week1
Dwh lecture slides-week1Dwh lecture slides-week1
Dwh lecture slides-week1
 
Dwh lecture slides-week 13
Dwh lecture slides-week 13Dwh lecture slides-week 13
Dwh lecture slides-week 13
 
Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13
 
Data warehousing and mining furc
Data warehousing and mining furcData warehousing and mining furc
Data warehousing and mining furc
 
Lecture 39
Lecture 39Lecture 39
Lecture 39
 
Lecture 36
Lecture 36Lecture 36
Lecture 36
 
Lecture 28
Lecture 28Lecture 28
Lecture 28
 

Dernier

Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfRagavanV2
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf203318pmpc
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756dollysharma2066
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 

Dernier (20)

Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 

Lecture 21

  • 1. DWH-Ahsan AbdullahDWH-Ahsan Abdullah 11 Data WarehousingData Warehousing Lecture-21Lecture-21 Introduction to Data Quality Management (DQM)Introduction to Data Quality Management (DQM) Virtual University of PakistanVirtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research www.nu.edu.pk/cairindex.asp National University of Computers & Emerging Sciences, Islamabad Email: ahsan101@yahoo.com
  • 2. DWH-Ahsan Abdullah 2 Introduction to Data QualityIntroduction to Data Quality Management (DQM)Management (DQM)
  • 3. DWH-Ahsan Abdullah 3 What is Quality? InformallyWhat is Quality? Informally Some things are better than others i.e. they are ofSome things are better than others i.e. they are of higher quality. How much “better” is better?higher quality. How much “better” is better? Is the right item the best item to purchase? HowIs the right item the best item to purchase? How about after the purchase?about after the purchase? What is quality of service? The bank exampleWhat is quality of service? The bank example
  • 4. DWH-Ahsan Abdullah 4 What is Quality? FormallyWhat is Quality? Formally “Quality is conformance to requirements” P. Crosby, “Quality is Free” 1979 “Degree of excellence” Webster’s Third New International Dictionary
  • 5. DWH-Ahsan Abdullah 5 What is Quality? Examples from Auto IndustryWhat is Quality? Examples from Auto Industry Quality means meeting customer’s needs, not necessarily exceeding them. Quality means improving things customers care about, because that makes their lives easier and more comfortable. Why example from auto-industry?
  • 6. DWH-Ahsan Abdullah 6 What is Data Quality?What is Data Quality? Muhammad Khan Height = 5’8” Weight = 160 lbs Gender = Male Age = 35 yrs Emp_ID = 440 All data is an abstraction of something real What is Data? Note Change the picture
  • 7. DWH-Ahsan Abdullah 7 What is Data Quality?What is Data Quality? Intrinsic Data Quality Electronic reproduction of reality. Realistic Data Quality Degree of utility or value of data to business.
  • 8. DWH-Ahsan Abdullah 8 Data Quality & OrganizationsData Quality & Organizations Intelligent Learning Organization: High-quality data is an open, shared resource with value- adding processes. The dysfunctional learning organization: Low-quality data is a proprietary resource with cost-adding processes. {Comment: Put picture of person in water holding round tube with data written on it}
  • 9. DWH-Ahsan Abdullah 9 Law #1 - “Data that is not used cannot be correct!” Law #2 - “Data quality is a function of its use, not its collection!” Law #3 - “Data will be no better than its most stringent use!” Law #4 - “Data quality problems increase with the age of the system!” Law #5 – “The less likely something is to occur, the more traumatic it will be when it happens!” Orr’s Laws of Data QualityOrr’s Laws of Data Quality
  • 10. DWH-Ahsan Abdullah 10 Total Quality Control (TQM)Total Quality Control (TQM) Philosophy of involving all forPhilosophy of involving all for systematicsystematic andand continuouscontinuous improvement.improvement. It is customer oriented. Why?It is customer oriented. Why? TQM incorporates the concept of product quality,TQM incorporates the concept of product quality, process control, quality assurance, and qualityprocess control, quality assurance, and quality improvement.improvement. Quality assurance isQuality assurance is NOTNOT Quality improvementQuality improvement
  • 11. DWH-Ahsan Abdullah 11 Co$t of fixing data qualityCo$t of fixing data quality Lowest Quality Highest quality Costofachievingquality  Defect minimization is economical.  Defect elimination is very very expensive. Exponential rise in cost
  • 12. DWH-Ahsan Abdullah 12 Co$t of Data Quality DefectsCo$t of Data Quality Defects  Controllable CostsControllable Costs  Recurring costs for analyzing, correcting, and preventingRecurring costs for analyzing, correcting, and preventing data errorsdata errors  Resultant CostsResultant Costs  Internal and external failure costs of business opportunitiesInternal and external failure costs of business opportunities missed.missed.  Equipment & Training CostsEquipment & Training Costs
  • 13. DWH-Ahsan Abdullah 13 Where data quality is critical?Where data quality is critical? Almost everywhere, some examples:Almost everywhere, some examples: Marketing communications.Marketing communications. Customer matching.Customer matching. Retail house-holding.Retail house-holding. Combining MIS systems after acquisition.Combining MIS systems after acquisition.
  • 14. DWH-Ahsan Abdullah 14 Characteristics or Dimensions of Data QualityCharacteristics or Dimensions of Data Quality Data Quality Characteristic Definition Accuracy Qualitatively assessing lack of error, high accuracy corresponding to small error. Completeness The degree to which values are present in the attributes that require them.
  • 15. DWH-Ahsan Abdullah 15 Completeness Vs AccuracyCompleteness Vs Accuracy 95% accurate and 100% complete OR 100% accurate and 95% complete Which is better? Depends on data quality (i) tolerances,Depends on data quality (i) tolerances, the (ii) corresponding application and the (iii) cost ofthe (ii) corresponding application and the (iii) cost of achieving that data quality vs. the (iv) business value.achieving that data quality vs. the (iv) business value.
  • 16. DWH-Ahsan Abdullah 16 Characteristics or Dimensions of Data QualityCharacteristics or Dimensions of Data Quality Data Quality Characteristic Definition Consistency A measure of the degree to which a set of data satisfies a set of constraints. Timeliness A measure of how current or up to date the data is. Uniqueness The state of being only one of its kind or being without an equal or parallel. Interpretability The extent to which data is in appropriate languages, symbols, and units, and the definitions are clear. Accessibility The extent to which data is available, or easily and quickly retrievable Objectivity The extent to which data is unbiased, unprejudiced, and impartial

Notes de l'éditeur

  1. <number>
  2. <number>