SlideShare une entreprise Scribd logo
1  sur  4
Télécharger pour lire hors ligne
Data Quality
View Point
2Maveric Systems
Overview
Quality of data for business operations is considered to be a critical component of enterprise success. With the
exponential rise in ways and means by which data is generated and consumed, organizations are more and more
focusing on ensuring data quality. Studies indicate that fewer than 50% of IT decision makers have confidence in
their organization’s data quality initiatives, although more than 90% acknowledge the growing importance and
volumes of data that they have to grapple with in future.
This whitepaper aims to throw light on the importance of data quality and means to assure quality of data, focusing
on key facets and tools. Ensuring data quality in today’s environment of Big data is not only necessary but also
critical to an organization’s survival.
Data Quality
Data Quality Framework
A robust framework that can address data quality needs to look at the following 6 focus areas.
Profile: Analyze the data to ensure conformance to data quality dimensions outlined below
 Completeness: Field level (e.g., missing data) or data level (e.g., missing line item for an order) check to ensure
all requisite details are available. This will ensure that the key fields are populated with relevant data
 Conformity: Validation to ensure whether the data conforms to the required formats i.e. whether values are
correlating with the attributes and structure of the target field
 Consistency: Check to ensure values in one data set are consistent with the same values in another data set. This
can be within table (e.g., inconsistency between Title and Gender columns) or across tables (e.g., aggregates and
parent/child relationship)
 Accuracy: Aim to ensure the data represents its intended purpose/requirement. Data inaccuracy may happen
due to typo, using acronyms or untimely arrival of data
 Duplication: Check carried out at field level or record level to ensure same data set is not stored more than once
within a table
3Maveric SystemsData Quality
Cleanup: Revamp existing data to improve data quality by using the following methods:
 Lookup: Identify the clean data set and use it as lookup to cleanup the data. This may be within a table (e.g.,
Gender column having values like M, Male, Mr, etc.) or by looking up value in another table (master table or a
manually created lookup table). We may also use this method for cleaning up missing values (e.g., populate
Gender column based on Title or vice versa)
 Auto/Manual: Create scripts based on requirements to fix the data issues. However, certain data issues cannot
be fixed through any of the methods e.g., mandatory columns are missing, but that data is stored in a free text
column, say ‘Comments’ column
Standardize: Regulate the data to improve accuracy through the following:
 Geo-coding: Adopt various measures to ensure conformance to geographical addresses like standardizing
address columns to meet postal rules (e.g., USPS for US addresses) and getting and storing the Latitude &
Longitude of the physical location using the standardized address
 Matching/Linking: At times, we may be able to get all details about a field from multiple data sources or notice
duplicate data that may not be exactly matching but closely relating to an existing value. To address these issues
and create accurate data, we need to create business logic that enables matching of such data and removing
anomalies
Monitor: Establish systems or business processes to track quality of incoming data. Some of these tracking
mechanisms include the following.
 Business rules: Create quality checks based on business rules (e.g., daily transaction limit cannot exceed a
specific amount per account) and execute the checks on a periodic basis in line with requirements and raise alert
when a check fails
 Trending/Threshold: Analyze data and identify the trend of data (e.g., customer creation on a daily basis should
be between ‘x’ and ‘y’, sum of all loan amounts disbursed on a day should be between ‘x’ and ‘y’). Quality checks
can then be created using the analysis and run periodically based on requirement. The boundary values for
analyzing the trends also need to be changed periodically based on the patterns observed
Fix: Clean up existing data to improve quality. This needs to be carried out in a manual way by analyzing for each
data quality fail alert, the data set and identifying the root cause (it can be missing data, data duplication, data
corruption, business logic not implemented correctly, etc.). Further, it also needs to be checked upfront if the alert
that has been given is fake or requires attention.
Report: Build a reporting system to have a better understanding of the quality of data using the following means:
 Dashboard: Provides details of the quality checks that are being run, execution outcome details, and issues
identified and resolved, etc. in addition to the current status, they also provide information on trends over a
period of time
 Email: Email based alert system should be established to provide the results of each data quality check
execution. The system should have the ability to be configured at various levels
ABOUT MAVERIC
Started in 2000, Maveric Systems is a leading provider of IT Lifecycle Assurance with expertise across
requirements to release. With a strong focus on the Banking and Telecom sectors, Maveric has built a business on
the principles of deep domain expertise and innovation. Maveric’s client portfolio includes a wide array of
renowned banks, financial institutions, insurance companies, leading software product companies and telecom
companies.
Maveric Systems Limited (Corporate Office): “Lords Tower” Block 1, 2nd Floor, Plot No 1&2 NP, Jawaharlal Nehru
Road, Thiru Vi Ka Industrial Estate, Ekkatuthangal, Chennai 600032
India | Singapore | Saudi Arabia | UAE | UK | USA
Write to us at info@maveric-systems.com | www.maveric-systems.com
Data Quality Maturity Model & Tools
Using the data quality framework, organizations can move towards implementing and executing initiatives that have
a profound effect on the quality of data. However, organizations need to be cognizant of the state of data quality
maturity they are in currently and take steps to move towards higher maturity levels. The maturity model outlined
below provides a snap shot of the evolution of various data quality maturity initiatives.
An organization needs to move towards the final stage of achieving the managed and optimizing level in terms of
quality of data. A plethora of tools, manual approaches and customized solutions are available to help firms move
from one maturity stage to another. These include profiling and checking tools like SQL scripting, UI and dashboard
formats like HTML, CSS and PHP, and various third party end to end solutions like Informatica data quality, IBM
quality stage, Talend data quality and Open source data quality.
Conclusion
Ascertaining and ensuring the quality of data is paramount for business success. With personalized customer
outreach mechanisms increasing in prominence, dependence on data to understand customer preferences and
buying behavior has become more important than ever. Add to it, exponential increase in the amount of data
generated across multiple devices and access mechanisms. The importance of data quality cannot be more
emphasized. The framework and maturity model provided in this article will only serve as a mode to move towards
being a more data quality intensive organization. Understanding the importance of data quality, analyzing the level
of data quality maturity currently in and taking steps to move towards the highest level are all dependent on the
individual organization. Businesses who understand this and move on the path to execution are the ones to succeed
in today’s highly competitive landscape.

Contenu connexe

Tendances

Data Quality MDM
Data Quality MDMData Quality MDM
Data Quality MDMUniserv
 
Data quality architecture
Data quality architectureData quality architecture
Data quality architectureanicewick
 
Classification of Data in Statistics
Classification of Data in Statistics Classification of Data in Statistics
Classification of Data in Statistics Stat Analytica
 
Data quality management system
Data quality management systemData quality management system
Data quality management systemselinasimpson361
 
Generic Patch Presentation 210716
Generic Patch Presentation 210716Generic Patch Presentation 210716
Generic Patch Presentation 210716Chris Birch
 
Data Mining : Healthcare Application
Data Mining : Healthcare ApplicationData Mining : Healthcare Application
Data Mining : Healthcare Applicationosman ansari
 
Databases
DatabasesDatabases
DatabasesUMaine
 

Tendances (9)

Data Quality MDM
Data Quality MDMData Quality MDM
Data Quality MDM
 
Data quality architecture
Data quality architectureData quality architecture
Data quality architecture
 
Classification of Data in Statistics
Classification of Data in Statistics Classification of Data in Statistics
Classification of Data in Statistics
 
Data quality management
Data quality managementData quality management
Data quality management
 
Data quality management system
Data quality management systemData quality management system
Data quality management system
 
Generic Patch Presentation 210716
Generic Patch Presentation 210716Generic Patch Presentation 210716
Generic Patch Presentation 210716
 
Data Mining : Healthcare Application
Data Mining : Healthcare ApplicationData Mining : Healthcare Application
Data Mining : Healthcare Application
 
Databases
DatabasesDatabases
Databases
 
Big data
Big dataBig data
Big data
 

Similaire à Data quality

What is Data Observability.pdf
What is Data Observability.pdfWhat is Data Observability.pdf
What is Data Observability.pdf4dalert
 
Data quality testing – a quick checklist to measure and improve data quality
Data quality testing – a quick checklist to measure and improve data qualityData quality testing – a quick checklist to measure and improve data quality
Data quality testing – a quick checklist to measure and improve data qualityJaveriaGauhar
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bijeffd00
 
Making Data Quality a Way of Life
Making Data Quality a Way of LifeMaking Data Quality a Way of Life
Making Data Quality a Way of LifeCognizant
 
State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023RTTS
 
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineQlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineSrikanth Sharma Boddupalli
 
Top 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfTop 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfShaikSikindar1
 
Quality management best practices
Quality management best practicesQuality management best practices
Quality management best practicesselinasimpson2201
 
Data quality management best practices
Data quality management best practicesData quality management best practices
Data quality management best practicesselinasimpson2201
 
Data Quality
Data QualityData Quality
Data QualityVijaya K
 
Data Quality_ the holy grail for a Data Fluent Organization.pptx
Data Quality_ the holy grail for a Data Fluent Organization.pptxData Quality_ the holy grail for a Data Fluent Organization.pptx
Data Quality_ the holy grail for a Data Fluent Organization.pptxBalvinder Hira
 
SDM Presentation V1.0
SDM Presentation V1.0SDM Presentation V1.0
SDM Presentation V1.0KirSinc
 
5 Best Practices of Effective Data Quality Management
5 Best Practices of Effective Data Quality Management5 Best Practices of Effective Data Quality Management
5 Best Practices of Effective Data Quality ManagementData Entry India Outsource
 
How Data.com Drives Data Quality
How Data.com Drives Data QualityHow Data.com Drives Data Quality
How Data.com Drives Data QualitySalesforce Partners
 
DDMA / T-Mobile: Datakwaliteit
DDMA / T-Mobile: DatakwaliteitDDMA / T-Mobile: Datakwaliteit
DDMA / T-Mobile: DatakwaliteitDDMA
 
A G S004 Smith 091707
A G S004  Smith 091707A G S004  Smith 091707
A G S004 Smith 091707Dreamforce07
 
Data Quality Management: Cleaner Data, Better Reporting
Data Quality Management: Cleaner Data, Better ReportingData Quality Management: Cleaner Data, Better Reporting
Data Quality Management: Cleaner Data, Better Reportingaccenture
 
Developing A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataDeveloping A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataFindWhitePapers
 
AI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfAI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfarifulislam946965
 

Similaire à Data quality (20)

Lecture 01 mis
Lecture 01 misLecture 01 mis
Lecture 01 mis
 
What is Data Observability.pdf
What is Data Observability.pdfWhat is Data Observability.pdf
What is Data Observability.pdf
 
Data quality testing – a quick checklist to measure and improve data quality
Data quality testing – a quick checklist to measure and improve data qualityData quality testing – a quick checklist to measure and improve data quality
Data quality testing – a quick checklist to measure and improve data quality
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bi
 
Making Data Quality a Way of Life
Making Data Quality a Way of LifeMaking Data Quality a Way of Life
Making Data Quality a Way of Life
 
State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023
 
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineQlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
 
Top 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfTop 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdf
 
Quality management best practices
Quality management best practicesQuality management best practices
Quality management best practices
 
Data quality management best practices
Data quality management best practicesData quality management best practices
Data quality management best practices
 
Data Quality
Data QualityData Quality
Data Quality
 
Data Quality_ the holy grail for a Data Fluent Organization.pptx
Data Quality_ the holy grail for a Data Fluent Organization.pptxData Quality_ the holy grail for a Data Fluent Organization.pptx
Data Quality_ the holy grail for a Data Fluent Organization.pptx
 
SDM Presentation V1.0
SDM Presentation V1.0SDM Presentation V1.0
SDM Presentation V1.0
 
5 Best Practices of Effective Data Quality Management
5 Best Practices of Effective Data Quality Management5 Best Practices of Effective Data Quality Management
5 Best Practices of Effective Data Quality Management
 
How Data.com Drives Data Quality
How Data.com Drives Data QualityHow Data.com Drives Data Quality
How Data.com Drives Data Quality
 
DDMA / T-Mobile: Datakwaliteit
DDMA / T-Mobile: DatakwaliteitDDMA / T-Mobile: Datakwaliteit
DDMA / T-Mobile: Datakwaliteit
 
A G S004 Smith 091707
A G S004  Smith 091707A G S004  Smith 091707
A G S004 Smith 091707
 
Data Quality Management: Cleaner Data, Better Reporting
Data Quality Management: Cleaner Data, Better ReportingData Quality Management: Cleaner Data, Better Reporting
Data Quality Management: Cleaner Data, Better Reporting
 
Developing A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataDeveloping A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product Data
 
AI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfAI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdf
 

Plus de drishtipuro1234

T24 Core Banking: Benefits
T24 Core Banking: BenefitsT24 Core Banking: Benefits
T24 Core Banking: Benefitsdrishtipuro1234
 
Digital Transformation in Banking Financial Services Industry
Digital Transformation in Banking Financial Services IndustryDigital Transformation in Banking Financial Services Industry
Digital Transformation in Banking Financial Services Industrydrishtipuro1234
 
Banking Transformations Setting up for Failure
Banking Transformations Setting up for FailureBanking Transformations Setting up for Failure
Banking Transformations Setting up for Failuredrishtipuro1234
 
Islamic Asset Management A new Iceberg
Islamic Asset Management A new IcebergIslamic Asset Management A new Iceberg
Islamic Asset Management A new Icebergdrishtipuro1234
 
Anti Money Laundering AML Market Dynamics Technology Challenges Spanish
Anti Money Laundering AML Market Dynamics Technology Challenges SpanishAnti Money Laundering AML Market Dynamics Technology Challenges Spanish
Anti Money Laundering AML Market Dynamics Technology Challenges Spanishdrishtipuro1234
 
Test Management Montioring Control
Test Management Montioring ControlTest Management Montioring Control
Test Management Montioring Controldrishtipuro1234
 
Challenges in New Age Banking
Challenges in New Age BankingChallenges in New Age Banking
Challenges in New Age Bankingdrishtipuro1234
 
AML testing and assurance services
AML testing and assurance servicesAML testing and assurance services
AML testing and assurance servicesdrishtipuro1234
 
Platforms offered by maveric systems
Platforms offered by maveric systemsPlatforms offered by maveric systems
Platforms offered by maveric systemsdrishtipuro1234
 
Software testing solutions
Software testing solutionsSoftware testing solutions
Software testing solutionsdrishtipuro1234
 

Plus de drishtipuro1234 (12)

T24 Core Banking: Benefits
T24 Core Banking: BenefitsT24 Core Banking: Benefits
T24 Core Banking: Benefits
 
Digital Transformation in Banking Financial Services Industry
Digital Transformation in Banking Financial Services IndustryDigital Transformation in Banking Financial Services Industry
Digital Transformation in Banking Financial Services Industry
 
Banking Transformations Setting up for Failure
Banking Transformations Setting up for FailureBanking Transformations Setting up for Failure
Banking Transformations Setting up for Failure
 
Islamic Asset Management A new Iceberg
Islamic Asset Management A new IcebergIslamic Asset Management A new Iceberg
Islamic Asset Management A new Iceberg
 
step on cloud payments
step on cloud paymentsstep on cloud payments
step on cloud payments
 
Requirement elicitation
Requirement elicitationRequirement elicitation
Requirement elicitation
 
Anti Money Laundering AML Market Dynamics Technology Challenges Spanish
Anti Money Laundering AML Market Dynamics Technology Challenges SpanishAnti Money Laundering AML Market Dynamics Technology Challenges Spanish
Anti Money Laundering AML Market Dynamics Technology Challenges Spanish
 
Test Management Montioring Control
Test Management Montioring ControlTest Management Montioring Control
Test Management Montioring Control
 
Challenges in New Age Banking
Challenges in New Age BankingChallenges in New Age Banking
Challenges in New Age Banking
 
AML testing and assurance services
AML testing and assurance servicesAML testing and assurance services
AML testing and assurance services
 
Platforms offered by maveric systems
Platforms offered by maveric systemsPlatforms offered by maveric systems
Platforms offered by maveric systems
 
Software testing solutions
Software testing solutionsSoftware testing solutions
Software testing solutions
 

Dernier

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceanilsa9823
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 

Dernier (20)

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 

Data quality

  • 2. 2Maveric Systems Overview Quality of data for business operations is considered to be a critical component of enterprise success. With the exponential rise in ways and means by which data is generated and consumed, organizations are more and more focusing on ensuring data quality. Studies indicate that fewer than 50% of IT decision makers have confidence in their organization’s data quality initiatives, although more than 90% acknowledge the growing importance and volumes of data that they have to grapple with in future. This whitepaper aims to throw light on the importance of data quality and means to assure quality of data, focusing on key facets and tools. Ensuring data quality in today’s environment of Big data is not only necessary but also critical to an organization’s survival. Data Quality Data Quality Framework A robust framework that can address data quality needs to look at the following 6 focus areas. Profile: Analyze the data to ensure conformance to data quality dimensions outlined below  Completeness: Field level (e.g., missing data) or data level (e.g., missing line item for an order) check to ensure all requisite details are available. This will ensure that the key fields are populated with relevant data  Conformity: Validation to ensure whether the data conforms to the required formats i.e. whether values are correlating with the attributes and structure of the target field  Consistency: Check to ensure values in one data set are consistent with the same values in another data set. This can be within table (e.g., inconsistency between Title and Gender columns) or across tables (e.g., aggregates and parent/child relationship)  Accuracy: Aim to ensure the data represents its intended purpose/requirement. Data inaccuracy may happen due to typo, using acronyms or untimely arrival of data  Duplication: Check carried out at field level or record level to ensure same data set is not stored more than once within a table
  • 3. 3Maveric SystemsData Quality Cleanup: Revamp existing data to improve data quality by using the following methods:  Lookup: Identify the clean data set and use it as lookup to cleanup the data. This may be within a table (e.g., Gender column having values like M, Male, Mr, etc.) or by looking up value in another table (master table or a manually created lookup table). We may also use this method for cleaning up missing values (e.g., populate Gender column based on Title or vice versa)  Auto/Manual: Create scripts based on requirements to fix the data issues. However, certain data issues cannot be fixed through any of the methods e.g., mandatory columns are missing, but that data is stored in a free text column, say ‘Comments’ column Standardize: Regulate the data to improve accuracy through the following:  Geo-coding: Adopt various measures to ensure conformance to geographical addresses like standardizing address columns to meet postal rules (e.g., USPS for US addresses) and getting and storing the Latitude & Longitude of the physical location using the standardized address  Matching/Linking: At times, we may be able to get all details about a field from multiple data sources or notice duplicate data that may not be exactly matching but closely relating to an existing value. To address these issues and create accurate data, we need to create business logic that enables matching of such data and removing anomalies Monitor: Establish systems or business processes to track quality of incoming data. Some of these tracking mechanisms include the following.  Business rules: Create quality checks based on business rules (e.g., daily transaction limit cannot exceed a specific amount per account) and execute the checks on a periodic basis in line with requirements and raise alert when a check fails  Trending/Threshold: Analyze data and identify the trend of data (e.g., customer creation on a daily basis should be between ‘x’ and ‘y’, sum of all loan amounts disbursed on a day should be between ‘x’ and ‘y’). Quality checks can then be created using the analysis and run periodically based on requirement. The boundary values for analyzing the trends also need to be changed periodically based on the patterns observed Fix: Clean up existing data to improve quality. This needs to be carried out in a manual way by analyzing for each data quality fail alert, the data set and identifying the root cause (it can be missing data, data duplication, data corruption, business logic not implemented correctly, etc.). Further, it also needs to be checked upfront if the alert that has been given is fake or requires attention. Report: Build a reporting system to have a better understanding of the quality of data using the following means:  Dashboard: Provides details of the quality checks that are being run, execution outcome details, and issues identified and resolved, etc. in addition to the current status, they also provide information on trends over a period of time  Email: Email based alert system should be established to provide the results of each data quality check execution. The system should have the ability to be configured at various levels
  • 4. ABOUT MAVERIC Started in 2000, Maveric Systems is a leading provider of IT Lifecycle Assurance with expertise across requirements to release. With a strong focus on the Banking and Telecom sectors, Maveric has built a business on the principles of deep domain expertise and innovation. Maveric’s client portfolio includes a wide array of renowned banks, financial institutions, insurance companies, leading software product companies and telecom companies. Maveric Systems Limited (Corporate Office): “Lords Tower” Block 1, 2nd Floor, Plot No 1&2 NP, Jawaharlal Nehru Road, Thiru Vi Ka Industrial Estate, Ekkatuthangal, Chennai 600032 India | Singapore | Saudi Arabia | UAE | UK | USA Write to us at info@maveric-systems.com | www.maveric-systems.com Data Quality Maturity Model & Tools Using the data quality framework, organizations can move towards implementing and executing initiatives that have a profound effect on the quality of data. However, organizations need to be cognizant of the state of data quality maturity they are in currently and take steps to move towards higher maturity levels. The maturity model outlined below provides a snap shot of the evolution of various data quality maturity initiatives. An organization needs to move towards the final stage of achieving the managed and optimizing level in terms of quality of data. A plethora of tools, manual approaches and customized solutions are available to help firms move from one maturity stage to another. These include profiling and checking tools like SQL scripting, UI and dashboard formats like HTML, CSS and PHP, and various third party end to end solutions like Informatica data quality, IBM quality stage, Talend data quality and Open source data quality. Conclusion Ascertaining and ensuring the quality of data is paramount for business success. With personalized customer outreach mechanisms increasing in prominence, dependence on data to understand customer preferences and buying behavior has become more important than ever. Add to it, exponential increase in the amount of data generated across multiple devices and access mechanisms. The importance of data quality cannot be more emphasized. The framework and maturity model provided in this article will only serve as a mode to move towards being a more data quality intensive organization. Understanding the importance of data quality, analyzing the level of data quality maturity currently in and taking steps to move towards the highest level are all dependent on the individual organization. Businesses who understand this and move on the path to execution are the ones to succeed in today’s highly competitive landscape.