SlideShare une entreprise Scribd logo
1  sur  14
Presented by Gopalakrishnan K
KG Data Solutions
gopalk@kgds.org
 What Is A Data Warehouse?
 History
 Current scenario
 Characteristics
 Operational Database vs. Data Warehouse
 Architecture
 Data Model
Gopal K KGDS
 The term "data warehouse" refers to a special
type of database that acts as the central
repository for company data. It can be thought of
as a database archive that is segregated from the
operational databases, and used primarily for
reporting and data mining purposes.
 The relational database revolution in the early
1980s ushered in an era of improved access to
the valuable information contained deep within
data. Still improvements were needed.
 It was soon discovered that databases modeled
to be efficient at transactional processing were
not always optimized for complex reporting or
analytical needs
 Inmon champions the large centralized Data Warehouse approach
leveraging solid relational design principles. His Corporate
Information Factory remains an example of this "top down"
philosophy.
 Kimball, on the other hand, favors the development of individual
data marts at the departmental level that get integrated together
using the Information Bus architecture. This "bottom up" approach
dovetails nicely with Kimball's preference for star-schema modeling
Many of the current changes in today's data industry also affect Data
Warehousing. Cloud storage and high-velocity, real-time data analysis
being two obvious factors playing a role in the practice's evolution. On
the end-user side, web-based and mobile access to decision support or
reporting data is a major requirement on many projects. Advances in
the practice of ontology have enhanced the capabilities of ETL systems
to parse information out of unstructured as well as structured data
sources
 Subject-oriented
The data in the database is organized so that all the data elements
relating to the same real-world event or object are linked together.
 Time-variant
The changes to the data in the database are tracked and recorded
so that reports can be produced showing changes over time.
 Non-volatile
Data in the database is never over-written or deleted. Once
committed, the data is static, read-only, but retained for future
reporting.
 Integrated
The database contains data from most or all of an organization's
operational applications, and that this data is made consistent.
 The processing load of reporting reduced the
response time of the operational systems.
 The database designs of operational systems
were not optimized for information analysis and
reporting.
 Most organizations had more than one
operational system, so company-wide reporting
could not be supported from a single system.
 Development of reports in operational systems
often required writing specific computer
programs which was slow and expensive.
 Consolidation of data from a wide variety of data
sources.
 Ability to analyze data beyond the level of
standard monitoring reports.
 Operational response time unaffected.
Data warehouse presentation
Data warehouse presentation
Data warehouse presentation

Contenu connexe

Tendances

Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecture
samaksh1982
 
Hds ucp sap hana infographic v6[1]
Hds ucp  sap hana infographic v6[1]Hds ucp  sap hana infographic v6[1]
Hds ucp sap hana infographic v6[1]
Barbara Götz
 
Data flow in Extraction of ETL data warehousing
Data flow in Extraction of ETL data warehousingData flow in Extraction of ETL data warehousing
Data flow in Extraction of ETL data warehousing
Dr. Dipti Patil
 

Tendances (20)

data warehousing
data warehousingdata warehousing
data warehousing
 
Managing Data Integration Initiatives
Managing Data Integration InitiativesManaging Data Integration Initiatives
Managing Data Integration Initiatives
 
Datos iO Product Overview
Datos iO Product OverviewDatos iO Product Overview
Datos iO Product Overview
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 
Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecture
 
Sridhar-Profile-0117
Sridhar-Profile-0117Sridhar-Profile-0117
Sridhar-Profile-0117
 
7 data warehouse & marts
7 data warehouse & marts7 data warehouse & marts
7 data warehouse & marts
 
How Yellowbrick Data Integrates to Existing Environments Webcast
How Yellowbrick Data Integrates to Existing Environments WebcastHow Yellowbrick Data Integrates to Existing Environments Webcast
How Yellowbrick Data Integrates to Existing Environments Webcast
 
Denodo DataFest 2017: Business Needs for a Fast Data Strategy
Denodo DataFest 2017: Business Needs for a Fast Data StrategyDenodo DataFest 2017: Business Needs for a Fast Data Strategy
Denodo DataFest 2017: Business Needs for a Fast Data Strategy
 
The Big Data Analytics Ecosystem at LinkedIn
The Big Data Analytics Ecosystem at LinkedInThe Big Data Analytics Ecosystem at LinkedIn
The Big Data Analytics Ecosystem at LinkedIn
 
Data mining and data warehousing
Data mining and data warehousingData mining and data warehousing
Data mining and data warehousing
 
Hds ucp sap hana infographic v6[1]
Hds ucp  sap hana infographic v6[1]Hds ucp  sap hana infographic v6[1]
Hds ucp sap hana infographic v6[1]
 
IT Category Purchasing Managers Opportunity for Savings with Non Relational S...
IT Category Purchasing Managers Opportunity for Savings with Non Relational S...IT Category Purchasing Managers Opportunity for Savings with Non Relational S...
IT Category Purchasing Managers Opportunity for Savings with Non Relational S...
 
Gartner Cool Vendor Report 2014
Gartner Cool Vendor Report 2014Gartner Cool Vendor Report 2014
Gartner Cool Vendor Report 2014
 
Etl elt simplified
Etl elt simplifiedEtl elt simplified
Etl elt simplified
 
Data warehouse
Data warehouse Data warehouse
Data warehouse
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Data flow in Extraction of ETL data warehousing
Data flow in Extraction of ETL data warehousingData flow in Extraction of ETL data warehousing
Data flow in Extraction of ETL data warehousing
 
Taming the ETL beast: How LinkedIn uses metadata to run complex ETL flows rel...
Taming the ETL beast: How LinkedIn uses metadata to run complex ETL flows rel...Taming the ETL beast: How LinkedIn uses metadata to run complex ETL flows rel...
Taming the ETL beast: How LinkedIn uses metadata to run complex ETL flows rel...
 
Why shift from ETL to ELT?
Why shift from ETL to ELT?Why shift from ETL to ELT?
Why shift from ETL to ELT?
 

En vedette

Jennifer_Goldberg_QuittingFeature
Jennifer_Goldberg_QuittingFeatureJennifer_Goldberg_QuittingFeature
Jennifer_Goldberg_QuittingFeature
Jennifer Goldberg
 
Social Media position paper
Social Media position paperSocial Media position paper
Social Media position paper
Shelly Lawrence
 
Managing warehouse operations. How to manage and run warehouse operations by ...
Managing warehouse operations. How to manage and run warehouse operations by ...Managing warehouse operations. How to manage and run warehouse operations by ...
Managing warehouse operations. How to manage and run warehouse operations by ...
Omar Youssef
 

En vedette (20)

Immigrate to canada from hyderabad
Immigrate to canada from hyderabadImmigrate to canada from hyderabad
Immigrate to canada from hyderabad
 
Qué es una webquest
Qué es una webquest Qué es una webquest
Qué es una webquest
 
Jennifer_Goldberg_QuittingFeature
Jennifer_Goldberg_QuittingFeatureJennifer_Goldberg_QuittingFeature
Jennifer_Goldberg_QuittingFeature
 
Gestão empreendedora de mídia social
Gestão empreendedora de mídia socialGestão empreendedora de mídia social
Gestão empreendedora de mídia social
 
Presentation zse drukarska-karolina&dominika
Presentation zse drukarska-karolina&dominikaPresentation zse drukarska-karolina&dominika
Presentation zse drukarska-karolina&dominika
 
Dicas e truques no photoshop
Dicas e truques no photoshopDicas e truques no photoshop
Dicas e truques no photoshop
 
ashutosh_1401
ashutosh_1401ashutosh_1401
ashutosh_1401
 
Syphilis congenital
Syphilis congenital Syphilis congenital
Syphilis congenital
 
Volume benda putar cincin untuk diupload di slide share
Volume benda putar cincin untuk diupload di slide shareVolume benda putar cincin untuk diupload di slide share
Volume benda putar cincin untuk diupload di slide share
 
Творческий проект "Кулинарные истории"Телегиной Е.
Творческий проект "Кулинарные истории"Телегиной Е.Творческий проект "Кулинарные истории"Телегиной Е.
Творческий проект "Кулинарные истории"Телегиной Е.
 
Творческий проект "Хранение информации"
Творческий проект "Хранение информации"Творческий проект "Хранение информации"
Творческий проект "Хранение информации"
 
"Кулайка"
 "Кулайка" "Кулайка"
"Кулайка"
 
Творческая работа "Мы за ЗОЖ!"
Творческая работа "Мы за ЗОЖ!"Творческая работа "Мы за ЗОЖ!"
Творческая работа "Мы за ЗОЖ!"
 
Творческий проект "Кулинарный поединок"
Творческий проект "Кулинарный поединок"Творческий проект "Кулинарный поединок"
Творческий проект "Кулинарный поединок"
 
Творческая работа "Реки на просторах Томской области"
Творческая работа "Реки на просторах Томской области"Творческая работа "Реки на просторах Томской области"
Творческая работа "Реки на просторах Томской области"
 
Sampling - Stratified vs Cluster
Sampling - Stratified vs ClusterSampling - Stratified vs Cluster
Sampling - Stratified vs Cluster
 
Come un territorio diventa creativo. Una lezione veneziana 11 01 17 Andrea Po...
Come un territorio diventa creativo. Una lezione veneziana 11 01 17 Andrea Po...Come un territorio diventa creativo. Una lezione veneziana 11 01 17 Andrea Po...
Come un territorio diventa creativo. Una lezione veneziana 11 01 17 Andrea Po...
 
Parametric vs Non-Parametric
Parametric vs Non-ParametricParametric vs Non-Parametric
Parametric vs Non-Parametric
 
Social Media position paper
Social Media position paperSocial Media position paper
Social Media position paper
 
Managing warehouse operations. How to manage and run warehouse operations by ...
Managing warehouse operations. How to manage and run warehouse operations by ...Managing warehouse operations. How to manage and run warehouse operations by ...
Managing warehouse operations. How to manage and run warehouse operations by ...
 

Similaire à Data warehouse presentation

Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
obieefans
 

Similaire à Data warehouse presentation (20)

Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | Qubole
 
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdfTop 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdf
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guide
 
Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
 
DW 101
DW 101DW 101
DW 101
 
Oracle sql plsql & dw
Oracle sql plsql & dwOracle sql plsql & dw
Oracle sql plsql & dw
 
TOPIC 9 data warehousing and data mining.pdf
TOPIC 9 data warehousing and data mining.pdfTOPIC 9 data warehousing and data mining.pdf
TOPIC 9 data warehousing and data mining.pdf
 
data warehousing and data mining (1).pdf
data warehousing and data mining (1).pdfdata warehousing and data mining (1).pdf
data warehousing and data mining (1).pdf
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
A Comparitive Study Of ETL Tools
A Comparitive Study Of ETL ToolsA Comparitive Study Of ETL Tools
A Comparitive Study Of ETL Tools
 
Fbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_servicesFbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_services
 
The technology of the business data lake
The technology of the business data lakeThe technology of the business data lake
The technology of the business data lake
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
BI Architecture in support of data quality
BI Architecture in support of data qualityBI Architecture in support of data quality
BI Architecture in support of data quality
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
Data processing in Industrial Systems course notes after week 5
Data processing in Industrial Systems course notes after week 5Data processing in Industrial Systems course notes after week 5
Data processing in Industrial Systems course notes after week 5
 
Implementation of Data Marts in Data ware house
Implementation of Data Marts in Data ware houseImplementation of Data Marts in Data ware house
Implementation of Data Marts in Data ware house
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Data warehouse presentation

  • 1. Presented by Gopalakrishnan K KG Data Solutions gopalk@kgds.org
  • 2.  What Is A Data Warehouse?  History  Current scenario  Characteristics  Operational Database vs. Data Warehouse  Architecture  Data Model Gopal K KGDS
  • 3.  The term "data warehouse" refers to a special type of database that acts as the central repository for company data. It can be thought of as a database archive that is segregated from the operational databases, and used primarily for reporting and data mining purposes.
  • 4.  The relational database revolution in the early 1980s ushered in an era of improved access to the valuable information contained deep within data. Still improvements were needed.  It was soon discovered that databases modeled to be efficient at transactional processing were not always optimized for complex reporting or analytical needs
  • 5.  Inmon champions the large centralized Data Warehouse approach leveraging solid relational design principles. His Corporate Information Factory remains an example of this "top down" philosophy.  Kimball, on the other hand, favors the development of individual data marts at the departmental level that get integrated together using the Information Bus architecture. This "bottom up" approach dovetails nicely with Kimball's preference for star-schema modeling
  • 6. Many of the current changes in today's data industry also affect Data Warehousing. Cloud storage and high-velocity, real-time data analysis being two obvious factors playing a role in the practice's evolution. On the end-user side, web-based and mobile access to decision support or reporting data is a major requirement on many projects. Advances in the practice of ontology have enhanced the capabilities of ETL systems to parse information out of unstructured as well as structured data sources
  • 7.  Subject-oriented The data in the database is organized so that all the data elements relating to the same real-world event or object are linked together.  Time-variant The changes to the data in the database are tracked and recorded so that reports can be produced showing changes over time.
  • 8.  Non-volatile Data in the database is never over-written or deleted. Once committed, the data is static, read-only, but retained for future reporting.  Integrated The database contains data from most or all of an organization's operational applications, and that this data is made consistent.
  • 9.  The processing load of reporting reduced the response time of the operational systems.  The database designs of operational systems were not optimized for information analysis and reporting.
  • 10.  Most organizations had more than one operational system, so company-wide reporting could not be supported from a single system.  Development of reports in operational systems often required writing specific computer programs which was slow and expensive.
  • 11.  Consolidation of data from a wide variety of data sources.  Ability to analyze data beyond the level of standard monitoring reports.  Operational response time unaffected.