SlideShare une entreprise Scribd logo
1  sur  21
PRESENTED BY

GANESH DHARESHWAR
ABHIJAN GHOSH
What is data warehousing?
 data warehouse is a
 database used for reporting
 and analysis
 Integrated collection of
  ENTERPRISE-WIDE DATA, oriented to
  decision making
 Provides strategic information
 Performing Information analysis that could
  not done by operating system
Need for data warehousing
Maintain data history
Even if the source transaction systems do not.
 Integrate data from multiple source systems,
Improve data quality by providing consistent
 codes and descriptions
Provides a flexible, conducive and interactive
 source of strategic information
Performing Information analysis that could not
 done by operating system
Data Rich, but Information Poor
• Data is stored, not explored :
  by its volume and complexity
  it represents a burden,
  not a support
• Data overload results in
  uninformed decisions,
  contradictory information,
  higher overhead,
  wrong decisions,
  increased costs
• Data is not designed and
  is not structured for
  successful management
  decision making
Improving Decision Making

                  Decisions




             DataInformation
                  Warehouse




                    Data



                               5
Operational data stores
Data focuses on transaction functions
 such as bank card withdrawals
  and deposits
It is organised by application         ODS
 It contains the current values
It supports day-to-day operational decision
 supports information
 it is detailed , nonredundant and updateable
Informational data stores
 Itis organised around subject
 such as customer, product
It is
 summarized, archived, derived
Data is static until refreshed
Data is nonupdateable
Difference between operational
    &informational data stores
                    Operational                 Informational
                    Data                        data
Data content        Current value               Summarized, archived,
                                                derived

Data organization   By application              By subject
Data stability      Dynamic                     Static until refreshed
Data structure      Optimized for transaction   Optimized for complex
                                                Queries
Access frequency    High                        Medium to low
Access type         Read/update/delete          Read/aggregate
                    Field by field              Added to
Response time       Subsecond(<1s) to2-3s       Several second to minute
Data warehousing is defined as

 A data warehouse is a subject-oriented, integrated,
  time-variant, non-volatile collection of data in
  support of management decision
 A data warehouse is designed for easy access by
  users to large amounts of information, and data
  access is typically supported by specialized analytical
  tools and applications.
Data Warehouse Characteristics
It is database designed for analytical
 tasks, using data from multiple application
It supports a relatively small numbers of users
 with relatively long interaction
Its content is periodically updated
It contains current and historical data to
 provide a historical perspective of information
It contains a few large tables
Integrated
    • Data is stored once in a single integrated location
                   (e.g. insurance company)

                 Auto Policy
                 Processing                  Data Warehouse
                  System                     Database

Customer
                Fire Policy
data            Processing
stored           System
in several
databases
                                              Subject = Customer
                 FACTS, LIFE
              Commercial, Accounting
                  Applications

                                                               12
Time - Variant
• Data is stored as a series of snapshots or views which record how it is
  collected across time.
 Data Warehouse Data


                        Time              Data
                       {
                          Key

        Data is tagged with some element of time - creation date, as of
         date, etc.
        Data is available on-line for long periods of time for trend
         analysis and forecasting. For example, five or more years

                                                                           13
Non-Volatile

• Existing data in the warehouse is not overwritten or
   updated.                                                   External
                                                              Sources


                          Production                             Data
                          Databases                              Warehouse
                                                   Data          Database
Production
                                                 Warehouse
Applications
                                                Environment

                                       • Load
               • Update
               • Insert                                        • Read-Only
               • Delete

                                                                             14
Subject Oriented
       • Example for an insurance company :
Applications Area                               Data Warehouse
                            Auto and Fire
                               Policy
   Commercial
                             Processing     Customer             Policy
     and Life
                              Systems
    Insurance
     Systems


                                                       Data
                     Data

                              Claims
                                            Losses               Premium
 Accounting                 Processing
  System         Billing      System
                 System


                                                                          15
Data Warehouse Architecture
It is based on a
 relational database
 management system
 server that function
 as the central repository
 for informational data
Operational System           Data Warehouse




                                                     Ad-hoc
                                                    Reporting




        Conversion
        & Interface                                OLAP
                                                   Cubes




                                                   Canned
                                                   Reports

        ODS           Staging Area
                                      Data Marts


                                                           17
Data Warehouse Architecture
The source data for it is operational application
During processing data is transformed into an
 integrated structure and format
The transformation process may involve
 conversion, summarization, filtering and
 condensation of data
References:
Introduction to data warehousing
.wikipedia.org/wiki/Data_warehouse
www.slideshare.net/datacleaners11/datawar
 ehousingppt
www.4shared.com/office/pLEWhceH/Data_W
 arehousing.html
www.cse.iitb.ac.in/dbms/Data/Talks/krithi-
 talk-impact.ppt
Introduction to data warehousing

Contenu connexe

Tendances

Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture janani thirupathi
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guidethomasmary607
 
Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse ArchitecturesTheju Paul
 
Introduction To Data Warehousing
Introduction To Data WarehousingIntroduction To Data Warehousing
Introduction To Data WarehousingAlex Meadows
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Miningidnats
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesFellowBuddy.com
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse conceptsobieefans
 
Data warehousing and data mart
Data warehousing and data martData warehousing and data mart
Data warehousing and data martAmit Sarkar
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseSOMASUNDARAM T
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etlAashish Rathod
 
Data warehouse design
Data warehouse designData warehouse design
Data warehouse designines beltaief
 
Architecture of data mining system
Architecture of data mining systemArchitecture of data mining system
Architecture of data mining systemramya marichamy
 
Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modelingvivekjv
 
Project Presentation on Data WareHouse
Project Presentation on Data WareHouseProject Presentation on Data WareHouse
Project Presentation on Data WareHouseAbhi Bhardwaj
 

Tendances (20)

DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guide
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse Architectures
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Introduction To Data Warehousing
Introduction To Data WarehousingIntroduction To Data Warehousing
Introduction To Data Warehousing
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
Data warehousing and data mart
Data warehousing and data martData warehousing and data mart
Data warehousing and data mart
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
Data warehouse design
Data warehouse designData warehouse design
Data warehouse design
 
Architecture of data mining system
Architecture of data mining systemArchitecture of data mining system
Architecture of data mining system
 
Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modeling
 
Project Presentation on Data WareHouse
Project Presentation on Data WareHouseProject Presentation on Data WareHouse
Project Presentation on Data WareHouse
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 

En vedette

Definition of Information System
Definition of Information SystemDefinition of Information System
Definition of Information SystemLansey Wegner
 
Pharmacists Licensure Exam Modules
Pharmacists Licensure Exam ModulesPharmacists Licensure Exam Modules
Pharmacists Licensure Exam ModulesCristina Joy Reyes
 
Application of fourier series
Application of fourier seriesApplication of fourier series
Application of fourier seriesGirish Dhareshwar
 
General Chemistry and Inorganic Pharmaceutical Chemistry Module 1 Pharmacist ...
General Chemistry and Inorganic Pharmaceutical Chemistry Module 1 Pharmacist ...General Chemistry and Inorganic Pharmaceutical Chemistry Module 1 Pharmacist ...
General Chemistry and Inorganic Pharmaceutical Chemistry Module 1 Pharmacist ...Senyora Ouf'ra
 
Basics of bridge construction
Basics of bridge constructionBasics of bridge construction
Basics of bridge constructionsymphonyjo
 
Bridge
Bridge Bridge
Bridge illpa
 
Common drugs and antidotes
Common drugs and antidotesCommon drugs and antidotes
Common drugs and antidotesAdel Regellana
 
Information system
Information systemInformation system
Information systemhiddensoul
 
Using span tables as1684 2
Using span tables   as1684 2Using span tables   as1684 2
Using span tables as1684 2jbusse
 

En vedette (20)

As 1684.4 2010 residential timber-framed construction simplified - non-cyclon...
As 1684.4 2010 residential timber-framed construction simplified - non-cyclon...As 1684.4 2010 residential timber-framed construction simplified - non-cyclon...
As 1684.4 2010 residential timber-framed construction simplified - non-cyclon...
 
Magnev train real
Magnev train real Magnev train real
Magnev train real
 
As 1684.2 2010 residential timber-framed construction non-cyclonic areas
As 1684.2 2010 residential timber-framed construction non-cyclonic areasAs 1684.2 2010 residential timber-framed construction non-cyclonic areas
As 1684.2 2010 residential timber-framed construction non-cyclonic areas
 
Definition of Information System
Definition of Information SystemDefinition of Information System
Definition of Information System
 
Pharmacists Licensure Exam Modules
Pharmacists Licensure Exam ModulesPharmacists Licensure Exam Modules
Pharmacists Licensure Exam Modules
 
Roof Framing
Roof FramingRoof Framing
Roof Framing
 
Application of fourier series
Application of fourier seriesApplication of fourier series
Application of fourier series
 
Class antidotes
Class antidotesClass antidotes
Class antidotes
 
General Chemistry and Inorganic Pharmaceutical Chemistry Module 1 Pharmacist ...
General Chemistry and Inorganic Pharmaceutical Chemistry Module 1 Pharmacist ...General Chemistry and Inorganic Pharmaceutical Chemistry Module 1 Pharmacist ...
General Chemistry and Inorganic Pharmaceutical Chemistry Module 1 Pharmacist ...
 
Antidote
AntidoteAntidote
Antidote
 
Expert Systems
Expert SystemsExpert Systems
Expert Systems
 
Basics of bridge construction
Basics of bridge constructionBasics of bridge construction
Basics of bridge construction
 
Bridge
Bridge Bridge
Bridge
 
Bridges
BridgesBridges
Bridges
 
Common drugs and antidotes
Common drugs and antidotesCommon drugs and antidotes
Common drugs and antidotes
 
types of bridges
types of bridgestypes of bridges
types of bridges
 
6.expert systems
6.expert systems6.expert systems
6.expert systems
 
Information system
Information systemInformation system
Information system
 
Types of bridges.pptx 1
Types of bridges.pptx 1Types of bridges.pptx 1
Types of bridges.pptx 1
 
Using span tables as1684 2
Using span tables   as1684 2Using span tables   as1684 2
Using span tables as1684 2
 

Similaire à Introduction to data warehousing

Enterprise Data Warehousing Positioning
Enterprise Data Warehousing PositioningEnterprise Data Warehousing Positioning
Enterprise Data Warehousing PositioningEdenH6
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY
 
Data warehouse
Data warehouseData warehouse
Data warehouseRajThakuri
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dworacle content
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.pptSumathiG8
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biA P
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.pptSamPrem3
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.pptPalaniKumarR2
 
Prague data management meetup 2017-02-28
Prague data management meetup 2017-02-28Prague data management meetup 2017-02-28
Prague data management meetup 2017-02-28Martin Bém
 
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAININGDATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAININGDatawarehouse Trainings
 

Similaire à Introduction to data warehousing (20)

DW 101
DW 101DW 101
DW 101
 
Enterprise Data Warehousing Positioning
Enterprise Data Warehousing PositioningEnterprise Data Warehousing Positioning
Enterprise Data Warehousing Positioning
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
DWBASIC.ppt
DWBASIC.pptDWBASIC.ppt
DWBASIC.ppt
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dw
 
Dbm630_lecture02-03
Dbm630_lecture02-03Dbm630_lecture02-03
Dbm630_lecture02-03
 
Dbm630_Lecture02-03
Dbm630_Lecture02-03Dbm630_Lecture02-03
Dbm630_Lecture02-03
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-bi
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt
 
Prague data management meetup 2017-02-28
Prague data management meetup 2017-02-28Prague data management meetup 2017-02-28
Prague data management meetup 2017-02-28
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Dwh basics datastage online training
Dwh basics datastage online trainingDwh basics datastage online training
Dwh basics datastage online training
 
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAININGDATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
 

Introduction to data warehousing

  • 2. What is data warehousing?  data warehouse is a database used for reporting and analysis  Integrated collection of ENTERPRISE-WIDE DATA, oriented to decision making  Provides strategic information  Performing Information analysis that could not done by operating system
  • 3. Need for data warehousing Maintain data history Even if the source transaction systems do not. Integrate data from multiple source systems, Improve data quality by providing consistent codes and descriptions Provides a flexible, conducive and interactive source of strategic information Performing Information analysis that could not done by operating system
  • 4. Data Rich, but Information Poor • Data is stored, not explored : by its volume and complexity it represents a burden, not a support • Data overload results in uninformed decisions, contradictory information, higher overhead, wrong decisions, increased costs • Data is not designed and is not structured for successful management decision making
  • 5. Improving Decision Making Decisions DataInformation Warehouse Data 5
  • 6. Operational data stores Data focuses on transaction functions such as bank card withdrawals and deposits It is organised by application ODS  It contains the current values It supports day-to-day operational decision supports information  it is detailed , nonredundant and updateable
  • 7. Informational data stores  Itis organised around subject such as customer, product It is summarized, archived, derived Data is static until refreshed Data is nonupdateable
  • 8. Difference between operational &informational data stores Operational Informational Data data Data content Current value Summarized, archived, derived Data organization By application By subject Data stability Dynamic Static until refreshed Data structure Optimized for transaction Optimized for complex Queries Access frequency High Medium to low Access type Read/update/delete Read/aggregate Field by field Added to Response time Subsecond(<1s) to2-3s Several second to minute
  • 9. Data warehousing is defined as  A data warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data in support of management decision  A data warehouse is designed for easy access by users to large amounts of information, and data access is typically supported by specialized analytical tools and applications.
  • 10.
  • 11. Data Warehouse Characteristics It is database designed for analytical tasks, using data from multiple application It supports a relatively small numbers of users with relatively long interaction Its content is periodically updated It contains current and historical data to provide a historical perspective of information It contains a few large tables
  • 12. Integrated • Data is stored once in a single integrated location (e.g. insurance company) Auto Policy Processing Data Warehouse System Database Customer Fire Policy data Processing stored System in several databases Subject = Customer FACTS, LIFE Commercial, Accounting Applications 12
  • 13. Time - Variant • Data is stored as a series of snapshots or views which record how it is collected across time. Data Warehouse Data Time Data { Key  Data is tagged with some element of time - creation date, as of date, etc.  Data is available on-line for long periods of time for trend analysis and forecasting. For example, five or more years 13
  • 14. Non-Volatile • Existing data in the warehouse is not overwritten or updated. External Sources Production Data Databases Warehouse Data Database Production Warehouse Applications Environment • Load • Update • Insert • Read-Only • Delete 14
  • 15. Subject Oriented • Example for an insurance company : Applications Area Data Warehouse Auto and Fire Policy Commercial Processing Customer Policy and Life Systems Insurance Systems Data Data Claims Losses Premium Accounting Processing System Billing System System 15
  • 16. Data Warehouse Architecture It is based on a relational database management system server that function as the central repository for informational data
  • 17. Operational System Data Warehouse Ad-hoc Reporting Conversion & Interface OLAP Cubes Canned Reports ODS Staging Area Data Marts 17
  • 18. Data Warehouse Architecture The source data for it is operational application During processing data is transformed into an integrated structure and format The transformation process may involve conversion, summarization, filtering and condensation of data
  • 19.
  • 20. References: Introduction to data warehousing .wikipedia.org/wiki/Data_warehouse www.slideshare.net/datacleaners11/datawar ehousingppt www.4shared.com/office/pLEWhceH/Data_W arehousing.html www.cse.iitb.ac.in/dbms/Data/Talks/krithi- talk-impact.ppt