SlideShare une entreprise Scribd logo
1  sur  34
PRESENTATION
ON
DATA WAREHOUSING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
NITTTR,CHANDIGARH
2
Course Overview
DW Architecture
Data mart
Difference b/w DW And OS
Data warehouse Schema
Types of Data stored in
datawarehouse
Data Warehouse
It is a repository of data used for analysis
and reporting.
The data stored in data warehouse is
loaded from different business source
system.
Data may pass through operational data
store for cleaning before it is used in D/w
for reporting.
3
DW ARCHITECTURE
4
OPERATIONAL SOURCE SYSTEM:
5
 In data warehouse data may be uploaded from one or
many operational source system.
 Suppose retail industry and sale consumer product
generating revenues for a company through different
channel.
DIFFERENT CHANNELS ARE:-
1. SALES
2. ECOM
3. POS
4. MARKETING
Staging Area:-
1. Non Persistent
2. Persistent
6
Non Persistent:- Data from sale, Ecom or
from different channel it just replicate
data in a Non Persistent storage area in a
specific given time(like 8-11 a.m).
7
Operational Data store: It redefined form
of data . It may have data from different
channel in single table or in mulitple table
but in readable format.
This is called Integration Area.
8
Informatica Power Centre:
Access, transform and integrate data from
any system , in any format and deliver the
data throughout the enterprise.
ETL :- is used to put records into data
warehouse.
9
10
ETL 3 Phases:
Extract
Transform
Load
11
Extraction: we extract data from source
system and make it accessible for further
processing.
Extraction Strategies:
o Full extraction
o Partial extraction
12
Transform:
Data extracted into a staging server is a raw data
and can’t be used as it is .
It needs to be cleaned, mapped and
transformed before finally loaded into data
warehouse
13
Basic Transformation Tasks:Basic Transformation Tasks:
Selection
Matching
Data cleaning and enrichment
Consolidation or summarization
Loading:
 In loading we load the data from staging
server into data warehouse.
 Data loading fetches prepare data, applies
it into the data warehouse and store it in
the database
15
Types of loading:
 Initial Load:- It is done when first time data to
be loaded into data warehouse.
 Incremental Load:- applying energy changes
as necessary in a periodic manner.
 Full Refresh:- completely erasing the contents
of one or more tables and reloading with fresh
data.
16
ETL Tools:
Enterprise Software
Informatica
IBM data stage
CloverETL
Microsoft SQL Server
17
18
Why we store data into Data Marts
from Data Warehouse?
In d/w it may have record 10 year old
record .
It is time consuming process.
In Data Mart we create table and store
only summarized data and store a weakly
data.
19
DIFFERENCE BETWEEN DW AND
ODS
20
21
22
Data Warehousing - Schemas
Schema is a logical description of the entire database. It includes
the name and description of records of all record types including all
associated data-items and aggregates. Much like a database, a data
warehouse also requires to maintain a schema. A database uses
relational model, while a data warehouse uses :
•Star schema
•Snowflake schema
• Fact Constellation schema
23
Star Schema:
•Each dimension in a star schema is represented with only one-
dimension table.
•This dimension table contains the set of attributes.
•The following diagram shows the sales data of a company with
respect to the four dimensions, namely time, item, branch, and
location.
24
•There is a fact table at the center. It contains the keys to each of
four dimensions.
•The fact table also contains the attributes, namely dollars sold and
units sold.
25
Snowflake Schema:
•Some dimension tables in the Snowflake schema are normalized.
•The normalization splits up the data into additional tables.
•Unlike Star schema, the dimensions table in a snowflake schema
are normalized. For example, the item dimension table in star
schema is normalized and split into two dimension tables, namely
item and supplier table.
•Now the item dimension table contains the attributes item_key,
item_name, type, brand, and supplier-key.
•The supplier key is linked to the supplier dimension table. The
supplier dimension table contains the attributes supplier_key and
supplier_type.
26
27
Fact Constellation Schema:
•A fact constellation has multiple fact tables. It is also known as
galaxy schema.
•The following diagram shows two fact tables, namely sales and
shipping.
•The sales fact table is same as that in the star schema.
•The shipping fact table has the five dimensions, namely item _ key,
time _ key , shipper _ key , from _ location, to _ location.
•The shipping fact table also contains two measures, namely dollars
sold and units sold.
•It is also possible to share dimension tables between fact tables. For
example, time, item, and location dimension tables are shared
between the sales and shipping fact table.
28
29
Types of Data Stored in a Data
Warehouse:
The term data warehouse is used to distinguish a
database that is used for business analysis
(OLAP) rather than transaction processing
(OLTP)
Your data warehouse will store these types of
data:
• Historical data
•Derived data
• Metadata
30
Historical Data
A data warehouse typically contains several years of
historical data. The amount of data that you decide to
make available depends on available disk space and the
types of analysis that you want to support. This data can
come from your transactional database archives or other
sources.
Some applications might perform analyses that require
data at lower levels than users typically view it. You
will need to check with the application builder or the
application's documentation for those types of data
requirements.
31
Derived Data
Derived data is generated from existing data using a
mathematical operation or a data transformation. It can be
created as part of a database maintenance operation or
generated at run-time in response to a query.
Metadata
Metadata is data that describes the data and schema objects,
and is used by applications to fetch and compute the data
correctly.
32
Useful URLs
Ralph Kimball’s home page
http://www.rkimball.com
Larry Greenfield’s Data Warehouse Information
Center
http://pwp.starnetinc.com/larryg/
Data Warehousing Institute
http://www.dw-institute.com/
OLAP Council
http://www.olapcouncil.com/
33
34
Thanks for your
attention…!!!
Any Queries ??

Contenu connexe

Tendances

Data warehouse presentaion
Data warehouse presentaionData warehouse presentaion
Data warehouse presentaionsridhark1981
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEyad Manna
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl conceptsjeshocarme
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architectureuncleRhyme
 
Introduction To Data Warehousing
Introduction To Data WarehousingIntroduction To Data Warehousing
Introduction To Data WarehousingAlex Meadows
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
OLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingOLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingWalid Elbadawy
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etlAashish Rathod
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional ModelingSunita Sahu
 
1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouseKrish_ver2
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaRadhika Kotecha
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemJames Serra
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Miningidnats
 

Tendances (20)

Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehouse presentaion
Data warehouse presentaionData warehouse presentaion
Data warehouse presentaion
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl concepts
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Introduction To Data Warehousing
Introduction To Data WarehousingIntroduction To Data Warehousing
Introduction To Data Warehousing
 
OLTP vs OLAP
OLTP vs OLAPOLTP vs OLAP
OLTP vs OLAP
 
Data Warehouse 101
Data Warehouse 101Data Warehouse 101
Data Warehouse 101
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
OLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingOLAP OnLine Analytical Processing
OLAP OnLine Analytical Processing
 
Data warehousing ppt
Data warehousing pptData warehousing ppt
Data warehousing ppt
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouse
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 

Similaire à Data warehouse

UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxUNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxshruthisweety4
 
Chapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.pptChapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.pptSubrata Kumer Paul
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olapSalah Amean
 
Data Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.pptData Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.pptMutiaSari53
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxDURGADEVIL
 
Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architectureDeepak Chaurasia
 
Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesInformaticaTrainingClasses
 
Data Warehousing for students educationpptx
Data Warehousing for students educationpptxData Warehousing for students educationpptx
Data Warehousing for students educationpptxjainyshah20
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processingVijayasankariS
 
Data Warehouse By Piyush
Data Warehouse By PiyushData Warehouse By Piyush
Data Warehouse By Piyushastronish
 
Data warehouse - Nivetha Durganathan
Data warehouse - Nivetha DurganathanData warehouse - Nivetha Durganathan
Data warehouse - Nivetha DurganathanNivetha Durganathan
 
Data warehousing
Data warehousingData warehousing
Data warehousingAllen Woods
 
DataWarehousingandAbInitioConcepts.ppt
DataWarehousingandAbInitioConcepts.pptDataWarehousingandAbInitioConcepts.ppt
DataWarehousingandAbInitioConcepts.pptPurnenduMaity2
 
11667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect411667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect4ambujm
 

Similaire à Data warehouse (20)

UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxUNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
 
Chapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.pptChapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.ppt
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
 
Data warehouse logical design
Data warehouse logical designData warehouse logical design
Data warehouse logical design
 
Data Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.pptData Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.ppt
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docx
 
Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClasses
 
Data Warehousing for students educationpptx
Data Warehousing for students educationpptxData Warehousing for students educationpptx
Data Warehousing for students educationpptx
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processing
 
Data Warehouse By Piyush
Data Warehouse By PiyushData Warehouse By Piyush
Data Warehouse By Piyush
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
 
Cs1011 dw-dm-1
Cs1011 dw-dm-1Cs1011 dw-dm-1
Cs1011 dw-dm-1
 
Data warehouse - Nivetha Durganathan
Data warehouse - Nivetha DurganathanData warehouse - Nivetha Durganathan
Data warehouse - Nivetha Durganathan
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
DataWarehousingandAbInitioConcepts.ppt
DataWarehousingandAbInitioConcepts.pptDataWarehousingandAbInitioConcepts.ppt
DataWarehousingandAbInitioConcepts.ppt
 
DMDW 1st module.pdf
DMDW 1st module.pdfDMDW 1st module.pdf
DMDW 1st module.pdf
 
11667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect411667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect4
 
BI Suite Overview
BI Suite OverviewBI Suite Overview
BI Suite Overview
 

Dernier

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 

Dernier (20)

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 

Data warehouse

  • 1. PRESENTATION ON DATA WAREHOUSING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING NITTTR,CHANDIGARH
  • 2. 2 Course Overview DW Architecture Data mart Difference b/w DW And OS Data warehouse Schema Types of Data stored in datawarehouse
  • 3. Data Warehouse It is a repository of data used for analysis and reporting. The data stored in data warehouse is loaded from different business source system. Data may pass through operational data store for cleaning before it is used in D/w for reporting. 3
  • 5. OPERATIONAL SOURCE SYSTEM: 5  In data warehouse data may be uploaded from one or many operational source system.  Suppose retail industry and sale consumer product generating revenues for a company through different channel. DIFFERENT CHANNELS ARE:- 1. SALES 2. ECOM 3. POS 4. MARKETING
  • 6. Staging Area:- 1. Non Persistent 2. Persistent 6
  • 7. Non Persistent:- Data from sale, Ecom or from different channel it just replicate data in a Non Persistent storage area in a specific given time(like 8-11 a.m). 7
  • 8. Operational Data store: It redefined form of data . It may have data from different channel in single table or in mulitple table but in readable format. This is called Integration Area. 8
  • 9. Informatica Power Centre: Access, transform and integrate data from any system , in any format and deliver the data throughout the enterprise. ETL :- is used to put records into data warehouse. 9
  • 10. 10
  • 12. Extraction: we extract data from source system and make it accessible for further processing. Extraction Strategies: o Full extraction o Partial extraction 12
  • 13. Transform: Data extracted into a staging server is a raw data and can’t be used as it is . It needs to be cleaned, mapped and transformed before finally loaded into data warehouse 13
  • 14. Basic Transformation Tasks:Basic Transformation Tasks: Selection Matching Data cleaning and enrichment Consolidation or summarization
  • 15. Loading:  In loading we load the data from staging server into data warehouse.  Data loading fetches prepare data, applies it into the data warehouse and store it in the database 15
  • 16. Types of loading:  Initial Load:- It is done when first time data to be loaded into data warehouse.  Incremental Load:- applying energy changes as necessary in a periodic manner.  Full Refresh:- completely erasing the contents of one or more tables and reloading with fresh data. 16
  • 17. ETL Tools: Enterprise Software Informatica IBM data stage CloverETL Microsoft SQL Server 17
  • 18. 18
  • 19. Why we store data into Data Marts from Data Warehouse? In d/w it may have record 10 year old record . It is time consuming process. In Data Mart we create table and store only summarized data and store a weakly data. 19
  • 20. DIFFERENCE BETWEEN DW AND ODS 20
  • 21. 21
  • 22. 22 Data Warehousing - Schemas Schema is a logical description of the entire database. It includes the name and description of records of all record types including all associated data-items and aggregates. Much like a database, a data warehouse also requires to maintain a schema. A database uses relational model, while a data warehouse uses : •Star schema •Snowflake schema • Fact Constellation schema
  • 23. 23 Star Schema: •Each dimension in a star schema is represented with only one- dimension table. •This dimension table contains the set of attributes. •The following diagram shows the sales data of a company with respect to the four dimensions, namely time, item, branch, and location.
  • 24. 24 •There is a fact table at the center. It contains the keys to each of four dimensions. •The fact table also contains the attributes, namely dollars sold and units sold.
  • 25. 25 Snowflake Schema: •Some dimension tables in the Snowflake schema are normalized. •The normalization splits up the data into additional tables. •Unlike Star schema, the dimensions table in a snowflake schema are normalized. For example, the item dimension table in star schema is normalized and split into two dimension tables, namely item and supplier table. •Now the item dimension table contains the attributes item_key, item_name, type, brand, and supplier-key. •The supplier key is linked to the supplier dimension table. The supplier dimension table contains the attributes supplier_key and supplier_type.
  • 26. 26
  • 27. 27 Fact Constellation Schema: •A fact constellation has multiple fact tables. It is also known as galaxy schema. •The following diagram shows two fact tables, namely sales and shipping. •The sales fact table is same as that in the star schema. •The shipping fact table has the five dimensions, namely item _ key, time _ key , shipper _ key , from _ location, to _ location. •The shipping fact table also contains two measures, namely dollars sold and units sold. •It is also possible to share dimension tables between fact tables. For example, time, item, and location dimension tables are shared between the sales and shipping fact table.
  • 28. 28
  • 29. 29 Types of Data Stored in a Data Warehouse: The term data warehouse is used to distinguish a database that is used for business analysis (OLAP) rather than transaction processing (OLTP) Your data warehouse will store these types of data: • Historical data •Derived data • Metadata
  • 30. 30 Historical Data A data warehouse typically contains several years of historical data. The amount of data that you decide to make available depends on available disk space and the types of analysis that you want to support. This data can come from your transactional database archives or other sources. Some applications might perform analyses that require data at lower levels than users typically view it. You will need to check with the application builder or the application's documentation for those types of data requirements.
  • 31. 31 Derived Data Derived data is generated from existing data using a mathematical operation or a data transformation. It can be created as part of a database maintenance operation or generated at run-time in response to a query. Metadata Metadata is data that describes the data and schema objects, and is used by applications to fetch and compute the data correctly.
  • 32. 32 Useful URLs Ralph Kimball’s home page http://www.rkimball.com Larry Greenfield’s Data Warehouse Information Center http://pwp.starnetinc.com/larryg/ Data Warehousing Institute http://www.dw-institute.com/ OLAP Council http://www.olapcouncil.com/
  • 33. 33