Lecture on Dimensional Modeling for Data Warehouse Design

Lecture-12
Dimensional Modeling (DM)
By Mamuna Fatima
1

 Problems with early COBOLian data processing
systems.
 Data redundancies
 From flat file to Table, each entity ultimately becomes
a Table in the physical schema.
 Simple O(n2
) Join to work with Tables
2

◦ Coupled with normalization drives out all
the redundancy out of the database.
◦ Change (or add or delete) the data at just
one point.
◦ Can be used with indexing for very fast
access.
◦ Resulted in success of OLTP systems.
3

 Lets have a look at a typical ER data model first.
 Some Observations:
◦ All tables look-alike, as a consequence it is difficult to
identify:
 Which table is more important ?
 Which is the largest?
 Which tables contain numerical measurements of the business?
 Which table contain nearly static descriptive attributes?
4

◦ Many topologies for the same ER diagram,
all appearing different.
 Very hard to visualize and remember.
 A large number of possible connections to any
two (or more) tables
5
1
10
3
12
2
6
5
11 4
7
8
9
1
10
3
12
2
6
5
11
4
7
8
9

 The Paradox: Trying to make information
accessible using tables resulted in an inability to
query them!
 ER and Normalization result in large number of tables
which are:
◦ Hard to understand by the users (DB programmers)
◦ Hard to navigate optimally by DBMS software
 Real value of ER is in using tables individually or in
pairs
 Too complex for queries that span multiple tables with
a large number of records
6

ER DM
Constituted to optimize OLTP
performance.
Constituted to optimize DSS
query performance.
Models the micro relationships
among data elements.
Models the macro
relationships among data
elements with an overall
deterministic strategy.
A wild variability of the
structure of ER models.
All dimensions serve as
equal entry points to the
fact table.
Very vulnerable to changes in
the user's querying habits,
because such schemas are
asymmetrical.
Changes in users' querying
habits can be
accommodated by
automatic SQL generators.
7

Two general methods:
◦ De-Normalization
◦ Dimensional Modeling (DM)
8

 A simpler logical model optimized for decision
support.
 Inherently dimensional in nature, with a single
central fact table and a set of smaller
dimensional tables.
 Multi-part key for the fact table
 Dimensional tables with a single-part PK.
 Keys are usually system generated
9

Data cubes
Dimension Table Dimension Table
Fact Table
...

 Results in a star like structure, called star schema
or star join.
◦ All relationships mandatory M-1.
◦ Single path between any two levels.
 Supports ROLAP operations.
11

12
Items
Books Cloths
Fiction Text Men Women
MedicalEngg
Analysts tend to look at the data through dimension at aAnalysts tend to look at the data through dimension at a
particular “level” in the hierarchyparticular “level” in the hierarchy

14
CITY DISTRICT
1
ZONE CITY
DISTRICTDIVISION
MONTH QTR
STORE # STREET ZONE ...
WEEK MONTH
DATE WEEK
RECEIPT #STORE # DATE ...
ITEM #RECEIPT # ... $
ITEM # CATEGORY
ITEM #
DEPTCATEGORY
year
month
week
sale_header
store
sale_detail
item_x_cat
item_x_splir
cat_x_dept
M
1
M
1M
1
M
1
1
M M
1
M
M M1
1
M
1
1
M
YEAR QTR
1
M
quarter
SUPPLIER
DIVISIONPROVINCEM
1 BACK
division
district
zone

15
RECEIPT#
STORE#
DATE
ITEM# M
Fact Table
ITEM#
CATEGORY
DEPT
SUPPLIER
Product Dim
M
Sale Rs.
M
STORE#
ZONE
CITY
PROVINCE
Geography Dim
DISTRICT
DATE
WEEK
QUARTER
YEAR
Time Dim
MONTH
.
.
.
1
1
1
facts
DIVISION

16
Beauty lies in close correspondence
with the business, evident even to
business users.

Dimensional hierarchies are collapsed into a single
table for each dimension. Loss of Information?
A single fact table created with a single header from the
detail records, resulting in:
◦ A vastly simplified physical data model!
◦ Fewer tables (thousands of tables in some ERP systems).
◦ Fewer joins resulting in high performance.
◦ Some requirement of additional space.
17

Lecture on Dimensional Modeling for Data Warehouse Design

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Lecture on Dimensional Modeling for Data Warehouse Design

Similar to Lecture on Dimensional Modeling for Data Warehouse Design (20)

More from Sulman Ahmed

More from Sulman Ahmed (20)

Recently uploaded

Recently uploaded (20)

Lecture on Dimensional Modeling for Data Warehouse Design

Editor's Notes