SlideShare une entreprise Scribd logo
1  sur  34
www.netpeach.com
Traditional Business Intelligence
Overview
By:
Netpeach Data Management Team
www.netpeach.com
Business Intelligence Overview
- Definition
- Architecture
- Source systems /OLTP
- ETL process
- Data Warehouses /OLAP
- OLTP vs. OLAP
- ODS and Data Marts
- Data Warehouse Design Approaches
- Dimensional Modeling
- From Enterprise models to Dimensional models
- Schema Types: Star, Snowflake, Fact Constellation
- Conclusion
www.netpeach.com
Definition
The term business intelligence (BI) refers to
technologies, applications and practices for the
collection, integration, analysis, and presentation of
business information.
The purpose of business intelligence is to support
better business decision making. BI systems provide
historical, current, and predictive views of business
operations, most often using data that has been
gathered into a data warehouse or a data mart and
occasionally working from operational data.
www.netpeach.com
BI enables enterprises to
- Measure performance and trends
- Use analytic information strategically
- Unlock the value of its information
- Identify opportunities
- Improve efficiency
- Perform competitive analysis..
- Find the Cause
- Data Mining
- Etc.
www.netpeach.com
Examples
• Cause & predictive analysis: Credit cart annual
fee
• Performance and trends: Region total sales /
our sales
• Competitive: Our sales / competitor sales in a
particular region or a location, etc,
• Right timing: Bank customer accounts (pattern
changes)
• Data mining: market basket analysis
www.netpeach.com
Architecture
ETL
Extract
Clean
Transform
Integrate
Loading
Analysis Services
Reporting Services
OLAP Services
Dashboards /
Scorecards
Alerts /
Notifications
EDW
DM
ODS ODS
DM
Etc.
Staging
/ETL
Source
Systems
Target
Systems
BI Data Presentation
www.netpeach.com
Architecture cont…
Typical BI architecture has the following components:
• A source system, also called Operational system—typically
an online transaction processing (OLTP) system, but other
systems or files that capture or hold data of interest are
also possible.
• An extraction, transformation, and loading (ETL) process.
• A data warehouse—typically an online analytical
processing (OLAP) system.
• A business intelligence platform such as Microstrategy.
www.netpeach.com
Source Systems (OLTP)
An operational system is a term used in data
warehousing to refer to a system that is used to process
the day-to-day transactions of an organization. These
systems are designed so processing of day-to-day
transactions is performed efficiently and the integrity of
the transactional data is preserved.
Sometimes operational systems are referred to as
operational databases, transaction processing systems, or
on-line transaction processing systems (OLTP). In OLTP —
online transaction processing systems relational database
design use the discipline of data modeling and generally
follow the Codd rules of data normalization in order to
ensure absolute data integrity
www.netpeach.com
Source Systems examples
- Account transactions in a Bank
- Sales transactions in a Retail outlet.
- Inventory management transactions in a
warehouse
- Workforce management transactions such as
attendance, vacations, overtime tracking, etc.
- Operational expenditure systems
- External sources such as industry information like
elasticity or demand of a product from a third
part sources in Retail domain.
- Etc.
www.netpeach.com
ETL – Extraction, Transformation and Loading
The Extraction, Transformation, and Loading (ETL) process
represents all the steps necessary to move data from different
source systems to an integrated data warehouse.
The ETL process involves the following steps:
- Data is gathered from various source systems.
- The data is transformed and prepared to be loaded into the
data warehouse. Transformation procedures can include
converting data types and names, eliminating unwanted
data, correcting typographical errors, aggregating data,
filling in incomplete data, and similar processes to
standardize the format and structure of data.
- The data is loaded into the data warehouse.
www.netpeach.com
Data Warehouse / Data Mart (OLAP)
A Data Warehouse, in its simplest perception, is no more than
a collection of the key pieces of information used to manage
and direct the business for the most profitable outcome.
- According to Bill Inmon, “a data warehouse is a
 subject-oriented,
 integrated,
 nonvolatile,
 time-variant
collection of data in support of management decisions”.
- Ralph Kimball states that a data warehouse is “ a copy of
transaction data specifically structured for Query and
Analysis”.
www.netpeach.com
OLAP
OLAP: a category of software tools that provides
analysis of data stored in a database. OLAP tools
enable users to analyze different dimensions of
multidimensional data. For example, it provides
time series and trend analysis views. OLAP often
is used in data mining.
www.netpeach.com
OLAP Analysis
Imagine an organization that manufactures and sells goods in
several states of USA
During the OLAP analysis, the top executives may seek answers
for the following:
- Number of products manufactured.
- Number of products manufactured in a location
- Number of products manufactured on time basis within a
location.
- Number of products manufactured in the current year when
compared to the previous year.
- Sales Dollar value for a particular product.
- Sales Dollar value for a product in a location.
- Sales Dollar value for a product in a year within a location.
- Sales Dollar value for a product in a year within a location sold
or serviced by an employee.
www.netpeach.com
OLTP / OLAP
OLTP FEATURE OLAP
Transactional applications
using a Front-end,
- data capture, modify,
delete
- No direct DB access
PURPOSE Analysis purpose
- Analyse Data
- Read only
- Some times direct access to DB
Operational administrative
staff, Data Entry
operator, database
professional, etc.
TYPE OF USERS Manager, analyst, executive,
executive management
Relational Data Structures DATA STRUCTURES Multidimensional Data Structures
Normalized DBMS DUPLICATED DATA De-Normalized & Normalized DBMS
Many NUMBER OF USERS Few
Predefined operations WORKLOAD AD-HOC queries , Predefined reports
Volatile DATA MODIFICATIONS Update on a regular basis
Small volume (Current Data) DATA Volume Large Volume (Historical Data)
Availability Must be high Response time must be good
www.netpeach.com
DW related: ODS and Data Marts
ODS (Operational Data Store) - This has a broad
enterprise wide scope, but unlike the real
enterprise data warehouse, data is refreshed in
near real time and used for routine business
activity.
Data Mart – is a subset of data warehouse and it
supports a particular region, business unit or
business function
www.netpeach.com
Typical positioning of an ODS
Application 1
Application 2
Application n
ETL
Source Applications
ODS
EDW
Data Mart Data Mart Data Mart
www.netpeach.com
ODS vs DW
ODS DW
It is designed to support
operational monitoring.
It is designed to support
Decision Making Process.
Data is volatile Non-Volatile
Current Data Historical Data
Designed for running the
business
Designed for Analyzing the
business
Follows Normalization Follows de-normalization
Designed using E/R Modeling Using Dimensional Modeling
www.netpeach.com
ODS and DW use case
In a pharmaceutical company
Customer ODS is used for:
- sending new product details,
- promotional activities,
- and scheduling appointments.
DW is used to answer:
- In a month, what is the total value of
medicines prescribed by a Doctor?
- What is our company share
- Is he missing any info from us.
www.netpeach.com
Data Warehouse design approaches
Kimball - Let everybody build what they want when
they want it, we'll integrate it all when and if we
need to. (BOTTOM-UP APPROACH)
Pros: fast to build, quick ROI, nimble
Cons: harder to maintain as an enterprise
resource, often redundant, often difficult to
integrate data marts
Inmon - Don't do anything until you've designed
everything. (TOP-DOWN APPROACH)
Pros: easy to maintain, tightly integrated
Cons: takes way too long to deliver first projects, rigid
www.netpeach.com
Dimensional data modeling
• Dimensional data modeling is
– A logical design technique
that seeks to
– present the data in a standard frame work
that is
– intuitive and allows high-performance access.
• A data model specifically for designing data
warehouses
• The method was developed based on observations of
practice, and in particular, providing data in “user-
friendly” form.
www.netpeach.com
From ER Models to Dimensional Models
A typical process in an enterprise
www.netpeach.com
Sample OLTP data model
www.netpeach.com
Step 1. Classify Entities
Transaction Entities
- An event happened at a point of time
- contains measurements or quantities
Component Entities :
- directly related to a transaction entity
- Component entities answer questions like “who”, “what”, “when”, “where”,
“how” and “why” of a business event.
In a sales application transaction entities are:
Customer: who made the purchase
Product: what was sold
Location: where it was sold
Period: when it was sold
Classification Entities:
- related to component entities by a chain of one-to-many relationships
- represent hierarchies embedded in the data model
www.netpeach.com
Step 2. Identify Hierarchies
• A hierarchy in an Entity Relationship model is any
sequence of entities joined together by one-to-many
relationships, all aligned in the same direction.
www.netpeach.com
Step 3. Produce Dimensional Models
Operators For Producing Dimensional Models
Operator 1: Collapse Hierarchy
Operator 2: Aggregation
There is a wide range of options for producing dimensional
models from an Entity Relationship model.
These include:
Star Schema
Snowflake Schema
Constellation / Integrated Schema
www.netpeach.com
Star Schema
• A fact table is formed for each transaction entity. The
key of the table is the combination of the keys of its
associated component entities.
• A dimension table is formed for each component
entity, by collapsing hierarchically related
classification entities into it.
A star schema consists of one large central table
called the fact table, and a number of smaller tables
called dimension tables which radiate out from the
central table
www.netpeach.com
Sample Star Schema
www.netpeach.com
Snowflake Schema
A snowflake schema is a star schema with all
hierarchies explicitly shown.
www.netpeach.com
Star vs. Snowflake
Star Schema Snowflake
Ease of
maintenance/change:
Has redundant data and hence less easy
to maintain/change
No redundancy and hence more easy to
maintain and change
Ease of Use:
Less complex queries and easy to
understand
More complex queries and hence less
easy to understand
Query Performance:
Less no. of foreign keys and hence lesser
query execution time
More foreign keys-and hence more
query execution time
Space:
Has de-normalized tables hence takes
more space.
Has normalized tables hence takes less
space.
Good for:
Good for data marts with simple
relationships (1:1 or 1:many)
Good to use for data warehouse core to
simplify complex relationships (many :
many)
When to use:
When a dimension hierarchy contains
more levels it is a good practice to use
Star schema as it requires few joins and
improves performance.
When a dimension hierarchy contains
fewer levels and is data volume is
relatively big in size, snowflake is better
as it reduces space and joins.
Star schema does not support many to many relationship between attributes in a
dimension as each dimension is de-normalized into a single table.
www.netpeach.com
Fact constellation schema
The fact constellation architecture contains multiple
fact tables that share many dimension tables
www.netpeach.com
Constellation /Integrated Schema
A constellation schema consists of a set
of star schemas with hierarchically linked fact tables.
www.netpeach.com
Step 4. Evolution and Refinement
• Check if we can Combine any Fact Tables
• Check if we can Combine any Dimension
Tables
• Handling Subtypes
www.netpeach.com
Conclusion
ETL tools
- Informatica
- Data junction
- Data stage
- Ab initio
- SSIS
- Oracle Warehouse
Builder.
- Pentaho
- Talend
- …
OLAP tools
- Business Objects
- Cognos Powerplay
- MicroStrategy
- Hyperion Essbase
- SSAS
- SSRS
- Oracle Express
- Oracle OLAP option
- Tableau
- …
Databases
- Teradata
- Natezza
- Oracle
- SQL Server
- DB2
- SAP Hana
- …
Below are few most popular tools:
www.netpeach.com
Questions & Answers
For any questions or additional information contact nyerram@netpeach.com

Contenu connexe

Tendances

From Backtesting to Live Trading by Vesna Straser at QuantCon 2016
From Backtesting to Live Trading by Vesna Straser at QuantCon 2016From Backtesting to Live Trading by Vesna Straser at QuantCon 2016
From Backtesting to Live Trading by Vesna Straser at QuantCon 2016Quantopian
 
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ..."A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...Quantopian
 
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an..."Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...Quantopian
 
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016Quantopian
 
Quant insti webinar on algorithmic trading for technocrats!
Quant insti webinar on algorithmic trading for technocrats!Quant insti webinar on algorithmic trading for technocrats!
Quant insti webinar on algorithmic trading for technocrats!QuantInsti
 
Report on HIGH FREQUENCY TRADING
Report on HIGH FREQUENCY TRADINGReport on HIGH FREQUENCY TRADING
Report on HIGH FREQUENCY TRADINGVarsha Chauhan
 
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ..."Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...Quantopian
 
Technology Edge in Algo Trading: Traditional Vs Automated Trading System Arch...
Technology Edge in Algo Trading: Traditional Vs Automated Trading System Arch...Technology Edge in Algo Trading: Traditional Vs Automated Trading System Arch...
Technology Edge in Algo Trading: Traditional Vs Automated Trading System Arch...QuantInsti
 
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlphaQuantopian
 
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas..."Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...Quantopian
 
DIY Quant Strategies on Quantopian
DIY Quant Strategies on QuantopianDIY Quant Strategies on Quantopian
DIY Quant Strategies on QuantopianJess Stauth
 
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...Quantopian
 
Technical analysis (2)
Technical analysis (2)Technical analysis (2)
Technical analysis (2)Anand Verma
 
Market Timing, Big Data, and Machine Learning by Xiao Qiao at QuantCon 2016
Market Timing, Big Data, and Machine Learning by Xiao Qiao at QuantCon 2016Market Timing, Big Data, and Machine Learning by Xiao Qiao at QuantCon 2016
Market Timing, Big Data, and Machine Learning by Xiao Qiao at QuantCon 2016Quantopian
 
Risk Management: Maximising Long-Term Growth Presentation
Risk Management: Maximising Long-Term Growth PresentationRisk Management: Maximising Long-Term Growth Presentation
Risk Management: Maximising Long-Term Growth PresentationQuantInsti
 
10 Ways Backtests Lie by Tucker Balch
10 Ways Backtests Lie by Tucker Balch10 Ways Backtests Lie by Tucker Balch
10 Ways Backtests Lie by Tucker BalchQuantopian
 
Dual Momentum Investing by Gary Antonacci QuantCon 2016
Dual Momentum Investing by Gary Antonacci QuantCon 2016Dual Momentum Investing by Gary Antonacci QuantCon 2016
Dual Momentum Investing by Gary Antonacci QuantCon 2016Quantopian
 
Forex lesson 1 forex overview
Forex lesson 1   forex overviewForex lesson 1   forex overview
Forex lesson 1 forex overviewpipsumo traderfx
 
Beware of Low Frequency Data by Ernie Chan, Managing Member, QTS Capital Mana...
Beware of Low Frequency Data by Ernie Chan, Managing Member, QTS Capital Mana...Beware of Low Frequency Data by Ernie Chan, Managing Member, QTS Capital Mana...
Beware of Low Frequency Data by Ernie Chan, Managing Member, QTS Capital Mana...Quantopian
 

Tendances (20)

From Backtesting to Live Trading by Vesna Straser at QuantCon 2016
From Backtesting to Live Trading by Vesna Straser at QuantCon 2016From Backtesting to Live Trading by Vesna Straser at QuantCon 2016
From Backtesting to Live Trading by Vesna Straser at QuantCon 2016
 
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ..."A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...
 
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an..."Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
 
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016
 
Quant insti webinar on algorithmic trading for technocrats!
Quant insti webinar on algorithmic trading for technocrats!Quant insti webinar on algorithmic trading for technocrats!
Quant insti webinar on algorithmic trading for technocrats!
 
Report on HIGH FREQUENCY TRADING
Report on HIGH FREQUENCY TRADINGReport on HIGH FREQUENCY TRADING
Report on HIGH FREQUENCY TRADING
 
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ..."Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...
 
Technology Edge in Algo Trading: Traditional Vs Automated Trading System Arch...
Technology Edge in Algo Trading: Traditional Vs Automated Trading System Arch...Technology Edge in Algo Trading: Traditional Vs Automated Trading System Arch...
Technology Edge in Algo Trading: Traditional Vs Automated Trading System Arch...
 
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha
 
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas..."Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...
 
Quantitative Trading
Quantitative TradingQuantitative Trading
Quantitative Trading
 
DIY Quant Strategies on Quantopian
DIY Quant Strategies on QuantopianDIY Quant Strategies on Quantopian
DIY Quant Strategies on Quantopian
 
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
 
Technical analysis (2)
Technical analysis (2)Technical analysis (2)
Technical analysis (2)
 
Market Timing, Big Data, and Machine Learning by Xiao Qiao at QuantCon 2016
Market Timing, Big Data, and Machine Learning by Xiao Qiao at QuantCon 2016Market Timing, Big Data, and Machine Learning by Xiao Qiao at QuantCon 2016
Market Timing, Big Data, and Machine Learning by Xiao Qiao at QuantCon 2016
 
Risk Management: Maximising Long-Term Growth Presentation
Risk Management: Maximising Long-Term Growth PresentationRisk Management: Maximising Long-Term Growth Presentation
Risk Management: Maximising Long-Term Growth Presentation
 
10 Ways Backtests Lie by Tucker Balch
10 Ways Backtests Lie by Tucker Balch10 Ways Backtests Lie by Tucker Balch
10 Ways Backtests Lie by Tucker Balch
 
Dual Momentum Investing by Gary Antonacci QuantCon 2016
Dual Momentum Investing by Gary Antonacci QuantCon 2016Dual Momentum Investing by Gary Antonacci QuantCon 2016
Dual Momentum Investing by Gary Antonacci QuantCon 2016
 
Forex lesson 1 forex overview
Forex lesson 1   forex overviewForex lesson 1   forex overview
Forex lesson 1 forex overview
 
Beware of Low Frequency Data by Ernie Chan, Managing Member, QTS Capital Mana...
Beware of Low Frequency Data by Ernie Chan, Managing Member, QTS Capital Mana...Beware of Low Frequency Data by Ernie Chan, Managing Member, QTS Capital Mana...
Beware of Low Frequency Data by Ernie Chan, Managing Member, QTS Capital Mana...
 

En vedette

Confluence vs sharepoint compared
Confluence vs sharepoint comparedConfluence vs sharepoint compared
Confluence vs sharepoint comparedNagaraj Yerram
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEZalpa Rathod
 
ISTI 2014 conference non traditional bi
ISTI 2014  conference non traditional biISTI 2014  conference non traditional bi
ISTI 2014 conference non traditional biAlberici Andrea
 
The Developer vs. The Relational Database: How to do it right
The Developer vs. The Relational Database: How to do it rightThe Developer vs. The Relational Database: How to do it right
The Developer vs. The Relational Database: How to do it rightGogula Aryalingam
 
Data Business Model 2017-2019
Data Business Model 2017-2019Data Business Model 2017-2019
Data Business Model 2017-2019Luciano Gregoris
 
Designing and developing Business Process dimensional Model or Data Warehouse
Designing and developing  Business Process dimensional Model  or Data WarehouseDesigning and developing  Business Process dimensional Model  or Data Warehouse
Designing and developing Business Process dimensional Model or Data WarehouseSlava Kokaev
 
From big data overload to business impact
From big data overload to business impactFrom big data overload to business impact
From big data overload to business impactMiguel Garcia
 
SAP EM data model
SAP EM data modelSAP EM data model
SAP EM data modelQ Data USA
 
Data is not a business model moving knowledge to action presentation
Data is not a business model  moving knowledge to action presentationData is not a business model  moving knowledge to action presentation
Data is not a business model moving knowledge to action presentationJennifer van der Meer
 
Data for Business Journalism, NICAR 2012
Data for Business Journalism, NICAR 2012Data for Business Journalism, NICAR 2012
Data for Business Journalism, NICAR 2012Chris Taggart
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesBen Siscovick
 
Trends in Big Data & Business Challenges
Trends in Big Data & Business Challenges   Trends in Big Data & Business Challenges
Trends in Big Data & Business Challenges Experian_US
 
Bônus vip jovanir adriano
Bônus vip jovanir adrianoBônus vip jovanir adriano
Bônus vip jovanir adrianoEvandro Zef
 

En vedette (20)

Confluence vs sharepoint compared
Confluence vs sharepoint comparedConfluence vs sharepoint compared
Confluence vs sharepoint compared
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
OLAP
OLAPOLAP
OLAP
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
SharePoint Alternatives
SharePoint AlternativesSharePoint Alternatives
SharePoint Alternatives
 
ISTI 2014 conference non traditional bi
ISTI 2014  conference non traditional biISTI 2014  conference non traditional bi
ISTI 2014 conference non traditional bi
 
The Developer vs. The Relational Database: How to do it right
The Developer vs. The Relational Database: How to do it rightThe Developer vs. The Relational Database: How to do it right
The Developer vs. The Relational Database: How to do it right
 
Data Business Model 2017-2019
Data Business Model 2017-2019Data Business Model 2017-2019
Data Business Model 2017-2019
 
Designing and developing Business Process dimensional Model or Data Warehouse
Designing and developing  Business Process dimensional Model  or Data WarehouseDesigning and developing  Business Process dimensional Model  or Data Warehouse
Designing and developing Business Process dimensional Model or Data Warehouse
 
From big data overload to business impact
From big data overload to business impactFrom big data overload to business impact
From big data overload to business impact
 
BI Presentation
BI PresentationBI Presentation
BI Presentation
 
3dw
3dw3dw
3dw
 
Issue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-businessIssue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-business
 
SAP EM data model
SAP EM data modelSAP EM data model
SAP EM data model
 
Data is not a business model moving knowledge to action presentation
Data is not a business model  moving knowledge to action presentationData is not a business model  moving knowledge to action presentation
Data is not a business model moving knowledge to action presentation
 
Cognos powerplay
Cognos powerplayCognos powerplay
Cognos powerplay
 
Data for Business Journalism, NICAR 2012
Data for Business Journalism, NICAR 2012Data for Business Journalism, NICAR 2012
Data for Business Journalism, NICAR 2012
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA Ventures
 
Trends in Big Data & Business Challenges
Trends in Big Data & Business Challenges   Trends in Big Data & Business Challenges
Trends in Big Data & Business Challenges
 
Bônus vip jovanir adriano
Bônus vip jovanir adrianoBônus vip jovanir adriano
Bônus vip jovanir adriano
 

Similaire à Traditional Data-warehousing / BI overview

Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing conceptspcherukumalla
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Singh
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?RTTS
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxvipush1
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Conceptsraulmisir
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mininggulab sharma
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 
Data warehouse
Data warehouseData warehouse
Data warehouseMR Z
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Miningcpjcollege
 
Oracle Hyperion overview
Oracle Hyperion overviewOracle Hyperion overview
Oracle Hyperion overviewClick4learning
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseShanthi Mukkavilli
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxSalehaMariyam
 

Similaire à Traditional Data-warehousing / BI overview (20)

Msbi by quontra us
Msbi by quontra usMsbi by quontra us
Msbi by quontra us
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPT
 
Datawarehouse and OLAP
Datawarehouse and OLAPDatawarehouse and OLAP
Datawarehouse and OLAP
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
CTP Data Warehouse
CTP Data WarehouseCTP Data Warehouse
CTP Data Warehouse
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptx
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mining
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Mining
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
Oracle Hyperion overview
Oracle Hyperion overviewOracle Hyperion overview
Oracle Hyperion overview
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptx
 
OLAP
OLAPOLAP
OLAP
 

Dernier

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 

Dernier (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

Traditional Data-warehousing / BI overview

  • 2. www.netpeach.com Business Intelligence Overview - Definition - Architecture - Source systems /OLTP - ETL process - Data Warehouses /OLAP - OLTP vs. OLAP - ODS and Data Marts - Data Warehouse Design Approaches - Dimensional Modeling - From Enterprise models to Dimensional models - Schema Types: Star, Snowflake, Fact Constellation - Conclusion
  • 3. www.netpeach.com Definition The term business intelligence (BI) refers to technologies, applications and practices for the collection, integration, analysis, and presentation of business information. The purpose of business intelligence is to support better business decision making. BI systems provide historical, current, and predictive views of business operations, most often using data that has been gathered into a data warehouse or a data mart and occasionally working from operational data.
  • 4. www.netpeach.com BI enables enterprises to - Measure performance and trends - Use analytic information strategically - Unlock the value of its information - Identify opportunities - Improve efficiency - Perform competitive analysis.. - Find the Cause - Data Mining - Etc.
  • 5. www.netpeach.com Examples • Cause & predictive analysis: Credit cart annual fee • Performance and trends: Region total sales / our sales • Competitive: Our sales / competitor sales in a particular region or a location, etc, • Right timing: Bank customer accounts (pattern changes) • Data mining: market basket analysis
  • 6. www.netpeach.com Architecture ETL Extract Clean Transform Integrate Loading Analysis Services Reporting Services OLAP Services Dashboards / Scorecards Alerts / Notifications EDW DM ODS ODS DM Etc. Staging /ETL Source Systems Target Systems BI Data Presentation
  • 7. www.netpeach.com Architecture cont… Typical BI architecture has the following components: • A source system, also called Operational system—typically an online transaction processing (OLTP) system, but other systems or files that capture or hold data of interest are also possible. • An extraction, transformation, and loading (ETL) process. • A data warehouse—typically an online analytical processing (OLAP) system. • A business intelligence platform such as Microstrategy.
  • 8. www.netpeach.com Source Systems (OLTP) An operational system is a term used in data warehousing to refer to a system that is used to process the day-to-day transactions of an organization. These systems are designed so processing of day-to-day transactions is performed efficiently and the integrity of the transactional data is preserved. Sometimes operational systems are referred to as operational databases, transaction processing systems, or on-line transaction processing systems (OLTP). In OLTP — online transaction processing systems relational database design use the discipline of data modeling and generally follow the Codd rules of data normalization in order to ensure absolute data integrity
  • 9. www.netpeach.com Source Systems examples - Account transactions in a Bank - Sales transactions in a Retail outlet. - Inventory management transactions in a warehouse - Workforce management transactions such as attendance, vacations, overtime tracking, etc. - Operational expenditure systems - External sources such as industry information like elasticity or demand of a product from a third part sources in Retail domain. - Etc.
  • 10. www.netpeach.com ETL – Extraction, Transformation and Loading The Extraction, Transformation, and Loading (ETL) process represents all the steps necessary to move data from different source systems to an integrated data warehouse. The ETL process involves the following steps: - Data is gathered from various source systems. - The data is transformed and prepared to be loaded into the data warehouse. Transformation procedures can include converting data types and names, eliminating unwanted data, correcting typographical errors, aggregating data, filling in incomplete data, and similar processes to standardize the format and structure of data. - The data is loaded into the data warehouse.
  • 11. www.netpeach.com Data Warehouse / Data Mart (OLAP) A Data Warehouse, in its simplest perception, is no more than a collection of the key pieces of information used to manage and direct the business for the most profitable outcome. - According to Bill Inmon, “a data warehouse is a  subject-oriented,  integrated,  nonvolatile,  time-variant collection of data in support of management decisions”. - Ralph Kimball states that a data warehouse is “ a copy of transaction data specifically structured for Query and Analysis”.
  • 12. www.netpeach.com OLAP OLAP: a category of software tools that provides analysis of data stored in a database. OLAP tools enable users to analyze different dimensions of multidimensional data. For example, it provides time series and trend analysis views. OLAP often is used in data mining.
  • 13. www.netpeach.com OLAP Analysis Imagine an organization that manufactures and sells goods in several states of USA During the OLAP analysis, the top executives may seek answers for the following: - Number of products manufactured. - Number of products manufactured in a location - Number of products manufactured on time basis within a location. - Number of products manufactured in the current year when compared to the previous year. - Sales Dollar value for a particular product. - Sales Dollar value for a product in a location. - Sales Dollar value for a product in a year within a location. - Sales Dollar value for a product in a year within a location sold or serviced by an employee.
  • 14. www.netpeach.com OLTP / OLAP OLTP FEATURE OLAP Transactional applications using a Front-end, - data capture, modify, delete - No direct DB access PURPOSE Analysis purpose - Analyse Data - Read only - Some times direct access to DB Operational administrative staff, Data Entry operator, database professional, etc. TYPE OF USERS Manager, analyst, executive, executive management Relational Data Structures DATA STRUCTURES Multidimensional Data Structures Normalized DBMS DUPLICATED DATA De-Normalized & Normalized DBMS Many NUMBER OF USERS Few Predefined operations WORKLOAD AD-HOC queries , Predefined reports Volatile DATA MODIFICATIONS Update on a regular basis Small volume (Current Data) DATA Volume Large Volume (Historical Data) Availability Must be high Response time must be good
  • 15. www.netpeach.com DW related: ODS and Data Marts ODS (Operational Data Store) - This has a broad enterprise wide scope, but unlike the real enterprise data warehouse, data is refreshed in near real time and used for routine business activity. Data Mart – is a subset of data warehouse and it supports a particular region, business unit or business function
  • 16. www.netpeach.com Typical positioning of an ODS Application 1 Application 2 Application n ETL Source Applications ODS EDW Data Mart Data Mart Data Mart
  • 17. www.netpeach.com ODS vs DW ODS DW It is designed to support operational monitoring. It is designed to support Decision Making Process. Data is volatile Non-Volatile Current Data Historical Data Designed for running the business Designed for Analyzing the business Follows Normalization Follows de-normalization Designed using E/R Modeling Using Dimensional Modeling
  • 18. www.netpeach.com ODS and DW use case In a pharmaceutical company Customer ODS is used for: - sending new product details, - promotional activities, - and scheduling appointments. DW is used to answer: - In a month, what is the total value of medicines prescribed by a Doctor? - What is our company share - Is he missing any info from us.
  • 19. www.netpeach.com Data Warehouse design approaches Kimball - Let everybody build what they want when they want it, we'll integrate it all when and if we need to. (BOTTOM-UP APPROACH) Pros: fast to build, quick ROI, nimble Cons: harder to maintain as an enterprise resource, often redundant, often difficult to integrate data marts Inmon - Don't do anything until you've designed everything. (TOP-DOWN APPROACH) Pros: easy to maintain, tightly integrated Cons: takes way too long to deliver first projects, rigid
  • 20. www.netpeach.com Dimensional data modeling • Dimensional data modeling is – A logical design technique that seeks to – present the data in a standard frame work that is – intuitive and allows high-performance access. • A data model specifically for designing data warehouses • The method was developed based on observations of practice, and in particular, providing data in “user- friendly” form.
  • 21. www.netpeach.com From ER Models to Dimensional Models A typical process in an enterprise
  • 23. www.netpeach.com Step 1. Classify Entities Transaction Entities - An event happened at a point of time - contains measurements or quantities Component Entities : - directly related to a transaction entity - Component entities answer questions like “who”, “what”, “when”, “where”, “how” and “why” of a business event. In a sales application transaction entities are: Customer: who made the purchase Product: what was sold Location: where it was sold Period: when it was sold Classification Entities: - related to component entities by a chain of one-to-many relationships - represent hierarchies embedded in the data model
  • 24. www.netpeach.com Step 2. Identify Hierarchies • A hierarchy in an Entity Relationship model is any sequence of entities joined together by one-to-many relationships, all aligned in the same direction.
  • 25. www.netpeach.com Step 3. Produce Dimensional Models Operators For Producing Dimensional Models Operator 1: Collapse Hierarchy Operator 2: Aggregation There is a wide range of options for producing dimensional models from an Entity Relationship model. These include: Star Schema Snowflake Schema Constellation / Integrated Schema
  • 26. www.netpeach.com Star Schema • A fact table is formed for each transaction entity. The key of the table is the combination of the keys of its associated component entities. • A dimension table is formed for each component entity, by collapsing hierarchically related classification entities into it. A star schema consists of one large central table called the fact table, and a number of smaller tables called dimension tables which radiate out from the central table
  • 28. www.netpeach.com Snowflake Schema A snowflake schema is a star schema with all hierarchies explicitly shown.
  • 29. www.netpeach.com Star vs. Snowflake Star Schema Snowflake Ease of maintenance/change: Has redundant data and hence less easy to maintain/change No redundancy and hence more easy to maintain and change Ease of Use: Less complex queries and easy to understand More complex queries and hence less easy to understand Query Performance: Less no. of foreign keys and hence lesser query execution time More foreign keys-and hence more query execution time Space: Has de-normalized tables hence takes more space. Has normalized tables hence takes less space. Good for: Good for data marts with simple relationships (1:1 or 1:many) Good to use for data warehouse core to simplify complex relationships (many : many) When to use: When a dimension hierarchy contains more levels it is a good practice to use Star schema as it requires few joins and improves performance. When a dimension hierarchy contains fewer levels and is data volume is relatively big in size, snowflake is better as it reduces space and joins. Star schema does not support many to many relationship between attributes in a dimension as each dimension is de-normalized into a single table.
  • 30. www.netpeach.com Fact constellation schema The fact constellation architecture contains multiple fact tables that share many dimension tables
  • 31. www.netpeach.com Constellation /Integrated Schema A constellation schema consists of a set of star schemas with hierarchically linked fact tables.
  • 32. www.netpeach.com Step 4. Evolution and Refinement • Check if we can Combine any Fact Tables • Check if we can Combine any Dimension Tables • Handling Subtypes
  • 33. www.netpeach.com Conclusion ETL tools - Informatica - Data junction - Data stage - Ab initio - SSIS - Oracle Warehouse Builder. - Pentaho - Talend - … OLAP tools - Business Objects - Cognos Powerplay - MicroStrategy - Hyperion Essbase - SSAS - SSRS - Oracle Express - Oracle OLAP option - Tableau - … Databases - Teradata - Natezza - Oracle - SQL Server - DB2 - SAP Hana - … Below are few most popular tools:
  • 34. www.netpeach.com Questions & Answers For any questions or additional information contact nyerram@netpeach.com