SlideShare a Scribd company logo
1 of 28
DATA CUBES Presented by: Mohammed Siddig Ahmed April , 2011 sudan university
DATA CUBES Data cube is a structure  that enable OLAP  to achieves the multidimensional functionality. The data cube is used to represent data along some measure of interest. Data Cubes are an easy way to look at the data ( allow us to look at complex data in a simple format). Although called a "cube", it can be 2-dimensional, 3-dimensional, or higher-dimensional.
DATA CUBES databases  design s is for OLTP and efficiency in data storage. data cube design is for efficiency in data retrieval (ensures report optimization). The cube is comparable to a table in a relational database.
Dimensions And Measures data cubes have categories of data called dimensions and measures. measure   represents some fact (or number) such as cost or units of service.  dimension  represents descriptive categories of data such as time or location.
Dimensions And Measures
Data Cubes Concepts  Three important concepts associated with data cubes : Slicing.  Dicing. Rotating.
Slicing the term slice most often refers to a two- dimensional page selected from the cube. subset of a multidimensional array corresponding to a single value for one or more members of the dimensions not in the subset.
Slicing Slicing-Wireless Mouse
Slicing Slicing-Asia
Dicing A related operation to slicing . in the case of dicing, we define a subcube of the original space. Dicing provides you the smallest available slice.
Dicing
Rotating Some times called pivoting. Rotating changes the dimensional orientation of the report from the cube data. For example … rotating may consist of swapping the rows and columns, or moving one of the row dimensions into the column dimension or swapping an off-spreadsheet dimension with one of the dimensions in the page display
Rotating
Dimensions represents descriptive categories of data such as time or location. Each dimension includes different levels of categories.
Dimensions
Categories is an item that matches a specific description or classification such as years in a time dimension. Categories can be at different levels of information within a dimension.
Categories parent category is the next higher level of another category in a drill-up path. child category is the next lower level category in a drill-down path.
Categories
Categories
measures The measures are the actual data values that occupy the cells as defined by the dimensions selected. Measures include facts or variables typically stored as numerical fields.
measures
Computed versus Stored Data Cubes The goal is to retrieve the information from the data cube in the most efficient way possible. Three possible solutions are:  Pre-compute all cells in the cube. Pre-compute no cells. Pre-compute some of the cells.
Computed versus Stored Data Cubes If the whole cube is pre-computed Advantage the queries run on the cube will be very fast. Disadvantage pre-computed cube requires a lot of memory.
Computed versus Stored Data Cubes To minimize memory requirements, we can pre-compute none of the cells in the cube. But thequeries on the cube will run more slowly. As a compromise between these two, we can pre-compute only those cells in the cube which will most likely be used for decision support queries.
representation of Totals A simple data cube does not contain totals. The storage of totals increases the size of the data cube but can also decrease the time to make total-based queries. A simple way to represent totals is to add an additional layer on n sides of the n-dimensional data cube.
representation of Totals
REFRENCES http://training.inet.com Data Mining Concepts and techniques , 2E. to NASA’s Data Cube Training presentation . http://projects.cs.dal.ca/panda/datacube.html
The end

More Related Content

What's hot

Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modeling
vivekjv
 
Multidimentional data model
Multidimentional data modelMultidimentional data model
Multidimentional data model
jagdish_93
 

What's hot (20)

Data Preprocessing
Data PreprocessingData Preprocessing
Data Preprocessing
 
OLAP operations
OLAP operationsOLAP operations
OLAP operations
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
Data Mining:  Concepts and Techniques (3rd ed.)- Chapter 3 preprocessingData Mining:  Concepts and Techniques (3rd ed.)- Chapter 3 preprocessing
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
 
Data cube
Data cubeData cube
Data cube
 
Dynamic Itemset Counting
Dynamic Itemset CountingDynamic Itemset Counting
Dynamic Itemset Counting
 
Aggregate fact tables
Aggregate fact tablesAggregate fact tables
Aggregate fact tables
 
Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modeling
 
Multidimentional data model
Multidimentional data modelMultidimentional data model
Multidimentional data model
 
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALADATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
 
3. mining frequent patterns
3. mining frequent patterns3. mining frequent patterns
3. mining frequent patterns
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
2.2 decision tree
2.2 decision tree2.2 decision tree
2.2 decision tree
 
Data preprocessing in Data Mining
Data preprocessing in Data MiningData preprocessing in Data Mining
Data preprocessing in Data Mining
 
Association rule mining
Association rule miningAssociation rule mining
Association rule mining
 
Kdd process
Kdd processKdd process
Kdd process
 
5.5 graph mining
5.5 graph mining5.5 graph mining
5.5 graph mining
 
Dbms relational model
Dbms relational modelDbms relational model
Dbms relational model
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 

Viewers also liked (6)

OLAP Cubes: Basic operations
OLAP Cubes: Basic operationsOLAP Cubes: Basic operations
OLAP Cubes: Basic operations
 
MS SQL SERVER: Olap cubes and data mining
MS SQL SERVER: Olap cubes and data miningMS SQL SERVER: Olap cubes and data mining
MS SQL SERVER: Olap cubes and data mining
 
Olap Cube Design
Olap Cube DesignOlap Cube Design
Olap Cube Design
 
Datacube
DatacubeDatacube
Datacube
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationData Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalization
 
Multi dimensional model vs (1)
Multi dimensional model vs (1)Multi dimensional model vs (1)
Multi dimensional model vs (1)
 

Similar to Data cubes

dataminingpres-150821063129-lva1-app6891 (3).pdf
dataminingpres-150821063129-lva1-app6891 (3).pdfdataminingpres-150821063129-lva1-app6891 (3).pdf
dataminingpres-150821063129-lva1-app6891 (3).pdf
AnilGupta681764
 
Essbase beginner's guide olap fundamental chapter 1
Essbase beginner's guide olap fundamental chapter 1Essbase beginner's guide olap fundamental chapter 1
Essbase beginner's guide olap fundamental chapter 1
Amit Sharma
 
Jovian DATA: A multidimensional database for the cloud
Jovian DATA: A multidimensional database for the cloudJovian DATA: A multidimensional database for the cloud
Jovian DATA: A multidimensional database for the cloud
Bharat Rane
 
Lecture 8 is for best and you should read
Lecture 8 is for best and you should readLecture 8 is for best and you should read
Lecture 8 is for best and you should read
centralcollegepkr
 
Lecture-8-The-GIS-Database-Part-1.ppt
Lecture-8-The-GIS-Database-Part-1.pptLecture-8-The-GIS-Database-Part-1.ppt
Lecture-8-The-GIS-Database-Part-1.ppt
Prabin Pandit
 

Similar to Data cubes (20)

datacub
datacubdatacub
datacub
 
dataminingpres-150821063129-lva1-app6891 (3).pdf
dataminingpres-150821063129-lva1-app6891 (3).pdfdataminingpres-150821063129-lva1-app6891 (3).pdf
dataminingpres-150821063129-lva1-app6891 (3).pdf
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 
Essbase beginner's guide olap fundamental chapter 1
Essbase beginner's guide olap fundamental chapter 1Essbase beginner's guide olap fundamental chapter 1
Essbase beginner's guide olap fundamental chapter 1
 
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
 
mod 2.pdf
mod 2.pdfmod 2.pdf
mod 2.pdf
 
Comparison between cube techniques
Comparison between cube techniquesComparison between cube techniques
Comparison between cube techniques
 
My2dw
My2dwMy2dw
My2dw
 
Data warehouse physical design
Data warehouse physical designData warehouse physical design
Data warehouse physical design
 
Jovian DATA: A multidimensional database for the cloud
Jovian DATA: A multidimensional database for the cloudJovian DATA: A multidimensional database for the cloud
Jovian DATA: A multidimensional database for the cloud
 
Lecture 8 is for best and you should read
Lecture 8 is for best and you should readLecture 8 is for best and you should read
Lecture 8 is for best and you should read
 
Data .pptx
Data .pptxData .pptx
Data .pptx
 
Cassandra data modelling best practices
Cassandra data modelling best practicesCassandra data modelling best practices
Cassandra data modelling best practices
 
Lecture-8-The-GIS-Database-Part-1.ppt
Lecture-8-The-GIS-Database-Part-1.pptLecture-8-The-GIS-Database-Part-1.ppt
Lecture-8-The-GIS-Database-Part-1.ppt
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationData Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalization
 
Cs437 lecture 7-8
Cs437 lecture 7-8Cs437 lecture 7-8
Cs437 lecture 7-8
 
Bigtable_Paper
Bigtable_PaperBigtable_Paper
Bigtable_Paper
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
1234
12341234
1234
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Recently uploaded (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

Data cubes

  • 1. DATA CUBES Presented by: Mohammed Siddig Ahmed April , 2011 sudan university
  • 2. DATA CUBES Data cube is a structure that enable OLAP to achieves the multidimensional functionality. The data cube is used to represent data along some measure of interest. Data Cubes are an easy way to look at the data ( allow us to look at complex data in a simple format). Although called a "cube", it can be 2-dimensional, 3-dimensional, or higher-dimensional.
  • 3. DATA CUBES databases design s is for OLTP and efficiency in data storage. data cube design is for efficiency in data retrieval (ensures report optimization). The cube is comparable to a table in a relational database.
  • 4. Dimensions And Measures data cubes have categories of data called dimensions and measures. measure represents some fact (or number) such as cost or units of service. dimension represents descriptive categories of data such as time or location.
  • 6. Data Cubes Concepts Three important concepts associated with data cubes : Slicing. Dicing. Rotating.
  • 7. Slicing the term slice most often refers to a two- dimensional page selected from the cube. subset of a multidimensional array corresponding to a single value for one or more members of the dimensions not in the subset.
  • 10. Dicing A related operation to slicing . in the case of dicing, we define a subcube of the original space. Dicing provides you the smallest available slice.
  • 12. Rotating Some times called pivoting. Rotating changes the dimensional orientation of the report from the cube data. For example … rotating may consist of swapping the rows and columns, or moving one of the row dimensions into the column dimension or swapping an off-spreadsheet dimension with one of the dimensions in the page display
  • 14. Dimensions represents descriptive categories of data such as time or location. Each dimension includes different levels of categories.
  • 16. Categories is an item that matches a specific description or classification such as years in a time dimension. Categories can be at different levels of information within a dimension.
  • 17. Categories parent category is the next higher level of another category in a drill-up path. child category is the next lower level category in a drill-down path.
  • 20. measures The measures are the actual data values that occupy the cells as defined by the dimensions selected. Measures include facts or variables typically stored as numerical fields.
  • 22. Computed versus Stored Data Cubes The goal is to retrieve the information from the data cube in the most efficient way possible. Three possible solutions are: Pre-compute all cells in the cube. Pre-compute no cells. Pre-compute some of the cells.
  • 23. Computed versus Stored Data Cubes If the whole cube is pre-computed Advantage the queries run on the cube will be very fast. Disadvantage pre-computed cube requires a lot of memory.
  • 24. Computed versus Stored Data Cubes To minimize memory requirements, we can pre-compute none of the cells in the cube. But thequeries on the cube will run more slowly. As a compromise between these two, we can pre-compute only those cells in the cube which will most likely be used for decision support queries.
  • 25. representation of Totals A simple data cube does not contain totals. The storage of totals increases the size of the data cube but can also decrease the time to make total-based queries. A simple way to represent totals is to add an additional layer on n sides of the n-dimensional data cube.
  • 27. REFRENCES http://training.inet.com Data Mining Concepts and techniques , 2E. to NASA’s Data Cube Training presentation . http://projects.cs.dal.ca/panda/datacube.html