SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
Business Intelligence
History

●   Business Intelligence term first apparition on
    1958 by Hans Peter Luhn, an IBM researcher
●   Authomatic method to provide current
    awareness services to scientists and engineers
●   Current definition of Business Intelligence as a
    combination of processes and technologies for
    gathering, storing, analyzing and providing
    access to informations to help enterprise users
    to make conscious decisions


                                           www.robertomarchetto.com
Main concept

●   Collect data from different sources
●   Integrate and clean up data in a common, easy
    to analyze repository
●   Provide business related analysis for managers
    and decision makers
●   Focus on business, data integration, data
    presentation




                                          www.robertomarchetto.com
Datawarehouse

●   Bill Inmon: A collection of data in support of
    decisional process
    ●   End-user oriented
    ●   Collected from different sources
    ●   Time dependence
    ●   Data is not editable
●   In theory means a group of processes
●   In the real world is often used for the database


                                             www.robertomarchetto.com
OLTP: On-Line Transaction Processing

 ●   Commonly used in ERP, CRM systems and
     database applications
 ●   Focuson transaction level (one invoice, one
     sales order, a search query, etc.)
 ●   Updates and insertions are frequent
 ●   Relational model with many tables, using
     normalization rules




                                           www.robertomarchetto.com
OLAP: On-Line Analytical Processing

●   A system designed for analysis prouposes
●   Focused on the data exploration on the whole
●   Data once added changes a lot less frequently
●   13 (12+0) rules of Dr. Codd (1993)
    ●   Multidimensional view
    ●   Intuitive data manipulation
    ●   Dimensions, Facts, Hierarchy levels, Cardinality



                                                 www.robertomarchetto.com
On-Line Analytical Processing




                         www.robertomarchetto.com
Relational OLAP

●   Uses relational database schemas and SQL to
    store and access OLAP cubes
●   Reuse of RDBMS technology
●   Many tools and vendors available
●   SQL can be used directly by many tools
●   Scalability




                                        www.robertomarchetto.com
Star schema




              www.robertomarchetto.com
Memory OLAP, Hybrid OLAP

●   Memory OLAP uses optimized multidimensional arrays
●   Requires pre-computation and storage of the cube
    (processing)
●   Often better in performances than ROLAP, better
    caching, multidimensional indexing
●   Compression techniques, statistical indexes
●   Less scalable than ROLAP on high volume of data,
    less tools and vendors available
●   Hybrid OLAP (HOLAP) is the combination of ROLAP
    and MOLAP

                                              www.robertomarchetto.com
Slowly Changing Dimensions

●   In some Business Intelligence implementations data is
    always added and almost never modified
●   This makes possible to go back in the timeline
●   For example if an employer was hired in a time period
    you can analyze data as being in that period, counting
    exactly the number of employes
●   A common approach to ensure Slowly Changing
    Dimesions is to add some special fields to the
    database records, giving a time-related validity for
    each record


                                                 www.robertomarchetto.com
MDX

●   Multidimensional Expressions (MDX) is a query
    language for OLAP databases
●   MDX is to OLAP as SQL queries are to OLTP
    databases
●   Powerfull on computing indexes and navigating
    through OLAP dimensions
●   SELECT
    {[Measures].[Store Sales]} ON COLUMNS
    {[Date].[2002], [Date].[2003]} ON ROWS
    FROM Sales
    WHERE ([Store].[USA].[CA])

                                        www.robertomarchetto.com
Features for a BI platform
●   Data storage, data management
●   Data Integration, process schedulement
●   Querying and reporting
●   On Line Analitycal Processing (OLAP)
●   Documents management, versioning
●   Statistical computations
●   Microsoft Office or Open Office support
●   Easy to use and end user self creation of
    documents (indipendence from developers)
                                           www.robertomarchetto.com
Dashboards, KPIs




                   www.robertomarchetto.com
Geoanalysis




              www.robertomarchetto.com
Data Mining

●   Requires a strong preparation in computational statistics




                                                   www.robertomarchetto.com
What-if analysis




                   www.robertomarchetto.com
Open Source offers

         ●   Reporting
         ●   OLAP
         ●   Charts
         ●   Portal containers
         ●   Data integration tools
         ●   Libraries, CMS,
             scheduler
         ●   Databases

                         www.robertomarchetto.com
SpagoBI (BI Suite)

         ●   Engineering
             Informatica (Italy)
         ●   Integration of
             components using
             drivers
         ●   Comprehensive
         ●   Full Open Source




                          www.robertomarchetto.com
Pentaho (BI Suite)

         ●   Pentaho (USA)
         ●   Acquisition instead of
             integration
         ●   Strong marketing
         ●   Commercial and
             Open Source




                         www.robertomarchetto.com
JasperServer (BI Suite)

            ●   JasperSoft (USA)
            ●   Famous for
                JasperReports
            ●   Easy to use
            ●   Commercial and
                Open Souce




                              www.robertomarchetto.com
Palo (In memory OLAP)

           ●   Jedox (Germany)
           ●   Interesting technology
               (M-OLAP, GPU)
           ●   Excel and OpenOffice
               plugins
           ●   Web spreadsheet and
               reporting
           ●   Open Source and
               Commercial support

                           www.robertomarchetto.com
Talend (Data Integration)

             ●   Talend (France)
             ●   „Cool Vendor“
                 Gartner for Data
                 Integration
             ●   Data Integration, Data
                 Quality, Data
                 Management, ESB
             ●   Open Source and
                 Commercial support

                             www.robertomarchetto.com

Contenu connexe

Tendances

Anatomy of in memory processing in Spark
Anatomy of in memory processing in SparkAnatomy of in memory processing in Spark
Anatomy of in memory processing in Sparkdatamantra
 
Interactive workflow management using Azkaban
Interactive workflow management using AzkabanInteractive workflow management using Azkaban
Interactive workflow management using Azkabandatamantra
 
Near real-time anomaly detection at Lyft
Near real-time anomaly detection at LyftNear real-time anomaly detection at Lyft
Near real-time anomaly detection at Lyftmarkgrover
 
A Tool For Big Data Analysis using Apache Spark
A Tool For Big Data Analysis using Apache SparkA Tool For Big Data Analysis using Apache Spark
A Tool For Big Data Analysis using Apache Sparkdatamantra
 
Flink Case Study: Capital One
Flink Case Study: Capital OneFlink Case Study: Capital One
Flink Case Study: Capital OneFlink Forward
 
Big Data at Speed
Big Data at SpeedBig Data at Speed
Big Data at Speedmarkgrover
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Sergio Fernández
 
Flink Case Study: OKKAM
Flink Case Study: OKKAMFlink Case Study: OKKAM
Flink Case Study: OKKAMFlink Forward
 
Big Data Hoopla Simplified - TDWI Memphis 2014
Big Data Hoopla Simplified - TDWI Memphis 2014Big Data Hoopla Simplified - TDWI Memphis 2014
Big Data Hoopla Simplified - TDWI Memphis 2014Rajan Kanitkar
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016Sergio Fernández
 
Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraRavindra Ranwala
 
Semantika Introduction
Semantika IntroductionSemantika Introduction
Semantika IntroductionJosef Hardi
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Juan Sequeda
 
Eclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaEclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaJeen Broekstra
 
ETL Metadata Injection with Pentaho Data Integration
ETL Metadata Injection with Pentaho Data IntegrationETL Metadata Injection with Pentaho Data Integration
ETL Metadata Injection with Pentaho Data IntegrationDavid Fombella Pombal
 
Towards a Commons RDF Library - ApacheCon Europe 2014
Towards a Commons RDF Library - ApacheCon Europe 2014Towards a Commons RDF Library - ApacheCon Europe 2014
Towards a Commons RDF Library - ApacheCon Europe 2014Sergio Fernández
 
Data Science Salon: A Journey of Deploying a Data Science Engine to Production
Data Science Salon: A Journey of Deploying a Data Science Engine to ProductionData Science Salon: A Journey of Deploying a Data Science Engine to Production
Data Science Salon: A Journey of Deploying a Data Science Engine to ProductionFormulatedby
 

Tendances (20)

Anatomy of in memory processing in Spark
Anatomy of in memory processing in SparkAnatomy of in memory processing in Spark
Anatomy of in memory processing in Spark
 
Interactive workflow management using Azkaban
Interactive workflow management using AzkabanInteractive workflow management using Azkaban
Interactive workflow management using Azkaban
 
Near real-time anomaly detection at Lyft
Near real-time anomaly detection at LyftNear real-time anomaly detection at Lyft
Near real-time anomaly detection at Lyft
 
A Tool For Big Data Analysis using Apache Spark
A Tool For Big Data Analysis using Apache SparkA Tool For Big Data Analysis using Apache Spark
A Tool For Big Data Analysis using Apache Spark
 
Flink Case Study: Capital One
Flink Case Study: Capital OneFlink Case Study: Capital One
Flink Case Study: Capital One
 
Big Data at Speed
Big Data at SpeedBig Data at Speed
Big Data at Speed
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
 
Flink Case Study: OKKAM
Flink Case Study: OKKAMFlink Case Study: OKKAM
Flink Case Study: OKKAM
 
Big Data Hoopla Simplified - TDWI Memphis 2014
Big Data Hoopla Simplified - TDWI Memphis 2014Big Data Hoopla Simplified - TDWI Memphis 2014
Big Data Hoopla Simplified - TDWI Memphis 2014
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
 
R training at Aimia
R training at AimiaR training at Aimia
R training at Aimia
 
Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandra
 
Are we there yet?
Are we there yet?Are we there yet?
Are we there yet?
 
Semantika Introduction
Semantika IntroductionSemantika Introduction
Semantika Introduction
 
Apache Marmotta - Introduction
Apache Marmotta - IntroductionApache Marmotta - Introduction
Apache Marmotta - Introduction
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011
 
Eclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaEclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in Java
 
ETL Metadata Injection with Pentaho Data Integration
ETL Metadata Injection with Pentaho Data IntegrationETL Metadata Injection with Pentaho Data Integration
ETL Metadata Injection with Pentaho Data Integration
 
Towards a Commons RDF Library - ApacheCon Europe 2014
Towards a Commons RDF Library - ApacheCon Europe 2014Towards a Commons RDF Library - ApacheCon Europe 2014
Towards a Commons RDF Library - ApacheCon Europe 2014
 
Data Science Salon: A Journey of Deploying a Data Science Engine to Production
Data Science Salon: A Journey of Deploying a Data Science Engine to ProductionData Science Salon: A Journey of Deploying a Data Science Engine to Production
Data Science Salon: A Journey of Deploying a Data Science Engine to Production
 

Similaire à Business Intelligence Open Source

Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data IntegrationRoberto Marchetto
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Singh
 
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016Dan Lynn
 
Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016Dan Lynn
 
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...Thierry Badard
 
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...Mark Rittman
 
Transition to a modern data platform
Transition to a modern data platform Transition to a modern data platform
Transition to a modern data platform Michael Ghen
 
Datamensional Business Intelligence and Data Services
Datamensional Business Intelligence and Data ServicesDatamensional Business Intelligence and Data Services
Datamensional Business Intelligence and Data ServicesDatamensional
 
In-Memory Oracle BI Applications (UKOUG Analytics Event, July 2013)
In-Memory Oracle BI Applications (UKOUG Analytics Event, July 2013)In-Memory Oracle BI Applications (UKOUG Analytics Event, July 2013)
In-Memory Oracle BI Applications (UKOUG Analytics Event, July 2013)Mark Rittman
 
An Introduction To Palomino
An Introduction To PalominoAn Introduction To Palomino
An Introduction To PalominoLaine Campbell
 
FinTech Data Challenges @ Nerdwallet
FinTech Data Challenges @ Nerdwallet FinTech Data Challenges @ Nerdwallet
FinTech Data Challenges @ Nerdwallet Vaibhav Jajoo
 
Accelerate integration with SAP using MuleSoft
Accelerate integration with SAP using MuleSoftAccelerate integration with SAP using MuleSoft
Accelerate integration with SAP using MuleSoftNeerajKumar1965
 
Data_and_Analytics_Industry_IESE_v3.pdf
Data_and_Analytics_Industry_IESE_v3.pdfData_and_Analytics_Industry_IESE_v3.pdf
Data_and_Analytics_Industry_IESE_v3.pdfprevota
 
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Jaroslav Gergic
 
New enhancements for security and usability in EDB 13
New enhancements for security and usability in EDB 13New enhancements for security and usability in EDB 13
New enhancements for security and usability in EDB 13EDB
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxSalehaMariyam
 

Similaire à Business Intelligence Open Source (20)

Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data Integration
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPT
 
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
 
DevOps Days Rockies MLOps
DevOps Days Rockies MLOpsDevOps Days Rockies MLOps
DevOps Days Rockies MLOps
 
Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016
 
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...
 
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
 
Transition to a modern data platform
Transition to a modern data platform Transition to a modern data platform
Transition to a modern data platform
 
Datamensional Business Intelligence and Data Services
Datamensional Business Intelligence and Data ServicesDatamensional Business Intelligence and Data Services
Datamensional Business Intelligence and Data Services
 
Industrialiser spark
Industrialiser sparkIndustrialiser spark
Industrialiser spark
 
In-Memory Oracle BI Applications (UKOUG Analytics Event, July 2013)
In-Memory Oracle BI Applications (UKOUG Analytics Event, July 2013)In-Memory Oracle BI Applications (UKOUG Analytics Event, July 2013)
In-Memory Oracle BI Applications (UKOUG Analytics Event, July 2013)
 
An Introduction To Palomino
An Introduction To PalominoAn Introduction To Palomino
An Introduction To Palomino
 
Executive Intro to R
Executive Intro to RExecutive Intro to R
Executive Intro to R
 
Big Data Pitfalls
Big Data PitfallsBig Data Pitfalls
Big Data Pitfalls
 
FinTech Data Challenges @ Nerdwallet
FinTech Data Challenges @ Nerdwallet FinTech Data Challenges @ Nerdwallet
FinTech Data Challenges @ Nerdwallet
 
Accelerate integration with SAP using MuleSoft
Accelerate integration with SAP using MuleSoftAccelerate integration with SAP using MuleSoft
Accelerate integration with SAP using MuleSoft
 
Data_and_Analytics_Industry_IESE_v3.pdf
Data_and_Analytics_Industry_IESE_v3.pdfData_and_Analytics_Industry_IESE_v3.pdf
Data_and_Analytics_Industry_IESE_v3.pdf
 
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
 
New enhancements for security and usability in EDB 13
New enhancements for security and usability in EDB 13New enhancements for security and usability in EDB 13
New enhancements for security and usability in EDB 13
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptx
 

Dernier

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Dernier (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Business Intelligence Open Source

  • 2. History ● Business Intelligence term first apparition on 1958 by Hans Peter Luhn, an IBM researcher ● Authomatic method to provide current awareness services to scientists and engineers ● Current definition of Business Intelligence as a combination of processes and technologies for gathering, storing, analyzing and providing access to informations to help enterprise users to make conscious decisions www.robertomarchetto.com
  • 3. Main concept ● Collect data from different sources ● Integrate and clean up data in a common, easy to analyze repository ● Provide business related analysis for managers and decision makers ● Focus on business, data integration, data presentation www.robertomarchetto.com
  • 4. Datawarehouse ● Bill Inmon: A collection of data in support of decisional process ● End-user oriented ● Collected from different sources ● Time dependence ● Data is not editable ● In theory means a group of processes ● In the real world is often used for the database www.robertomarchetto.com
  • 5. OLTP: On-Line Transaction Processing ● Commonly used in ERP, CRM systems and database applications ● Focuson transaction level (one invoice, one sales order, a search query, etc.) ● Updates and insertions are frequent ● Relational model with many tables, using normalization rules www.robertomarchetto.com
  • 6. OLAP: On-Line Analytical Processing ● A system designed for analysis prouposes ● Focused on the data exploration on the whole ● Data once added changes a lot less frequently ● 13 (12+0) rules of Dr. Codd (1993) ● Multidimensional view ● Intuitive data manipulation ● Dimensions, Facts, Hierarchy levels, Cardinality www.robertomarchetto.com
  • 7. On-Line Analytical Processing www.robertomarchetto.com
  • 8. Relational OLAP ● Uses relational database schemas and SQL to store and access OLAP cubes ● Reuse of RDBMS technology ● Many tools and vendors available ● SQL can be used directly by many tools ● Scalability www.robertomarchetto.com
  • 9. Star schema www.robertomarchetto.com
  • 10. Memory OLAP, Hybrid OLAP ● Memory OLAP uses optimized multidimensional arrays ● Requires pre-computation and storage of the cube (processing) ● Often better in performances than ROLAP, better caching, multidimensional indexing ● Compression techniques, statistical indexes ● Less scalable than ROLAP on high volume of data, less tools and vendors available ● Hybrid OLAP (HOLAP) is the combination of ROLAP and MOLAP www.robertomarchetto.com
  • 11. Slowly Changing Dimensions ● In some Business Intelligence implementations data is always added and almost never modified ● This makes possible to go back in the timeline ● For example if an employer was hired in a time period you can analyze data as being in that period, counting exactly the number of employes ● A common approach to ensure Slowly Changing Dimesions is to add some special fields to the database records, giving a time-related validity for each record www.robertomarchetto.com
  • 12. MDX ● Multidimensional Expressions (MDX) is a query language for OLAP databases ● MDX is to OLAP as SQL queries are to OLTP databases ● Powerfull on computing indexes and navigating through OLAP dimensions ● SELECT {[Measures].[Store Sales]} ON COLUMNS {[Date].[2002], [Date].[2003]} ON ROWS FROM Sales WHERE ([Store].[USA].[CA]) www.robertomarchetto.com
  • 13. Features for a BI platform ● Data storage, data management ● Data Integration, process schedulement ● Querying and reporting ● On Line Analitycal Processing (OLAP) ● Documents management, versioning ● Statistical computations ● Microsoft Office or Open Office support ● Easy to use and end user self creation of documents (indipendence from developers) www.robertomarchetto.com
  • 14. Dashboards, KPIs www.robertomarchetto.com
  • 15. Geoanalysis www.robertomarchetto.com
  • 16. Data Mining ● Requires a strong preparation in computational statistics www.robertomarchetto.com
  • 17. What-if analysis www.robertomarchetto.com
  • 18. Open Source offers ● Reporting ● OLAP ● Charts ● Portal containers ● Data integration tools ● Libraries, CMS, scheduler ● Databases www.robertomarchetto.com
  • 19. SpagoBI (BI Suite) ● Engineering Informatica (Italy) ● Integration of components using drivers ● Comprehensive ● Full Open Source www.robertomarchetto.com
  • 20. Pentaho (BI Suite) ● Pentaho (USA) ● Acquisition instead of integration ● Strong marketing ● Commercial and Open Source www.robertomarchetto.com
  • 21. JasperServer (BI Suite) ● JasperSoft (USA) ● Famous for JasperReports ● Easy to use ● Commercial and Open Souce www.robertomarchetto.com
  • 22. Palo (In memory OLAP) ● Jedox (Germany) ● Interesting technology (M-OLAP, GPU) ● Excel and OpenOffice plugins ● Web spreadsheet and reporting ● Open Source and Commercial support www.robertomarchetto.com
  • 23. Talend (Data Integration) ● Talend (France) ● „Cool Vendor“ Gartner for Data Integration ● Data Integration, Data Quality, Data Management, ESB ● Open Source and Commercial support www.robertomarchetto.com