SlideShare une entreprise Scribd logo
1  sur  10
Big data
BIG DATA VS DATA WAREHOUSING
A LOOK AT THE VALUE AND DIFFERENCES OF DATA WAREHOUSING AND
BIG DATA
Tshegofatso Mogomotsi
The purpose of the presentation is to outline the value that Big data
and Data warehousing can contribute into a business respectively.
Differentiate the two concepts and their benefits.
Tshegofatso Mogomotsi
2016
Overview
 What is Data warehousing, Big data, and Fast data
 Big data tools
 Use Case
 Summary of differences
Defining Data warehousing, Big data and Fast data in
business
 Data warehousing
Data warehouses are usually used to correspond broad business data from various data sources to provide greater
insight into the performance of a business. Data warehouses are different from regular databases in that databases
are optimized to maintain strict accuracy of data by rapidly updating real-time data. Unlike relational databases, data
warehouses are designed to give a long-range view of data over time and specialize in data gathering which
allows for further processed like data mining (Informatica, 2016)
 Big data
Big data is defined by large or complex data sets that traditional data processing techniques and applications are
inadequate. Challenges include analysis, storage, transfer, visualization, querying, updating, and information privacy.
The term often refers simply to the use of predictive analytics, user behavior analytics, or certain other advanced
data analytics methods that extract value from data.
 Fast data
Big data grows through a constant stream of incoming data. John Hugg, a software architect, proposes that instead
of simply storing that data to be analyzed later, perhaps we've reached the point where it can be analyzed as it's
ingested while still maintaining extremely high intake rates.
Big data is not only measured by volume of data, it is also measured by volume in terms of time-velocity. Velocity
represents working data, immediate status, or data with ongoing purpose. The best way to capture the value of
incoming data is to react to it the instant it arrives. If you are processing incoming data in batches, you've already lost
time and, thus, the value of the active data.
Defining Data warehousing, Big data and Fast data in
business
Deliver
business value
through the
analysis of data
William H. Inmon, described a data
warehouse as being a subject-oriented,
integrated, time-variant collection of data that
supports management's decision-making
process.
Big data is technology capable of carrying
large amounts of data stored in an
unstructured format. This data, when
captured, manipulated, and analyzed can
help a corporation to gain useful insight.
Fast data is the application of big data analytics
to smaller data sets in real-time in order to
solve a problem or create business value. The
goal of fast data is to quickly gather and mine
structured and unstructured data so that action
can be taken.
Big data tools
Big
Data
Data storage
Data
cleaning
Data mining
Data
analysis
Data
Visualisation
Below is a view of some the applications/tools used for Big data
management and processing
Data Storage and Management
Cloudera
MongoDB
Oracle Database(or the Oracle NoSQL Database)
Data cleaning tools
OpenRefine
DataCleaner
Data mining tools – predictive analysis
Rapid Miner
IBM SPSS Modeler
Oracle Data Miner GUI
Data analytics
Oracle R
BigML
Data visualization
Tableau
Silk
Uses: Case study
 Company ABC is a large South African shoe manufacturing company that
also has retail stores across the African region. A manufacturer of various
shoe types for the whole family. ABC annual turnover for the 2015/16
financial was 16.6 million.
 The company is looking to increase their profit margin by 10 percent in the
next 2017/18 financial year and to achieve this they recently invested in Big
data infrastructure.
Uses: Case study
 Big data
 ABC recently recognized that there is an increasing amount of data which is
not captured in their operational databases such as clickstream logs, social
feeds, customer support emails, location data from mobile devices and chat
transcripts. Big data systems harness these new sources of data, and allow
businesses to analyze and extract business value from these large data sets.
 Example of how Big data systems can add value to ABC
Using Big data tools, the BI team identifies customers that are active on
specific marathon websites, search information related to
marathons/running, and engage with social feeds related to
marathons/running. Then uses the data to predict that these customers
may be running a marathon soon, then forward products and specials of
running shoes to these customers.
Uses: Case study
 Data warehouse
 ABC’s data warehouse contains data from its company financials systems, its customer
marketing systems, its billing systems, its point-of-sales systems, and so on.
Traditionally, data warehouses source data solely from other databases. The need for a
data warehouse often becomes evident when analytic requirements become challenging
for the ongoing performance of operational databases.
 The data warehouse stores current and historical data and is used for creating analytical
reports for knowledge workers throughout the company. Examples of reports could
range from annual and quarterly comparisons and trends to detailed daily sales analysis.
 The data warehouse provides the company with reliable, believable and accessible data
that everyone in the company can rely on.
 Even with a Big data initiative incorporated into the ABC’s business, the data warehouse
- built upon a relational database, can continue to be the primary analytic database for
storing much of a company’s core transactional data: financial records, customer data,
point of-sale data and so forth.
Summary of differences
Big data Data warehousing
Big data solution is a technology- a means to store and manage large
amounts of data
Data warehousing is an architecture - a way of organizing data so that there
is corporate credibility and integrity.
The Big data scope of data is beyond data found in the corporation (Web,
sales, customer contact center, social media, mobile data).
An enterprise’s data warehouse contains data from its enterprise databases.
Big data applies an architecture that acquires data from multiple data
sources, organizes and stores that data in a suitable format for analysis.
Data warehouses do not excel at handling raw, unstructured, or complex
data.
Big data is measured by volume and velocity. A data warehouse is measured by volume.
If unlocked properly – data can contain much valuable information that can
lead to better decisions that, in turn, can lead to more revenue, more
profitability and increased market share.
Data warehouse provides a “single version of the truth” for decision making
in the corporation. With a data warehouse there is an integrated, granular,
historical single point of reference for data in the corporation.

Contenu connexe

Tendances

Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data miningRohit Kumar
 
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big DataSnowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big DataSnowball Group
 
Big Data & Analytics perspectives in Banking
Big Data & Analytics perspectives in BankingBig Data & Analytics perspectives in Banking
Big Data & Analytics perspectives in BankingGianpaolo Zampol
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2Parviz Vakili
 
GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378Parag Kapile
 
Augmented analytics will push the analytics adoption
Augmented analytics will push the analytics adoptionAugmented analytics will push the analytics adoption
Augmented analytics will push the analytics adoptionPolestarsolutions
 
Tamr | MDM and the Data Unification Imperative
Tamr | MDM and the Data Unification ImperativeTamr | MDM and the Data Unification Imperative
Tamr | MDM and the Data Unification ImperativeTamr_Inc
 
Business Intelligence Overview
Business Intelligence OverviewBusiness Intelligence Overview
Business Intelligence OverviewClaudio Menozzi
 
Analytics driving innovation and efficiency in Banking
Analytics driving innovation and efficiency in BankingAnalytics driving innovation and efficiency in Banking
Analytics driving innovation and efficiency in BankingGianpaolo Zampol
 
Governing Big Data : Principles and practices
Governing Big Data : Principles and practicesGoverning Big Data : Principles and practices
Governing Big Data : Principles and practicesPiyush Malik
 
Information Technology Data Mining
Information Technology Data MiningInformation Technology Data Mining
Information Technology Data Miningsamiksha sharma
 
Tamr | Making enterprise elephants dance @ boston data festival
Tamr | Making enterprise elephants dance @ boston data festival Tamr | Making enterprise elephants dance @ boston data festival
Tamr | Making enterprise elephants dance @ boston data festival Tamr_Inc
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?Logi Analytics
 

Tendances (19)

Big data-analytics-ebook
Big data-analytics-ebookBig data-analytics-ebook
Big data-analytics-ebook
 
5 Big Data Use Cases for 2013
5 Big Data Use Cases for 20135 Big Data Use Cases for 2013
5 Big Data Use Cases for 2013
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
 
Big data
Big dataBig data
Big data
 
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big DataSnowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big Data
 
Big Data & Analytics perspectives in Banking
Big Data & Analytics perspectives in BankingBig Data & Analytics perspectives in Banking
Big Data & Analytics perspectives in Banking
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
 
GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378
 
Augmented analytics will push the analytics adoption
Augmented analytics will push the analytics adoptionAugmented analytics will push the analytics adoption
Augmented analytics will push the analytics adoption
 
Tamr | MDM and the Data Unification Imperative
Tamr | MDM and the Data Unification ImperativeTamr | MDM and the Data Unification Imperative
Tamr | MDM and the Data Unification Imperative
 
Business Intelligence Overview
Business Intelligence OverviewBusiness Intelligence Overview
Business Intelligence Overview
 
Data mining
Data miningData mining
Data mining
 
Big data
Big dataBig data
Big data
 
Analytics driving innovation and efficiency in Banking
Analytics driving innovation and efficiency in BankingAnalytics driving innovation and efficiency in Banking
Analytics driving innovation and efficiency in Banking
 
Governing Big Data : Principles and practices
Governing Big Data : Principles and practicesGoverning Big Data : Principles and practices
Governing Big Data : Principles and practices
 
Information Technology Data Mining
Information Technology Data MiningInformation Technology Data Mining
Information Technology Data Mining
 
Tamr | Making enterprise elephants dance @ boston data festival
Tamr | Making enterprise elephants dance @ boston data festival Tamr | Making enterprise elephants dance @ boston data festival
Tamr | Making enterprise elephants dance @ boston data festival
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?
 
Graph Database
Graph Database  Graph Database
Graph Database
 

En vedette

BlueVia SDK for .NET Overview
BlueVia SDK for .NET OverviewBlueVia SDK for .NET Overview
BlueVia SDK for .NET OverviewBlueVia
 
Inside Wijmo 5, a Large-scale JavaScript Product
Inside Wijmo 5, a Large-scale JavaScript ProductInside Wijmo 5, a Large-scale JavaScript Product
Inside Wijmo 5, a Large-scale JavaScript ProductChris Bannon
 
AngularJs Workshop SDP December 28th 2014
AngularJs Workshop SDP December 28th 2014AngularJs Workshop SDP December 28th 2014
AngularJs Workshop SDP December 28th 2014Ran Wahle
 
Joseph Inbaraj S 11+ years ALM Admin Resume
Joseph Inbaraj S 11+ years ALM Admin ResumeJoseph Inbaraj S 11+ years ALM Admin Resume
Joseph Inbaraj S 11+ years ALM Admin Resumeqtpjoseph
 
AngularJS Services
AngularJS ServicesAngularJS Services
AngularJS ServicesEyal Vardi
 
Cryptography
CryptographyCryptography
Cryptographymilanmath
 
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafDSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafTim Donohue
 
Business Plan: Photography Business
Business Plan: Photography Business Business Plan: Photography Business
Business Plan: Photography Business Moin Sarker
 
Introduction to Network Security
Introduction to Network SecurityIntroduction to Network Security
Introduction to Network SecurityComputing Cage
 

En vedette (11)

BlueVia SDK for .NET Overview
BlueVia SDK for .NET OverviewBlueVia SDK for .NET Overview
BlueVia SDK for .NET Overview
 
Inside Wijmo 5, a Large-scale JavaScript Product
Inside Wijmo 5, a Large-scale JavaScript ProductInside Wijmo 5, a Large-scale JavaScript Product
Inside Wijmo 5, a Large-scale JavaScript Product
 
AngularJs Workshop SDP December 28th 2014
AngularJs Workshop SDP December 28th 2014AngularJs Workshop SDP December 28th 2014
AngularJs Workshop SDP December 28th 2014
 
Other Newsletter Articles - Manufacturing
Other Newsletter Articles - ManufacturingOther Newsletter Articles - Manufacturing
Other Newsletter Articles - Manufacturing
 
AFR_TS_Catalog
AFR_TS_CatalogAFR_TS_Catalog
AFR_TS_Catalog
 
Joseph Inbaraj S 11+ years ALM Admin Resume
Joseph Inbaraj S 11+ years ALM Admin ResumeJoseph Inbaraj S 11+ years ALM Admin Resume
Joseph Inbaraj S 11+ years ALM Admin Resume
 
AngularJS Services
AngularJS ServicesAngularJS Services
AngularJS Services
 
Cryptography
CryptographyCryptography
Cryptography
 
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafDSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
 
Business Plan: Photography Business
Business Plan: Photography Business Business Plan: Photography Business
Business Plan: Photography Business
 
Introduction to Network Security
Introduction to Network SecurityIntroduction to Network Security
Introduction to Network Security
 

Similaire à Big data vs datawarehousing

Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Aditya205306
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
BIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICSBIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICSVikram Joshi
 
Data Warehousing as a Service (DWaaS): Faster Business Decision Making
Data Warehousing as a Service (DWaaS): Faster Business Decision MakingData Warehousing as a Service (DWaaS): Faster Business Decision Making
Data Warehousing as a Service (DWaaS): Faster Business Decision MakingKavika Roy
 
Converting Big Data To Smart Data | The Step-By-Step Guide!
Converting Big Data To Smart Data | The Step-By-Step Guide!Converting Big Data To Smart Data | The Step-By-Step Guide!
Converting Big Data To Smart Data | The Step-By-Step Guide!Kavika Roy
 
Addressing Storage Challenges to Support Business Analytics and Big Data Work...
Addressing Storage Challenges to Support Business Analytics and Big Data Work...Addressing Storage Challenges to Support Business Analytics and Big Data Work...
Addressing Storage Challenges to Support Business Analytics and Big Data Work...IBM India Smarter Computing
 
Group 2 Handling and Processing of big data (1).pptx
Group 2 Handling and Processing of big data (1).pptxGroup 2 Handling and Processing of big data (1).pptx
Group 2 Handling and Processing of big data (1).pptxNATASHABANO
 
Business Analytics and Big Data
Business Analytics and Big DataBusiness Analytics and Big Data
Business Analytics and Big DataAbhishek Kapoor
 
Data warehouse
Data warehouseData warehouse
Data warehouseMR Z
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperExperian
 
Is Your Company Braced Up for handling Big Data
Is Your Company Braced Up for handling Big DataIs Your Company Braced Up for handling Big Data
Is Your Company Braced Up for handling Big Datahimanshu13jun
 

Similaire à Big data vs datawarehousing (20)

Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
Big Data analytics best practices
Big Data analytics best practicesBig Data analytics best practices
Big Data analytics best practices
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
BIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICSBIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICS
 
Data Warehousing as a Service (DWaaS): Faster Business Decision Making
Data Warehousing as a Service (DWaaS): Faster Business Decision MakingData Warehousing as a Service (DWaaS): Faster Business Decision Making
Data Warehousing as a Service (DWaaS): Faster Business Decision Making
 
Converting Big Data To Smart Data | The Step-By-Step Guide!
Converting Big Data To Smart Data | The Step-By-Step Guide!Converting Big Data To Smart Data | The Step-By-Step Guide!
Converting Big Data To Smart Data | The Step-By-Step Guide!
 
Addressing Storage Challenges to Support Business Analytics and Big Data Work...
Addressing Storage Challenges to Support Business Analytics and Big Data Work...Addressing Storage Challenges to Support Business Analytics and Big Data Work...
Addressing Storage Challenges to Support Business Analytics and Big Data Work...
 
Jn2516891694
Jn2516891694Jn2516891694
Jn2516891694
 
Jn2516891694
Jn2516891694Jn2516891694
Jn2516891694
 
Group 2 Handling and Processing of big data (1).pptx
Group 2 Handling and Processing of big data (1).pptxGroup 2 Handling and Processing of big data (1).pptx
Group 2 Handling and Processing of big data (1).pptx
 
Business Analytics and Big Data
Business Analytics and Big DataBusiness Analytics and Big Data
Business Analytics and Big Data
 
Big Data
Big DataBig Data
Big Data
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Bidata
BidataBidata
Bidata
 
new.pptx
new.pptxnew.pptx
new.pptx
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White Paper
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
Is Your Company Braced Up for handling Big Data
Is Your Company Braced Up for handling Big DataIs Your Company Braced Up for handling Big Data
Is Your Company Braced Up for handling Big Data
 

Dernier

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Dernier (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Big data vs datawarehousing

  • 1. Big data BIG DATA VS DATA WAREHOUSING A LOOK AT THE VALUE AND DIFFERENCES OF DATA WAREHOUSING AND BIG DATA Tshegofatso Mogomotsi
  • 2. The purpose of the presentation is to outline the value that Big data and Data warehousing can contribute into a business respectively. Differentiate the two concepts and their benefits. Tshegofatso Mogomotsi 2016
  • 3. Overview  What is Data warehousing, Big data, and Fast data  Big data tools  Use Case  Summary of differences
  • 4. Defining Data warehousing, Big data and Fast data in business  Data warehousing Data warehouses are usually used to correspond broad business data from various data sources to provide greater insight into the performance of a business. Data warehouses are different from regular databases in that databases are optimized to maintain strict accuracy of data by rapidly updating real-time data. Unlike relational databases, data warehouses are designed to give a long-range view of data over time and specialize in data gathering which allows for further processed like data mining (Informatica, 2016)  Big data Big data is defined by large or complex data sets that traditional data processing techniques and applications are inadequate. Challenges include analysis, storage, transfer, visualization, querying, updating, and information privacy. The term often refers simply to the use of predictive analytics, user behavior analytics, or certain other advanced data analytics methods that extract value from data.  Fast data Big data grows through a constant stream of incoming data. John Hugg, a software architect, proposes that instead of simply storing that data to be analyzed later, perhaps we've reached the point where it can be analyzed as it's ingested while still maintaining extremely high intake rates. Big data is not only measured by volume of data, it is also measured by volume in terms of time-velocity. Velocity represents working data, immediate status, or data with ongoing purpose. The best way to capture the value of incoming data is to react to it the instant it arrives. If you are processing incoming data in batches, you've already lost time and, thus, the value of the active data.
  • 5. Defining Data warehousing, Big data and Fast data in business Deliver business value through the analysis of data William H. Inmon, described a data warehouse as being a subject-oriented, integrated, time-variant collection of data that supports management's decision-making process. Big data is technology capable of carrying large amounts of data stored in an unstructured format. This data, when captured, manipulated, and analyzed can help a corporation to gain useful insight. Fast data is the application of big data analytics to smaller data sets in real-time in order to solve a problem or create business value. The goal of fast data is to quickly gather and mine structured and unstructured data so that action can be taken.
  • 6. Big data tools Big Data Data storage Data cleaning Data mining Data analysis Data Visualisation Below is a view of some the applications/tools used for Big data management and processing Data Storage and Management Cloudera MongoDB Oracle Database(or the Oracle NoSQL Database) Data cleaning tools OpenRefine DataCleaner Data mining tools – predictive analysis Rapid Miner IBM SPSS Modeler Oracle Data Miner GUI Data analytics Oracle R BigML Data visualization Tableau Silk
  • 7. Uses: Case study  Company ABC is a large South African shoe manufacturing company that also has retail stores across the African region. A manufacturer of various shoe types for the whole family. ABC annual turnover for the 2015/16 financial was 16.6 million.  The company is looking to increase their profit margin by 10 percent in the next 2017/18 financial year and to achieve this they recently invested in Big data infrastructure.
  • 8. Uses: Case study  Big data  ABC recently recognized that there is an increasing amount of data which is not captured in their operational databases such as clickstream logs, social feeds, customer support emails, location data from mobile devices and chat transcripts. Big data systems harness these new sources of data, and allow businesses to analyze and extract business value from these large data sets.  Example of how Big data systems can add value to ABC Using Big data tools, the BI team identifies customers that are active on specific marathon websites, search information related to marathons/running, and engage with social feeds related to marathons/running. Then uses the data to predict that these customers may be running a marathon soon, then forward products and specials of running shoes to these customers.
  • 9. Uses: Case study  Data warehouse  ABC’s data warehouse contains data from its company financials systems, its customer marketing systems, its billing systems, its point-of-sales systems, and so on. Traditionally, data warehouses source data solely from other databases. The need for a data warehouse often becomes evident when analytic requirements become challenging for the ongoing performance of operational databases.  The data warehouse stores current and historical data and is used for creating analytical reports for knowledge workers throughout the company. Examples of reports could range from annual and quarterly comparisons and trends to detailed daily sales analysis.  The data warehouse provides the company with reliable, believable and accessible data that everyone in the company can rely on.  Even with a Big data initiative incorporated into the ABC’s business, the data warehouse - built upon a relational database, can continue to be the primary analytic database for storing much of a company’s core transactional data: financial records, customer data, point of-sale data and so forth.
  • 10. Summary of differences Big data Data warehousing Big data solution is a technology- a means to store and manage large amounts of data Data warehousing is an architecture - a way of organizing data so that there is corporate credibility and integrity. The Big data scope of data is beyond data found in the corporation (Web, sales, customer contact center, social media, mobile data). An enterprise’s data warehouse contains data from its enterprise databases. Big data applies an architecture that acquires data from multiple data sources, organizes and stores that data in a suitable format for analysis. Data warehouses do not excel at handling raw, unstructured, or complex data. Big data is measured by volume and velocity. A data warehouse is measured by volume. If unlocked properly – data can contain much valuable information that can lead to better decisions that, in turn, can lead to more revenue, more profitability and increased market share. Data warehouse provides a “single version of the truth” for decision making in the corporation. With a data warehouse there is an integrated, granular, historical single point of reference for data in the corporation.