SlideShare une entreprise Scribd logo
1  sur  30
Analytics
and Big Data
Analytics
Robin Bloor Ph D
The Sequence of Topics….
1

2
3
4

5

Data Science?
The Nature of
Analytics
Machine Learning Et
Al
The Business
Perspective
The Future
1
What Is Data Science?
 There

is no “data
science.” It’s a
misnomer
 All science is
empirical and involves
data analysis.
 Science implements a
method.
 So do statisticians
What Is A Data Scientist?
 Project

manager
 Qualified statistician
 Domain Business
expert
 Experienced data
architect
 Software engineer
(It’s a team)
Data Scientist v Business Analysts
 Claims

that business
analysts can be data
scientists are dubious
 Good practitioners of
statistics understand
data (from years of
training)
 Software understands
nothing, it simply
implements algorithms
Who Understands Data?
Nevertheless!

You can know more
about a business
from its data than
by any other
means
2

The
Nature
Of
Analytics
The Field of Business Intelligence
Hindsight

• Regular
reporting/operational
BI

Oversight

• Dashboards, OLAP,
BPM, etc.

Insight

• Data
mining, statistical
analysis

Foresight

• Predictive analytics
The Driving Force is Insight
A Process Not An Activity
 Data Analytics is a multidisciplinary end-to-end
process
 Until recently it was a
walled-garden. But recently
the walls were torn down
by…
 Data availability
 Scalable technology
 Open source tools
The Data Analytics Process - Detail
The CRITICAL Workload Issue
 Previously, we viewed
database workloads as
an i/o optimization
problem
 With analytics the
workload is a very
variable mix of i/o
and calculation
 No databases were
built for this – not
even Big Data
databases
3

Machine
Learning
Et Al
Analytical Latencies
1 Data access
2 Data preparation
3 Model development
4 Execution
5 Implementation
6 Model Audit & Update

Speed = value (probably)
The Open Source Dynamic
 The R Language
 Over 1 million
users
 Hadoop and its
Ecosystem
 Reduced latency
for analytics
 Machine Learning
Algorithms
 Raw power
None of these are engineered for performance
Machine Learning Algorithms - 1
 There are many:
 Neural network(s)
 Bayesian networks
 Decisions
trees/random
forests
 Support vector
machines
 K-means
 Clustering
 Regression(s)
 Etc.
Machine Learning Algorithms - 2
 They are not newly
invented
 We did not
previously use them
much because we
never had the
computer power
 Now that we have
the power (at a
price) we can
employ them
Machine Learning Algorithms - 3
 Machine learning
algorithms can check
all possibilities
 We never had the
computer power
 Now that we have
the power (at a
price) we can
employ them
The Impact?
 Machine learning
and processing
power (parallelism)
will change the
data analysis
process
 The analytics team
needs to
understand IT
4

The
Business
Perspective
Business Metamorphosis
 The role of data
analysis has not
changed
 Only the speed has
changed
 The process will
evolve
 It will be disruptive
for incumbent
vendors
The Data Analysis Budget
 Data Analysis is
Business R&D
 The focus is on
business process
 The outcome of
successful R&D is
a changed process
 Think of
manufacturing for
a useful analogy
The Data Analysis Budget
 Data Analysis is
Business R&D
 The focus is on
business process
 The outcome of
successful R&D is
a changed process
 Think of
manufacturing for
a useful analogy
5

The
Future
Non èfinitafino a quando
la signora grassacanta
 Hardware disruption
 Software disruption
 Business process
disruption
 All we know is:
 Analytical
processing will get
faster
 Analytic latencies
will reduce
 Data will continue
to grow
 Analytics will be a
differentiator
In Summary…
1

2
3
4

5

Data Science?
The Nature of
Analytics
Machine Learning Et
Al
The Business
Perspective
The Future
Analytics and Big Data Analytics
Analytics and Big Data Analytics

Contenu connexe

Tendances

What Is Machine Learning? | What Is Machine Learning And How Does It Work? | ...
What Is Machine Learning? | What Is Machine Learning And How Does It Work? | ...What Is Machine Learning? | What Is Machine Learning And How Does It Work? | ...
What Is Machine Learning? | What Is Machine Learning And How Does It Work? | ...
Simplilearn
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and Techniques
Rui Pedro Paiva
 

Tendances (20)

AI and Managerial Decision Making
AI and Managerial Decision MakingAI and Managerial Decision Making
AI and Managerial Decision Making
 
What Is Machine Learning? | What Is Machine Learning And How Does It Work? | ...
What Is Machine Learning? | What Is Machine Learning And How Does It Work? | ...What Is Machine Learning? | What Is Machine Learning And How Does It Work? | ...
What Is Machine Learning? | What Is Machine Learning And How Does It Work? | ...
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?
 
[DevDay2019] How do I test AI models? - By Minh Hoang, Senior QA Engineer at KMS
[DevDay2019] How do I test AI models? - By Minh Hoang, Senior QA Engineer at KMS[DevDay2019] How do I test AI models? - By Minh Hoang, Senior QA Engineer at KMS
[DevDay2019] How do I test AI models? - By Minh Hoang, Senior QA Engineer at KMS
 
Big Data & Artificial Intelligence
Big Data & Artificial IntelligenceBig Data & Artificial Intelligence
Big Data & Artificial Intelligence
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and Techniques
 
Machine learning beyond the tech giants
Machine learning beyond the tech giantsMachine learning beyond the tech giants
Machine learning beyond the tech giants
 
Natural language Analysis
Natural language AnalysisNatural language Analysis
Natural language Analysis
 
Machine Learning for Auditors
Machine Learning for AuditorsMachine Learning for Auditors
Machine Learning for Auditors
 
Artificial Intelligence Overview PowerPoint Presentation Slides
Artificial Intelligence Overview PowerPoint Presentation Slides Artificial Intelligence Overview PowerPoint Presentation Slides
Artificial Intelligence Overview PowerPoint Presentation Slides
 
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Aptage future of ai webinar slides
Aptage future of ai webinar slidesAptage future of ai webinar slides
Aptage future of ai webinar slides
 
Artificial Intelligence: Expert Systems Components
Artificial Intelligence: Expert Systems ComponentsArtificial Intelligence: Expert Systems Components
Artificial Intelligence: Expert Systems Components
 
The Machine Learning Audit
The Machine Learning AuditThe Machine Learning Audit
The Machine Learning Audit
 
Artificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to businessArtificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to business
 
Artificial intelligence slides beginners
Artificial intelligence slides beginners Artificial intelligence slides beginners
Artificial intelligence slides beginners
 
Healthcare + AI: Use cases & Challenges
Healthcare + AI: Use cases & ChallengesHealthcare + AI: Use cases & Challenges
Healthcare + AI: Use cases & Challenges
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine Learning
 

En vedette

Innovation - The key to enhance Customer Experience
Innovation - The key to enhance Customer Experience Innovation - The key to enhance Customer Experience
Innovation - The key to enhance Customer Experience
SAS Institute India Pvt. Ltd
 

En vedette (7)

Assort Surgical Management Systems
Assort Surgical Management SystemsAssort Surgical Management Systems
Assort Surgical Management Systems
 
Foresight: The Secret Weapon of Strategy
Foresight: The Secret Weapon of StrategyForesight: The Secret Weapon of Strategy
Foresight: The Secret Weapon of Strategy
 
Self Leadership for Influence and Impact
Self Leadership for Influence and ImpactSelf Leadership for Influence and Impact
Self Leadership for Influence and Impact
 
Big data and analytics - Petteri Alahuhta
Big data and analytics - Petteri AlahuhtaBig data and analytics - Petteri Alahuhta
Big data and analytics - Petteri Alahuhta
 
Innovation - The key to enhance Customer Experience
Innovation - The key to enhance Customer Experience Innovation - The key to enhance Customer Experience
Innovation - The key to enhance Customer Experience
 
Positioning Internal Audit for the Future
Positioning Internal Audit for the FuturePositioning Internal Audit for the Future
Positioning Internal Audit for the Future
 
Hindsight, Insight, Foresight - How to increase innovation potential
Hindsight, Insight, Foresight  - How to increase innovation potentialHindsight, Insight, Foresight  - How to increase innovation potential
Hindsight, Insight, Foresight - How to increase innovation potential
 

Similaire à Analytics and Big Data Analytics

Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018
mark madsen
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Simplilearn
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 
Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
PoojaPatidar11
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...
mark madsen
 
Customer Intelligence & Analytics - Part I
Customer Intelligence & Analytics - Part ICustomer Intelligence & Analytics - Part I
Customer Intelligence & Analytics - Part I
Vivastream
 

Similaire à Analytics and Big Data Analytics (20)

Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analytics
 
Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi Periasamy
 
Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...
Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...
Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...
 
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedback
 
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptxDATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
 
Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017 Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017
 
The Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape OverviewThe Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape Overview
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products
 
Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...
 
Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-shared
 
Customer Intelligence & Analytics - Part I
Customer Intelligence & Analytics - Part ICustomer Intelligence & Analytics - Part I
Customer Intelligence & Analytics - Part I
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
What is business analytics
What is business analyticsWhat is business analytics
What is business analytics
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 

Plus de Inside Analysis

Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
Inside Analysis
 

Plus de Inside Analysis (20)

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data Letdown
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On Time
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop Adoption
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of Everything
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global Level
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your Architecture
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the Risk
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big Data
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data Warehouse
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave Duggal
 
Modus Operandi
Modus OperandiModus Operandi
Modus Operandi
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Analytics and Big Data Analytics

  • 2. The Sequence of Topics…. 1 2 3 4 5 Data Science? The Nature of Analytics Machine Learning Et Al The Business Perspective The Future
  • 3. 1
  • 4. What Is Data Science?  There is no “data science.” It’s a misnomer  All science is empirical and involves data analysis.  Science implements a method.  So do statisticians
  • 5. What Is A Data Scientist?  Project manager  Qualified statistician  Domain Business expert  Experienced data architect  Software engineer (It’s a team)
  • 6. Data Scientist v Business Analysts  Claims that business analysts can be data scientists are dubious  Good practitioners of statistics understand data (from years of training)  Software understands nothing, it simply implements algorithms
  • 8. Nevertheless! You can know more about a business from its data than by any other means
  • 10. The Field of Business Intelligence Hindsight • Regular reporting/operational BI Oversight • Dashboards, OLAP, BPM, etc. Insight • Data mining, statistical analysis Foresight • Predictive analytics
  • 11. The Driving Force is Insight
  • 12. A Process Not An Activity  Data Analytics is a multidisciplinary end-to-end process  Until recently it was a walled-garden. But recently the walls were torn down by…  Data availability  Scalable technology  Open source tools
  • 13. The Data Analytics Process - Detail
  • 14. The CRITICAL Workload Issue  Previously, we viewed database workloads as an i/o optimization problem  With analytics the workload is a very variable mix of i/o and calculation  No databases were built for this – not even Big Data databases
  • 16. Analytical Latencies 1 Data access 2 Data preparation 3 Model development 4 Execution 5 Implementation 6 Model Audit & Update Speed = value (probably)
  • 17. The Open Source Dynamic  The R Language  Over 1 million users  Hadoop and its Ecosystem  Reduced latency for analytics  Machine Learning Algorithms  Raw power None of these are engineered for performance
  • 18. Machine Learning Algorithms - 1  There are many:  Neural network(s)  Bayesian networks  Decisions trees/random forests  Support vector machines  K-means  Clustering  Regression(s)  Etc.
  • 19. Machine Learning Algorithms - 2  They are not newly invented  We did not previously use them much because we never had the computer power  Now that we have the power (at a price) we can employ them
  • 20. Machine Learning Algorithms - 3  Machine learning algorithms can check all possibilities  We never had the computer power  Now that we have the power (at a price) we can employ them
  • 21. The Impact?  Machine learning and processing power (parallelism) will change the data analysis process  The analytics team needs to understand IT
  • 23. Business Metamorphosis  The role of data analysis has not changed  Only the speed has changed  The process will evolve  It will be disruptive for incumbent vendors
  • 24. The Data Analysis Budget  Data Analysis is Business R&D  The focus is on business process  The outcome of successful R&D is a changed process  Think of manufacturing for a useful analogy
  • 25. The Data Analysis Budget  Data Analysis is Business R&D  The focus is on business process  The outcome of successful R&D is a changed process  Think of manufacturing for a useful analogy
  • 27. Non èfinitafino a quando la signora grassacanta  Hardware disruption  Software disruption  Business process disruption  All we know is:  Analytical processing will get faster  Analytic latencies will reduce  Data will continue to grow  Analytics will be a differentiator
  • 28. In Summary… 1 2 3 4 5 Data Science? The Nature of Analytics Machine Learning Et Al The Business Perspective The Future