SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
November 2013

BIG ANALYTICS
THE GOOD & THE VALUE
About Think Big Analytics

¨ 

Formed in 2010 to help clients launch and scale-out Big Data solutions

¨ 

Services include Big Data strategy, training, engineering and data science

¨ 

¨ 

Management Background: Quantcast, Cambridge Technology, Oracle, Sun
Microsystems, Accenture
Blue chip clients, including:
Ø 

Internet Transactions Security Global #1

Ø 

Retail 2 of Global Top 5

Ø 

Banking 4 of Global Top 1; Financial Services 2 of Global Top 5

Ø 

Asset Management Global #1

Ø 

Disk Manufacturing Global #1

Ø 

Social Networking Global #1

CONFIDENTIAL

|

2

2
Think Big Integrated Value

Integrated Value
Advisory
¨ 
¨ 

¨ 

¨ 

Understand true
business needs
Evaluate suitability of
new technologies

¨ 

Provide perspective on
market ideas
¨ 

Ensure engineering
and analytics support
business goals
Help establish realistic
and attainable
objectives
Drive client-specific
innovation

Implement
¨ 

¨ 

¨ 

Understand technology
preferences and
limitations
Assess talent skills and
development needs
Develop deep knowledge
of the data and tools

CONFIDENTIAL

|

3
Big Analytics

Ÿ 
Ÿ 
Ÿ 
Ÿ 

New data
Yielding new opportunities
Enabled by new approaches
With supporting organization

CONFIDENTIAL

|

4
New Data

CONFIDENTIAL

|

5
Nontraditional Formats

Ÿ  Unstructured data != text
- Call logs
- Raw video
- Satellite photos

CONFIDENTIAL

|

6
Exhaust

Ÿ  Byproduct data
Ÿ  Driving interest in the Internet
of Things

Ÿ  Our machines tell a story about
us

CONFIDENTIAL

|

7
Data about Data

Ÿ  Data usage patterns
Ÿ  Driving next generation
organizations
- Data access patterns as KPI
- Systems access as employee
engagement

CONFIDENTIAL

|

8
New Opportunity

CONFIDENTIAL

|

9
Fingerprinting

Ÿ  Unintentional patterns define us
- ATM rhythm
- Botnet synchronization

Ÿ  More connected world exposes
more fingerprints
- Mobile installs and settings +
NFC
- Sensory data at shopping mall
displays

CONFIDENTIAL

|

10
Dark Data Insights

Ÿ 
Ÿ 
Ÿ 
Ÿ 
Ÿ 

It’s back from the dead!
Audit data
Fund manager predictions
Employee logs
Architectural records

CONFIDENTIAL

|

11
New Approaches

CONFIDENTIAL

|

12
Unstructured Analysis

Ÿ  Non-traditional structures
- Path models
- High dimensionality

Ÿ  Text
- POS
- Classification

Ÿ  Images
- Object recognition
- Time differentials

CONFIDENTIAL

|

13
Deep Learning

Ÿ  MapReduce built for
- Bootstrapped models
- Partitioning data by complex
logic

Ÿ  Backpropagation is hard
Ÿ  Feature learning isn’t (always)

CONFIDENTIAL

|

14
Challenges Incorporating Data
Science

CONFIDENTIAL

|

15
Organizational Integration

Ÿ  Traditionally under engineering
Ÿ  Integrated with data creators,
not data consumers

Ÿ  Disconnected from business
priorities

CONFIDENTIAL

|

16
Success Loops

Ÿ  We take BI for granted
- Analysts find novel patterns
- Business sees new trends
- Statistics is balanced by
domain knowledge
- Integration of actors aware of
feasibility, cost, and impact

Ÿ  Where does your data scientist
sit?

CONFIDENTIAL

|

17
Successful Incorporation of Data
Science

CONFIDENTIAL

|

18
Partnership

Ÿ  Business is a partner, not a
customer

Ÿ  New insights, capabilities, and
products are not born in a
vacuum

CONFIDENTIAL

|

19
Cross Functional Teams

Ÿ  Data science is a process, not
a job role

- Engineering
- Research
- Statistics
- Business
- Salesmanship

Ÿ  Successful Big Analytics blends
skills, perspectives, and pushes
boundaries

CONFIDENTIAL

|

20
Measurement

Ÿ  Requires KPI/KRI
Ÿ  Performance metrics
- Direct actions
- Create purpose

CONFIDENTIAL

|

21
Client Success

CONFIDENTIAL

|

22
Example Client Phase 1

Ÿ  First phase: Big Analytics
execution

Ÿ  New methods of Botnet
detection

Ÿ  Led to patent

CONFIDENTIAL

|

23
Example Client Phase 2

Ÿ  Further analysis
- Improvement of Botnet models

Ÿ  Expansion of cross functional
Big Analytic team
- Tool selection
- Training
- Early win identification
- Self-selected group

CONFIDENTIAL

|

24
Example Client Phase 3

Ÿ  Cross-Functional Analytic
Organization

Ÿ 
Ÿ 
Ÿ 
Ÿ 

Governance
Ownership and accountability
Process
Roadmap

CONFIDENTIAL

|

25
Questions?
www.thinkbiganalytics.com
www.linkedin.com/in/danmallinger
@danmallinger

CONFIDENTIAL

|

26

Contenu connexe

Tendances

SpeedTrack Tech Overview 2015
SpeedTrack Tech Overview 2015SpeedTrack Tech Overview 2015
SpeedTrack Tech Overview 2015
Michael Zoltowski
 

Tendances (20)

GraphTour 2020 - Customer Journey with Neo4j Services
GraphTour 2020 - Customer Journey with Neo4j ServicesGraphTour 2020 - Customer Journey with Neo4j Services
GraphTour 2020 - Customer Journey with Neo4j Services
 
"What we learned from 5 years of building a data science software that actual...
"What we learned from 5 years of building a data science software that actual..."What we learned from 5 years of building a data science software that actual...
"What we learned from 5 years of building a data science software that actual...
 
Big data perspective solution & technology
Big data perspective solution & technologyBig data perspective solution & technology
Big data perspective solution & technology
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at Scale
 
PASS Summit Data Storytelling with R Power BI and AzureML
PASS Summit Data Storytelling with R Power BI and AzureMLPASS Summit Data Storytelling with R Power BI and AzureML
PASS Summit Data Storytelling with R Power BI and AzureML
 
SpeedTrack Tech Overview 2015
SpeedTrack Tech Overview 2015SpeedTrack Tech Overview 2015
SpeedTrack Tech Overview 2015
 
Building a data platform tnt
Building a data platform tntBuilding a data platform tnt
Building a data platform tnt
 
Building up a Data Science Team from Scratch
Building up a Data Science Team from ScratchBuilding up a Data Science Team from Scratch
Building up a Data Science Team from Scratch
 
The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
 
How a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratchHow a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratch
 
Vishal resume
Vishal resumeVishal resume
Vishal resume
 
BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
 
Knowi Overview: NoSQL Analytics and Business Intelligence
Knowi Overview:  NoSQL Analytics and Business IntelligenceKnowi Overview:  NoSQL Analytics and Business Intelligence
Knowi Overview: NoSQL Analytics and Business Intelligence
 
Dataiku data science studio
Dataiku data science studioDataiku data science studio
Dataiku data science studio
 
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
 
The Role(s) of Data Science in Modern Organizations
The Role(s) of Data Science in Modern OrganizationsThe Role(s) of Data Science in Modern Organizations
The Role(s) of Data Science in Modern Organizations
 
Handoop training in bangalore
Handoop training in bangaloreHandoop training in bangalore
Handoop training in bangalore
 
Week1day2 (1)
Week1day2 (1)Week1day2 (1)
Week1day2 (1)
 
Strategy toolbox for startsups
Strategy toolbox for startsupsStrategy toolbox for startsups
Strategy toolbox for startsups
 

En vedette

January 2015 HUG: Apache Flink: Fast and reliable large-scale data processing
January 2015 HUG: Apache Flink:  Fast and reliable large-scale data processingJanuary 2015 HUG: Apache Flink:  Fast and reliable large-scale data processing
January 2015 HUG: Apache Flink: Fast and reliable large-scale data processing
Yahoo Developer Network
 

En vedette (11)

BIG Data Science: A Path Forward
BIG Data Science:  A Path ForwardBIG Data Science:  A Path Forward
BIG Data Science: A Path Forward
 
Real Time Data Processing Using Spark Streaming
Real Time Data Processing Using Spark StreamingReal Time Data Processing Using Spark Streaming
Real Time Data Processing Using Spark Streaming
 
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster AnswersR+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
 
Apachecon Europe 2012: Operating HBase - Things you need to know
Apachecon Europe 2012: Operating HBase - Things you need to knowApachecon Europe 2012: Operating HBase - Things you need to know
Apachecon Europe 2012: Operating HBase - Things you need to know
 
R + 15 minutes = Hadoop cluster
R + 15 minutes = Hadoop clusterR + 15 minutes = Hadoop cluster
R + 15 minutes = Hadoop cluster
 
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYCDan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
 
January 2015 HUG: Apache Flink: Fast and reliable large-scale data processing
January 2015 HUG: Apache Flink:  Fast and reliable large-scale data processingJanuary 2015 HUG: Apache Flink:  Fast and reliable large-scale data processing
January 2015 HUG: Apache Flink: Fast and reliable large-scale data processing
 
Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics? Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics?
 
HBase and Impala Notes - Munich HUG - 20131017
HBase and Impala Notes - Munich HUG - 20131017HBase and Impala Notes - Munich HUG - 20131017
HBase and Impala Notes - Munich HUG - 20131017
 
High Performance Predictive Analytics in R and Hadoop
High Performance Predictive Analytics in R and HadoopHigh Performance Predictive Analytics in R and Hadoop
High Performance Predictive Analytics in R and Hadoop
 
Predictive Analytics using R
Predictive Analytics using RPredictive Analytics using R
Predictive Analytics using R
 

Similaire à Big Analytics: Building Lasting Value

Competitive Advantage from the Data Lake
Competitive Advantage from the Data LakeCompetitive Advantage from the Data Lake
Competitive Advantage from the Data Lake
Argyle Executive Forum
 
Team undiscovered opportunuity analysis report presentation- venture lab 2012
Team undiscovered   opportunuity analysis report presentation- venture lab 2012Team undiscovered   opportunuity analysis report presentation- venture lab 2012
Team undiscovered opportunuity analysis report presentation- venture lab 2012
oceanfree
 

Similaire à Big Analytics: Building Lasting Value (20)

Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATLDan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
 
IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?
 
Data strategy demistifying data
Data strategy demistifying dataData strategy demistifying data
Data strategy demistifying data
 
Competitive Advantage from the Data Lake
Competitive Advantage from the Data LakeCompetitive Advantage from the Data Lake
Competitive Advantage from the Data Lake
 
Acctiva: expertise in Business Intelligence, Data Warehousing, Data Governance
Acctiva: expertise in Business Intelligence, Data Warehousing, Data GovernanceAcctiva: expertise in Business Intelligence, Data Warehousing, Data Governance
Acctiva: expertise in Business Intelligence, Data Warehousing, Data Governance
 
The Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impactThe Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impact
 
Building a Winning Roadmap for Analytics
Building a Winning Roadmap for AnalyticsBuilding a Winning Roadmap for Analytics
Building a Winning Roadmap for Analytics
 
Big Data Strategies
Big Data StrategiesBig Data Strategies
Big Data Strategies
 
ThoughtWorks: Monetising Open Banking
ThoughtWorks: Monetising Open Banking  ThoughtWorks: Monetising Open Banking
ThoughtWorks: Monetising Open Banking
 
Agile BI success factors
Agile BI success factorsAgile BI success factors
Agile BI success factors
 
Scaling Your Enterprise With Data Science
Scaling Your Enterprise With Data ScienceScaling Your Enterprise With Data Science
Scaling Your Enterprise With Data Science
 
From 'I think' to 'I know'
From 'I think' to 'I know'From 'I think' to 'I know'
From 'I think' to 'I know'
 
Team undiscovered opportunuity analysis report presentation- venture lab 2012
Team undiscovered   opportunuity analysis report presentation- venture lab 2012Team undiscovered   opportunuity analysis report presentation- venture lab 2012
Team undiscovered opportunuity analysis report presentation- venture lab 2012
 
Big data analytics and innovation
Big data analytics and innovationBig data analytics and innovation
Big data analytics and innovation
 
From Customer Insights to Action
From Customer Insights to ActionFrom Customer Insights to Action
From Customer Insights to Action
 
ASAS 2014 - Klasien Postma
ASAS 2014 - Klasien PostmaASAS 2014 - Klasien Postma
ASAS 2014 - Klasien Postma
 
Developing a Modernization Strategy: Evaluating the Options by Chris Koppe
Developing a Modernization Strategy: Evaluating the Options by Chris KoppeDeveloping a Modernization Strategy: Evaluating the Options by Chris Koppe
Developing a Modernization Strategy: Evaluating the Options by Chris Koppe
 
Data strategy in a Big Data world
Data strategy in a Big Data worldData strategy in a Big Data world
Data strategy in a Big Data world
 
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdfMainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
 
Denodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
Denodo DataFest 2016: Enterprise View of Data with Semantic Data LayerDenodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
Denodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Big Analytics: Building Lasting Value