SlideShare a Scribd company logo
1 of 16
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
1 
Making Big Data a First-class 
Citizen in the Enterprise 
Tony Baer 
tony.baer@ovum.com 
IT014-002860 
January 24, 2014
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
2 
Contents 
 The premise 
 What we mean 
 Scope – focus on Hadoop, the most popular emerging Big Data 
platform 
 Addressing the enterprise 
 IT organization 
 Data center infrastructure, processes, policies, and practices 
 Creating value; competitive benefits to the business 
 The endgame
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
3 
The premise: Voting Big Data off the island! 
Big Data must become a first-class citizen in the enterprise and 
cannot exist on its own island.
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
4 
Hadoop user base and use cases are changing 
 Users are changing from Internet companies to mainstream 
enterprises. 
 Use cases are changing from Internet search, ad optimization 
to customer churn analysis, sales and promotions, and 
operations.
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
5 
What does it mean for Big Data to become a first-class 
citizen? 
 IT organization 
 No more SWAT teams! 
 Must map to existing people and skills 
 Data center 
 Must map to existing infrastructure, subject to same constraints 
 Enterprise 
 Must address real business problems, not abstract data science research
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
6 
Contents 
 The premise 
 Addressing the enterprise 
 IT organization 
 Data center infrastructure, processes, policies, and practices 
 Creating value; competitive benefits to the business 
 The endgame
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
7 
Mapping Big Data skills to the IT organization 
 Enrich, don’t replace, your existing app developers, DBAs, system 
administrators 
 Huge existing SQL skills base – you’re not going to replace them 
 Large Java developer base, lots of scripting language diversity 
 Popularity of JavaScript/JSON 
 Skills: 
 Technology – the easy part 
 Domain and data science – not so easy 
Don’t forget the people part!
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
8 
Extending the IT organization for Big Data 
 SQL and NoSQL/Hadoop platforms are converging 
 SQL access to Hadoop 
 Hadoop platform SQL support 
 BI tool Hadoop support 
 MapReduce approaches to Advanced SQL platforms 
 MongoDB, CouchDB, Riak 
 Empowering web JavaScript developers with familiar JSON 
 Data science? 
 The apps are coming… 
SQL on Hadoop and Big Data apps are works in progress…
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
9 
Addressing the data center 
 Like most Internet technologies, Hadoop conceived in zone of trust 
 Small, elite band of practitioners 
 Big concern? Getting access to available cluster resources elsewhere 
inside the firewall 
 Enterprise? 
 Security 
 Data stewardship 
 Coping with finite resource 
 Availability and reliability
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
10 
Data center: Big Data must be secure like any 
database management system or data warehouse 
 AAA enforced for access, authentication, authorization 
 Must become more granular by user, data 
 Must become more unified 
 Integrate with LDAP/Active Directory 
 Data privacy mandates 
 This is a policy, not a technology, issue 
 “Don’t be creepy” – don’t blindside your customers based on 
knowledge they didn't know you have 
 Regulation plays driving role for some sectors
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
11 
Data center: Big Data platforms must behave like any 
database management system or data warehouse 
 Data stewardship/lifecycle 
 Data quality, protection, lifecycle management, retention 
 Resource management 
 Capacity utilization critical 
 Availability/reliability 
 Performance management essential for large clusters 
Major change from early Internet adopters
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
12 
Address the business 
Good business cases count! 
 Do: 
 Focus on existing problems (the problems are often more obvious 
than you think…) 
 Identify key points of pain, like any new IT solution 
 Don’t 
 Concoct “interesting” data science problems for the heck of it 
 Get carried away with data (with lots of data, there are lots of chances 
for detecting irrelevant trends) 
 Give up after a few tries …. iterate! 
Don’t get caught up in a data science project
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
13 
Hadoop benefits: Solving familiar problems in new 
ways 
Customer 
holistic view 
Predictive churn 
analysis 
upsell/cross-sell, 
next-best-offer, 
cross-channel ID 
resolution 
Risk mitigation 
Fraud detection, 
counter-party risk 
management, 
credit scoring 
Operational 
efficiency 
Machine data for 
managing smart 
grids, smart urban 
infrastructure, 
supply chain 
logistics 
Not arbitrary data science
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
14 
Contents 
 The premise 
 What we mean 
 Scope – focus on Hadoop, the most popular emerging Big Data 
platform 
 Addressing the enterprise 
 IT organization 
 Data center infrastructure, processes, policies, and practices 
 Creating value; competitive benefits to the business 
 The endgame
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
15 
The endgame: What becoming a first-class citizen 
really means 
 Big Data – and emerging platforms like Hadoop – originated as 
specialized IT systems requiring specially skilled practitioners. 
 This model is not sustainable as Big Data crosses over to the 
enterprise. 
 Big Data must get off its island. 
 Big Data must be accessible to the IT organization, fit into the data 
center, and address real business problems.
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
16 
Big Data: Embrace and extend 
 IT organization 
 Embrace existing SQL, Java, and other programming language skills 
 Extend skills to understand handling of larger volumes and varieties of data and 
new analytic techniques to supplement SQL 
 Data center 
 Embrace existing policies and practices for data stewardship, resource 
management, security, performance management 
 Extend policies and practices to accommodate platform with different workload 
characteristics, and support of active archiving 
 Business 
 Embrace existing competitive problems; don’t look for new problems because the 
data and platform are different 
 Extend approaches to problem solving by incorporating new data types and new 
forms of analyses to deepen understanding and insights

More Related Content

What's hot

Telco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsTelco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsAlan Quayle
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
Conflict in the Cloud – Issues & Solutions for Big Data
Conflict in the Cloud – Issues & Solutions for Big DataConflict in the Cloud – Issues & Solutions for Big Data
Conflict in the Cloud – Issues & Solutions for Big DataHalo BI
 
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera, Inc.
 
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIReal-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIibi
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemCapgemini
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...Capgemini
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataCloudera, Inc.
 
Optimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsOptimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsCloudera, Inc.
 
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Edureka!
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
A #Pink14 Presentation: Optimizing for the #SDDC
A #Pink14 Presentation: Optimizing for the #SDDCA #Pink14 Presentation: Optimizing for the #SDDC
A #Pink14 Presentation: Optimizing for the #SDDCTeamQuest Corporation
 
Customer Case Studies of Self-Service Big Data Analytics
Customer Case Studies of Self-Service Big Data AnalyticsCustomer Case Studies of Self-Service Big Data Analytics
Customer Case Studies of Self-Service Big Data AnalyticsDatameer
 
Succeeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and TechnologySucceeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and Technologyibi
 
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Denodo
 
Managing Growing Transaction Volumes Using Hadoop
Managing Growing Transaction Volumes Using HadoopManaging Growing Transaction Volumes Using Hadoop
Managing Growing Transaction Volumes Using HadoopArvind Purushothaman
 
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Cloudera, Inc.
 
Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Barry Devlin
 
Business unIntelligence, Chapter 5
Business unIntelligence, Chapter 5Business unIntelligence, Chapter 5
Business unIntelligence, Chapter 5Barry Devlin
 
Business unIntelligence - a Whistle Stop Tour
Business unIntelligence - a Whistle Stop TourBusiness unIntelligence - a Whistle Stop Tour
Business unIntelligence - a Whistle Stop TourBarry Devlin
 

What's hot (20)

Telco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsTelco Big Data 2012 Highlights
Telco Big Data 2012 Highlights
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
Conflict in the Cloud – Issues & Solutions for Big Data
Conflict in the Cloud – Issues & Solutions for Big DataConflict in the Cloud – Issues & Solutions for Big Data
Conflict in the Cloud – Issues & Solutions for Big Data
 
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learning
 
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIReal-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BI
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big Data
 
Optimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsOptimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analytics
 
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
A #Pink14 Presentation: Optimizing for the #SDDC
A #Pink14 Presentation: Optimizing for the #SDDCA #Pink14 Presentation: Optimizing for the #SDDC
A #Pink14 Presentation: Optimizing for the #SDDC
 
Customer Case Studies of Self-Service Big Data Analytics
Customer Case Studies of Self-Service Big Data AnalyticsCustomer Case Studies of Self-Service Big Data Analytics
Customer Case Studies of Self-Service Big Data Analytics
 
Succeeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and TechnologySucceeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and Technology
 
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
 
Managing Growing Transaction Volumes Using Hadoop
Managing Growing Transaction Volumes Using HadoopManaging Growing Transaction Volumes Using Hadoop
Managing Growing Transaction Volumes Using Hadoop
 
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
 
Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too
 
Business unIntelligence, Chapter 5
Business unIntelligence, Chapter 5Business unIntelligence, Chapter 5
Business unIntelligence, Chapter 5
 
Business unIntelligence - a Whistle Stop Tour
Business unIntelligence - a Whistle Stop TourBusiness unIntelligence - a Whistle Stop Tour
Business unIntelligence - a Whistle Stop Tour
 

Viewers also liked

The Cassandra Platform - Christos Diou
The Cassandra Platform - Christos Diou The Cassandra Platform - Christos Diou
The Cassandra Platform - Christos Diou Cassandra Project
 
Manual cassandra NoSQL
Manual cassandra NoSQLManual cassandra NoSQL
Manual cassandra NoSQLlignia
 
Pré-processamento em Big Data
Pré-processamento em Big DataPré-processamento em Big Data
Pré-processamento em Big DataJoão Gabriel Lima
 
Apache Cassandra - Base de datos
Apache Cassandra - Base de datosApache Cassandra - Base de datos
Apache Cassandra - Base de datosZteeven Zalinas
 
Fast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming AnalyticsFast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming AnalyticsTony Baer
 
Hadoop, SQL & NoSQL: No Longer an Either-or Question
Hadoop, SQL & NoSQL: No Longer an Either-or QuestionHadoop, SQL & NoSQL: No Longer an Either-or Question
Hadoop, SQL & NoSQL: No Longer an Either-or QuestionTony Baer
 
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...Daniel Briian
 
Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elo7
 
Great Visualizations and Analytics using Business Intelligence Open Source
Great Visualizations and Analytics using Business Intelligence Open SourceGreat Visualizations and Analytics using Business Intelligence Open Source
Great Visualizations and Analytics using Business Intelligence Open SourceStratebi
 
Hadoop MapReduce Streaming and Pipes
Hadoop MapReduce  Streaming and PipesHadoop MapReduce  Streaming and Pipes
Hadoop MapReduce Streaming and PipesHanborq Inc.
 
Manual apache cassandra y comandos en la shell
Manual apache cassandra y comandos en la shellManual apache cassandra y comandos en la shell
Manual apache cassandra y comandos en la shellKevin López
 
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsBuilding a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsSpark Summit
 
Desenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosDesenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosElo7
 
PyCon APAC 2016 Keynote
PyCon APAC 2016 KeynotePyCon APAC 2016 Keynote
PyCon APAC 2016 KeynoteWes McKinney
 
Improving data interoperability in Python and R
Improving data interoperability in Python and RImproving data interoperability in Python and R
Improving data interoperability in Python and RWes McKinney
 

Viewers also liked (20)

Qcon Rio 2015 - Data Lakes Workshop
Qcon Rio 2015 - Data Lakes WorkshopQcon Rio 2015 - Data Lakes Workshop
Qcon Rio 2015 - Data Lakes Workshop
 
The Cassandra Platform - Christos Diou
The Cassandra Platform - Christos Diou The Cassandra Platform - Christos Diou
The Cassandra Platform - Christos Diou
 
Manual cassandra NoSQL
Manual cassandra NoSQLManual cassandra NoSQL
Manual cassandra NoSQL
 
All things py
All things pyAll things py
All things py
 
Nosql y cassandra
Nosql y cassandraNosql y cassandra
Nosql y cassandra
 
Pré-processamento em Big Data
Pré-processamento em Big DataPré-processamento em Big Data
Pré-processamento em Big Data
 
Apache Cassandra - Base de datos
Apache Cassandra - Base de datosApache Cassandra - Base de datos
Apache Cassandra - Base de datos
 
Fast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming AnalyticsFast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming Analytics
 
Hadoop, SQL & NoSQL: No Longer an Either-or Question
Hadoop, SQL & NoSQL: No Longer an Either-or QuestionHadoop, SQL & NoSQL: No Longer an Either-or Question
Hadoop, SQL & NoSQL: No Longer an Either-or Question
 
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
 
Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data
 
Apache cassandra
Apache cassandraApache cassandra
Apache cassandra
 
Great Visualizations and Analytics using Business Intelligence Open Source
Great Visualizations and Analytics using Business Intelligence Open SourceGreat Visualizations and Analytics using Business Intelligence Open Source
Great Visualizations and Analytics using Business Intelligence Open Source
 
Hadoop MapReduce Streaming and Pipes
Hadoop MapReduce  Streaming and PipesHadoop MapReduce  Streaming and Pipes
Hadoop MapReduce Streaming and Pipes
 
Manual apache cassandra y comandos en la shell
Manual apache cassandra y comandos en la shellManual apache cassandra y comandos en la shell
Manual apache cassandra y comandos en la shell
 
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsBuilding a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
 
Up and running with pyspark
Up and running with pysparkUp and running with pyspark
Up and running with pyspark
 
Desenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosDesenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntos
 
PyCon APAC 2016 Keynote
PyCon APAC 2016 KeynotePyCon APAC 2016 Keynote
PyCon APAC 2016 Keynote
 
Improving data interoperability in Python and R
Improving data interoperability in Python and RImproving data interoperability in Python and R
Improving data interoperability in Python and R
 

Similar to Making Big Data a First Class citizen in the enterprise

Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersDatameer
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyCloudera, Inc.
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data AnalyticsVMware Tanzu
 
A better business case for big data with Hadoop
A better business case for big data with HadoopA better business case for big data with Hadoop
A better business case for big data with HadoopAptitude Software
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessNG DATA
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
HP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for youHP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for youHP Enterprise Italia
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
5 big data at work linking discovery and bi to improve business outcomes from...
5 big data at work linking discovery and bi to improve business outcomes from...5 big data at work linking discovery and bi to improve business outcomes from...
5 big data at work linking discovery and bi to improve business outcomes from...Dr. Wilfred Lin (Ph.D.)
 
Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry Persontyle
 
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014pietvz
 
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXHadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXModern Data Stack France
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsDatameer
 
Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Dell World
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyCloudera, Inc.
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopGhassan Al-Yafie
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseJeff Kelly
 

Similar to Making Big Data a First Class citizen in the enterprise (20)

Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business Managers
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data Strategy
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
 
The value of our data
The value of our dataThe value of our data
The value of our data
 
A better business case for big data with Hadoop
A better business case for big data with HadoopA better business case for big data with Hadoop
A better business case for big data with Hadoop
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your business
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
HP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for youHP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for you
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
5 big data at work linking discovery and bi to improve business outcomes from...
5 big data at work linking discovery and bi to improve business outcomes from...5 big data at work linking discovery and bi to improve business outcomes from...
5 big data at work linking discovery and bi to improve business outcomes from...
 
Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry
 
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
 
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXHadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data Analytics
 
Are you ready for Big Data 2.0? EMA Analyst Research
Are you ready for Big Data 2.0? EMA Analyst ResearchAre you ready for Big Data 2.0? EMA Analyst Research
Are you ready for Big Data 2.0? EMA Analyst Research
 
Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data Journey
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoop
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 

Recently uploaded

Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 

Recently uploaded (20)

Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 

Making Big Data a First Class citizen in the enterprise

  • 1. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 1 Making Big Data a First-class Citizen in the Enterprise Tony Baer tony.baer@ovum.com IT014-002860 January 24, 2014
  • 2. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 2 Contents  The premise  What we mean  Scope – focus on Hadoop, the most popular emerging Big Data platform  Addressing the enterprise  IT organization  Data center infrastructure, processes, policies, and practices  Creating value; competitive benefits to the business  The endgame
  • 3. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 3 The premise: Voting Big Data off the island! Big Data must become a first-class citizen in the enterprise and cannot exist on its own island.
  • 4. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 4 Hadoop user base and use cases are changing  Users are changing from Internet companies to mainstream enterprises.  Use cases are changing from Internet search, ad optimization to customer churn analysis, sales and promotions, and operations.
  • 5. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 5 What does it mean for Big Data to become a first-class citizen?  IT organization  No more SWAT teams!  Must map to existing people and skills  Data center  Must map to existing infrastructure, subject to same constraints  Enterprise  Must address real business problems, not abstract data science research
  • 6. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 6 Contents  The premise  Addressing the enterprise  IT organization  Data center infrastructure, processes, policies, and practices  Creating value; competitive benefits to the business  The endgame
  • 7. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 7 Mapping Big Data skills to the IT organization  Enrich, don’t replace, your existing app developers, DBAs, system administrators  Huge existing SQL skills base – you’re not going to replace them  Large Java developer base, lots of scripting language diversity  Popularity of JavaScript/JSON  Skills:  Technology – the easy part  Domain and data science – not so easy Don’t forget the people part!
  • 8. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 8 Extending the IT organization for Big Data  SQL and NoSQL/Hadoop platforms are converging  SQL access to Hadoop  Hadoop platform SQL support  BI tool Hadoop support  MapReduce approaches to Advanced SQL platforms  MongoDB, CouchDB, Riak  Empowering web JavaScript developers with familiar JSON  Data science?  The apps are coming… SQL on Hadoop and Big Data apps are works in progress…
  • 9. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 9 Addressing the data center  Like most Internet technologies, Hadoop conceived in zone of trust  Small, elite band of practitioners  Big concern? Getting access to available cluster resources elsewhere inside the firewall  Enterprise?  Security  Data stewardship  Coping with finite resource  Availability and reliability
  • 10. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 10 Data center: Big Data must be secure like any database management system or data warehouse  AAA enforced for access, authentication, authorization  Must become more granular by user, data  Must become more unified  Integrate with LDAP/Active Directory  Data privacy mandates  This is a policy, not a technology, issue  “Don’t be creepy” – don’t blindside your customers based on knowledge they didn't know you have  Regulation plays driving role for some sectors
  • 11. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 11 Data center: Big Data platforms must behave like any database management system or data warehouse  Data stewardship/lifecycle  Data quality, protection, lifecycle management, retention  Resource management  Capacity utilization critical  Availability/reliability  Performance management essential for large clusters Major change from early Internet adopters
  • 12. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 12 Address the business Good business cases count!  Do:  Focus on existing problems (the problems are often more obvious than you think…)  Identify key points of pain, like any new IT solution  Don’t  Concoct “interesting” data science problems for the heck of it  Get carried away with data (with lots of data, there are lots of chances for detecting irrelevant trends)  Give up after a few tries …. iterate! Don’t get caught up in a data science project
  • 13. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 13 Hadoop benefits: Solving familiar problems in new ways Customer holistic view Predictive churn analysis upsell/cross-sell, next-best-offer, cross-channel ID resolution Risk mitigation Fraud detection, counter-party risk management, credit scoring Operational efficiency Machine data for managing smart grids, smart urban infrastructure, supply chain logistics Not arbitrary data science
  • 14. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 14 Contents  The premise  What we mean  Scope – focus on Hadoop, the most popular emerging Big Data platform  Addressing the enterprise  IT organization  Data center infrastructure, processes, policies, and practices  Creating value; competitive benefits to the business  The endgame
  • 15. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 15 The endgame: What becoming a first-class citizen really means  Big Data – and emerging platforms like Hadoop – originated as specialized IT systems requiring specially skilled practitioners.  This model is not sustainable as Big Data crosses over to the enterprise.  Big Data must get off its island.  Big Data must be accessible to the IT organization, fit into the data center, and address real business problems.
  • 16. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 16 Big Data: Embrace and extend  IT organization  Embrace existing SQL, Java, and other programming language skills  Extend skills to understand handling of larger volumes and varieties of data and new analytic techniques to supplement SQL  Data center  Embrace existing policies and practices for data stewardship, resource management, security, performance management  Extend policies and practices to accommodate platform with different workload characteristics, and support of active archiving  Business  Embrace existing competitive problems; don’t look for new problems because the data and platform are different  Extend approaches to problem solving by incorporating new data types and new forms of analyses to deepen understanding and insights