SlideShare une entreprise Scribd logo
1  sur  20
Ideas. Realized. ®
RCG Global Services
Business Intelligence
Big Data Proof of Concept
July 2014
© 2014 RCG. All Rights Reserved. Proprietary and Confidential.
Big Data Architecture Goes Beyond BI
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 2
What ‘Traditional’ BI Misses . . .
“More than 80% of all data in an enterprise is unstructured
information. Unfortunately, attempts to leverage this resource
often fail because many businesses lack the technology to utilize
content that resides outside the scope of structured databases.”
Is What Big Data Is Designed to Deliver
“The practices and technology that close the gap between [all
types of] data available and the ability to turn that data into
business insight.”
[http://www.aiim.org/Research-and-Publications/Research/White-Papers/Data-is-Unstructured-Information]
The Big Data Landscape Is Complex
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 3
Big Data Challenges
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 4
 Understanding and architecting
solutions incorporating Big Data
technologies (Hadoop, NewSQL,
NoSQL, in-memory, and so forth)
 Navigating Hadoop, its ‘projects’/
components, and packaged Hadoop
options in the market
 Knowing which Big Data solution best
meets your needs
 Planning, sizing, installing, and using a
Big Data server complex
 Incorporating Big Data into existing data
management and governance processes
 Delivering analytic results from Big
Data volumes and variety of data types,
particularly real-time data stream
analysis, ad hoc queries and searches,
and inferential analytics
Examples of RCG’s Big Data Experience
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 5
IT Cost Reduction
Savings of $3 million and
processing time reduction
from 4.5 hours to 1.5 hours
realized by using Big Data
technologies rather than
traditional ETL and MPP
database options
RCG’s Big Data Lab
Demonstrates Big Data
technologies and produces
Advanced Analytics Insights
with client dataClick Stream Analysis
Real time click stream
analysis and correlation
to in-store purchase history
SKU Analysis
Analyze sales, inventories,
and delivery logistics by sku
by day over years of history
using Big Data technology
and architecture
recommendations
Business Operations
Analysis
Near real-time analysis of
business operations to manage
inventories, adjust pricing, and
manage promotions
ROIC Analysis for Store
Renovations
Advanced Analytics doubled
ROIC, increased store
profitability, and reduced capital
allocated for store renovations by
$150M
RCG’s Big Data Offerings for Business
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 6
Demonstrates the business value of Big Data using your data
in RCG’s Big Data Lab with Big Data technologies and analytics
Requires no investment in Big Data hardware, software, or
skills in IT or business units
Big Data Proof
of Concept
Identifies how Advanced Analytics can support your business
goals and with technologies that fit in your IT environment
Provides a Roadmap of projects to deliver business value and
add the capabilities needed for successful advanced analytics
Advanced
Analytics
Roadmap
Provides business insights using inferential / predictive, real-
time data stream, text, and other advanced analytics
Applies Advanced Analytics techniques to develop business
insights by RCG Data Scientists and identifies business actions
Advanced
Analytics and
Insight
RCG’s Big Data Offerings for IT
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 7
Identifies the savings IT can achieve using Big Data and open
source technologies in place of further investment in high-
cost ETL products and massively parallel processing platforms
IT Cost
Reduction with
Big Data
Identifies the Big Data technologies best suited to your
environment and needs of the business and develops the
architecture that fits Big Data into your IT infrastructure
Big Data and
Advanced
Analytics
Architecture
Sizes, configures, and sets up the cluster of Big Data storage
required to support the needs of the business, installs the Big
Data software, and trains the IT staff who will monitor,
maintain, and use the Big Data installation
Big Data
Technology
Installation
8
Big Data Proof of Concept (PoC)
The RCG Big Data Proof of Concept
demonstrates the business value of Big Data
using your data in RCG’s Big Data Lab with Big
Data technologies and analytics. This requires
no investment in Big Data hardware, software,
or skills in your IT or business units.
Big Data Proof of Concept Overview
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 9
Get statistically
significant
business insights
Apply insights to solve
the problem or
act on the opportunity
Start with a real
business problem
or opportunity
RCG’s Data
Scientists apply
Advanced Analytics
1 3
4
5
Take real
business data to
RCG’s Big Data Lab
2
Big Data PoC Objectives
The primary objective for RCG’s Big Data PoC is to demonstrate
the business value of Big Data analytics and business insights
using client data in RCG’s Big Data Lab.
This provides clients with:
‒ Access to RCG’s skills, experience, and facilities to jump start the
learning curve and application of Big Data analytics
‒ The ability to apply a Big Data technology of interest
‒ A low-cost, easy way to demonstrate the value of Big Data analytics
and business insights to enhance business results
The Big Data PoC requires no investment in hardware,
software, or skills in a client’s IT or business departments
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 10
RCG’s Big Data Lab
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 11
Our Big Data Lab has the capacity
and capability to help you make
sense of this to solve problems and
take advantage of opportunities
We can help you select
a Big Data option that
makes sense
for your company
12 nodes of Hadoop or NoSQL configuration
½ terabyte of memory
144 terabytes of storage
‘R’ and SAS statistical analysis technologies
Apache Hadoop project software
NoSQL and NewSQL options
Big Data PoC Approach
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 12
Activities:
Collect data related to
the business problem,
such as customer data,
purchase history, emails,
demographics, product
data, product sales
history, and so forth
Mask sensitive data such
as names, credit card or
financial specifics, health
information, and so
forth
Client Activities:
Participate in problem
definition workshop
Approve problem
definition and its
objectives
Activities:
Conduct brief (1-2
hours) problem
statement workshop
with Business Executives
Identify an objective,
such as sales lift through
better product
recommendations or
improving healthcare
outcomes through
personalized treatments
Client Activities:
Activities:
Transmit data to RCG
(depending on the
volume of data, this can
be done over high
speed communications
or through physical
media)
Load data into RCG’s
Big Data Lab
Client Activities:
Provide collected
data for analysis
Activities:
Client Activities:
Activities:
Develop results and
insights
Make recommendations
for business actions and
for applying Big Data
Client Activities:
Review results and
insights
Review business action
recommendations
Review Big Data
recommendations
Work Products:Work Products: Work Products: Work Products: Deliverables:
File(s) of data to be
used for the PoC
Problem Statement Big Data clusters set
up in clients
technology of choice
Client data loaded
into the Big Data
cluster
Results and insights
Recommendations for
business actions
Recommendations for
implementing Big Data
Identify a
Business
Problem Area
Collect Data
Related to the
Problem Area
Load Data
into RCG’s Big
Data Lab
Apply Big
Data
Analytics
Produce
Results and
Insights
Collect data related to
the problem definition
Ensure data complies
with privacy and
regulatory policies
Review data insights
and provide
feedback
Data insights, with
their statistical
significance
Perform initial analysis
based on client
requirements
Develop and refine
statistical models to
focus on new insights
Review with client and
iterate as needed
Present current model
and define
automation/validation
process if needed
Big Data PoC Approach
 RCG Activities
‒ Conduct brief (1-2 hours) problem statement workshop with Business
Executives to identify:
• An objective for the PoC, such as sales lift through better product
recommendations or improving healthcare outcomes through personalized
treatments
• The data related to the business problem to analyze
• The technical environment to be set up for the PoC in RCG’s Big Data Lab
 Client Responsibility
‒ Participate in facilitated problem definition workshop
‒ Approve PoC Problem Statement
 Work Products
‒ Problem Statement
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 13
Identify a
Business
Problem Area
Collect Data
Related to the
Problem Area
Load Data
into RCG’s Big
Data Lab
Apply Big
Data
Analytics
Produce
Results and
Insights
Big Data PoC Approach
 RCG Activities
‒ None
 Client Responsibility
‒ Collect the data related to the business problem to analyze
‒ Mask sensitive data such as names, credit card or financial
specifics, health information, and so forth
 Work Products
‒ Collected files of data to analyze
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 14
Identify a
Business
Problem Area
Collect Data
Related to the
Problem Area
Load Data
into RCG’s Big
Data Lab
Apply Big
Data
Analytics
Produce
Results and
Insights
Big Data PoC Approach
 RCG Activities
‒ Set up the technical environment for the PoC
‒ Load the CLIENT’s data for analysis
 Client Responsibility
‒ None
 Work Products
‒ Collected files of data loaded into the specified technical
environment in RCG’s Big Data Lab
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 15
Identify a
Business
Problem Area
Collect Data
Related to the
Problem Area
Load Data
into RCG’s Big
Data Lab
Apply Big
Data
Analytics
Produce
Results and
Insights
Big Data PoC Approach
 RCG Activities
‒ Perform initial analysis based on client requirements
‒ Review insights with client
‒ Develop and refine statistical models to further focus on new insights
‒ Review with client and iterate as needed
‒ Present current model and define automation/validation process if
needed
 Client Responsibility
‒ Provide feedback on interim analysis results
 Work Products
‒ Data analysis, actionable statistical model(s) based on insights
discovered
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 16
Identify a
Business
Problem Area
Collect Data
Related to the
Problem Area
Load Data
into RCG’s Big
Data Lab
Apply Big
Data
Analytics
Produce
Results and
Insights
Big Data PoC Approach
 RCG Activities
‒ Develop results and insights
‒ Make recommendations for business actions and for applying Big Data
 Client Responsibility
‒ Review results and insights
‒ Review business action recommendations
‒ Review Big Data recommendations
 Deliverables
‒ Results and insights
‒ Recommendations for business actions
‒ Recommendations for implementing Big Data and Advanced Analytics
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 17
Identify a
Business
Problem Area
Collect Data
Related to the
Problem Area
Load Data
into RCG’s Big
Data Lab
Apply Big
Data
Analytics
Produce
Results and
Insights
Roles and Responsibilities
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 18
RCG Solution Architect
• Overall responsibility for RCG’s delivery of
the PoC
• Develops, with CLIENT, the problem
definition that the PoC will investigate
• Presents the PoC results and recommended
business actions
RCG Technical Specialist
• Implements the Big Data and Advanced
Analytics software in RCG’s Big Data Lab for
the PoC
• Loads the data collected by the CLIENT into
RCG’s Big Data Lab
RCG Data Scientist
• Provides Data Science experience and
knowledge to the project
• Develops Advanced Analytics models and
discovers data insights
• Determines the statistical significance of
these data insights and works to improve it
CLIENT's Project Manager
• Primary contact for RCG,
working to create and review
project schedule, milestones and
deliverables
• Participate in problem definition
workshop
CLIENT's Business Participants
• Provide CLIENT's business
expertise
• Participate in problem definition
workshop
• Identify information related to
the PoC’s problem definition
CLIENT's IT SMEs
• Provide CLIENT's ITs expertise
• Participate in problem definition
workshop
• Collect CLIENT’s data related to
the PoC’s problem definition
Big Data PoC Timeline and Fees
© 2014 RCG. All Rights Reserved. Proprietary and Confidential. 19
Project Timeline
 Estimated Duration: 3 to 5 weeks
 Estimated Total Fees: $35 - 55K, plus expenses (this does not include
fees for RCG assistance to collect and prepare data, if it is required)
 Typical Resources: Big Data Solution Architect, Data Scientist, Big Data
Technical Specialist
Activity Week 1 Week 2 Week 3 Week 4 Week 5
Identify a Business Problem Area
Collect Data Related to the Problem Area
Load Data into RCG’s Big Data Lab
Apply Big Data Analytics
Produce Results and Insights
Our Brand Promise
Our reputation is built upon the premise that
we are a company that listens.
We bring a creative view to your
business initiative.
We are collaborative and accountable as
we jointly create your solution.
We continuously innovate from concept to result and
help you affect business change.
There will be no surprises.
Ideas. Realized.®

Contenu connexe

Tendances

Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | EdurekaData Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | EdurekaEdureka!
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?DATAVERSITY
 
Graph Databases for Master Data Management
Graph Databases for Master Data ManagementGraph Databases for Master Data Management
Graph Databases for Master Data ManagementNeo4j
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsKamalika Dutta
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBernard Marr
 
Data Lake: A simple introduction
Data Lake: A simple introductionData Lake: A simple introduction
Data Lake: A simple introductionIBM Analytics
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best PracticesDATAVERSITY
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 
Data Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesData Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesDATAVERSITY
 
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...DataWorks Summit
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchNeo4j
 
A Universe of Knowledge Graphs
A Universe of Knowledge GraphsA Universe of Knowledge Graphs
A Universe of Knowledge GraphsNeo4j
 
Graph Data Modeling Best Practices(Eric_Monk).pptx
Graph Data Modeling Best Practices(Eric_Monk).pptxGraph Data Modeling Best Practices(Eric_Monk).pptx
Graph Data Modeling Best Practices(Eric_Monk).pptxNeo4j
 
Introduction to Stream Processing
Introduction to Stream ProcessingIntroduction to Stream Processing
Introduction to Stream ProcessingGuido Schmutz
 

Tendances (20)

Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | EdurekaData Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?
 
Graph Databases for Master Data Management
Graph Databases for Master Data ManagementGraph Databases for Master Data Management
Graph Databases for Master Data Management
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time Systems
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must Know
 
Data Lake: A simple introduction
Data Lake: A simple introductionData Lake: A simple introduction
Data Lake: A simple introduction
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Data Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesData Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph Databases
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
 
Data modelling 101
Data modelling 101Data modelling 101
Data modelling 101
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
 
A Universe of Knowledge Graphs
A Universe of Knowledge GraphsA Universe of Knowledge Graphs
A Universe of Knowledge Graphs
 
Graph Data Modeling Best Practices(Eric_Monk).pptx
Graph Data Modeling Best Practices(Eric_Monk).pptxGraph Data Modeling Best Practices(Eric_Monk).pptx
Graph Data Modeling Best Practices(Eric_Monk).pptx
 
Introduction to Stream Processing
Introduction to Stream ProcessingIntroduction to Stream Processing
Introduction to Stream Processing
 

Similaire à Big Data Proof of Concept

Big Data Solutions Executive Overview
Big Data Solutions Executive OverviewBig Data Solutions Executive Overview
Big Data Solutions Executive OverviewRCG Global Services
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014Hortonworks
 
Big data analytics overview
Big data analytics overviewBig data analytics overview
Big data analytics overviewWise Men
 
What to focus on when choosing a Business Intelligence tool?
What to focus on when choosing a Business Intelligence tool? What to focus on when choosing a Business Intelligence tool?
What to focus on when choosing a Business Intelligence tool? Marketplanet
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseJeff Kelly
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceSkillspeed
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesSAP Technology
 
Unlock Big Data's Potential in Financial Services with Hortonworks
Unlock Big Data's Potential in Financial Services with Hortonworks Unlock Big Data's Potential in Financial Services with Hortonworks
Unlock Big Data's Potential in Financial Services with Hortonworks Pactera_US
 
Athira mp cv_latest - copy
Athira mp cv_latest - copyAthira mp cv_latest - copy
Athira mp cv_latest - copyAthira MP
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoopDr. Wilfred Lin (Ph.D.)
 
Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...Capgemini
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data AnalyticsDatameer
 
Execute your first hive project
Execute your first hive project Execute your first hive project
Execute your first hive project edunextgen
 
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, PentahoMongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, PentahoMongoDB
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Cloudera, Inc.
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikBardess Group
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleBardess Group
 
BIG Data & Hadoop Applications in E-Commerce
BIG Data & Hadoop Applications in E-CommerceBIG Data & Hadoop Applications in E-Commerce
BIG Data & Hadoop Applications in E-CommerceSkillspeed
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyArcadia Data
 

Similaire à Big Data Proof of Concept (20)

Big Data Solutions Executive Overview
Big Data Solutions Executive OverviewBig Data Solutions Executive Overview
Big Data Solutions Executive Overview
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
 
Big data analytics overview
Big data analytics overviewBig data analytics overview
Big data analytics overview
 
What to focus on when choosing a Business Intelligence tool?
What to focus on when choosing a Business Intelligence tool? What to focus on when choosing a Business Intelligence tool?
What to focus on when choosing a Business Intelligence tool?
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in Finance
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped Opportunities
 
Unlock Big Data's Potential in Financial Services with Hortonworks
Unlock Big Data's Potential in Financial Services with Hortonworks Unlock Big Data's Potential in Financial Services with Hortonworks
Unlock Big Data's Potential in Financial Services with Hortonworks
 
Athira mp cv_latest - copy
Athira mp cv_latest - copyAthira mp cv_latest - copy
Athira mp cv_latest - copy
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data Analytics
 
Execute your first hive project
Execute your first hive project Execute your first hive project
Execute your first hive project
 
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, PentahoMongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess Qlik
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus Example
 
BIG Data & Hadoop Applications in E-Commerce
BIG Data & Hadoop Applications in E-CommerceBIG Data & Hadoop Applications in E-Commerce
BIG Data & Hadoop Applications in E-Commerce
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics Strategy
 

Dernier

VAPI CALL GIRL 92628/71154 VAPI CALL GIR
VAPI CALL GIRL 92628/71154 VAPI CALL GIRVAPI CALL GIRL 92628/71154 VAPI CALL GIR
VAPI CALL GIRL 92628/71154 VAPI CALL GIRNiteshKumar82226
 
Call Girls in Mukherjee Nagar Delhi 8826158885 Genuine Escorts Service
Call Girls in Mukherjee Nagar Delhi 8826158885 Genuine Escorts ServiceCall Girls in Mukherjee Nagar Delhi 8826158885 Genuine Escorts Service
Call Girls in Mukherjee Nagar Delhi 8826158885 Genuine Escorts Serviceteencall080
 
Bhopal Call girl service 6289102337 bhopal escort service
Bhopal Call girl service 6289102337 bhopal escort serviceBhopal Call girl service 6289102337 bhopal escort service
Bhopal Call girl service 6289102337 bhopal escort servicemaheshsingh64440
 
Best VIP Call Girls Noida Sector 23 Call Me: 8700611579
Best VIP Call Girls Noida Sector 23 Call Me: 8700611579Best VIP Call Girls Noida Sector 23 Call Me: 8700611579
Best VIP Call Girls Noida Sector 23 Call Me: 8700611579diyaspanoida
 
BADDI CALL GIRL 92628/71154 BADDI CALL G
BADDI CALL GIRL 92628/71154 BADDI CALL GBADDI CALL GIRL 92628/71154 BADDI CALL G
BADDI CALL GIRL 92628/71154 BADDI CALL GNiteshKumar82226
 
DIGHA CALL GIRL 92628/1154 DIGHA CALL GI
DIGHA CALL GIRL 92628/1154 DIGHA CALL GIDIGHA CALL GIRL 92628/1154 DIGHA CALL GI
DIGHA CALL GIRL 92628/1154 DIGHA CALL GINiteshKumar82226
 
9999266834 Call Girls In Noida Sector 37 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 37 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 37 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 37 (Delhi) Call Girl Servicenishacall1
 
Call Girls In {Connaught Place Delhi} 9667938988 IndianRussian High Profile E...
Call Girls In {Connaught Place Delhi} 9667938988 IndianRussian High Profile E...Call Girls In {Connaught Place Delhi} 9667938988 IndianRussian High Profile E...
Call Girls In {Connaught Place Delhi} 9667938988 IndianRussian High Profile E...aakahthapa70
 
RAJKOT CALL GIRLS 92628/71154 RAJKOT CAL
RAJKOT CALL GIRLS 92628/71154 RAJKOT CALRAJKOT CALL GIRLS 92628/71154 RAJKOT CAL
RAJKOT CALL GIRLS 92628/71154 RAJKOT CALNiteshKumar82226
 
+91-9899900591 Russian Call Girls In New Delhi Independent Russian Call Girls...
+91-9899900591 Russian Call Girls In New Delhi Independent Russian Call Girls...+91-9899900591 Russian Call Girls In New Delhi Independent Russian Call Girls...
+91-9899900591 Russian Call Girls In New Delhi Independent Russian Call Girls...kauryashika82
 
Call Girls in Luxus Grand Hotel | 💋 03274100048
Call Girls in Luxus Grand Hotel | 💋 03274100048Call Girls in Luxus Grand Hotel | 💋 03274100048
Call Girls in Luxus Grand Hotel | 💋 03274100048Ifra Zohaib
 
Call Girls In Lahore || 03274100048 ||Lahore Call Girl Available 24/7
Call Girls In Lahore || 03274100048 ||Lahore Call Girl Available 24/7Call Girls In Lahore || 03274100048 ||Lahore Call Girl Available 24/7
Call Girls In Lahore || 03274100048 ||Lahore Call Girl Available 24/7Sana Rajpoot
 
Call Girls | 😏💦 03274100048 | Call Girls Near Me
Call Girls | 😏💦 03274100048 | Call Girls Near MeCall Girls | 😏💦 03274100048 | Call Girls Near Me
Call Girls | 😏💦 03274100048 | Call Girls Near MeIfra Zohaib
 
Call Girls In {{Connaught Place Delhi}}96679@38988 Indian Russian High Profil...
Call Girls In {{Connaught Place Delhi}}96679@38988 Indian Russian High Profil...Call Girls In {{Connaught Place Delhi}}96679@38988 Indian Russian High Profil...
Call Girls In {{Connaught Place Delhi}}96679@38988 Indian Russian High Profil...aakahthapa70
 
Call Girls in B-18 Islamabad || 🔝 03274100048
Call Girls in B-18 Islamabad || 🔝 03274100048Call Girls in B-18 Islamabad || 🔝 03274100048
Call Girls in B-18 Islamabad || 🔝 03274100048Ifra Zohaib
 
+91-9310611641 Russian Call Girls In New Delhi Independent Russian Call Girls...
+91-9310611641 Russian Call Girls In New Delhi Independent Russian Call Girls...+91-9310611641 Russian Call Girls In New Delhi Independent Russian Call Girls...
+91-9310611641 Russian Call Girls In New Delhi Independent Russian Call Girls...teencall080
 
Call Now ☎9870417354|| Call Girls in Dwarka Escort Service Delhi N.C.R.
Call Now ☎9870417354|| Call Girls in Dwarka Escort Service Delhi N.C.R.Call Now ☎9870417354|| Call Girls in Dwarka Escort Service Delhi N.C.R.
Call Now ☎9870417354|| Call Girls in Dwarka Escort Service Delhi N.C.R.riyadelhic riyadelhic
 
Call Girls In {{Green Park Delhi}}9667938988 Indian Russian High Profile Esco...
Call Girls In {{Green Park Delhi}}9667938988 Indian Russian High Profile Esco...Call Girls In {{Green Park Delhi}}9667938988 Indian Russian High Profile Esco...
Call Girls In {{Green Park Delhi}}9667938988 Indian Russian High Profile Esco...aakahthapa70
 

Dernier (20)

VAPI CALL GIRL 92628/71154 VAPI CALL GIR
VAPI CALL GIRL 92628/71154 VAPI CALL GIRVAPI CALL GIRL 92628/71154 VAPI CALL GIR
VAPI CALL GIRL 92628/71154 VAPI CALL GIR
 
Call Girls in Mukherjee Nagar Delhi 8826158885 Genuine Escorts Service
Call Girls in Mukherjee Nagar Delhi 8826158885 Genuine Escorts ServiceCall Girls in Mukherjee Nagar Delhi 8826158885 Genuine Escorts Service
Call Girls in Mukherjee Nagar Delhi 8826158885 Genuine Escorts Service
 
Bhopal Call girl service 6289102337 bhopal escort service
Bhopal Call girl service 6289102337 bhopal escort serviceBhopal Call girl service 6289102337 bhopal escort service
Bhopal Call girl service 6289102337 bhopal escort service
 
Best VIP Call Girls Noida Sector 23 Call Me: 8700611579
Best VIP Call Girls Noida Sector 23 Call Me: 8700611579Best VIP Call Girls Noida Sector 23 Call Me: 8700611579
Best VIP Call Girls Noida Sector 23 Call Me: 8700611579
 
BADDI CALL GIRL 92628/71154 BADDI CALL G
BADDI CALL GIRL 92628/71154 BADDI CALL GBADDI CALL GIRL 92628/71154 BADDI CALL G
BADDI CALL GIRL 92628/71154 BADDI CALL G
 
DIGHA CALL GIRL 92628/1154 DIGHA CALL GI
DIGHA CALL GIRL 92628/1154 DIGHA CALL GIDIGHA CALL GIRL 92628/1154 DIGHA CALL GI
DIGHA CALL GIRL 92628/1154 DIGHA CALL GI
 
9999266834 Call Girls In Noida Sector 37 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 37 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 37 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 37 (Delhi) Call Girl Service
 
Call Girls In Goa For Fun 9316020077 By Goa Call Girls For Pick Up Night
Call Girls In  Goa  For Fun 9316020077 By  Goa  Call Girls For Pick Up NightCall Girls In  Goa  For Fun 9316020077 By  Goa  Call Girls For Pick Up Night
Call Girls In Goa For Fun 9316020077 By Goa Call Girls For Pick Up Night
 
Call Girls In {Connaught Place Delhi} 9667938988 IndianRussian High Profile E...
Call Girls In {Connaught Place Delhi} 9667938988 IndianRussian High Profile E...Call Girls In {Connaught Place Delhi} 9667938988 IndianRussian High Profile E...
Call Girls In {Connaught Place Delhi} 9667938988 IndianRussian High Profile E...
 
➥🔝9953056974 🔝▻ Anand Vihar Call-girl in Women Seeking Men 🔝Delhi🔝 NCR
➥🔝9953056974 🔝▻ Anand Vihar Call-girl in Women Seeking Men 🔝Delhi🔝 NCR➥🔝9953056974 🔝▻ Anand Vihar Call-girl in Women Seeking Men 🔝Delhi🔝 NCR
➥🔝9953056974 🔝▻ Anand Vihar Call-girl in Women Seeking Men 🔝Delhi🔝 NCR
 
RAJKOT CALL GIRLS 92628/71154 RAJKOT CAL
RAJKOT CALL GIRLS 92628/71154 RAJKOT CALRAJKOT CALL GIRLS 92628/71154 RAJKOT CAL
RAJKOT CALL GIRLS 92628/71154 RAJKOT CAL
 
+91-9899900591 Russian Call Girls In New Delhi Independent Russian Call Girls...
+91-9899900591 Russian Call Girls In New Delhi Independent Russian Call Girls...+91-9899900591 Russian Call Girls In New Delhi Independent Russian Call Girls...
+91-9899900591 Russian Call Girls In New Delhi Independent Russian Call Girls...
 
Call Girls in Luxus Grand Hotel | 💋 03274100048
Call Girls in Luxus Grand Hotel | 💋 03274100048Call Girls in Luxus Grand Hotel | 💋 03274100048
Call Girls in Luxus Grand Hotel | 💋 03274100048
 
Call Girls In Lahore || 03274100048 ||Lahore Call Girl Available 24/7
Call Girls In Lahore || 03274100048 ||Lahore Call Girl Available 24/7Call Girls In Lahore || 03274100048 ||Lahore Call Girl Available 24/7
Call Girls In Lahore || 03274100048 ||Lahore Call Girl Available 24/7
 
Call Girls | 😏💦 03274100048 | Call Girls Near Me
Call Girls | 😏💦 03274100048 | Call Girls Near MeCall Girls | 😏💦 03274100048 | Call Girls Near Me
Call Girls | 😏💦 03274100048 | Call Girls Near Me
 
Call Girls In {{Connaught Place Delhi}}96679@38988 Indian Russian High Profil...
Call Girls In {{Connaught Place Delhi}}96679@38988 Indian Russian High Profil...Call Girls In {{Connaught Place Delhi}}96679@38988 Indian Russian High Profil...
Call Girls In {{Connaught Place Delhi}}96679@38988 Indian Russian High Profil...
 
Call Girls in B-18 Islamabad || 🔝 03274100048
Call Girls in B-18 Islamabad || 🔝 03274100048Call Girls in B-18 Islamabad || 🔝 03274100048
Call Girls in B-18 Islamabad || 🔝 03274100048
 
+91-9310611641 Russian Call Girls In New Delhi Independent Russian Call Girls...
+91-9310611641 Russian Call Girls In New Delhi Independent Russian Call Girls...+91-9310611641 Russian Call Girls In New Delhi Independent Russian Call Girls...
+91-9310611641 Russian Call Girls In New Delhi Independent Russian Call Girls...
 
Call Now ☎9870417354|| Call Girls in Dwarka Escort Service Delhi N.C.R.
Call Now ☎9870417354|| Call Girls in Dwarka Escort Service Delhi N.C.R.Call Now ☎9870417354|| Call Girls in Dwarka Escort Service Delhi N.C.R.
Call Now ☎9870417354|| Call Girls in Dwarka Escort Service Delhi N.C.R.
 
Call Girls In {{Green Park Delhi}}9667938988 Indian Russian High Profile Esco...
Call Girls In {{Green Park Delhi}}9667938988 Indian Russian High Profile Esco...Call Girls In {{Green Park Delhi}}9667938988 Indian Russian High Profile Esco...
Call Girls In {{Green Park Delhi}}9667938988 Indian Russian High Profile Esco...
 

Big Data Proof of Concept

  • 1. Ideas. Realized. ® RCG Global Services Business Intelligence Big Data Proof of Concept July 2014 © 2014 RCG. All Rights Reserved. Proprietary and Confidential.
  • 2. Big Data Architecture Goes Beyond BI © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 2 What ‘Traditional’ BI Misses . . . “More than 80% of all data in an enterprise is unstructured information. Unfortunately, attempts to leverage this resource often fail because many businesses lack the technology to utilize content that resides outside the scope of structured databases.” Is What Big Data Is Designed to Deliver “The practices and technology that close the gap between [all types of] data available and the ability to turn that data into business insight.” [http://www.aiim.org/Research-and-Publications/Research/White-Papers/Data-is-Unstructured-Information]
  • 3. The Big Data Landscape Is Complex © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 3
  • 4. Big Data Challenges © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 4  Understanding and architecting solutions incorporating Big Data technologies (Hadoop, NewSQL, NoSQL, in-memory, and so forth)  Navigating Hadoop, its ‘projects’/ components, and packaged Hadoop options in the market  Knowing which Big Data solution best meets your needs  Planning, sizing, installing, and using a Big Data server complex  Incorporating Big Data into existing data management and governance processes  Delivering analytic results from Big Data volumes and variety of data types, particularly real-time data stream analysis, ad hoc queries and searches, and inferential analytics
  • 5. Examples of RCG’s Big Data Experience © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 5 IT Cost Reduction Savings of $3 million and processing time reduction from 4.5 hours to 1.5 hours realized by using Big Data technologies rather than traditional ETL and MPP database options RCG’s Big Data Lab Demonstrates Big Data technologies and produces Advanced Analytics Insights with client dataClick Stream Analysis Real time click stream analysis and correlation to in-store purchase history SKU Analysis Analyze sales, inventories, and delivery logistics by sku by day over years of history using Big Data technology and architecture recommendations Business Operations Analysis Near real-time analysis of business operations to manage inventories, adjust pricing, and manage promotions ROIC Analysis for Store Renovations Advanced Analytics doubled ROIC, increased store profitability, and reduced capital allocated for store renovations by $150M
  • 6. RCG’s Big Data Offerings for Business © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 6 Demonstrates the business value of Big Data using your data in RCG’s Big Data Lab with Big Data technologies and analytics Requires no investment in Big Data hardware, software, or skills in IT or business units Big Data Proof of Concept Identifies how Advanced Analytics can support your business goals and with technologies that fit in your IT environment Provides a Roadmap of projects to deliver business value and add the capabilities needed for successful advanced analytics Advanced Analytics Roadmap Provides business insights using inferential / predictive, real- time data stream, text, and other advanced analytics Applies Advanced Analytics techniques to develop business insights by RCG Data Scientists and identifies business actions Advanced Analytics and Insight
  • 7. RCG’s Big Data Offerings for IT © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 7 Identifies the savings IT can achieve using Big Data and open source technologies in place of further investment in high- cost ETL products and massively parallel processing platforms IT Cost Reduction with Big Data Identifies the Big Data technologies best suited to your environment and needs of the business and develops the architecture that fits Big Data into your IT infrastructure Big Data and Advanced Analytics Architecture Sizes, configures, and sets up the cluster of Big Data storage required to support the needs of the business, installs the Big Data software, and trains the IT staff who will monitor, maintain, and use the Big Data installation Big Data Technology Installation
  • 8. 8 Big Data Proof of Concept (PoC) The RCG Big Data Proof of Concept demonstrates the business value of Big Data using your data in RCG’s Big Data Lab with Big Data technologies and analytics. This requires no investment in Big Data hardware, software, or skills in your IT or business units.
  • 9. Big Data Proof of Concept Overview © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 9 Get statistically significant business insights Apply insights to solve the problem or act on the opportunity Start with a real business problem or opportunity RCG’s Data Scientists apply Advanced Analytics 1 3 4 5 Take real business data to RCG’s Big Data Lab 2
  • 10. Big Data PoC Objectives The primary objective for RCG’s Big Data PoC is to demonstrate the business value of Big Data analytics and business insights using client data in RCG’s Big Data Lab. This provides clients with: ‒ Access to RCG’s skills, experience, and facilities to jump start the learning curve and application of Big Data analytics ‒ The ability to apply a Big Data technology of interest ‒ A low-cost, easy way to demonstrate the value of Big Data analytics and business insights to enhance business results The Big Data PoC requires no investment in hardware, software, or skills in a client’s IT or business departments © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 10
  • 11. RCG’s Big Data Lab © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 11 Our Big Data Lab has the capacity and capability to help you make sense of this to solve problems and take advantage of opportunities We can help you select a Big Data option that makes sense for your company 12 nodes of Hadoop or NoSQL configuration ½ terabyte of memory 144 terabytes of storage ‘R’ and SAS statistical analysis technologies Apache Hadoop project software NoSQL and NewSQL options
  • 12. Big Data PoC Approach © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 12 Activities: Collect data related to the business problem, such as customer data, purchase history, emails, demographics, product data, product sales history, and so forth Mask sensitive data such as names, credit card or financial specifics, health information, and so forth Client Activities: Participate in problem definition workshop Approve problem definition and its objectives Activities: Conduct brief (1-2 hours) problem statement workshop with Business Executives Identify an objective, such as sales lift through better product recommendations or improving healthcare outcomes through personalized treatments Client Activities: Activities: Transmit data to RCG (depending on the volume of data, this can be done over high speed communications or through physical media) Load data into RCG’s Big Data Lab Client Activities: Provide collected data for analysis Activities: Client Activities: Activities: Develop results and insights Make recommendations for business actions and for applying Big Data Client Activities: Review results and insights Review business action recommendations Review Big Data recommendations Work Products:Work Products: Work Products: Work Products: Deliverables: File(s) of data to be used for the PoC Problem Statement Big Data clusters set up in clients technology of choice Client data loaded into the Big Data cluster Results and insights Recommendations for business actions Recommendations for implementing Big Data Identify a Business Problem Area Collect Data Related to the Problem Area Load Data into RCG’s Big Data Lab Apply Big Data Analytics Produce Results and Insights Collect data related to the problem definition Ensure data complies with privacy and regulatory policies Review data insights and provide feedback Data insights, with their statistical significance Perform initial analysis based on client requirements Develop and refine statistical models to focus on new insights Review with client and iterate as needed Present current model and define automation/validation process if needed
  • 13. Big Data PoC Approach  RCG Activities ‒ Conduct brief (1-2 hours) problem statement workshop with Business Executives to identify: • An objective for the PoC, such as sales lift through better product recommendations or improving healthcare outcomes through personalized treatments • The data related to the business problem to analyze • The technical environment to be set up for the PoC in RCG’s Big Data Lab  Client Responsibility ‒ Participate in facilitated problem definition workshop ‒ Approve PoC Problem Statement  Work Products ‒ Problem Statement © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 13 Identify a Business Problem Area Collect Data Related to the Problem Area Load Data into RCG’s Big Data Lab Apply Big Data Analytics Produce Results and Insights
  • 14. Big Data PoC Approach  RCG Activities ‒ None  Client Responsibility ‒ Collect the data related to the business problem to analyze ‒ Mask sensitive data such as names, credit card or financial specifics, health information, and so forth  Work Products ‒ Collected files of data to analyze © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 14 Identify a Business Problem Area Collect Data Related to the Problem Area Load Data into RCG’s Big Data Lab Apply Big Data Analytics Produce Results and Insights
  • 15. Big Data PoC Approach  RCG Activities ‒ Set up the technical environment for the PoC ‒ Load the CLIENT’s data for analysis  Client Responsibility ‒ None  Work Products ‒ Collected files of data loaded into the specified technical environment in RCG’s Big Data Lab © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 15 Identify a Business Problem Area Collect Data Related to the Problem Area Load Data into RCG’s Big Data Lab Apply Big Data Analytics Produce Results and Insights
  • 16. Big Data PoC Approach  RCG Activities ‒ Perform initial analysis based on client requirements ‒ Review insights with client ‒ Develop and refine statistical models to further focus on new insights ‒ Review with client and iterate as needed ‒ Present current model and define automation/validation process if needed  Client Responsibility ‒ Provide feedback on interim analysis results  Work Products ‒ Data analysis, actionable statistical model(s) based on insights discovered © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 16 Identify a Business Problem Area Collect Data Related to the Problem Area Load Data into RCG’s Big Data Lab Apply Big Data Analytics Produce Results and Insights
  • 17. Big Data PoC Approach  RCG Activities ‒ Develop results and insights ‒ Make recommendations for business actions and for applying Big Data  Client Responsibility ‒ Review results and insights ‒ Review business action recommendations ‒ Review Big Data recommendations  Deliverables ‒ Results and insights ‒ Recommendations for business actions ‒ Recommendations for implementing Big Data and Advanced Analytics © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 17 Identify a Business Problem Area Collect Data Related to the Problem Area Load Data into RCG’s Big Data Lab Apply Big Data Analytics Produce Results and Insights
  • 18. Roles and Responsibilities © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 18 RCG Solution Architect • Overall responsibility for RCG’s delivery of the PoC • Develops, with CLIENT, the problem definition that the PoC will investigate • Presents the PoC results and recommended business actions RCG Technical Specialist • Implements the Big Data and Advanced Analytics software in RCG’s Big Data Lab for the PoC • Loads the data collected by the CLIENT into RCG’s Big Data Lab RCG Data Scientist • Provides Data Science experience and knowledge to the project • Develops Advanced Analytics models and discovers data insights • Determines the statistical significance of these data insights and works to improve it CLIENT's Project Manager • Primary contact for RCG, working to create and review project schedule, milestones and deliverables • Participate in problem definition workshop CLIENT's Business Participants • Provide CLIENT's business expertise • Participate in problem definition workshop • Identify information related to the PoC’s problem definition CLIENT's IT SMEs • Provide CLIENT's ITs expertise • Participate in problem definition workshop • Collect CLIENT’s data related to the PoC’s problem definition
  • 19. Big Data PoC Timeline and Fees © 2014 RCG. All Rights Reserved. Proprietary and Confidential. 19 Project Timeline  Estimated Duration: 3 to 5 weeks  Estimated Total Fees: $35 - 55K, plus expenses (this does not include fees for RCG assistance to collect and prepare data, if it is required)  Typical Resources: Big Data Solution Architect, Data Scientist, Big Data Technical Specialist Activity Week 1 Week 2 Week 3 Week 4 Week 5 Identify a Business Problem Area Collect Data Related to the Problem Area Load Data into RCG’s Big Data Lab Apply Big Data Analytics Produce Results and Insights
  • 20. Our Brand Promise Our reputation is built upon the premise that we are a company that listens. We bring a creative view to your business initiative. We are collaborative and accountable as we jointly create your solution. We continuously innovate from concept to result and help you affect business change. There will be no surprises. Ideas. Realized.®

Notes de l'éditeur

  1. Forrester’s definition of Big Data: “the practices and technology that close the gap between [all types of] data available and the ability to turn that data into business insight.”
  2. 12 nodes of Hadoop or NoSQL configuration – this reflects the use of the lab for Proof of Concepts, not necessarily production-level support ½ terabyte of memory 144 terabytes of storage – this provides for a meaningful amount of data to be stored for data science analytics ‘R’ and SAS statistical analysis technologies Apache Hadoop project software – including HDFS, HBase, Hive, Pig, Sqoop, Yarn, Zookeeper, Mahout, Tez, Flume, Ambari, Oozie, Falcon, Knox, Accumulo, Storm, Kafka, add-ons and connectors to Microsoft, Oracle, Teradata, Informatica, and Talend, and Cloudera, Hortonworks, and MapR Hadoop packages NoSQL and NewSQL options, including Cassandra, Couchbase, MongoDB, and HPCC
  3. Here are my thoughts on Big Data PoC proposals. I suggest that: Identify a Business Problem Area be a half day Solution Build type of activity; it may be helpful if this were "free" (no cost for the activity, but build the cost into the costs of the next steps) Collect Data may require RCG assistance onsite at the rates Rob quoted; this step may take time and should be T&M and not count against a Lab timebox Load Data into the Lab is when a period of time starts; this will be the Big Data Environment Specialist configuring the environment for the PoC, which can be done while Collect Data is happening, and loading client data into the Lab Apply Analytics is where the work is; three weeks should be a good start, as long as we can coordinate our analytic resources; it may be desirable to include a Manila resource or two to generate more models and insights Produce Results and Insights should happen in the third week or so, allowing for an iteration or two with the client This is one week of a Big Data Environment Specialist ($7,000), three weeks of a Big Data Scientist ($24,000), and 3 weeks for two Manila-based Big Data Analysts (around $12,000), totaling about $45,000 if the costs for step 1 are included. But it will depend on the expectations of the client and how sophisticated the statistical models need to be to meet the expectation.   So, I suggest the proposal to JC Penney should be: We will come in to Identify a Business Problem Area JC Penney wants us to attack, the data needed to analyze it, decide whether we need to help Collect Data, and determine how much Apply Analytics we need to do. We do this for "free" and adjust the price of the PoC depending what expectations JC Penney has for this.   We can say that a ballpark price for a PoC. is  $45,000, but that the price can vary based on how extensive the PoC is.   Just some thoughts on the matter . . .