SlideShare une entreprise Scribd logo
1  sur  34
Télécharger pour lire hors ligne
© 2015 IBM Corporation
From Concept to Production:
Nationwide Insurance IBM BigInsights
Journey with Telematics # 2404
Krish Rajaram & Rajesh Nandagiri – 10/26/2015
Big Data and
Analytics Helps
Nationwide
Customers Become
Better Drivers
Agenda
Introduction
Architecture
Data Processing
Data Access
Business Benefits
2
0.67
8.30 8.30
0.00
1.00
2.00
3.00
4.00
5.00
6.00
7.00
8.00
9.00
0.00
100.00
200.00
300.00
400.00
500.00
600.00
700.00
1 Hr Batch 12 Hrs Loop 12 Hrs batch
DataVolumeinGB
ElapseTimeinMins
Cycles
AfterRedesign
First Iteration
Introduction
About Nationwide
SmartRide Program
SmartRide Data
About Nationwide
4
16+
MILLION
POLICIES
25MILLION
CONTRIBUTED
TO NONPROFITS
AND COMMUNITIES
$
1#
INSURER OF
FARMS AND
RANCHES
7LARGEST
HOMEOWNER AND
AUTO INSURANCE
PROVIDER IN THE U.S.
th
GALLUP
GREAT PLACE
TO WORK
AWARD WINNER
3 YEARS RUNNING
LARGEST PET
INSURER IN THE U.S.
9th
LARGEST
COMMERCIAL
INSURER
$23.9 BILLION IN REVENUE FOR 2013
Nationwide has approximately 31,000 associates
serving customers in nearly every state.
1#
PROVIDER OF
PUBLIC-SECTOR
RETIREMENT
PLANS
FOUNDED IN 1926 BY
MEMBERS OF THE
OHIO FARM BUREAU
28th
COMPUTERWORLD
GREAT PLACE TO
WORK IN IT
About SmartRide
• SmartRide is Nationwide's version of Telematics, offered to
customers to help them improve their driving behavior and save
on insurance premiums.
5
• Customers install a small device into their vehicle for 6 months
which measures…
SmartRide Data Characteristics
 Multiple vendors
 Files of different layouts arriving at different frequencies:
 Hourly
 Every 4 hrs
 Four CSV files per vendor
 ~ 30 GB to ~ 60 GB of data per day
 Data challenges
 Late arriving trips
 Partial trips
 Duplicate trips
 Orphan trips
6
Trip Data Characteristics
• Missing Timestamp & Speed Spike
• Acceleration Lag
7
vin_nb trip_nb position_ts Speed engine_rpm
abc 123 2015-07-21 12:31:36.0 54 1600
abc 123 2015-07-21 12:31:39.0 55 1800
abc 123 2015-07-21 12:31:42.0 57 1500
abc 123 2015-07-21 12:31:43.0 82 1600
abc 123 2015-07-21 12:31:44.0 58 1500
vin_nb trip_nb position_ts Speed engine_rpm
abc 123 2015-06-30 21:25:05.0 0 700
abc 123 2015-06-30 21:25:06.0 0 700
abc 123 2015-06-30 21:25:07.0 0 1000
abc 123 2015-06-30 21:25:08.0 8 1800
abc 123 2015-06-30 21:25:09.0 15 2000
Architecture
Logical Data Flow
IBM® BigInsights™ Configuration
Decision Catalog
Job Orchestration
Logical Data Flow
9
IBM® BigInsights™ for Apache™ Hadoop Configuration
• Version 2.1.2
 6 Management Nodes and 16 Data Nodes
 Each with 128 GB RAM and 18 TB of storage
 Hadoop 2.2, BigSQL 1.0, Hive 0.12, Hbase 0.96
• Three environments
 Dev, Test, and Production
All same configuration
• Limitations
 No workload management
 No environment for DR
 Used Test Cluster for Hbase failover
10
Decision Catalog
11
Job Orchestration
12
Data Processing
Design Considerations
Phases of Data Movement
Batch Performance Metrics
Design Considerations
• One hour window for end
to end processing
 Handling data issues
 Summarization
 Multiple cycles per
day
• Predictable run time for
backlog processing when
jobs fail
• Reloading incorrect
batch
• Restart failed batch
14
0.67
8.30 8.30
0.00
1.00
2.00
3.00
4.00
5.00
6.00
7.00
8.00
9.00
0.00
100.00
200.00
300.00
400.00
500.00
600.00
700.00
1 Hr Batch 12 Hrs Loop 12 Hrs batch
ElapseTimeinMins
Cycles
TripScrub TripSecDetails TripSum
State Median Hbase load AuditEn
raw2can Input Size in GB
Acquire Phase
15
 Raw trip files
copied into HDFS
using WebHDFS
protocol
 Folders created by
vendor, file, load
event ID, and batch #
 Used Sqoop to
transfer 4 TB of
historical data from
Data Warehouse
 Hive external tables
for each file
 Partitioned by load
event ID and batch #
 Used both BigSQL
1.0 and HiveQL
 Partitioned external tables helped in
o Processing backlog data
o Reprocessing incorrect batches
Standardize Phase
16
 Select data from
external tables based
on load event ID
 Each load event ID
can include one or
more batches
 More than one load
event ID can be
processed in one
cycle
 Data moved to next stage only from work tables
 Helped in performance
 Dynamic partitions helped in loading multiple batches
 Partitions get overwritten if already exists
 Helped in reprocessing incorrect batch
Work tables contain
data for CURRENT
processing cycle
Canonical tables
partitioned by
source and batch #
Load using
dynamic
partitioning
Data Scrubbing & Event Calculation
17
Trip
summary
Trip point
 Map side join
 Single read
multi write
Orphan
trips
Trip points
(Work table)
 Java M/R program for
o Scrubbing
o Events calculation
 Night time driving
 Hard brake
 Fast acceleration
 Miles driven
Events at
seconds level
(Work table)
 Very good performance gain
 Using Java for complex scrubbing rules
 Single read multiple writes
 Only required data points processed
 No data persisted to corpse tables
Summarization Phase
18
Events at
seconds level
(Work table)
 Gather all trips related to
devices from current trip
and aggregate at various
levels
 Union ALL
 UDF to store data points
for trip graph
 Replace new summary
info into final table
SRE summary
(Work table)
SRE summary
partitioned by
source
SRE summary
in HBase
 Parallelized the Union All operation
 Partitioning by Source enabled both Vendor
data to be processed at same time if overlap
happens
 PUT from Hive to Hbase, WAL disabled
 Shorten column
names
 Changed to epoch
time
 Prefix salting key
 Generate rowkey
 Column family
mapping
Batch Performance Metrics
19
1 Hr
Batch
SLA
0
0.5
1
1.5
2
2.5
0
5
10
15
20
25
30
35
40
DataSizeinGB
RuntimeinMins
Cycle Schedule Time
Avg Run Times for Hourly Cycles
0
2
4
6
8
10
12
0
10
20
30
40
50
60
0000 0400 0800 1200 1600 2000
DataSizeinGB
RuntimeinMins
Cycle Schedule Time
Avg Run Times for 4 hr Cycles
Trip Second Details
Standarize
SRE Trip Summary Hive
SRE Trip Summary Hbase
Audits
Acquire
Size in GB
Data Access
SmartRide Web Page
Application Layer
Column Family and Row Key Design
Performance Metrics
SmartRide Web Page
21
SmartRide Web Page – Daily
22
Application Layer
23
Data
Access
Layer
HBASE
API
Restful
Service
Single Page
Web App
BigSQL
&
Hive
Aggregates
Daily
HDFS
HBase
HRegion
Server
HRegion
HLog
Memstore
HFile
ODS – DB2
ODBC
Column Family & RowKey Design
24
RowKey –
Pfx_pgmId_pdflg_
timestamp
Column Family –
Summary Data
Column Family –
Trip-point Data
12_8798782_Tp_201
5080912000000
SM:miles,1500001245,’15’,
SM:hb,1500001245,’2’,
SM:fa,1500001245,’5’,
SM:nt,1500001245,’Y’
TP:Trip,1500001245,’{JSON
BLOB}’
Sorted
Lexicographically
• Column family (CF) helps in grouping the related columns
depending on access pattern.
• Co-locating the keys related to one customer in one region to
access data using filter from one region server.
Performance Metrics
Scenarios – 1x, 2x, 3x concurrent users, Zookeeper node going
down, Datanode unavailable
Tools used – Initial test using custom program, LoadRunner for
final test, SiteScope for monitoring resource consumption
25
SLA for aggregates – 5 sec
# of concurrent users - 1200
HBase Data Distribution – Using Hannibal
26
SmartRide Data Distribution
Business Benefits
• Deeper Engagement with Members
 Over 2 million website page views since the July
launch. To put in perspective, our vendor-hosted
website would receive 100,000 views in a 12 month
period.
 Over 60K users have accessed the new site and 90%
of those are new users.
• Increase in bind ratios across all channels
• Improvement in loss ratios
• Enterprise first "big data" implementation at Nationwide
27
Future scope – Personal and Commercial Fleet
28
Insights Give
Nationwide
Competitive
Advantage
Weather
Data
GPS
Data
Hourly
Trip Data
from
Device
Claims
Data
Other
Public
Records
© 2015 IBM Corporation
Thank You
We Value Your Feedback!
Don’t forget to submit your Insight session and speaker
feedback! Your feedback is very important to us – we use it
to continually improve the conference.
Access your surveys at insight2015survey.com to quickly
submit your surveys from your smartphone, laptop, or
conference kiosk.
30
31
Notices and Disclaimers
Copyright © 2015 by International Business Machines Corporation (IBM). No part of this document may be reproduced or transmitted in any form
without written permission from IBM.
U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM.
Information in these presentations (including information relating to products that have not yet been announced by IBM) has been reviewed for
accuracy as of the date of initial publication and could include unintentional technical or typographical errors. IBM shall have no responsibility to
update this information. THIS DOCUMENT IS DISTRIBUTED "AS IS" WITHOUT ANY WARRANTY, EITHER EXPRESS OR IMPLIED. IN NO
EVENT SHALL IBM BE LIABLE FOR ANY DAMAGE ARISING FROM THE USE OF THIS INFORMATION, INCLUDING BUT NOT LIMITED TO,
LOSS OF DATA, BUSINESS INTERRUPTION, LOSS OF PROFIT OR LOSS OF OPPORTUNITY. IBM products and services are warranted
according to the terms and conditions of the agreements under which they are provided.
Any statements regarding IBM's future direction, intent or product plans are subject to change or withdrawal without notice.
Performance data contained herein was generally obtained in a controlled, isolated environments. Customer examples are presented as
illustrations of how those customers have used IBM products and the results they may have achieved. Actual performance, cost, savings or other
results in other operating environments may vary.
References in this document to IBM products, programs, or services does not imply that IBM intends to make such products, programs or services
available in all countries in which IBM operates or does business.
Workshops, sessions and associated materials may have been prepared by independent session speakers, and do not necessarily reflect the
views of IBM. All materials and discussions are provided for informational purposes only, and are neither intended to, nor shall constitute legal or
other guidance or advice to any individual participant or their specific situation.
It is the customer’s responsibility to insure its own compliance with legal requirements and to obtain advice of competent legal counsel as to the
identification and interpretation of any relevant laws and regulatory requirements that may affect the customer’s business and any actions the
customer may need to take to comply with such laws. IBM does not provide legal advice or represent or warrant that its services or products will
ensure that the customer is in compliance with any law.
32
Notices and Disclaimers (con’t)
Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly
available sources. IBM has not tested those products in connection with this publication and cannot confirm the accuracy of performance,
compatibility or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the
suppliers of those products. IBM does not warrant the quality of any third-party products, or the ability of any such third-party products to
interoperate with IBM’s products. IBM EXPRESSLY DISCLAIMS ALL WARRANTIES, EXPRESSED OR IMPLIED, INCLUDING BUT NOT
LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.
The provision of the information contained herein is not intended to, and does not, grant any right or license under any IBM patents, copyrights,
trademarks or other intellectual property right.
• IBM, the IBM logo, ibm.com, Aspera®, Bluemix, Blueworks Live, CICS, Clearcase, Cognos®, DOORS®, Emptoris®, Enterprise Document
Management System™, FASP®, FileNet®, Global Business Services ®, Global Technology Services ®, IBM ExperienceOne™, IBM
SmartCloud®, IBM Social Business®, Information on Demand, ILOG, Maximo®, MQIntegrator®, MQSeries®, Netcool®, OMEGAMON,
OpenPower, PureAnalytics™, PureApplication®, pureCluster™, PureCoverage®, PureData®, PureExperience®, PureFlex®, pureQuery®,
pureScale®, PureSystems®, QRadar®, Rational®, Rhapsody®, Smarter Commerce®, SoDA, SPSS, Sterling Commerce®, StoredIQ,
Tealeaf®, Tivoli®, Trusteer®, Unica®, urban{code}®, Watson, WebSphere®, Worklight®, X-Force® and System z® Z/OS, are trademarks of
International Business Machines Corporation, registered in many jurisdictions worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at:
www.ibm.com/legal/copytrade.shtml.
• IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal
without notice at IBM’s sole discretion.
• Information regarding potential future products is intended to outline our general product direction
and it should not be relied on in making a purchasing decision.
• The information mentioned regarding potential future products is not a commitment, promise, or
legal obligation to deliver any material, code or functionality. Information about potential future
products may not be incorporated into any contract.
• The development, release, and timing of any future features or functionality described for our
products remains at our sole discretion.
Performance is based on measurements and projections using standard IBM benchmarks in a
controlled environment. The actual throughput or performance that any user will experience will vary
depending upon many factors, including considerations such as the amount of multiprogramming in the
user’s job stream, the I/O configuration, the storage configuration, and the workload processed.
Therefore, no assurance can be given that an individual user will achieve results similar to those stated
here.
Please Note:
2

Contenu connexe

Tendances

Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence ArchitecturePhilippe Julio
 
Introduction to Business Intelligence
Introduction to Business IntelligenceIntroduction to Business Intelligence
Introduction to Business IntelligenceAlmog Ramrajkar
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...MapR Technologies
 
MT13 - Keep your business processing operating at peak efficiency with Dell E...
MT13 - Keep your business processing operating at peak efficiency with Dell E...MT13 - Keep your business processing operating at peak efficiency with Dell E...
MT13 - Keep your business processing operating at peak efficiency with Dell E...Dell EMC World
 
IBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big DataIBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big DataIBM Analytics
 
Agile NoSQL With XRX
Agile NoSQL With XRXAgile NoSQL With XRX
Agile NoSQL With XRXDATAVERSITY
 
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data ModelerThe Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data ModelerDATAVERSITY
 
Microsoft Data Warehousing
Microsoft Data Warehousing Microsoft Data Warehousing
Microsoft Data Warehousing Glenture
 
Bi presentation to bkk
Bi presentation to bkkBi presentation to bkk
Bi presentation to bkkguest4e975e2
 
Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Niu Bai
 
IBM InfoSphere Stewardship Center for iis dqec
IBM InfoSphere Stewardship Center for iis dqecIBM InfoSphere Stewardship Center for iis dqec
IBM InfoSphere Stewardship Center for iis dqecIBMInfoSphereUGFR
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsRyan Gross
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Erik Fransen
 
Ibm leads way with hadoop and spark 2015 may 15
Ibm leads way with hadoop and spark 2015 may 15Ibm leads way with hadoop and spark 2015 may 15
Ibm leads way with hadoop and spark 2015 may 15IBMInfoSphereUGFR
 
Présentation IBM InfoSphere Information Server 11.3
Présentation IBM InfoSphere Information Server 11.3Présentation IBM InfoSphere Information Server 11.3
Présentation IBM InfoSphere Information Server 11.3IBMInfoSphereUGFR
 
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy:  A Simple, Scalable Solution for Getting Started with HadoopBig Data Made Easy:  A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with HadoopPrecisely
 
Big Data & Analytics Architecture
Big Data & Analytics ArchitectureBig Data & Analytics Architecture
Big Data & Analytics ArchitectureArvind Sathi
 
Third Nature - Open Source Data Warehousing
Third Nature - Open Source Data WarehousingThird Nature - Open Source Data Warehousing
Third Nature - Open Source Data Warehousingmark madsen
 

Tendances (20)

Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence Architecture
 
Introduction to Business Intelligence
Introduction to Business IntelligenceIntroduction to Business Intelligence
Introduction to Business Intelligence
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
 
MT13 - Keep your business processing operating at peak efficiency with Dell E...
MT13 - Keep your business processing operating at peak efficiency with Dell E...MT13 - Keep your business processing operating at peak efficiency with Dell E...
MT13 - Keep your business processing operating at peak efficiency with Dell E...
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
IBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big DataIBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big Data
 
Agile NoSQL With XRX
Agile NoSQL With XRXAgile NoSQL With XRX
Agile NoSQL With XRX
 
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data ModelerThe Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
 
Microsoft Data Warehousing
Microsoft Data Warehousing Microsoft Data Warehousing
Microsoft Data Warehousing
 
Bi presentation to bkk
Bi presentation to bkkBi presentation to bkk
Bi presentation to bkk
 
Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714
 
IBM InfoSphere Stewardship Center for iis dqec
IBM InfoSphere Stewardship Center for iis dqecIBM InfoSphere Stewardship Center for iis dqec
IBM InfoSphere Stewardship Center for iis dqec
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
 
Ibm leads way with hadoop and spark 2015 may 15
Ibm leads way with hadoop and spark 2015 may 15Ibm leads way with hadoop and spark 2015 may 15
Ibm leads way with hadoop and spark 2015 may 15
 
Présentation IBM InfoSphere Information Server 11.3
Présentation IBM InfoSphere Information Server 11.3Présentation IBM InfoSphere Information Server 11.3
Présentation IBM InfoSphere Information Server 11.3
 
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy:  A Simple, Scalable Solution for Getting Started with HadoopBig Data Made Easy:  A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
 
Big Data & Analytics Architecture
Big Data & Analytics ArchitectureBig Data & Analytics Architecture
Big Data & Analytics Architecture
 
Datacenter 2014: Raritan - Richard May
Datacenter 2014: Raritan -  Richard MayDatacenter 2014: Raritan -  Richard May
Datacenter 2014: Raritan - Richard May
 
Third Nature - Open Source Data Warehousing
Third Nature - Open Source Data WarehousingThird Nature - Open Source Data Warehousing
Third Nature - Open Source Data Warehousing
 

En vedette

Cloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and AnalyticsCloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and AnalyticsSeeling Cheung
 
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...Seeling Cheung
 
Big Fish Games: Democratizing Data Access
Big Fish Games: Democratizing Data AccessBig Fish Games: Democratizing Data Access
Big Fish Games: Democratizing Data AccessSeeling Cheung
 
Medical University of South Carolina: Using Big Data and Predictive Analytics...
Medical University of South Carolina: Using Big Data and Predictive Analytics...Medical University of South Carolina: Using Big Data and Predictive Analytics...
Medical University of South Carolina: Using Big Data and Predictive Analytics...Seeling Cheung
 
Southwest Power Pool big data case study
Southwest Power Pool big data case study Southwest Power Pool big data case study
Southwest Power Pool big data case study Seeling Cheung
 
Constant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake JourneyConstant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake JourneySeeling Cheung
 
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeeling Cheung
 
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Seeling Cheung
 
How Nationwide Insurance Transformed and Accelerated its Small_1.3.1
How Nationwide Insurance Transformed and Accelerated its Small_1.3.1How Nationwide Insurance Transformed and Accelerated its Small_1.3.1
How Nationwide Insurance Transformed and Accelerated its Small_1.3.1ptulachan
 
ReDeFine Advertising Campaign for GEICO
ReDeFine Advertising Campaign for GEICOReDeFine Advertising Campaign for GEICO
ReDeFine Advertising Campaign for GEICOillinipr3
 
Geico case study - Master Brand
Geico case study - Master BrandGeico case study - Master Brand
Geico case study - Master BrandElisa Tsang
 
Mobile Engagement: The Right Message at the Right Time with Nationwide
Mobile Engagement: The Right Message at the Right Time with NationwideMobile Engagement: The Right Message at the Right Time with Nationwide
Mobile Engagement: The Right Message at the Right Time with NationwideSalesforce Marketing Cloud
 
The Enabling Force of Technology at Liberty Mutual
The Enabling Force of Technology at Liberty MutualThe Enabling Force of Technology at Liberty Mutual
The Enabling Force of Technology at Liberty MutualVMware Tanzu
 
How Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionHow Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionEugene Yan Ziyou
 
Jcpoa what you_need_to_know
Jcpoa what you_need_to_knowJcpoa what you_need_to_know
Jcpoa what you_need_to_knowRoel Palmaers
 
อุปกรณ์เครือข่ายคอมพิวเตอร์
อุปกรณ์เครือข่ายคอมพิวเตอร์อุปกรณ์เครือข่ายคอมพิวเตอร์
อุปกรณ์เครือข่ายคอมพิวเตอร์I'Tay Tanawin
 

En vedette (20)

Cloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and AnalyticsCloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and Analytics
 
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...
 
Big Fish Games: Democratizing Data Access
Big Fish Games: Democratizing Data AccessBig Fish Games: Democratizing Data Access
Big Fish Games: Democratizing Data Access
 
Medical University of South Carolina: Using Big Data and Predictive Analytics...
Medical University of South Carolina: Using Big Data and Predictive Analytics...Medical University of South Carolina: Using Big Data and Predictive Analytics...
Medical University of South Carolina: Using Big Data and Predictive Analytics...
 
Southwest Power Pool big data case study
Southwest Power Pool big data case study Southwest Power Pool big data case study
Southwest Power Pool big data case study
 
Constant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake JourneyConstant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake Journey
 
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
 
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
 
How Nationwide Insurance Transformed and Accelerated its Small_1.3.1
How Nationwide Insurance Transformed and Accelerated its Small_1.3.1How Nationwide Insurance Transformed and Accelerated its Small_1.3.1
How Nationwide Insurance Transformed and Accelerated its Small_1.3.1
 
Nationwide Insurance
Nationwide InsuranceNationwide Insurance
Nationwide Insurance
 
ReDeFine Advertising Campaign for GEICO
ReDeFine Advertising Campaign for GEICOReDeFine Advertising Campaign for GEICO
ReDeFine Advertising Campaign for GEICO
 
Geico case study - Master Brand
Geico case study - Master BrandGeico case study - Master Brand
Geico case study - Master Brand
 
Mobile Engagement: The Right Message at the Right Time with Nationwide
Mobile Engagement: The Right Message at the Right Time with NationwideMobile Engagement: The Right Message at the Right Time with Nationwide
Mobile Engagement: The Right Message at the Right Time with Nationwide
 
The Enabling Force of Technology at Liberty Mutual
The Enabling Force of Technology at Liberty MutualThe Enabling Force of Technology at Liberty Mutual
The Enabling Force of Technology at Liberty Mutual
 
Why Pythian?
Why Pythian?Why Pythian?
Why Pythian?
 
How Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionHow Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversion
 
Ivan Oplesnin
Ivan OplesninIvan Oplesnin
Ivan Oplesnin
 
WINPOT CASINO
WINPOT CASINOWINPOT CASINO
WINPOT CASINO
 
Jcpoa what you_need_to_know
Jcpoa what you_need_to_knowJcpoa what you_need_to_know
Jcpoa what you_need_to_know
 
อุปกรณ์เครือข่ายคอมพิวเตอร์
อุปกรณ์เครือข่ายคอมพิวเตอร์อุปกรณ์เครือข่ายคอมพิวเตอร์
อุปกรณ์เครือข่ายคอมพิวเตอร์
 

Similaire à Concept to production Nationwide Insurance BigInsights Journey with Telematics

Lessons from handling up to 26 Billion transactions a day - The Weather Compa...
Lessons from handling up to 26 Billion transactions a day - The Weather Compa...Lessons from handling up to 26 Billion transactions a day - The Weather Compa...
Lessons from handling up to 26 Billion transactions a day - The Weather Compa...Derek Baron
 
How a real time platform supports the modern utility
How a real time platform supports the modern utilityHow a real time platform supports the modern utility
How a real time platform supports the modern utilityrobgirvan
 
Interconnect session 3498: Deployment Topologies for Jazz Reporting Service
Interconnect session 3498: Deployment Topologies for Jazz Reporting ServiceInterconnect session 3498: Deployment Topologies for Jazz Reporting Service
Interconnect session 3498: Deployment Topologies for Jazz Reporting ServiceRosa Naranjo
 
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...Prolifics
 
Introducing Elevate Capacity Management
Introducing Elevate Capacity ManagementIntroducing Elevate Capacity Management
Introducing Elevate Capacity ManagementPrecisely
 
IBM Insight 2014 - Advanced Warehouse Analytics in the Cloud
IBM Insight 2014 - Advanced Warehouse Analytics in the CloudIBM Insight 2014 - Advanced Warehouse Analytics in the Cloud
IBM Insight 2014 - Advanced Warehouse Analytics in the CloudTorsten Steinbach
 
MongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital TransformationMukund Babbar
 
Introducing Amazon Kinesis: Real-time Processing of Streaming Big Data (BDT10...
Introducing Amazon Kinesis: Real-time Processing of Streaming Big Data (BDT10...Introducing Amazon Kinesis: Real-time Processing of Streaming Big Data (BDT10...
Introducing Amazon Kinesis: Real-time Processing of Streaming Big Data (BDT10...Amazon Web Services
 
CIN-2650 - Cloud adoption! Enforcer to transform your organization around peo...
CIN-2650 - Cloud adoption! Enforcer to transform your organization around peo...CIN-2650 - Cloud adoption! Enforcer to transform your organization around peo...
CIN-2650 - Cloud adoption! Enforcer to transform your organization around peo...Hendrik van Run
 
Jazz for Service Management
Jazz for Service ManagementJazz for Service Management
Jazz for Service ManagementIBM Danmark
 
A Framework to Measure and Maximize Cloud ROI
A Framework to Measure and Maximize Cloud ROIA Framework to Measure and Maximize Cloud ROI
A Framework to Measure and Maximize Cloud ROIRightScale
 
Next generation business automation with the red hat decision manager and red...
Next generation business automation with the red hat decision manager and red...Next generation business automation with the red hat decision manager and red...
Next generation business automation with the red hat decision manager and red...Masahiko Umeno
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesJames Serra
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
November 2013 HUG: Real-time analytics with in-memory grid
November 2013 HUG: Real-time analytics with in-memory gridNovember 2013 HUG: Real-time analytics with in-memory grid
November 2013 HUG: Real-time analytics with in-memory gridYahoo Developer Network
 
Predicting When Your Applications Will Go Off the Rails! Managing DB2 Appli...
Predicting When Your Applications Will Go Off the Rails!  Managing DB2 Appli...Predicting When Your Applications Will Go Off the Rails!  Managing DB2 Appli...
Predicting When Your Applications Will Go Off the Rails! Managing DB2 Appli...CA Technologies
 

Similaire à Concept to production Nationwide Insurance BigInsights Journey with Telematics (20)

Lessons from handling up to 26 Billion transactions a day - The Weather Compa...
Lessons from handling up to 26 Billion transactions a day - The Weather Compa...Lessons from handling up to 26 Billion transactions a day - The Weather Compa...
Lessons from handling up to 26 Billion transactions a day - The Weather Compa...
 
How a real time platform supports the modern utility
How a real time platform supports the modern utilityHow a real time platform supports the modern utility
How a real time platform supports the modern utility
 
Interconnect session 3498: Deployment Topologies for Jazz Reporting Service
Interconnect session 3498: Deployment Topologies for Jazz Reporting ServiceInterconnect session 3498: Deployment Topologies for Jazz Reporting Service
Interconnect session 3498: Deployment Topologies for Jazz Reporting Service
 
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
 
Big Data Ready Enterprise
Big Data Ready Enterprise Big Data Ready Enterprise
Big Data Ready Enterprise
 
Introducing Elevate Capacity Management
Introducing Elevate Capacity ManagementIntroducing Elevate Capacity Management
Introducing Elevate Capacity Management
 
IBM Insight 2014 - Advanced Warehouse Analytics in the Cloud
IBM Insight 2014 - Advanced Warehouse Analytics in the CloudIBM Insight 2014 - Advanced Warehouse Analytics in the Cloud
IBM Insight 2014 - Advanced Warehouse Analytics in the Cloud
 
MongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB in a Mainframe World
MongoDB in a Mainframe World
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital Transformation
 
Sizing methods
Sizing methodsSizing methods
Sizing methods
 
Introducing Amazon Kinesis: Real-time Processing of Streaming Big Data (BDT10...
Introducing Amazon Kinesis: Real-time Processing of Streaming Big Data (BDT10...Introducing Amazon Kinesis: Real-time Processing of Streaming Big Data (BDT10...
Introducing Amazon Kinesis: Real-time Processing of Streaming Big Data (BDT10...
 
CIN-2650 - Cloud adoption! Enforcer to transform your organization around peo...
CIN-2650 - Cloud adoption! Enforcer to transform your organization around peo...CIN-2650 - Cloud adoption! Enforcer to transform your organization around peo...
CIN-2650 - Cloud adoption! Enforcer to transform your organization around peo...
 
Jazz for Service Management
Jazz for Service ManagementJazz for Service Management
Jazz for Service Management
 
A Framework to Measure and Maximize Cloud ROI
A Framework to Measure and Maximize Cloud ROIA Framework to Measure and Maximize Cloud ROI
A Framework to Measure and Maximize Cloud ROI
 
Next generation business automation with the red hat decision manager and red...
Next generation business automation with the red hat decision manager and red...Next generation business automation with the red hat decision manager and red...
Next generation business automation with the red hat decision manager and red...
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use Cases
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
November 2013 HUG: Real-time analytics with in-memory grid
November 2013 HUG: Real-time analytics with in-memory gridNovember 2013 HUG: Real-time analytics with in-memory grid
November 2013 HUG: Real-time analytics with in-memory grid
 
Predicting When Your Applications Will Go Off the Rails! Managing DB2 Appli...
Predicting When Your Applications Will Go Off the Rails!  Managing DB2 Appli...Predicting When Your Applications Will Go Off the Rails!  Managing DB2 Appli...
Predicting When Your Applications Will Go Off the Rails! Managing DB2 Appli...
 

Dernier

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 

Dernier (20)

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 

Concept to production Nationwide Insurance BigInsights Journey with Telematics

  • 1. © 2015 IBM Corporation From Concept to Production: Nationwide Insurance IBM BigInsights Journey with Telematics # 2404 Krish Rajaram & Rajesh Nandagiri – 10/26/2015
  • 2. Big Data and Analytics Helps Nationwide Customers Become Better Drivers
  • 3. Agenda Introduction Architecture Data Processing Data Access Business Benefits 2 0.67 8.30 8.30 0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 9.00 0.00 100.00 200.00 300.00 400.00 500.00 600.00 700.00 1 Hr Batch 12 Hrs Loop 12 Hrs batch DataVolumeinGB ElapseTimeinMins Cycles AfterRedesign First Iteration
  • 5. About Nationwide 4 16+ MILLION POLICIES 25MILLION CONTRIBUTED TO NONPROFITS AND COMMUNITIES $ 1# INSURER OF FARMS AND RANCHES 7LARGEST HOMEOWNER AND AUTO INSURANCE PROVIDER IN THE U.S. th GALLUP GREAT PLACE TO WORK AWARD WINNER 3 YEARS RUNNING LARGEST PET INSURER IN THE U.S. 9th LARGEST COMMERCIAL INSURER $23.9 BILLION IN REVENUE FOR 2013 Nationwide has approximately 31,000 associates serving customers in nearly every state. 1# PROVIDER OF PUBLIC-SECTOR RETIREMENT PLANS FOUNDED IN 1926 BY MEMBERS OF THE OHIO FARM BUREAU 28th COMPUTERWORLD GREAT PLACE TO WORK IN IT
  • 6. About SmartRide • SmartRide is Nationwide's version of Telematics, offered to customers to help them improve their driving behavior and save on insurance premiums. 5 • Customers install a small device into their vehicle for 6 months which measures…
  • 7. SmartRide Data Characteristics  Multiple vendors  Files of different layouts arriving at different frequencies:  Hourly  Every 4 hrs  Four CSV files per vendor  ~ 30 GB to ~ 60 GB of data per day  Data challenges  Late arriving trips  Partial trips  Duplicate trips  Orphan trips 6
  • 8. Trip Data Characteristics • Missing Timestamp & Speed Spike • Acceleration Lag 7 vin_nb trip_nb position_ts Speed engine_rpm abc 123 2015-07-21 12:31:36.0 54 1600 abc 123 2015-07-21 12:31:39.0 55 1800 abc 123 2015-07-21 12:31:42.0 57 1500 abc 123 2015-07-21 12:31:43.0 82 1600 abc 123 2015-07-21 12:31:44.0 58 1500 vin_nb trip_nb position_ts Speed engine_rpm abc 123 2015-06-30 21:25:05.0 0 700 abc 123 2015-06-30 21:25:06.0 0 700 abc 123 2015-06-30 21:25:07.0 0 1000 abc 123 2015-06-30 21:25:08.0 8 1800 abc 123 2015-06-30 21:25:09.0 15 2000
  • 9. Architecture Logical Data Flow IBM® BigInsights™ Configuration Decision Catalog Job Orchestration
  • 11. IBM® BigInsights™ for Apache™ Hadoop Configuration • Version 2.1.2  6 Management Nodes and 16 Data Nodes  Each with 128 GB RAM and 18 TB of storage  Hadoop 2.2, BigSQL 1.0, Hive 0.12, Hbase 0.96 • Three environments  Dev, Test, and Production All same configuration • Limitations  No workload management  No environment for DR  Used Test Cluster for Hbase failover 10
  • 14. Data Processing Design Considerations Phases of Data Movement Batch Performance Metrics
  • 15. Design Considerations • One hour window for end to end processing  Handling data issues  Summarization  Multiple cycles per day • Predictable run time for backlog processing when jobs fail • Reloading incorrect batch • Restart failed batch 14 0.67 8.30 8.30 0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 9.00 0.00 100.00 200.00 300.00 400.00 500.00 600.00 700.00 1 Hr Batch 12 Hrs Loop 12 Hrs batch ElapseTimeinMins Cycles TripScrub TripSecDetails TripSum State Median Hbase load AuditEn raw2can Input Size in GB
  • 16. Acquire Phase 15  Raw trip files copied into HDFS using WebHDFS protocol  Folders created by vendor, file, load event ID, and batch #  Used Sqoop to transfer 4 TB of historical data from Data Warehouse  Hive external tables for each file  Partitioned by load event ID and batch #  Used both BigSQL 1.0 and HiveQL  Partitioned external tables helped in o Processing backlog data o Reprocessing incorrect batches
  • 17. Standardize Phase 16  Select data from external tables based on load event ID  Each load event ID can include one or more batches  More than one load event ID can be processed in one cycle  Data moved to next stage only from work tables  Helped in performance  Dynamic partitions helped in loading multiple batches  Partitions get overwritten if already exists  Helped in reprocessing incorrect batch Work tables contain data for CURRENT processing cycle Canonical tables partitioned by source and batch # Load using dynamic partitioning
  • 18. Data Scrubbing & Event Calculation 17 Trip summary Trip point  Map side join  Single read multi write Orphan trips Trip points (Work table)  Java M/R program for o Scrubbing o Events calculation  Night time driving  Hard brake  Fast acceleration  Miles driven Events at seconds level (Work table)  Very good performance gain  Using Java for complex scrubbing rules  Single read multiple writes  Only required data points processed  No data persisted to corpse tables
  • 19. Summarization Phase 18 Events at seconds level (Work table)  Gather all trips related to devices from current trip and aggregate at various levels  Union ALL  UDF to store data points for trip graph  Replace new summary info into final table SRE summary (Work table) SRE summary partitioned by source SRE summary in HBase  Parallelized the Union All operation  Partitioning by Source enabled both Vendor data to be processed at same time if overlap happens  PUT from Hive to Hbase, WAL disabled  Shorten column names  Changed to epoch time  Prefix salting key  Generate rowkey  Column family mapping
  • 20. Batch Performance Metrics 19 1 Hr Batch SLA 0 0.5 1 1.5 2 2.5 0 5 10 15 20 25 30 35 40 DataSizeinGB RuntimeinMins Cycle Schedule Time Avg Run Times for Hourly Cycles 0 2 4 6 8 10 12 0 10 20 30 40 50 60 0000 0400 0800 1200 1600 2000 DataSizeinGB RuntimeinMins Cycle Schedule Time Avg Run Times for 4 hr Cycles Trip Second Details Standarize SRE Trip Summary Hive SRE Trip Summary Hbase Audits Acquire Size in GB
  • 21. Data Access SmartRide Web Page Application Layer Column Family and Row Key Design Performance Metrics
  • 23. SmartRide Web Page – Daily 22
  • 24. Application Layer 23 Data Access Layer HBASE API Restful Service Single Page Web App BigSQL & Hive Aggregates Daily HDFS HBase HRegion Server HRegion HLog Memstore HFile ODS – DB2 ODBC
  • 25. Column Family & RowKey Design 24 RowKey – Pfx_pgmId_pdflg_ timestamp Column Family – Summary Data Column Family – Trip-point Data 12_8798782_Tp_201 5080912000000 SM:miles,1500001245,’15’, SM:hb,1500001245,’2’, SM:fa,1500001245,’5’, SM:nt,1500001245,’Y’ TP:Trip,1500001245,’{JSON BLOB}’ Sorted Lexicographically • Column family (CF) helps in grouping the related columns depending on access pattern. • Co-locating the keys related to one customer in one region to access data using filter from one region server.
  • 26. Performance Metrics Scenarios – 1x, 2x, 3x concurrent users, Zookeeper node going down, Datanode unavailable Tools used – Initial test using custom program, LoadRunner for final test, SiteScope for monitoring resource consumption 25 SLA for aggregates – 5 sec # of concurrent users - 1200
  • 27. HBase Data Distribution – Using Hannibal 26 SmartRide Data Distribution
  • 28. Business Benefits • Deeper Engagement with Members  Over 2 million website page views since the July launch. To put in perspective, our vendor-hosted website would receive 100,000 views in a 12 month period.  Over 60K users have accessed the new site and 90% of those are new users. • Increase in bind ratios across all channels • Improvement in loss ratios • Enterprise first "big data" implementation at Nationwide 27
  • 29. Future scope – Personal and Commercial Fleet 28 Insights Give Nationwide Competitive Advantage Weather Data GPS Data Hourly Trip Data from Device Claims Data Other Public Records
  • 30. © 2015 IBM Corporation Thank You
  • 31. We Value Your Feedback! Don’t forget to submit your Insight session and speaker feedback! Your feedback is very important to us – we use it to continually improve the conference. Access your surveys at insight2015survey.com to quickly submit your surveys from your smartphone, laptop, or conference kiosk. 30
  • 32. 31 Notices and Disclaimers Copyright © 2015 by International Business Machines Corporation (IBM). No part of this document may be reproduced or transmitted in any form without written permission from IBM. U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM. Information in these presentations (including information relating to products that have not yet been announced by IBM) has been reviewed for accuracy as of the date of initial publication and could include unintentional technical or typographical errors. IBM shall have no responsibility to update this information. THIS DOCUMENT IS DISTRIBUTED "AS IS" WITHOUT ANY WARRANTY, EITHER EXPRESS OR IMPLIED. IN NO EVENT SHALL IBM BE LIABLE FOR ANY DAMAGE ARISING FROM THE USE OF THIS INFORMATION, INCLUDING BUT NOT LIMITED TO, LOSS OF DATA, BUSINESS INTERRUPTION, LOSS OF PROFIT OR LOSS OF OPPORTUNITY. IBM products and services are warranted according to the terms and conditions of the agreements under which they are provided. Any statements regarding IBM's future direction, intent or product plans are subject to change or withdrawal without notice. Performance data contained herein was generally obtained in a controlled, isolated environments. Customer examples are presented as illustrations of how those customers have used IBM products and the results they may have achieved. Actual performance, cost, savings or other results in other operating environments may vary. References in this document to IBM products, programs, or services does not imply that IBM intends to make such products, programs or services available in all countries in which IBM operates or does business. Workshops, sessions and associated materials may have been prepared by independent session speakers, and do not necessarily reflect the views of IBM. All materials and discussions are provided for informational purposes only, and are neither intended to, nor shall constitute legal or other guidance or advice to any individual participant or their specific situation. It is the customer’s responsibility to insure its own compliance with legal requirements and to obtain advice of competent legal counsel as to the identification and interpretation of any relevant laws and regulatory requirements that may affect the customer’s business and any actions the customer may need to take to comply with such laws. IBM does not provide legal advice or represent or warrant that its services or products will ensure that the customer is in compliance with any law.
  • 33. 32 Notices and Disclaimers (con’t) Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM has not tested those products in connection with this publication and cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. IBM does not warrant the quality of any third-party products, or the ability of any such third-party products to interoperate with IBM’s products. IBM EXPRESSLY DISCLAIMS ALL WARRANTIES, EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. The provision of the information contained herein is not intended to, and does not, grant any right or license under any IBM patents, copyrights, trademarks or other intellectual property right. • IBM, the IBM logo, ibm.com, Aspera®, Bluemix, Blueworks Live, CICS, Clearcase, Cognos®, DOORS®, Emptoris®, Enterprise Document Management System™, FASP®, FileNet®, Global Business Services ®, Global Technology Services ®, IBM ExperienceOne™, IBM SmartCloud®, IBM Social Business®, Information on Demand, ILOG, Maximo®, MQIntegrator®, MQSeries®, Netcool®, OMEGAMON, OpenPower, PureAnalytics™, PureApplication®, pureCluster™, PureCoverage®, PureData®, PureExperience®, PureFlex®, pureQuery®, pureScale®, PureSystems®, QRadar®, Rational®, Rhapsody®, Smarter Commerce®, SoDA, SPSS, Sterling Commerce®, StoredIQ, Tealeaf®, Tivoli®, Trusteer®, Unica®, urban{code}®, Watson, WebSphere®, Worklight®, X-Force® and System z® Z/OS, are trademarks of International Business Machines Corporation, registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at: www.ibm.com/legal/copytrade.shtml.
  • 34. • IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion. • Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. • The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. • The development, release, and timing of any future features or functionality described for our products remains at our sole discretion. Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon many factors, including considerations such as the amount of multiprogramming in the user’s job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve results similar to those stated here. Please Note: 2