SlideShare une entreprise Scribd logo
1  sur  31
© 2015 Impetus Technologies
1
Building Real-time Streaming Applications in
minutes
Apache Storm made easy
Anand Venugopal
Head of Product for StreamAnalytix
Panelists: Punit Shah (Architect), Syed Bilal (Data Scientist)
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
2
BRIEF INTRO
• Big Data Solutions & Services company
• Unique in depth, expertise – started Big Data implementations in 2008
• Proven with customer success
• IP and Products
• We deliver - Business Impact from Big Data Solutions
• Technology expertise
• Data Science
• BusinessAnalytics
• Serving Fortune 1000 companies since 1996
• Large-scale and mission critical software platforms
• HQ: Los Gatos,CA; 1500 people
• Offshore operations in 3 cities in India
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
3
AGENDA
• Modern Agile Enterprise
• Business Drivers
• Lambda Architecture
• Enterprise “Nervous System”
• StreamAnalytix Demo
• Building real-time data pipelines on Apache Storm – easy and fast
• Blending real-time and batch analytics in Social Media Analytics
• Announcements
• Q&A
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
4
DRIVERS FORTHE MODERN AGILE ENTERPRISE
Business
Operations
Business
Analytics
Real-time
Streaming Analytics
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
5
BUSINESS OPERATIONS INCREASINGLY REAL-TIME IN NATURE
Fleet Operations & Logistics Security
Mobile Devices andApps Energy Industry IT Operations
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
6
DRIVERS FOR A MODERN AGILE ENTERPRISE –THE CUSTOMER
You and II need it when
I need it
Stop sending
me blanket
Ads
My income
went up I can
buy a house
now
I already
bought it and
wont need it
for a few years
Dear Call center agent – I wish
you knew who I am and my
recent issues – I am about to
pull the plug!
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
7
CONTEXT UNAWARE  BAD CUSTOMER EXPERIENCE
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
8
CONTEXT AWARE  POSITIVE CUSTOMER EXPERIENCE
Multi-channel
engagement in
real-time
Context
Sensitive service
Happy customers,
Loyalty, Revenue,
Profits, Growth
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
9
MODERN AGILE ENTERPRISE = IMPACT FROM DATA ANALYTICS
DATA ANALYTICS IMPACT
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
10
MODERN AGILE ENTERPRISE – DATA MANAGEMENT
DATA ANALYTICS IMPACT
• Sense/ Capture/Transport
• Ingest,Transformation,
Pre-process
• Volume,Variety,Velocity –
Big Data +Traditional
merge
• Quality, Governance,
Security, Lineage, Life-
cycle
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
11
MODERN AGILE ENTERPRISE – ANALYTICS MATURITY
DATA ANALYTICS IMPACT
• Descriptive  Predictive 
Prescriptive (ML ?)
• Search and Exploration,
Sandbox – Data Scientists
• Across silos, Includes external
data, Fast blending
• Speed – Latency of access and
Recency of data
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
12
MODERN AGILE ENTERPRISE – REAL IMPACT
DATA ANALYTICS IMPACT
• Converting Insights to
Impact, Decision culture
• Measuring, closed-loop
feedback, Improving
• Customers Experience a
difference
• Financial measures are
clearly impacted
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
13
MODERN AGILE ENTERPRISE – SUMMARY
DATA ANALYTICS IMPACT
• Sense/ Capture/Transport
• Ingest,Transformation,
Pre-process
• Volume,Variety,Velocity –
Big Data +Traditional
merge
• Quality, Governance,
Security, Lineage, Life-
cycle
• Descriptive  Predictive 
Prescriptive (ML ?)
• Search and Exploration,
Sandbox – Data Scientists
• Across silos, Includes external
data, Fast blending
• Speed – Latency of access and
Recency of data
• Converting Insights to
Impact, Decision culture
• Measuring, closed-loop
feedback, Improving
• Customers Experience a
difference
• Financial measures are
clearly impacted
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
14
BATCHVS. REAL-TIME BUSINESS PROCESS
SENSE Days ANALYSE Weeks ACT
SENSE ANALYSE ACT
Sec/ ms
Batch
Real time
Sec/ ms
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
15
BARRIERTO BEING “AGILE” –THE “BATCH GAP”
The batch workflow is too slow
Views are out of date
Not yet
absorbed.Data absorbed into BatchViews
Now
Time
Just a few hours of data.
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
16
t
now
Hadoop works great back
here
RT-Ax works
here
BLENDED VIEW – HISTORICAL AND NOW
Blended viewBlended viewBlended View
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
17
LAMBDA ARCHITECTURE : BIGAND FAST DATA COMBINED
Batch Layer
All data
Pre-computed
information
Batch re-compute
Speed Layer
All data
Pre-computed
information
Real time increment
Batch view
Serving Layer
Batch view
Merge
Real time view
Real time view
All
Incoming
Data Query
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
18
THE FULL PICTURE OF A MODERN AND AGILE
ARCHITECTURE
Landing and
ingestion
Structured
Unstructured
External
Social
Machine
Geospatial
Time Series
Streaming
Provisioning,Workflow, Monitoring and Security
Enterprise
Data Lake
Predictive
applications
Exploration &
discovery
Enterprise
applications
Real-Time applications
Traditional
data
repositories
RDBMS MPP
Compliance, Governance, Information Lifecycle, Data
Lineage, Enterprise Meta Data Management
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
19
SMART ENTERPRISE BIG DATA BUS
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
20
Enterprise Class Real time Streaming Analytics Platform
A Product developed and offered by
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
21
AT A GLANCE
StreamAnalytix is a software platform that enables enterprises to analyze and
respond to events in real-time at Big Data scale. It is designed to rapidly build and
deploy streaming analytics applications for any industry vertical, any data format,
and any use-case
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
22
STREAMANALYTIX BLOCK DIAGRAM
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
23
DEMO – SOCIAL MEDIA ANALYTICS
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
24
TEXT ANALYTICS ALGORITHMS
TEXT CLASSIFICATION TOPIC MODELINGSENTIMENTANALYSIS
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
25
SENTIMENT ANALYSIS
• Lexicon based approach; Unsupervised model
• Lexicon prepared offline with Matrix
decomposition based approach.
• Custom language rules added:
• Surrounding words could negate, amplify etc.
• Dealing with Slang, abbreviations.
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
26
TEXT CLASSIFICATION
• 20 categories; Multiple labels if applicable
• Semantic similarity approach based on
Matrix Decomposition
• Classification Model trained offline. Can be
extended to more domains.
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
27
BATCH PROCESS –TOPIC MODELING
• Identification of main topics of conversation
within text.
• Model based on exploiting co-occurrence of
words across documents.
• Current model is specifically tuned for
informal text like twitter to deal with issues
like misspellings, slangs.
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
28
DEMO – SOCIAL MEDIA ANALYTICS
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
29
REAL-TIME MODEL BUILDING
read Kafka
Channel (Storm
Spout)
eventsKafka
Topic
Custom
Processor
(Model scoring
in real-time)
Kafka
Channel
(Storm Spout)
Custom
Processor
(Model Building
in real-time)
read
events
Update model for scoring in real-time
using the platform orchestration layer
(functionality of StreamAnalytix)
Scored
event sent
downstream
Two copies of the event are read into two
different topologies.
One Topology is responsible for
building/enhancing the model in real-time. This
component will also access historical data to
ensure model creation uses “all” the data.
The second topology is responsible for applying
the model in real-time for scoring.
At a certain threshold, the ‘newer’ version of the
model can be applied to the scoring component
by using the platform orchestration layer
without any down-time
Another possible scenario:
The real-time scoring and learning can be done by the same
component. In such situations, a fork of the incoming
message is not required. The system design/ configuration of
doing learning and scoring in the same component is a
matter of scale and performance tuning
Historical/
training
data
read
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
30
POLL
Functionality Cores/ Capacity Duration
Free 1 Limited Unlimited Forever
Free 2 ALL Limited Forever
Trial ALL Unlimited 60 days
Which of these options for trying StreamAnalytix would you download ?
Recorded version available at http://bit.ly/1PwhobK
© 2015 Impetus Technologies
31
Q&A
(Use the chat/Q&A panel)
Email us at inquiry@streamanalytix.com
www.StreamAnalytix.com
?
Request: On-premise and Cloud based trial and/or Proof of
concept

Contenu connexe

En vedette

Misty Gilbert Final Portfolio 2013 Information Technology King University
Misty Gilbert Final Portfolio 2013 Information Technology King UniversityMisty Gilbert Final Portfolio 2013 Information Technology King University
Misty Gilbert Final Portfolio 2013 Information Technology King UniversityMisty Gilbert
 
Is The Question “For Whom Did You Vote?” Relevant?
Is The Question “For Whom Did You Vote?” Relevant?Is The Question “For Whom Did You Vote?” Relevant?
Is The Question “For Whom Did You Vote?” Relevant?Alexandre A. Rocha
 
Hunter Business Group Overview
Hunter Business Group OverviewHunter Business Group Overview
Hunter Business Group OverviewRon Chandler
 
Open Source at scale: the Apache Software Foundation
Open Source at scale: the Apache Software FoundationOpen Source at scale: the Apache Software Foundation
Open Source at scale: the Apache Software FoundationBertrand Delacretaz
 
Abdome agudo clínica e imagem
Abdome agudo   clínica e imagemAbdome agudo   clínica e imagem
Abdome agudo clínica e imagemdiego2703
 
How Eastern Bank Uses Big Data to Better Serve and Protect its Customers
How Eastern Bank Uses Big Data to Better Serve and Protect its CustomersHow Eastern Bank Uses Big Data to Better Serve and Protect its Customers
How Eastern Bank Uses Big Data to Better Serve and Protect its CustomersBrian Griffith
 
Jangan: Imperatives in Indonesian
Jangan: Imperatives in IndonesianJangan: Imperatives in Indonesian
Jangan: Imperatives in IndonesianLearn Indo
 
15 acta sesion ordinaria n° 15 de 28 05 2013
15 acta sesion ordinaria n° 15 de 28 05 201315 acta sesion ordinaria n° 15 de 28 05 2013
15 acta sesion ordinaria n° 15 de 28 05 2013Tamara Salinas
 
23 Sports Analysis Apps
23 Sports Analysis Apps23 Sports Analysis Apps
23 Sports Analysis AppsRob Carroll
 

En vedette (18)

Misty Gilbert Final Portfolio 2013 Information Technology King University
Misty Gilbert Final Portfolio 2013 Information Technology King UniversityMisty Gilbert Final Portfolio 2013 Information Technology King University
Misty Gilbert Final Portfolio 2013 Information Technology King University
 
Is The Question “For Whom Did You Vote?” Relevant?
Is The Question “For Whom Did You Vote?” Relevant?Is The Question “For Whom Did You Vote?” Relevant?
Is The Question “For Whom Did You Vote?” Relevant?
 
Spanish Inheritance Tax
Spanish Inheritance TaxSpanish Inheritance Tax
Spanish Inheritance Tax
 
Presentacion cetm2011
Presentacion cetm2011Presentacion cetm2011
Presentacion cetm2011
 
Hunter Business Group Overview
Hunter Business Group OverviewHunter Business Group Overview
Hunter Business Group Overview
 
Portafolio
 Portafolio  Portafolio
Portafolio
 
Hostel SS - Karina Rosario
Hostel SS - Karina RosarioHostel SS - Karina Rosario
Hostel SS - Karina Rosario
 
Preguntas de investigación universitaria
Preguntas de investigación universitariaPreguntas de investigación universitaria
Preguntas de investigación universitaria
 
Open Source at scale: the Apache Software Foundation
Open Source at scale: the Apache Software FoundationOpen Source at scale: the Apache Software Foundation
Open Source at scale: the Apache Software Foundation
 
Rescatados 2004 12
Rescatados 2004 12Rescatados 2004 12
Rescatados 2004 12
 
Abdome agudo clínica e imagem
Abdome agudo   clínica e imagemAbdome agudo   clínica e imagem
Abdome agudo clínica e imagem
 
How Eastern Bank Uses Big Data to Better Serve and Protect its Customers
How Eastern Bank Uses Big Data to Better Serve and Protect its CustomersHow Eastern Bank Uses Big Data to Better Serve and Protect its Customers
How Eastern Bank Uses Big Data to Better Serve and Protect its Customers
 
Jangan: Imperatives in Indonesian
Jangan: Imperatives in IndonesianJangan: Imperatives in Indonesian
Jangan: Imperatives in Indonesian
 
15 acta sesion ordinaria n° 15 de 28 05 2013
15 acta sesion ordinaria n° 15 de 28 05 201315 acta sesion ordinaria n° 15 de 28 05 2013
15 acta sesion ordinaria n° 15 de 28 05 2013
 
American Imperialism
American ImperialismAmerican Imperialism
American Imperialism
 
Infecciones intrauterinas
Infecciones intrauterinasInfecciones intrauterinas
Infecciones intrauterinas
 
23 Sports Analysis Apps
23 Sports Analysis Apps23 Sports Analysis Apps
23 Sports Analysis Apps
 
Lengua 9 3
Lengua 9 3Lengua 9 3
Lengua 9 3
 

Plus de Impetus Technologies

Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Impetus Technologies
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarImpetus Technologies
 
Building Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarBuilding Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarImpetus Technologies
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Impetus Technologies
 
Impetus White Paper- Handling Data Corruption in Elasticsearch
Impetus White Paper- Handling  Data Corruption  in ElasticsearchImpetus White Paper- Handling  Data Corruption  in Elasticsearch
Impetus White Paper- Handling Data Corruption in ElasticsearchImpetus Technologies
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarImpetus Technologies
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarImpetus Technologies
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Impetus Technologies
 
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Impetus Technologies
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Impetus Technologies
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...Impetus Technologies
 
Enterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastEnterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastImpetus Technologies
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Impetus Technologies
 
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Impetus Technologies
 
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Impetus Technologies
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabImpetus Technologies
 
Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trendsImpetus Technologies
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labImpetus Technologies
 
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...Impetus Technologies
 
Performance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastPerformance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastImpetus Technologies
 

Plus de Impetus Technologies (20)

Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
 
Building Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarBuilding Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus Webinar
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
 
Impetus White Paper- Handling Data Corruption in Elasticsearch
Impetus White Paper- Handling  Data Corruption  in ElasticsearchImpetus White Paper- Handling  Data Corruption  in Elasticsearch
Impetus White Paper- Handling Data Corruption in Elasticsearch
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
 
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
 
Enterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastEnterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus Webcast
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
 
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
 
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLab
 
Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trends
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph lab
 
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
 
Performance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastPerformance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus Webcast
 

Dernier

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 

Dernier (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Building Real-time Streaming Apps in Minutes- StreamAnalytix Webinar

  • 1. © 2015 Impetus Technologies 1 Building Real-time Streaming Applications in minutes Apache Storm made easy Anand Venugopal Head of Product for StreamAnalytix Panelists: Punit Shah (Architect), Syed Bilal (Data Scientist) Recorded version available at http://bit.ly/1PwhobK
  • 2. © 2015 Impetus Technologies 2 BRIEF INTRO • Big Data Solutions & Services company • Unique in depth, expertise – started Big Data implementations in 2008 • Proven with customer success • IP and Products • We deliver - Business Impact from Big Data Solutions • Technology expertise • Data Science • BusinessAnalytics • Serving Fortune 1000 companies since 1996 • Large-scale and mission critical software platforms • HQ: Los Gatos,CA; 1500 people • Offshore operations in 3 cities in India Recorded version available at http://bit.ly/1PwhobK
  • 3. © 2015 Impetus Technologies 3 AGENDA • Modern Agile Enterprise • Business Drivers • Lambda Architecture • Enterprise “Nervous System” • StreamAnalytix Demo • Building real-time data pipelines on Apache Storm – easy and fast • Blending real-time and batch analytics in Social Media Analytics • Announcements • Q&A Recorded version available at http://bit.ly/1PwhobK
  • 4. © 2015 Impetus Technologies 4 DRIVERS FORTHE MODERN AGILE ENTERPRISE Business Operations Business Analytics Real-time Streaming Analytics Recorded version available at http://bit.ly/1PwhobK
  • 5. © 2015 Impetus Technologies 5 BUSINESS OPERATIONS INCREASINGLY REAL-TIME IN NATURE Fleet Operations & Logistics Security Mobile Devices andApps Energy Industry IT Operations Recorded version available at http://bit.ly/1PwhobK
  • 6. © 2015 Impetus Technologies 6 DRIVERS FOR A MODERN AGILE ENTERPRISE –THE CUSTOMER You and II need it when I need it Stop sending me blanket Ads My income went up I can buy a house now I already bought it and wont need it for a few years Dear Call center agent – I wish you knew who I am and my recent issues – I am about to pull the plug! Recorded version available at http://bit.ly/1PwhobK
  • 7. © 2015 Impetus Technologies 7 CONTEXT UNAWARE  BAD CUSTOMER EXPERIENCE Recorded version available at http://bit.ly/1PwhobK
  • 8. © 2015 Impetus Technologies 8 CONTEXT AWARE  POSITIVE CUSTOMER EXPERIENCE Multi-channel engagement in real-time Context Sensitive service Happy customers, Loyalty, Revenue, Profits, Growth Recorded version available at http://bit.ly/1PwhobK
  • 9. © 2015 Impetus Technologies 9 MODERN AGILE ENTERPRISE = IMPACT FROM DATA ANALYTICS DATA ANALYTICS IMPACT Recorded version available at http://bit.ly/1PwhobK
  • 10. © 2015 Impetus Technologies 10 MODERN AGILE ENTERPRISE – DATA MANAGEMENT DATA ANALYTICS IMPACT • Sense/ Capture/Transport • Ingest,Transformation, Pre-process • Volume,Variety,Velocity – Big Data +Traditional merge • Quality, Governance, Security, Lineage, Life- cycle Recorded version available at http://bit.ly/1PwhobK
  • 11. © 2015 Impetus Technologies 11 MODERN AGILE ENTERPRISE – ANALYTICS MATURITY DATA ANALYTICS IMPACT • Descriptive  Predictive  Prescriptive (ML ?) • Search and Exploration, Sandbox – Data Scientists • Across silos, Includes external data, Fast blending • Speed – Latency of access and Recency of data Recorded version available at http://bit.ly/1PwhobK
  • 12. © 2015 Impetus Technologies 12 MODERN AGILE ENTERPRISE – REAL IMPACT DATA ANALYTICS IMPACT • Converting Insights to Impact, Decision culture • Measuring, closed-loop feedback, Improving • Customers Experience a difference • Financial measures are clearly impacted Recorded version available at http://bit.ly/1PwhobK
  • 13. © 2015 Impetus Technologies 13 MODERN AGILE ENTERPRISE – SUMMARY DATA ANALYTICS IMPACT • Sense/ Capture/Transport • Ingest,Transformation, Pre-process • Volume,Variety,Velocity – Big Data +Traditional merge • Quality, Governance, Security, Lineage, Life- cycle • Descriptive  Predictive  Prescriptive (ML ?) • Search and Exploration, Sandbox – Data Scientists • Across silos, Includes external data, Fast blending • Speed – Latency of access and Recency of data • Converting Insights to Impact, Decision culture • Measuring, closed-loop feedback, Improving • Customers Experience a difference • Financial measures are clearly impacted Recorded version available at http://bit.ly/1PwhobK
  • 14. © 2015 Impetus Technologies 14 BATCHVS. REAL-TIME BUSINESS PROCESS SENSE Days ANALYSE Weeks ACT SENSE ANALYSE ACT Sec/ ms Batch Real time Sec/ ms Recorded version available at http://bit.ly/1PwhobK
  • 15. © 2015 Impetus Technologies 15 BARRIERTO BEING “AGILE” –THE “BATCH GAP” The batch workflow is too slow Views are out of date Not yet absorbed.Data absorbed into BatchViews Now Time Just a few hours of data. Recorded version available at http://bit.ly/1PwhobK
  • 16. © 2015 Impetus Technologies 16 t now Hadoop works great back here RT-Ax works here BLENDED VIEW – HISTORICAL AND NOW Blended viewBlended viewBlended View Recorded version available at http://bit.ly/1PwhobK
  • 17. © 2015 Impetus Technologies 17 LAMBDA ARCHITECTURE : BIGAND FAST DATA COMBINED Batch Layer All data Pre-computed information Batch re-compute Speed Layer All data Pre-computed information Real time increment Batch view Serving Layer Batch view Merge Real time view Real time view All Incoming Data Query Recorded version available at http://bit.ly/1PwhobK
  • 18. © 2015 Impetus Technologies 18 THE FULL PICTURE OF A MODERN AND AGILE ARCHITECTURE Landing and ingestion Structured Unstructured External Social Machine Geospatial Time Series Streaming Provisioning,Workflow, Monitoring and Security Enterprise Data Lake Predictive applications Exploration & discovery Enterprise applications Real-Time applications Traditional data repositories RDBMS MPP Compliance, Governance, Information Lifecycle, Data Lineage, Enterprise Meta Data Management Recorded version available at http://bit.ly/1PwhobK
  • 19. © 2015 Impetus Technologies 19 SMART ENTERPRISE BIG DATA BUS Recorded version available at http://bit.ly/1PwhobK
  • 20. © 2015 Impetus Technologies 20 Enterprise Class Real time Streaming Analytics Platform A Product developed and offered by Recorded version available at http://bit.ly/1PwhobK
  • 21. © 2015 Impetus Technologies 21 AT A GLANCE StreamAnalytix is a software platform that enables enterprises to analyze and respond to events in real-time at Big Data scale. It is designed to rapidly build and deploy streaming analytics applications for any industry vertical, any data format, and any use-case Recorded version available at http://bit.ly/1PwhobK
  • 22. © 2015 Impetus Technologies 22 STREAMANALYTIX BLOCK DIAGRAM Recorded version available at http://bit.ly/1PwhobK
  • 23. © 2015 Impetus Technologies 23 DEMO – SOCIAL MEDIA ANALYTICS Recorded version available at http://bit.ly/1PwhobK
  • 24. © 2015 Impetus Technologies 24 TEXT ANALYTICS ALGORITHMS TEXT CLASSIFICATION TOPIC MODELINGSENTIMENTANALYSIS Recorded version available at http://bit.ly/1PwhobK
  • 25. © 2015 Impetus Technologies 25 SENTIMENT ANALYSIS • Lexicon based approach; Unsupervised model • Lexicon prepared offline with Matrix decomposition based approach. • Custom language rules added: • Surrounding words could negate, amplify etc. • Dealing with Slang, abbreviations. Recorded version available at http://bit.ly/1PwhobK
  • 26. © 2015 Impetus Technologies 26 TEXT CLASSIFICATION • 20 categories; Multiple labels if applicable • Semantic similarity approach based on Matrix Decomposition • Classification Model trained offline. Can be extended to more domains. Recorded version available at http://bit.ly/1PwhobK
  • 27. © 2015 Impetus Technologies 27 BATCH PROCESS –TOPIC MODELING • Identification of main topics of conversation within text. • Model based on exploiting co-occurrence of words across documents. • Current model is specifically tuned for informal text like twitter to deal with issues like misspellings, slangs. Recorded version available at http://bit.ly/1PwhobK
  • 28. © 2015 Impetus Technologies 28 DEMO – SOCIAL MEDIA ANALYTICS Recorded version available at http://bit.ly/1PwhobK
  • 29. © 2015 Impetus Technologies 29 REAL-TIME MODEL BUILDING read Kafka Channel (Storm Spout) eventsKafka Topic Custom Processor (Model scoring in real-time) Kafka Channel (Storm Spout) Custom Processor (Model Building in real-time) read events Update model for scoring in real-time using the platform orchestration layer (functionality of StreamAnalytix) Scored event sent downstream Two copies of the event are read into two different topologies. One Topology is responsible for building/enhancing the model in real-time. This component will also access historical data to ensure model creation uses “all” the data. The second topology is responsible for applying the model in real-time for scoring. At a certain threshold, the ‘newer’ version of the model can be applied to the scoring component by using the platform orchestration layer without any down-time Another possible scenario: The real-time scoring and learning can be done by the same component. In such situations, a fork of the incoming message is not required. The system design/ configuration of doing learning and scoring in the same component is a matter of scale and performance tuning Historical/ training data read Recorded version available at http://bit.ly/1PwhobK
  • 30. © 2015 Impetus Technologies 30 POLL Functionality Cores/ Capacity Duration Free 1 Limited Unlimited Forever Free 2 ALL Limited Forever Trial ALL Unlimited 60 days Which of these options for trying StreamAnalytix would you download ? Recorded version available at http://bit.ly/1PwhobK
  • 31. © 2015 Impetus Technologies 31 Q&A (Use the chat/Q&A panel) Email us at inquiry@streamanalytix.com www.StreamAnalytix.com ? Request: On-premise and Cloud based trial and/or Proof of concept

Notes de l'éditeur

  1. Data source – listener for Active MQ Secure data streaming from remote servers Alert for call drop events in the main pipeline
  2. Amex specific recommendations Meta data store- Discoverable Leaner data mart/BI delivery Data driven Business Analytics
  3. Data source – listener for Active MQ Secure data streaming from remote servers Alert for call drop events in the main pipeline
  4. Data source – listener for Active MQ Secure data streaming from remote servers Alert for call drop events in the main pipeline
  5. Data source – listener for Active MQ Secure data streaming from remote servers Alert for call drop events in the main pipeline
  6. Data source – listener for Active MQ Secure data streaming from remote servers Alert for call drop events in the main pipeline
  7. Data source – listener for Active MQ Secure data streaming from remote servers Alert for call drop events in the main pipeline
  8. Amex specific recommendations Meta data store- Discoverable Leaner data mart/BI delivery Data driven Business Analytics
  9. Amex specific recommendations Meta data store- Discoverable Leaner data mart/BI delivery Data driven Business Analytics
  10. Amex specific recommendations Meta data store- Discoverable Leaner data mart/BI delivery Data driven Business Analytics
  11. Amex specific recommendations Meta data store- Discoverable Leaner data mart/BI delivery Data driven Business Analytics
  12. Amex specific recommendations Meta data store- Discoverable Leaner data mart/BI delivery Data driven Business Analytics
  13. Data source – listener for Active MQ Secure data streaming from remote servers Alert for call drop events in the main pipeline
  14. Eventual Accuracy: Can compute exact answer in batch layer and approximate answer in real-time layer
  15. Essentially, the Lambda Architecture comprises the following components, processes, and responsibilities: New Data: All data entering the system is dispatched to both the batch layer and the speed layer for processing. Batch layer: The batch layer has two functions: (i) managing the master dataset (an immutable, append-only set of raw data), and (ii) to pre-compute (arbitrary query functions) the batch views. Hadoop's HDFS is typically used to store the master dataset and perform the computation of the batch views using MapReduce. Serving layer: this layer indexes the batch views so that they can be queried in ad hoc with low latency. To implement the serving layer, usually technologies such as Apache HBase or ElephantDB are utilized. The Apache Drill project provides the capability to execute full ANSI SQL 2003 queries against batch views. Speed layer: This layer compensates for the high latency of updates to the serving layer, due to the batch layer. Using fast and incremental algorithms, the speed layer deals with recent data only. Storm is often used to implement this layer. Queries: Last but not least, Any incoming query can be answered by merging results from batch views and real-time views.
  16. Broad adoption of unified Data Lake architectures, will require information governance, meta data management and information lifecycle management capabilities.
  17. [Punit] – lose the text at the bottom and increase the size of the diagram Also make the label beneath the image to be large and bold StreamAnalytix is a software platform that enables enterprises to analyze and respond to events in real-time at Big Data scale. It is designed to rapidly build and deploy streaming analytics applications for any industry vertical, any data format, and any use-case