SlideShare une entreprise Scribd logo
1  sur  14
Hadoop’s Most Powerful Platform for Real-Time Stream
Computations
Prepared for Big Data Gurus
February 27th, 2014
DataTorrent Big Data Platform
• Vision: Ubiquitize Real-Time Big Data Computations
– Enterprise quality: Highly Available, Linearly Scalable, Operable and
Easy to Use
– Big data dimensional computations in real time with linear scalability
• Real-Time ETL: De-dup, Staging, Cleanup, Transformations, Load …
• Real-Time Computation Apps and Feed Ingestion (Games, Mobile, Set-top
Boxes, Devices, …)
– Multi-Feed Sources
– Run business logic in real-time with HA
• Real-Time Monitoring, and Security: Capacity, DDOS, …
• Real-Time Predictive Analytics: Web Analytics, Business Analytics, …
DataTorrent in Hadoop Ecosystem
© DataTorrent, 2014 - Confidential
DataTorrent in the Modern Data Architecture
APPLICATIONSDATASYSTEMSOURCES
RDBMS EDW
Emerging Sources
(Sensor, Sentiment, Geo, Unstructured)
HANA
BusinessObjects BI
OPERATIONAL TOOLS
DEV & DATA TOOLS
Existing Sources
(CRM, ERP, Clickstream, Logs)
INFRASTRUCTURE
Business Analytics Business Intelligence Tools OLAP Clients
Real-time Stream Analytics
DATAIN
MOTION
StrAM (Stream Application Master)
Security
SLA
Scalability
Alerts
Fault
Tolerance
Tools
Partitioning
Web Services
Dynamic
Modifications
State
Snapshot
Malhar – Open Source Operators and Apps Library
(Apache v2 License)
DataTorrent Technology Stack
© DataTorrent Inc. 2014 - Confidential
DataTorrent in Hadoop Reference Architecture
DATA IN MOTION
REAL TIME STREAMING APPLICATIONS
SOURCE DATA
MS Queue’s
Events
Files
Databases
Sensor data
Social
APPLICATIONS
BusinessObjects BI
Query/Visualization/ Reporting/Analytical Tools and Apps
Enterprise Repositories
RDBMS
EDW
NoSQL
Real Time Ingestion
DATA AT REST
BATCH APPLICATIONS
Hive
Pig
HBase
Custom
Message QueueData In Motion
YARN
HDFS
YARN
Map
Reduce
HDFS
OPERATIONAL INTELLEGENCE
BUSINESS ACTIONS
PREDICTIVE ANALYTICS
STREAM ETL
REALTIME ALERTS
© DataTorrent, 2014 - Confidential
Stream Processing
• A Stream is a sequence of data events with schema
• An Operator takes input streams and compute output streams
• An Application is a Directed Acyclic Graph (DAG)
• In-memory asynchronous distributed computations
• A Streaming Window is an atomic batch of sequential data events
DataTorrent Hadoop GRID
1
2
4
3
6
NM NM NM NM
Resource
Manager
StrAM
3
5
5
64
2
1
DT
Gateway
dtCLI
DT
Console
MapReduce
MapReduce
MapReduce
MapReduceMapReduce
MapReduce
Malhar Open Source Project
• Apache 2.0
• Operators (over 400 operators)
– Algorithms
– Ingestion, ETL
– Input and Output Adapters
• UI Widgets (over 50 widgets)
– Console widgets for stats
– Application widgets for app data
• Application Templates
– LogStream
– Map Reduce Debugger
– Shuffle less MapReduce
• Demo Apps (15 demo apps)
Malhar Open Source Project
• Continuous Integration: Unit tests
• Performance tests for operators
• Daily tests of Demos and Apps for memory usage
• More operators and UI widgets added as per new use cases/user requests
• Fully supported: Documentation, Certification
• Input and Output Adapters
– HBase, MongoDB, CouchDB, Redis, Memcache
– Flume, Kakfa, RabbitMQ, ActiveMQ, ZeroMQ
– JBDC, MySql, DerbyDB, TimesTen
– MQTT, Twitter, RSS, HTTP, WebSockets, Socket
– Logs: Apache, SMTP
– DFS, Local cache (Guava)
• Languages: Java, Python, JavaScript, Script, R
DataTorrent’s Platform Differentiators
.
Extreme Scalability Mission Critical Hadoop-Native
• Automatically scales to
changing loads. Massive
performance per node.
Billions of events/sec
• Sub-second latency with
linear scalability.
• Complex monitoring
applications with massive
computations.
• Built-in Stateful Fault-
tolerance. 24/7 uptime -
Highly Available.
• Predictive Analysis, Root
cause. Real-time ETL.
• Update your application
while it's running! A/B
testing (2h2014).
• Develop faster and
implement any business
logic with our open-source
framework.
• Runs on your existing
Apache Hadoop cluster.
• Integrate seamlessly with
your existing data flow and
monitoring stack.
Live Demonstration
Thank You!
• DataTorrent
• Try Sandbox (https://datatorrent.com)
• Free for
• Startup Program (Contact us for more details)
• Up to 25GB memory usage in production
• Non-production clusters
• Malhar Open Source (Apache 2.0) project
• https://github.com/DataTorrent/Malhar
• malhar-users@googlegroups.com
• Applications available Jan 2014
• LogStream Application
• Map-Reduce Monitor
DataTorrent Inc.
3200 Partrick Henry, 2nd Fl
Santa Clara, CA 95054
info@datatorrent.com
www.datatorrent.com
Twitter.com/DataTorrent
Facebook.com/DataTorrent

Contenu connexe

Dernier

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 

Dernier (20)

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 

En vedette

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Malhar data torrent (Big Data Guru meetup 2014-02-27)

  • 1. Hadoop’s Most Powerful Platform for Real-Time Stream Computations Prepared for Big Data Gurus February 27th, 2014
  • 2.
  • 3. DataTorrent Big Data Platform • Vision: Ubiquitize Real-Time Big Data Computations – Enterprise quality: Highly Available, Linearly Scalable, Operable and Easy to Use – Big data dimensional computations in real time with linear scalability • Real-Time ETL: De-dup, Staging, Cleanup, Transformations, Load … • Real-Time Computation Apps and Feed Ingestion (Games, Mobile, Set-top Boxes, Devices, …) – Multi-Feed Sources – Run business logic in real-time with HA • Real-Time Monitoring, and Security: Capacity, DDOS, … • Real-Time Predictive Analytics: Web Analytics, Business Analytics, …
  • 5. © DataTorrent, 2014 - Confidential DataTorrent in the Modern Data Architecture APPLICATIONSDATASYSTEMSOURCES RDBMS EDW Emerging Sources (Sensor, Sentiment, Geo, Unstructured) HANA BusinessObjects BI OPERATIONAL TOOLS DEV & DATA TOOLS Existing Sources (CRM, ERP, Clickstream, Logs) INFRASTRUCTURE Business Analytics Business Intelligence Tools OLAP Clients Real-time Stream Analytics DATAIN MOTION
  • 6. StrAM (Stream Application Master) Security SLA Scalability Alerts Fault Tolerance Tools Partitioning Web Services Dynamic Modifications State Snapshot Malhar – Open Source Operators and Apps Library (Apache v2 License) DataTorrent Technology Stack
  • 7. © DataTorrent Inc. 2014 - Confidential DataTorrent in Hadoop Reference Architecture DATA IN MOTION REAL TIME STREAMING APPLICATIONS SOURCE DATA MS Queue’s Events Files Databases Sensor data Social APPLICATIONS BusinessObjects BI Query/Visualization/ Reporting/Analytical Tools and Apps Enterprise Repositories RDBMS EDW NoSQL Real Time Ingestion DATA AT REST BATCH APPLICATIONS Hive Pig HBase Custom Message QueueData In Motion YARN HDFS YARN Map Reduce HDFS OPERATIONAL INTELLEGENCE BUSINESS ACTIONS PREDICTIVE ANALYTICS STREAM ETL REALTIME ALERTS
  • 8. © DataTorrent, 2014 - Confidential Stream Processing • A Stream is a sequence of data events with schema • An Operator takes input streams and compute output streams • An Application is a Directed Acyclic Graph (DAG) • In-memory asynchronous distributed computations • A Streaming Window is an atomic batch of sequential data events
  • 9. DataTorrent Hadoop GRID 1 2 4 3 6 NM NM NM NM Resource Manager StrAM 3 5 5 64 2 1 DT Gateway dtCLI DT Console MapReduce MapReduce MapReduce MapReduceMapReduce MapReduce
  • 10. Malhar Open Source Project • Apache 2.0 • Operators (over 400 operators) – Algorithms – Ingestion, ETL – Input and Output Adapters • UI Widgets (over 50 widgets) – Console widgets for stats – Application widgets for app data • Application Templates – LogStream – Map Reduce Debugger – Shuffle less MapReduce • Demo Apps (15 demo apps)
  • 11. Malhar Open Source Project • Continuous Integration: Unit tests • Performance tests for operators • Daily tests of Demos and Apps for memory usage • More operators and UI widgets added as per new use cases/user requests • Fully supported: Documentation, Certification • Input and Output Adapters – HBase, MongoDB, CouchDB, Redis, Memcache – Flume, Kakfa, RabbitMQ, ActiveMQ, ZeroMQ – JBDC, MySql, DerbyDB, TimesTen – MQTT, Twitter, RSS, HTTP, WebSockets, Socket – Logs: Apache, SMTP – DFS, Local cache (Guava) • Languages: Java, Python, JavaScript, Script, R
  • 12. DataTorrent’s Platform Differentiators . Extreme Scalability Mission Critical Hadoop-Native • Automatically scales to changing loads. Massive performance per node. Billions of events/sec • Sub-second latency with linear scalability. • Complex monitoring applications with massive computations. • Built-in Stateful Fault- tolerance. 24/7 uptime - Highly Available. • Predictive Analysis, Root cause. Real-time ETL. • Update your application while it's running! A/B testing (2h2014). • Develop faster and implement any business logic with our open-source framework. • Runs on your existing Apache Hadoop cluster. • Integrate seamlessly with your existing data flow and monitoring stack.
  • 14. Thank You! • DataTorrent • Try Sandbox (https://datatorrent.com) • Free for • Startup Program (Contact us for more details) • Up to 25GB memory usage in production • Non-production clusters • Malhar Open Source (Apache 2.0) project • https://github.com/DataTorrent/Malhar • malhar-users@googlegroups.com • Applications available Jan 2014 • LogStream Application • Map-Reduce Monitor DataTorrent Inc. 3200 Partrick Henry, 2nd Fl Santa Clara, CA 95054 info@datatorrent.com www.datatorrent.com Twitter.com/DataTorrent Facebook.com/DataTorrent

Notes de l'éditeur

  1. Ex