SlideShare une entreprise Scribd logo
Big Data & Messaging

with Artem Bilan
by Pivotal
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Artem Bilan
Spring Integration Team

abilan@gopivotal.com

Spring Framework
Spring AMQP
Spring XD
Reactor

https://spring.io/team/artembilan

http://stackoverflow.com/users/2756547

http://www.linkedin.com/in/cleric

https://github.com/artembilan

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Messaging and why do I care?

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Messaging is very simple

Headers

Payload

public static void main(String[] args) {
...
}
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Messaging in Spring IO

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Spring Integration
Written
Uses

Meet IoC!

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
The heart of Spring Integration
–
–
–

Endpoints (Filters) connected through
Channels (Pipes) exchanging
Message

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Spring Integration Message Flow

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
How does it work?

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
What is a Big Data application?
 Not just writing a few MR or Hive/Pig jobs.
 The full lifecycle involves
• Ingestion
• Stream Processing
• Workflow Orchestration
• Enterprise Integration
• Export
• Horizontal scalable deployment
 How do you write one of these?
• Status quo is to combine different projects – not ideal
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Spring XD
• Unified Platform
• Developer Productivity
• Modular Extensibility
• Distributed Architecture
• Portable Runtime
• Hadoop Distribution Agnostic
• Proven Foundation
• XD = ‘eXtreme Data’
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Spring XD
Files

Sensors

Mobile

Social

Spring XD Shell
Spring XD Runtime

Taps

Jobs

Compute

Workflow

Redis

Ingest

Streams

Export

Export

RDBMS

Gemfire

HDFS

NoSQL

R, SAS

Predictive modeling
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Stream Processing Model

How can
http |we make this|easier?
filter
file

Non-linear stream definitions also supported
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Runtimes
XD Admin

http | filter | file

CLUSTERED NODE

CLUSTERED NODE

CLUSTERED NODE

SINGLE
NODE

HTTP
Module

Filter
Module

File
Module

All
Modules

Rabbit, Redis, (Pluggable)
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.

In Memory
Transport

http | filter | file
Streams
MessageStore

HTTP
Tail
File
Mail
Twitter
Gemfire
Syslog
TCP
JMS
RabbitMQ
MQTT

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.

Filter
Transformer
Splitter
Aggregator
Groovy Script
Counters
HTTP
JSON
Java Code

File
HDFS
JDBC
TCP
Mail
RabbitMQ
Gemfire
Splunk
MQTT
Dynamic Router
Taps
 “Listen” to data from another stream
• Other stream is unaffected by the tap and unaware of its presence
• EAI ‘Wiretap’

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Analytics
• Simple Counter
• Field Value Counter
– Count occurrences of named
fields

• Aggregate Counter
– Pre-aggregate counts in time
buckets

• Gauge
– last value

• Rich Gauge
– Last value, running
average, min/max
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.

• Abstract API
• Implementations for
– Memory
– Redis

• Looking at
– Pattern
– JPMML
Jobs
 Jobs are a directed graph of steps
 Steps
• Copy or Process data
• Files, Databases, MR, Pig, Hive, Cascading

 Step executions are persisted
• Checkpointing with restart
• Rich error handling capabilities
 Single node or distributed with data partitioning
 Jobs can be Triggered from streams
 Executing jobs generate a stream of event data

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
SPRING XD

Demo

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
Learn More. Stay Connected.
•
•
•
•
•
•
•
•

Spring IO: https://spring.io/platform
GitHub: https://github.com/spring-projects
Spring Integration: http://projects.spring.io/spring-integration
Spring XD: http://projects.spring.io/spring-xd
Reactor: https://github.com/reactor
EIP: http://www.eaipatterns.com
Spring Batch: http://projects.spring.io/spring-batch
Spring for Hadoop: http://projects.spring.io/spring-hadoop

© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
ありがとう
© 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.

Contenu connexe

En vedette

Presentation a pivotal overview
Presentation   a pivotal overviewPresentation   a pivotal overview
Presentation a pivotal overview
xKinAnx
 
Guest Lecture on Big Data in Business,
Guest Lecture on Big Data in Business, Guest Lecture on Big Data in Business,
Guest Lecture on Big Data in Business,
saravana krishnamurthy
 
Pivotal OSS meetup - MADlib and PivotalR
Pivotal OSS meetup - MADlib and PivotalRPivotal OSS meetup - MADlib and PivotalR
Pivotal OSS meetup - MADlib and PivotalR
go-pivotal
 
Scale Out Your Big Data Apps: The Latest on Pivotal GemFire and GemFire XD
Scale Out Your Big Data Apps: The Latest on Pivotal GemFire and GemFire XDScale Out Your Big Data Apps: The Latest on Pivotal GemFire and GemFire XD
Scale Out Your Big Data Apps: The Latest on Pivotal GemFire and GemFire XD
VMware Tanzu
 
Accelerating Operational Excellence in 2015: Calculating the ROI of Real-Time...
Accelerating Operational Excellence in 2015: Calculating the ROI of Real-Time...Accelerating Operational Excellence in 2015: Calculating the ROI of Real-Time...
Accelerating Operational Excellence in 2015: Calculating the ROI of Real-Time...
Catavolt, Inc.
 
Pivotal Digital Transformation Forum: Data Science Technical Overview
Pivotal Digital Transformation Forum: Data Science Technical OverviewPivotal Digital Transformation Forum: Data Science Technical Overview
Pivotal Digital Transformation Forum: Data Science Technical Overview
VMware Tanzu
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
VMware Tanzu
 
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
EMC
 
Why Are Digital Disruptors Successful And How Can You Become One?
Why Are Digital Disruptors Successful And How Can You Become One? Why Are Digital Disruptors Successful And How Can You Become One?
Why Are Digital Disruptors Successful And How Can You Become One?
VMware Tanzu
 
Data as the New Oil: Producing Value in the Oil and Gas Industry
 Data as the New Oil: Producing Value in the Oil and Gas Industry Data as the New Oil: Producing Value in the Oil and Gas Industry
Data as the New Oil: Producing Value in the Oil and Gas Industry
VMware Tanzu
 
Modern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An OverviewModern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An Overview
Great Wide Open
 
Women who wrote the analytics book final
Women who wrote the analytics book finalWomen who wrote the analytics book final
Women who wrote the analytics book final
metabrown
 
Pivotal Data Lake Architecture & its role in security analytics
Pivotal Data Lake Architecture & its role in security analyticsPivotal Data Lake Architecture & its role in security analytics
Pivotal Data Lake Architecture & its role in security analytics
EMC
 
Pivotal Digital Transformation Forum: Data Science Bridging the Gap
Pivotal Digital Transformation Forum: Data Science Bridging the GapPivotal Digital Transformation Forum: Data Science Bridging the Gap
Pivotal Digital Transformation Forum: Data Science Bridging the Gap
VMware Tanzu
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
Cloudera, Inc.
 
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Cloudera, Inc.
 
2014 Big_Data_Forum_Pivotal
2014 Big_Data_Forum_Pivotal2014 Big_Data_Forum_Pivotal
2014 Big_Data_Forum_Pivotal
COMPUTEX TAIPEI
 
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
VMware Tanzu
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data Strategy
Cloudera, Inc.
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural Change
Cloudera, Inc.
 

En vedette (20)

Presentation a pivotal overview
Presentation   a pivotal overviewPresentation   a pivotal overview
Presentation a pivotal overview
 
Guest Lecture on Big Data in Business,
Guest Lecture on Big Data in Business, Guest Lecture on Big Data in Business,
Guest Lecture on Big Data in Business,
 
Pivotal OSS meetup - MADlib and PivotalR
Pivotal OSS meetup - MADlib and PivotalRPivotal OSS meetup - MADlib and PivotalR
Pivotal OSS meetup - MADlib and PivotalR
 
Scale Out Your Big Data Apps: The Latest on Pivotal GemFire and GemFire XD
Scale Out Your Big Data Apps: The Latest on Pivotal GemFire and GemFire XDScale Out Your Big Data Apps: The Latest on Pivotal GemFire and GemFire XD
Scale Out Your Big Data Apps: The Latest on Pivotal GemFire and GemFire XD
 
Accelerating Operational Excellence in 2015: Calculating the ROI of Real-Time...
Accelerating Operational Excellence in 2015: Calculating the ROI of Real-Time...Accelerating Operational Excellence in 2015: Calculating the ROI of Real-Time...
Accelerating Operational Excellence in 2015: Calculating the ROI of Real-Time...
 
Pivotal Digital Transformation Forum: Data Science Technical Overview
Pivotal Digital Transformation Forum: Data Science Technical OverviewPivotal Digital Transformation Forum: Data Science Technical Overview
Pivotal Digital Transformation Forum: Data Science Technical Overview
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
 
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
 
Why Are Digital Disruptors Successful And How Can You Become One?
Why Are Digital Disruptors Successful And How Can You Become One? Why Are Digital Disruptors Successful And How Can You Become One?
Why Are Digital Disruptors Successful And How Can You Become One?
 
Data as the New Oil: Producing Value in the Oil and Gas Industry
 Data as the New Oil: Producing Value in the Oil and Gas Industry Data as the New Oil: Producing Value in the Oil and Gas Industry
Data as the New Oil: Producing Value in the Oil and Gas Industry
 
Modern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An OverviewModern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An Overview
 
Women who wrote the analytics book final
Women who wrote the analytics book finalWomen who wrote the analytics book final
Women who wrote the analytics book final
 
Pivotal Data Lake Architecture & its role in security analytics
Pivotal Data Lake Architecture & its role in security analyticsPivotal Data Lake Architecture & its role in security analytics
Pivotal Data Lake Architecture & its role in security analytics
 
Pivotal Digital Transformation Forum: Data Science Bridging the Gap
Pivotal Digital Transformation Forum: Data Science Bridging the GapPivotal Digital Transformation Forum: Data Science Bridging the Gap
Pivotal Digital Transformation Forum: Data Science Bridging the Gap
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
 
2014 Big_Data_Forum_Pivotal
2014 Big_Data_Forum_Pivotal2014 Big_Data_Forum_Pivotal
2014 Big_Data_Forum_Pivotal
 
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data Strategy
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural Change
 

Similaire à Big data and messaging with Spring

Institutionalizing Open Source - Puneet Sachdev - Nasscom Tech Series - June ...
Institutionalizing Open Source - Puneet Sachdev - Nasscom Tech Series - June ...Institutionalizing Open Source - Puneet Sachdev - Nasscom Tech Series - June ...
Institutionalizing Open Source - Puneet Sachdev - Nasscom Tech Series - June ...
Puneet Sachdev
 
Building Highly Scalable Spring Applications using In-Memory Data Grids
Building Highly Scalable Spring Applications using In-Memory Data GridsBuilding Highly Scalable Spring Applications using In-Memory Data Grids
Building Highly Scalable Spring Applications using In-Memory Data Grids
John Blum
 
Asynchronous Event Streams – when java.util.stream met org.osgi.util.promise!...
Asynchronous Event Streams – when java.util.stream met org.osgi.util.promise!...Asynchronous Event Streams – when java.util.stream met org.osgi.util.promise!...
Asynchronous Event Streams – when java.util.stream met org.osgi.util.promise!...
mfrancis
 
IoT architecture
IoT architectureIoT architecture
IoT architecture
Sumit Sharma
 
Get to know the browser better and write faster web apps
Get to know the browser better   and write faster web appsGet to know the browser better   and write faster web apps
Get to know the browser better and write faster web apps
Lior Bar-On
 
S2DS London 2015 - Hadoop Real World
S2DS London 2015 - Hadoop Real WorldS2DS London 2015 - Hadoop Real World
S2DS London 2015 - Hadoop Real World
Sean Roberts
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Open APIs - Risks and Rewards (Øredev 2013)
Open APIs - Risks and Rewards (Øredev 2013)Open APIs - Risks and Rewards (Øredev 2013)
Open APIs - Risks and Rewards (Øredev 2013)
Nordic APIs
 
Node summit workshop
Node summit workshopNode summit workshop
Node summit workshop
Shubhra Kar
 
HTTP/2 and a Faster Web
HTTP/2 and a Faster WebHTTP/2 and a Faster Web
HTTP/2 and a Faster Web
C4Media
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
Srivatsan Ramanujam
 
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Khalid Salama
 
Web Architecture - Mechanism and Threats
Web Architecture - Mechanism and ThreatsWeb Architecture - Mechanism and Threats
Web Architecture - Mechanism and Threats
Sumedt Jitpukdebodin
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Rittman Analytics
 
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerceWhat We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
VMware Tanzu
 
gRPC, GraphQL, REST - Which API Tech to use - API Conference Berlin oct 20
gRPC, GraphQL, REST - Which API Tech to use - API Conference Berlin oct 20gRPC, GraphQL, REST - Which API Tech to use - API Conference Berlin oct 20
gRPC, GraphQL, REST - Which API Tech to use - API Conference Berlin oct 20
Phil Wilkins
 
Building a Stock Prediction system with Machine Learning using Geode, SpringX...
Building a Stock Prediction system with Machine Learning using Geode, SpringX...Building a Stock Prediction system with Machine Learning using Geode, SpringX...
Building a Stock Prediction system with Machine Learning using Geode, SpringX...
William Markito Oliveira
 
Automation + dev ops summit hail hydrate! from stream to lake
Automation + dev ops summit   hail hydrate! from stream to lakeAutomation + dev ops summit   hail hydrate! from stream to lake
Automation + dev ops summit hail hydrate! from stream to lake
Timothy Spann
 
Modernizing an Existing SOA-based Architecture with APIs
Modernizing an Existing SOA-based Architecture with APIsModernizing an Existing SOA-based Architecture with APIs
Modernizing an Existing SOA-based Architecture with APIs
Apigee | Google Cloud
 
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
How to build unified Batch & Streaming Pipelines with Apache Beam and DataflowHow to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
Daniel Zivkovic
 

Similaire à Big data and messaging with Spring (20)

Institutionalizing Open Source - Puneet Sachdev - Nasscom Tech Series - June ...
Institutionalizing Open Source - Puneet Sachdev - Nasscom Tech Series - June ...Institutionalizing Open Source - Puneet Sachdev - Nasscom Tech Series - June ...
Institutionalizing Open Source - Puneet Sachdev - Nasscom Tech Series - June ...
 
Building Highly Scalable Spring Applications using In-Memory Data Grids
Building Highly Scalable Spring Applications using In-Memory Data GridsBuilding Highly Scalable Spring Applications using In-Memory Data Grids
Building Highly Scalable Spring Applications using In-Memory Data Grids
 
Asynchronous Event Streams – when java.util.stream met org.osgi.util.promise!...
Asynchronous Event Streams – when java.util.stream met org.osgi.util.promise!...Asynchronous Event Streams – when java.util.stream met org.osgi.util.promise!...
Asynchronous Event Streams – when java.util.stream met org.osgi.util.promise!...
 
IoT architecture
IoT architectureIoT architecture
IoT architecture
 
Get to know the browser better and write faster web apps
Get to know the browser better   and write faster web appsGet to know the browser better   and write faster web apps
Get to know the browser better and write faster web apps
 
S2DS London 2015 - Hadoop Real World
S2DS London 2015 - Hadoop Real WorldS2DS London 2015 - Hadoop Real World
S2DS London 2015 - Hadoop Real World
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Open APIs - Risks and Rewards (Øredev 2013)
Open APIs - Risks and Rewards (Øredev 2013)Open APIs - Risks and Rewards (Øredev 2013)
Open APIs - Risks and Rewards (Øredev 2013)
 
Node summit workshop
Node summit workshopNode summit workshop
Node summit workshop
 
HTTP/2 and a Faster Web
HTTP/2 and a Faster WebHTTP/2 and a Faster Web
HTTP/2 and a Faster Web
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
 
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
 
Web Architecture - Mechanism and Threats
Web Architecture - Mechanism and ThreatsWeb Architecture - Mechanism and Threats
Web Architecture - Mechanism and Threats
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
 
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerceWhat We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
 
gRPC, GraphQL, REST - Which API Tech to use - API Conference Berlin oct 20
gRPC, GraphQL, REST - Which API Tech to use - API Conference Berlin oct 20gRPC, GraphQL, REST - Which API Tech to use - API Conference Berlin oct 20
gRPC, GraphQL, REST - Which API Tech to use - API Conference Berlin oct 20
 
Building a Stock Prediction system with Machine Learning using Geode, SpringX...
Building a Stock Prediction system with Machine Learning using Geode, SpringX...Building a Stock Prediction system with Machine Learning using Geode, SpringX...
Building a Stock Prediction system with Machine Learning using Geode, SpringX...
 
Automation + dev ops summit hail hydrate! from stream to lake
Automation + dev ops summit   hail hydrate! from stream to lakeAutomation + dev ops summit   hail hydrate! from stream to lake
Automation + dev ops summit hail hydrate! from stream to lake
 
Modernizing an Existing SOA-based Architecture with APIs
Modernizing an Existing SOA-based Architecture with APIsModernizing an Existing SOA-based Architecture with APIs
Modernizing an Existing SOA-based Architecture with APIs
 
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
How to build unified Batch & Streaming Pipelines with Apache Beam and DataflowHow to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
 

Dernier

HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
jpupo2018
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 

Dernier (20)

HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 

Big data and messaging with Spring

  • 1. Big Data & Messaging with Artem Bilan by Pivotal © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 2. Artem Bilan Spring Integration Team abilan@gopivotal.com Spring Framework Spring AMQP Spring XD Reactor https://spring.io/team/artembilan http://stackoverflow.com/users/2756547 http://www.linkedin.com/in/cleric https://github.com/artembilan © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 3. Messaging and why do I care? © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 4. Messaging is very simple Headers Payload public static void main(String[] args) { ... } © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 5. Messaging in Spring IO © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 6. Spring Integration Written Uses Meet IoC! © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 7. The heart of Spring Integration – – – Endpoints (Filters) connected through Channels (Pipes) exchanging Message © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 8. Spring Integration Message Flow © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 9. How does it work? © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 10. What is a Big Data application?  Not just writing a few MR or Hive/Pig jobs.  The full lifecycle involves • Ingestion • Stream Processing • Workflow Orchestration • Enterprise Integration • Export • Horizontal scalable deployment  How do you write one of these? • Status quo is to combine different projects – not ideal © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 11. Spring XD • Unified Platform • Developer Productivity • Modular Extensibility • Distributed Architecture • Portable Runtime • Hadoop Distribution Agnostic • Proven Foundation • XD = ‘eXtreme Data’ © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 12. Spring XD Files Sensors Mobile Social Spring XD Shell Spring XD Runtime Taps Jobs Compute Workflow Redis Ingest Streams Export Export RDBMS Gemfire HDFS NoSQL R, SAS Predictive modeling © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 13. Stream Processing Model How can http |we make this|easier? filter file Non-linear stream definitions also supported © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 14. Runtimes XD Admin http | filter | file CLUSTERED NODE CLUSTERED NODE CLUSTERED NODE SINGLE NODE HTTP Module Filter Module File Module All Modules Rabbit, Redis, (Pluggable) © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission. In Memory Transport http | filter | file
  • 15. Streams MessageStore HTTP Tail File Mail Twitter Gemfire Syslog TCP JMS RabbitMQ MQTT © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission. Filter Transformer Splitter Aggregator Groovy Script Counters HTTP JSON Java Code File HDFS JDBC TCP Mail RabbitMQ Gemfire Splunk MQTT Dynamic Router
  • 16. Taps  “Listen” to data from another stream • Other stream is unaffected by the tap and unaware of its presence • EAI ‘Wiretap’ © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 17. Analytics • Simple Counter • Field Value Counter – Count occurrences of named fields • Aggregate Counter – Pre-aggregate counts in time buckets • Gauge – last value • Rich Gauge – Last value, running average, min/max © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission. • Abstract API • Implementations for – Memory – Redis • Looking at – Pattern – JPMML
  • 18. Jobs  Jobs are a directed graph of steps  Steps • Copy or Process data • Files, Databases, MR, Pig, Hive, Cascading  Step executions are persisted • Checkpointing with restart • Rich error handling capabilities  Single node or distributed with data partitioning  Jobs can be Triggered from streams  Executing jobs generate a stream of event data © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 19. SPRING XD Demo © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 20. Learn More. Stay Connected. • • • • • • • • Spring IO: https://spring.io/platform GitHub: https://github.com/spring-projects Spring Integration: http://projects.spring.io/spring-integration Spring XD: http://projects.spring.io/spring-xd Reactor: https://github.com/reactor EIP: http://www.eaipatterns.com Spring Batch: http://projects.spring.io/spring-batch Spring for Hadoop: http://projects.spring.io/spring-hadoop © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.
  • 21. ありがとう © 2013 Pivotal Inc. All rights reserved. Do not distribute without permission.

Notes de l'éditeur

  1. Unified platform across big data domainsStream processing: ingestion & analyticsBatch processing workflow orchestration & exportProductivityHigh level DSL for managing streams and jobsProven foundation Built on existing assets: Spring Batch, Integration, DataExtensibleDI, Test friendly…DistributedA | B | C - Pluggable transports: Rabbit, Redis, …Portable Runtime Standalone – Simplicity and testabilityYARN –Fault Tolerance and ScalabilityIn-memory data grids – co-location of reference data.PAASHadoop Distribution Agnostic
  2. Now have a unified platform that handle the whole thing.TODO – get rid of mobile?....XD runtime address both stream and batch processing in a unfired manner.XD runtime address both stream and batch processing in a unfired manner.
  3. Distributed mode, process boundaries
  4. Now have a unified platform that handle the whole thing.TODO – get rid of mobile?....XD runtime address both stream and batch processing in a unfired manner.XD runtime address both stream and batch processing in a unfired manner.
  5. Now have a unified platform that handle the whole thing.TODO – get rid of mobile?....XD runtime address both stream and batch processing in a unfired manner.XD runtime address both stream and batch processing in a unfired manner.
  6. Now have a unified platform that handle the whole thing.TODO – get rid of mobile?....XD runtime address both stream and batch processing in a unfired manner.XD runtime address both stream and batch processing in a unfired manner.
  7. http | filehttp | hdfstwittersearch | hdfsaggregatecounter on hashtags