Fault Tolerance and Job Recovery in Apache Flink @ FlinkForward 2015

•

9 j'aime•3,472 vues

The talk explains how Apache Flink checkpoints stateful jobs using the asynchronous barrier snapshotting algorithm to give exactly once semantics in streaming. Furthermore, Flink's approach to master high availability (HA) is described which solves the problem of the JobManager being the single point of failure. Job checkpointing in combination with HA is the basis for Flink's fault tolerance mechanism to recover from occurring failures.

Technologie

Fault Tolerance and Job
Recovery in Apache Flink™
Till Rohrmann
trohrmann@apache.org
@stsffap

Better be safe than sorry
§  Failures will happen
§  EMC estimated $1.7 billion costs due to
data loss and system downtime
§  Recovery will save you time and costs
§  Switch between algorithms
§  Live upgrade of your system
3

Fault tolerance guarantees
§  At most once
•  No guarantees at all
§  At least once
•  For many applications
sufﬁcient
§  Exactly once
§  Flink provides all guarantees
5

Checkpoints
§  Consistent snapshots of distributed data
stream and operator state
6

Barriers
§  Markers for checkpoints
§  Injected in the data ﬂow
7

8
§  Alignment for multi-input operators

$Operator State §  Stateless operators §  System state §  User deﬁned state 9 ds.filter(_ != 0) ds.keyBy(0).window(TumblingTimeWindows.of(5, TimeUnit.SECONDS)) public class CounterSum implements RichReduceFunction<Long> { private OperatorState<Long> counter; @Override public Long reduce(Long v1, Long v2) throws Exception { counter.update(counter.value() + 1); return v1 + v2; } @Override public void open(Configuration config) { counter = getRuntimeContext().getOperatorState(“counter”, 0L, false); } }$

Advantages
§  Separation of app logic from recovery
•  Checkpointing interval is just a conﬁg
parameter
§  High throughput
•  Controllable checkpointing overhead
§  Low impact on latency
14

Without high availability
17
JobManager
TaskManager

With high availability
18
JobManager
TaskManager
Stand-by
JobManager
Apache Zookeeper™
KEEP GOING

Persisting jobs
19
JobManager
Client
TaskManagers
Apache Zookeeper™
Job
1.  Submit job

Persisting jobs
20
JobManager
Client
TaskManagers
Apache Zookeeper™
1.  Submit job
2.  Persist execuAon graph

Persisting jobs
21
JobManager
Client
TaskManagers
Apache Zookeeper™
1.  Submit job
2.  Persist execuAon graph
3.  Write handle to ZooKeeper

Persisting jobs
22
JobManager
Client
TaskManagers
Apache Zookeeper™
1.  Submit job
2.  Persist execuAon graph
3.  Write handle to ZooKeeper
4.  Deploy tasks

Handling checkpoints
23
JobManager
Client
TaskManagers
Apache Zookeeper™
1.  Take snapshots

Handling checkpoints
24
JobManager
Client
TaskManagers
Apache Zookeeper™
1.  Take snapshots
2.  Persist snapshots
3.  Send handles to JM

Handling checkpoints
25
JobManager
Client
TaskManagers
Apache Zookeeper™
1.  Take snapshots
2.  Persist snapshots
3.  Send handles to JM
4.  Create global checkpoint

Handling checkpoints
26
JobManager
Client
TaskManagers
Apache Zookeeper™
1.  Take snapshots
2.  Persist snapshots
3.  Send handles to JM
4.  Create global checkpoint
5.  Persist global checkpoint

Handling checkpoints
27
JobManager
Client
TaskManagers
Apache Zookeeper™
1.  Take snapshots
2.  Persist snapshots
3.  Send handles to JM
4.  Create global checkpoint
5.  Persist global checkpoint
6.  Write handle to ZooKeeper

TL;DL
§  Job recovery mechanism with low latency
and high throughput
§  Exactly one processing semantics
§  No single point of failure
è Flink will always keep processing
your data
31

Recommandé

Aljoscha Krettek - The Future of Apache FlinkFlink Forward

Till Rohrmann – Fault Tolerance and Job Recovery in Apache FlinkFlink Forward

Apache Flink at Strata San Jose 2016Kostas Tzoumas

Tech Talk @ Google on Flink Fault Tolerance and HAParis Carbone

Apache Flink: Streaming Done Right @ FOSDEM 2016Till Rohrmann

Tran Nam-Luc – Stale Synchronous Parallel Iterations on FlinkFlink Forward

Apache Flink Berlin Meetup May 2016Stephan Ewen

Computing recommendations at extreme scale with Apache Flink @Buzzwords 2015Till Rohrmann

Recommandé

Aljoscha Krettek - The Future of Apache FlinkFlink Forward

Till Rohrmann – Fault Tolerance and Job Recovery in Apache FlinkFlink Forward

Apache Flink at Strata San Jose 2016Kostas Tzoumas

Tech Talk @ Google on Flink Fault Tolerance and HAParis Carbone

Apache Flink: Streaming Done Right @ FOSDEM 2016Till Rohrmann

Tran Nam-Luc – Stale Synchronous Parallel Iterations on FlinkFlink Forward

Apache Flink Berlin Meetup May 2016Stephan Ewen

Computing recommendations at extreme scale with Apache Flink @Buzzwords 2015Till Rohrmann

Stephan Ewen - Experiences running Flink at Very Large ScaleVerverica

Unified Stream and Batch Processing with Apache FlinkDataWorks Summit/Hadoop Summit

Flink Forward SF 2017: Cliff Resnick & Seth Wiesman - From Zero to Streami...Flink Forward

Pulsar connector on flink 1.14宇帆盛

Matthias J. Sax – A Tale of Squirrels and StormsFlink Forward

Flink Forward SF 2017: Stephan Ewen - Convergence of real-time analytics and ...Flink Forward

Keynote: Building and Operating A Serverless Streaming Runtime for Apache Bea...Flink Forward

Big Data WarsawMaximilian Michels

Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry confluent

Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...Flink Forward

Stream Processing with Apache FlinkC4Media

Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...Flink Forward

Flink Forward Berlin 2017: Steffen Hausmann - Build a Real-time Stream Proces...Flink Forward

Flink forward SF 2017: Elizabeth K. Joseph and Ravi Yadav - Flink meet DC/OS ...Flink Forward

Flink Streaming @BudapestDataGyula Fóra

A look at Flink 1.2Stefan Richter

Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...Flink Forward

Flink Forward Berlin 2017: Maciek Próchniak - TouK Nussknacker - creating Fli...Flink Forward

Flink Forward SF 2017: Joe Olson - Using Flink and Queryable State to Buffer ...Flink Forward

Apache flink 1.0.0 overviewMapR Technologies

Dynamic Scaling: How Apache Flink Adapts to Changing Workloads (at FlinkForwa...Till Rohrmann

Click-Through Example for Flink’s KafkaConsumer CheckpointingRobert Metzger

Contenu connexe

Tendances

Stephan Ewen - Experiences running Flink at Very Large ScaleVerverica

Unified Stream and Batch Processing with Apache FlinkDataWorks Summit/Hadoop Summit

Flink Forward SF 2017: Cliff Resnick & Seth Wiesman - From Zero to Streami...Flink Forward

Pulsar connector on flink 1.14宇帆盛

Matthias J. Sax – A Tale of Squirrels and StormsFlink Forward

Flink Forward SF 2017: Stephan Ewen - Convergence of real-time analytics and ...Flink Forward

Keynote: Building and Operating A Serverless Streaming Runtime for Apache Bea...Flink Forward

Big Data WarsawMaximilian Michels

Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry confluent

Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...Flink Forward

Stream Processing with Apache FlinkC4Media

Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...Flink Forward

Flink Forward Berlin 2017: Steffen Hausmann - Build a Real-time Stream Proces...Flink Forward

Flink forward SF 2017: Elizabeth K. Joseph and Ravi Yadav - Flink meet DC/OS ...Flink Forward

Flink Streaming @BudapestDataGyula Fóra

A look at Flink 1.2Stefan Richter

Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...Flink Forward

Flink Forward Berlin 2017: Maciek Próchniak - TouK Nussknacker - creating Fli...Flink Forward

Flink Forward SF 2017: Joe Olson - Using Flink and Queryable State to Buffer ...Flink Forward

Apache flink 1.0.0 overviewMapR Technologies

Tendances (20)

Stephan Ewen - Experiences running Flink at Very Large Scale

Unified Stream and Batch Processing with Apache Flink

Flink Forward SF 2017: Cliff Resnick & Seth Wiesman - From Zero to Streami...

Pulsar connector on flink 1.14

Matthias J. Sax – A Tale of Squirrels and Storms

Flink Forward SF 2017: Stephan Ewen - Convergence of real-time analytics and ...

Keynote: Building and Operating A Serverless Streaming Runtime for Apache Bea...

Big Data Warsaw

Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry

Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...

Stream Processing with Apache Flink

Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...

Flink Forward Berlin 2017: Steffen Hausmann - Build a Real-time Stream Proces...

Flink forward SF 2017: Elizabeth K. Joseph and Ravi Yadav - Flink meet DC/OS ...

Flink Streaming @BudapestData

A look at Flink 1.2

Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...

Flink Forward Berlin 2017: Maciek Próchniak - TouK Nussknacker - creating Fli...

Flink Forward SF 2017: Joe Olson - Using Flink and Queryable State to Buffer ...

Apache flink 1.0.0 overview

En vedette

Dynamic Scaling: How Apache Flink Adapts to Changing Workloads (at FlinkForwa...Till Rohrmann

Click-Through Example for Flink’s KafkaConsumer CheckpointingRobert Metzger

Gilbert: Declarative Sparse Linear Algebra on Massively Parallel Dataflow Sys...Till Rohrmann

Streaming Data Flow with Apache Flink @ Paris Flink Meetup 2015Till Rohrmann

Interactive Data Analysis with Apache Flink @ Flink Meetup in BerlinTill Rohrmann

Introduction to Apache Flink - Fast and reliable big data processingTill Rohrmann

Streaming Analytics & CEP - Two sides of the same coin?Till Rohrmann

Juggling with Bits and Bytes - How Apache Flink operates on binary dataFabian Hueske

High availability and fault tolerance of openstackDeepak Mane

Apache Flink Hands OnRobert Metzger

Machine Learning with Apache Flink at Stockholm Machine Learning GroupTill Rohrmann

Eron Wright - Flink Security EnhancementsFlink Forward

Real-Time Streaming Data on AWSAmazon Web Services

Step-by-Step Introduction to Apache Flink Slim Baltagi

En vedette (14)

Dynamic Scaling: How Apache Flink Adapts to Changing Workloads (at FlinkForwa...

Click-Through Example for Flink’s KafkaConsumer Checkpointing

Gilbert: Declarative Sparse Linear Algebra on Massively Parallel Dataflow Sys...

Streaming Data Flow with Apache Flink @ Paris Flink Meetup 2015

Interactive Data Analysis with Apache Flink @ Flink Meetup in Berlin

Introduction to Apache Flink - Fast and reliable big data processing

Streaming Analytics & CEP - Two sides of the same coin?

Juggling with Bits and Bytes - How Apache Flink operates on binary data

High availability and fault tolerance of openstack

Apache Flink Hands On

Machine Learning with Apache Flink at Stockholm Machine Learning Group

Eron Wright - Flink Security Enhancements

Real-Time Streaming Data on AWS

Step-by-Step Introduction to Apache Flink

Similaire à Fault Tolerance and Job Recovery in Apache Flink @ FlinkForward 2015

(SDD423) Elastic Load Balancing Deep Dive and Best Practices | AWS re:Invent ...Amazon Web Services

Fault toleranceMichał Waleszczuk

When it Absolutely, Positively, Has to be There: Reliability Guarantees in Ka...confluent

Mario Fusco - Reactive programming in Java - Codemotion Milan 2017Codemotion

Exposing and Fixing Common App Performance ProblemsRiverbed Technology

Strata Singapore: GearpumpReal time DAG-Processing with Akka at ScaleSean Zhong

Flexible and Real-Time Stream Processing with Apache FlinkDataWorks Summit

Analitica de datos en tiempo real con Apache Flink y Apache BEAMjavier ramirez

Flink 0.10 - Upcoming FeaturesAljoscha Krettek

An introduction to_rac_system_test_planning_methodsAjith Narayanan

When Web Services Go BadSteve Loughran

ETSI NFV#13 NFV resiliency presentation - ali kafel - stratusAli Kafel

(CMP401) Elastic Load Balancing Deep Dive and Best PracticesAmazon Web Services

(CMP402) Amazon EC2 Instances Deep DiveAmazon Web Services

Network and distributed systemsSri Prasanna

Deep Dive on Elastic Load BalancingAmazon Web Services

Solve the colocation conundrum: Performance and density at scale with KubernetesNiklas Quarfot Nielsen

Deep Dive on Elastic Load BalancingAmazon Web Services

Software architecture for data applicationsDing Li

Oracle appsloadtestbestpracticessonusaini69

Similaire à Fault Tolerance and Job Recovery in Apache Flink @ FlinkForward 2015 (20)

(SDD423) Elastic Load Balancing Deep Dive and Best Practices | AWS re:Invent ...

Fault tolerance

When it Absolutely, Positively, Has to be There: Reliability Guarantees in Ka...

Mario Fusco - Reactive programming in Java - Codemotion Milan 2017

Exposing and Fixing Common App Performance Problems

Strata Singapore: GearpumpReal time DAG-Processing with Akka at Scale

Flexible and Real-Time Stream Processing with Apache Flink

Analitica de datos en tiempo real con Apache Flink y Apache BEAM

Flink 0.10 - Upcoming Features

An introduction to_rac_system_test_planning_methods

When Web Services Go Bad

ETSI NFV#13 NFV resiliency presentation - ali kafel - stratus

(CMP401) Elastic Load Balancing Deep Dive and Best Practices

(CMP402) Amazon EC2 Instances Deep Dive

Network and distributed systems

Deep Dive on Elastic Load Balancing

Solve the colocation conundrum: Performance and density at scale with Kubernetes

Deep Dive on Elastic Load Balancing

Software architecture for data applications

Oracle appsloadtestbestpractices

Plus de Till Rohrmann

Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...Till Rohrmann

Apache flink 1.7 and BeyondTill Rohrmann

Elastic Streams at Scale @ Flink Forward 2018 BerlinTill Rohrmann

Scaling stream data pipelines with Pravega and Apache FlinkTill Rohrmann

Modern Stream Processing With Apache Flink @ GOTO Berlin 2017Till Rohrmann

Apache Flink Meets Apache Mesos And DC/OS @ Mesos Meetup BerlinTill Rohrmann

Apache Flink® Meets Apache Mesos® and DC/OSTill Rohrmann

From Apache Flink® 1.3 to 1.4Till Rohrmann

Apache Flink and More @ MesosCon Asia 2017Till Rohrmann

Redesigning Apache Flink's Distributed Architecture @ Flink Forward 2017Till Rohrmann

Plus de Till Rohrmann (10)

Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...

Apache flink 1.7 and Beyond

Elastic Streams at Scale @ Flink Forward 2018 Berlin

Scaling stream data pipelines with Pravega and Apache Flink

Modern Stream Processing With Apache Flink @ GOTO Berlin 2017

Apache Flink Meets Apache Mesos And DC/OS @ Mesos Meetup Berlin

Apache Flink® Meets Apache Mesos® and DC/OS

From Apache Flink® 1.3 to 1.4

Apache Flink and More @ MesosCon Asia 2017

Redesigning Apache Flink's Distributed Architecture @ Flink Forward 2017

Dernier

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Artificial Intelligence: Facts and MythsJoaquim Jorge

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Histor y of HAM Radio presentation slidevu2urc

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Partners Life - Insurer Innovation Award 2024The Digital Insurer

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

How to convert PDF to text with Nanonetsnaman860154

Dernier (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Automating Google Workspace (GWS) & more with Apps Script

presentation ICT roal in 21st century education

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Artificial Intelligence: Facts and Myths

Boost PC performance: How more available memory can improve productivity

Histor y of HAM Radio presentation slide

How to Troubleshoot Apps for the Modern Connected Worker

🐬 The future of MySQL is Postgres 🐘

Partners Life - Insurer Innovation Award 2024

IAC 2024 - IA Fast Track to Search Focused AI Solutions

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Boost Fertility New Invention Ups Success Rates.pdf

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

08448380779 Call Girls In Civil Lines Women Seeking Men

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Data Cloud, More than a CDP by Matt Robison

How to convert PDF to text with Nanonets

Fault Tolerance and Job Recovery in Apache Flink @ FlinkForward 2015

1. Fault Tolerance and Job Recovery in Apache Flink™ Till Rohrmann trohrmann@apache.org @stsffap

2. 2

3. Better be safe than sorry §  Failures will happen §  EMC estimated $1.7 billion costs due to data loss and system downtime §  Recovery will save you time and costs §  Switch between algorithms §  Live upgrade of your system 3

4. Fault Tolerance 4

5. Fault tolerance guarantees §  At most once •  No guarantees at all §  At least once •  For many applications sufﬁcient §  Exactly once §  Flink provides all guarantees 5

6. Checkpoints §  Consistent snapshots of distributed data stream and operator state 6

7. Barriers §  Markers for checkpoints §  Injected in the data ﬂow 7

8. 8 §  Alignment for multi-input operators

9. Operator State §  Stateless operators §  System state §  User deﬁned state 9 ds.filter(_ != 0) ds.keyBy(0).window(TumblingTimeWindows.of(5, TimeUnit.SECONDS)) public class CounterSum implements RichReduceFunction<Long> { private OperatorState<Long> counter; @Override public Long reduce(Long v1, Long v2) throws Exception { counter.update(counter.value() + 1); return v1 + v2; } @Override public void open(Configuration config) { counter = getRuntimeContext().getOperatorState(“counter”, 0L, false); } }

10. 10

11. 11

12. 12

13. 13

14. Advantages §  Separation of app logic from recovery •  Checkpointing interval is just a conﬁg parameter §  High throughput •  Controllable checkpointing overhead §  Low impact on latency 14

15. 15

16. Cluster High Availability 16

17. Without high availability 17 JobManager TaskManager

18. With high availability 18 JobManager TaskManager Stand-by JobManager Apache Zookeeper™ KEEP GOING

19. Persisting jobs 19 JobManager Client TaskManagers Apache Zookeeper™ Job 1.  Submit job

20. Persisting jobs 20 JobManager Client TaskManagers Apache Zookeeper™ 1.  Submit job 2.  Persist execuAon graph

21. Persisting jobs 21 JobManager Client TaskManagers Apache Zookeeper™ 1.  Submit job 2.  Persist execuAon graph 3.  Write handle to ZooKeeper

22. Persisting jobs 22 JobManager Client TaskManagers Apache Zookeeper™ 1.  Submit job 2.  Persist execuAon graph 3.  Write handle to ZooKeeper 4.  Deploy tasks

23. Handling checkpoints 23 JobManager Client TaskManagers Apache Zookeeper™ 1.  Take snapshots

24. Handling checkpoints 24 JobManager Client TaskManagers Apache Zookeeper™ 1.  Take snapshots 2.  Persist snapshots 3.  Send handles to JM

25. Handling checkpoints 25 JobManager Client TaskManagers Apache Zookeeper™ 1.  Take snapshots 2.  Persist snapshots 3.  Send handles to JM 4.  Create global checkpoint

26. Handling checkpoints 26 JobManager Client TaskManagers Apache Zookeeper™ 1.  Take snapshots 2.  Persist snapshots 3.  Send handles to JM 4.  Create global checkpoint 5.  Persist global checkpoint

27. Handling checkpoints 27 JobManager Client TaskManagers Apache Zookeeper™ 1.  Take snapshots 2.  Persist snapshots 3.  Send handles to JM 4.  Create global checkpoint 5.  Persist global checkpoint 6.  Write handle to ZooKeeper

28. Conclusion 28

29. 29

30. 30

31. TL;DL §  Job recovery mechanism with low latency and high throughput §  Exactly one processing semantics §  No single point of failure è Flink will always keep processing your data 31

32. ﬂink.apache.org @ApacheFlink