SlideShare une entreprise Scribd logo
1  sur  32
Télécharger pour lire hors ligne
Meet the Committers Lab
Preparation
3-May-2023
© 2023 Cloudera, Inc. All rights reserved.
IMPORTANT NOTES
Guidance
Documentation
Daily Zoom Sessions
Examples
Ready Flows
Slack Channel
Flow Proctors
Shared Environment
Since it is shared environment,
each user has access to every
other users’ flow.
No production data should be
used.
We will stop your design
sessions after 4 hours of
inactivity.
© 2023 Cloudera, Inc. All rights reserved.
SANDBOX FROM MAY 3, 2023 to MAY 9 MIDNIGHT, 2023
Sandbox will be destroyed at midnight EST May 9, 2023 before
May 10, 2023.
You must complete your item, Save and Download Your Flows
Before Then.
All data and code will be destroyed on the end of the trial
Submit your flow (CRN), video and text via this form.
© 2023 Cloudera, Inc. All rights reserved.
NAVIGATE IN CHROME TO THE SHARED SANDBOX
https://login.cdpworkshops.cloudera.com/auth/realms/se-workshop-5
/protocol/saml/clients/cdp-sso
© 2023 Cloudera, Inc. All rights reserved.
REGISTRATION
Click Register
Must use Recent
Chrome Browser
© 2023 Cloudera, Inc. All rights reserved.
GETTING STARTED - GUIDED USE CASES
● Syslog to Kafka topic
● Reading and Filtering a Syslog Stream
● Writing Critical Syslog Events to Apache Iceberg
● Must use Recent Chrome Browser
© 2023 Cloudera, Inc. All rights reserved.
BEST IN FLOW COMPETITION - BUILD & DOCUMENT A FLOW
A chance to win a $2,000 Amazon gift card.
A great way to get recognition.
Cloudera public award social media post.
© 2023 Cloudera, Inc. All rights reserved.
BEST IN FLOW COMPETITION - BUILD & DOCUMENT A FLOW
A chance to win a $2,000 Amazon gift card.
A great way to get recognition.
Cloudera public award social media post.
© 2023 Cloudera, Inc. All rights reserved.
WHAT TO BUILD?
You can extend or try one of our tutorials
You can extend or use one of our Ready Flows
You can connect to external resources (passwords are visible,
only use public data or examples)
© 2023 Cloudera, Inc. All rights reserved.
FLOW REQUIREMENTS
The following are the requirements for the flow to be considered eligible for the competition:
1. The flow must be developed using the new DataFlow Designer in the DataFlow Service sandbox.
2. The flow must have at least one “source” data.
3. The flow must have at least one “destination” where the data is delivered
4. The flow must be functional, tested, and working using the Test session feature of the DataFlow Service. The
Data viewer should be used to inspect the data payload within the different flow steps.
5. The flow must be checked into the DataFlow Catalog, deployed using the deployment wizard, and validated that
it is correctly running.
6. Each submitted Flow must include the following additional details:
○ The CRN of the flow was checked into the flow catalog with a detailed description of the flow and use
case.
○ Link to a short blog describing the use case and the flow that was built and deployed using DataFlow
Designer
○ Link to a short video showing the flow running in the Flow Designer with the test session and data
traversing through flow. The Data viewer should be used to inspect the data payload within the different
flow steps.
○ Product feedback on the DataFlow Service.
© 2023 Cloudera, Inc. All rights reserved.
Criteria Description
Complete Flow Artifacts The submitted flow entry contains all the required artifacts, including Flow CRN in the Catalog, a link to
the blog describing the use case and the flow, and a short video link showing the flow running with
data traversing through the flow.
Adheres to NiFi flow best
practices
Follows NiFi flow design best practices like record-oriented processors, controller services, and
parameters.
Showcases NiFi processing
capabilities
Showcases NiFi processing capabilities including protocol bridging, schema transformation, routing,
filtering, enrichment, compression, etc.
Universal Data Distribution The flow showcases multiple data sources and delivers data to multiple destinations.
Uses the latest NiFi processors
and controllers services
Showcases the latest NiFi processors in the latest Apache NiFi release: 1.20, 1.19, 1.18, 1.17,
including PutSnowflakeInternalStage, PutIceberg, UpdateDeltaLakeTable, Amazon ML Processors:
Amazon Web Services Polly, Textract, Translate, and Transcribe services, etc.
ReadyFlow The flow addresses a common data pipeline use case and can be reused by other users hence a good
candidate to be added to the ReadyFlow gallery.
Deployable The flow should be able to be deployed with minimum effort with the appropriate documentation (e.g.:
description of parameters in the parameter context, the blog details, etc..)
© 2023 Cloudera, Inc. All rights reserved.
SANDBOX FLOW DEVELOPMENT BEST PRACTICES
Uniquely Name your
processors/ connections
with yourid_
Parameterize connection
information
Don’t use sensitive data
in sandbox
Don’t use or change other
people’s assets, only your
own
© 2023 Cloudera, Inc. All rights reserved.
Don’t use or change other
people’s assets, only your
own
Reuse components via
Copy and Process Groups
We are here to help reach
out via Slack or Zoom.
SANDBOX FLOW DEVELOPMENT BEST PRACTICES
© 2023 Cloudera, Inc. All rights reserved.
DAILY ZOOM
https://cloudera.zoom.us/j/964
60893376?pwd=eWZEVDhpZm
pFSDNRejFzMXkvcHpOdz09
© 2023 Cloudera, Inc. All rights reserved.
SLACK CHANNEL
https://bestinflow.slack.com/join
/shared_invite/zt-1uj1ti8hc-8mnh
mbr_AbOCD7f~A68P0w#/shared
-invite/email
© 2023 Cloudera, Inc. All rights reserved.
SOURCE CODE AND EXAMPLES
https://github.com/tspannhw/FLaN
K-DataFlows
© 2023 Cloudera, Inc. All rights reserved.
Submit Your Flow
https://docs.google.com/forms/d/1Ku2KSDFoxJy45jiOWuLRDi9Trpgm-42aaxeAVwy-fpo
© 2023 Cloudera, Inc. All rights reserved.
ADDITIONAL RESOURCES
Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development
Streaming Data Ingestion into an Open Data Lakehouse Made Easy with
DataFlow Example
Cloudera DataFlow Designer: Kafka to Iceberg in Cloudera Data Warehouse
Serverless NiFi Flows with DataFlow Functions
DataFlow Functions Technical Demo
DataFlow Documentation
23
© 2023 Cloudera, Inc. All rights reserved.
Marketing
Carolyn
Duby
Field CTO
Field
Meet the Data-In-Motion Team
Tim
Spann
Developer Advocate
Richard Walden
DIM SME Lead
Engineering Product
Chris
Joynt
Product Marketing
Joe
Witt
Engineering Leader
George
Vetticaden
Product Leader
Michael
Kohs
Product Owner for
DataFlow
Pierre
Villard
Product Owner for
DataFlow
Andre
Araujo
Product Owner for
Stream Processing
John Kuchmek
DIM SME Expert
© 2023 Cloudera, Inc. All rights reserved.
WARNING
“Notwithstanding any contrary terms in the Agreement, Customer
acknowledges that information shared using the Trial Product is
in a shared environment with similarly situated customers. All
information in the shared environment is accessible by all other
customers participating in the trial and such information will not
be deemed Confidential Information.”
https://www.cloudera.com/legal/commercial-terms-and-conditio
ns/cdp-public-cloud-trial-agreement.html
© 2023 Cloudera, Inc. All rights reserved.
CONTAINER BASED DATAFLOW
Flow Deployment Flow Monitoring
Allows easy flow deployment based
on NiFi 1.20 across CDP
environments (Dev, QA, Prod)
Define and assign KPIs to your
flows
Easy NiFi version upgrades
Update/Add KPIs, Update
Parameters, Change sizing
configuration
Automatic infrastructure scaling
based on CPU utilization
Central monitoring console for all
your flows across environments
Monitor flow metrics and
infrastructure usage
Define alerts for flows breaching
assigned KPIs
Flow Catalog
Keep track of your flow definitions
and versions in a central catalog
Reuse your existing NiFi flows by
uploading them to the catalog
Discover, search and reuse existing
flows easily
26
© 2023 Cloudera, Inc. All rights reserved.
FLOW CATALOG
• Central repository for flow
definitions
• Import existing NiFi flows
• Manage flow definitions
• Initiate flow deployments
27
© 2023 Cloudera, Inc. All rights reserved.
TURNS FLOW
DEFINITIONS
INTO FLOW
DEPLOYMENTS
2.) NiFi Config
4.) Configure Sizing & Scaling 5.) Define KPIs
1.) Start Deployment Wizard
3.) Provide Parameters for NiFi
28
© 2023 Cloudera, Inc. All rights reserved.
KEY
PERFORMANCE
INDICATORS
• Visibility into flow deployments
• Track high level flow
performance
• Track in-depth NiFi component
metrics
• Defined in Deployment Wizard
• Monitoring & Alerts in
Deployment Details
KPI Definition in Deployment Wizard KPI Monitoring
29
© 2023 Cloudera, Inc. All rights reserved.
DASHBOARD
• Central Monitoring View
• Monitors flow deployments
across CDP environments
• Monitors flow deployment
health & performance
• Drill into flow deployment to
monitor system metrics and
deployment events
30
© 2023 Cloudera, Inc. All rights reserved.
DEPLOYMENT
MANAGER
• Manage flow deployment
lifecycle
(Suspend/Start/Terminate)
• Add/Edit KPIs
• Change sizing configuration
• Update parameters
• Change NiFi version of the
deployment
• Gateway to NiFi canvas
32
© 2023 Cloudera, Inc. All rights reserved.
TH N Y U

Contenu connexe

Similaire à Meet the Committers Webinar_ Lab Preparation

ITPC Building Modern Data Streaming Apps
ITPC Building Modern Data Streaming AppsITPC Building Modern Data Streaming Apps
ITPC Building Modern Data Streaming AppsTimothy Spann
 
Azure Stack Overview (Dec/2018)
Azure Stack Overview (Dec/2018)Azure Stack Overview (Dec/2018)
Azure Stack Overview (Dec/2018)Cenk Ersoy
 
Meetup Streaming Data Pipeline Development
Meetup Streaming Data Pipeline DevelopmentMeetup Streaming Data Pipeline Development
Meetup Streaming Data Pipeline DevelopmentTimothy Spann
 
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023ssuser73434e
 
Unconference Round Table Notes
Unconference Round Table NotesUnconference Round Table Notes
Unconference Round Table NotesTimothy Spann
 
Cloudera-Connector-for-Teradata.pdf
Cloudera-Connector-for-Teradata.pdfCloudera-Connector-for-Teradata.pdf
Cloudera-Connector-for-Teradata.pdfychacha
 
Working with CloudStack Usage Data - CCCEU13
Working with CloudStack Usage Data - CCCEU13Working with CloudStack Usage Data - CCCEU13
Working with CloudStack Usage Data - CCCEU13ShapeBlue
 
Working with CloudStack Usage Data
Working with CloudStack Usage DataWorking with CloudStack Usage Data
Working with CloudStack Usage DataTariq Iqbal
 
002 Introducing Neo4j 5 for Administrators - NODES2022 AMERICAS Beginner 2 - ...
002 Introducing Neo4j 5 for Administrators - NODES2022 AMERICAS Beginner 2 - ...002 Introducing Neo4j 5 for Administrators - NODES2022 AMERICAS Beginner 2 - ...
002 Introducing Neo4j 5 for Administrators - NODES2022 AMERICAS Beginner 2 - ...Neo4j
 
Edge to ai analytics from edge to cloud with efficient movement of machine data
Edge to ai  analytics from edge to cloud with efficient movement of machine dataEdge to ai  analytics from edge to cloud with efficient movement of machine data
Edge to ai analytics from edge to cloud with efficient movement of machine dataTimothy Spann
 
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24Neo4j
 
Cloudfoundry Introduction
Cloudfoundry IntroductionCloudfoundry Introduction
Cloudfoundry IntroductionYitao Jiang
 
Azure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiAzure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiGirish Kalamati
 
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlueCloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlueShapeBlue
 
Zure Azure PaaS Zero to Hero - DevOps training day
Zure Azure PaaS Zero to Hero - DevOps training dayZure Azure PaaS Zero to Hero - DevOps training day
Zure Azure PaaS Zero to Hero - DevOps training dayOkko Oulasvirta
 
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
Edge to AI:  Analytics from Edge to Cloud with Efficient Movement of Machine ...Edge to AI:  Analytics from Edge to Cloud with Efficient Movement of Machine ...
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...Timothy Spann
 
New ThousandEyes Product Features and Release Highlights: October 2023
New ThousandEyes Product Features and Release Highlights: October 2023New ThousandEyes Product Features and Release Highlights: October 2023
New ThousandEyes Product Features and Release Highlights: October 2023ThousandEyes
 
PDF_Slide__Memodernisasi_aplikasi_Microsoft_Anda_dengan_cepat_di_AWS.pdf
PDF_Slide__Memodernisasi_aplikasi_Microsoft_Anda_dengan_cepat_di_AWS.pdfPDF_Slide__Memodernisasi_aplikasi_Microsoft_Anda_dengan_cepat_di_AWS.pdf
PDF_Slide__Memodernisasi_aplikasi_Microsoft_Anda_dengan_cepat_di_AWS.pdfRopiudin5
 
BRKSEC-3771 - WSA with wccp.pdf
BRKSEC-3771 - WSA with wccp.pdfBRKSEC-3771 - WSA with wccp.pdf
BRKSEC-3771 - WSA with wccp.pdfMenakaDevi14
 
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC MeetupTimothy Spann
 

Similaire à Meet the Committers Webinar_ Lab Preparation (20)

ITPC Building Modern Data Streaming Apps
ITPC Building Modern Data Streaming AppsITPC Building Modern Data Streaming Apps
ITPC Building Modern Data Streaming Apps
 
Azure Stack Overview (Dec/2018)
Azure Stack Overview (Dec/2018)Azure Stack Overview (Dec/2018)
Azure Stack Overview (Dec/2018)
 
Meetup Streaming Data Pipeline Development
Meetup Streaming Data Pipeline DevelopmentMeetup Streaming Data Pipeline Development
Meetup Streaming Data Pipeline Development
 
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
 
Unconference Round Table Notes
Unconference Round Table NotesUnconference Round Table Notes
Unconference Round Table Notes
 
Cloudera-Connector-for-Teradata.pdf
Cloudera-Connector-for-Teradata.pdfCloudera-Connector-for-Teradata.pdf
Cloudera-Connector-for-Teradata.pdf
 
Working with CloudStack Usage Data - CCCEU13
Working with CloudStack Usage Data - CCCEU13Working with CloudStack Usage Data - CCCEU13
Working with CloudStack Usage Data - CCCEU13
 
Working with CloudStack Usage Data
Working with CloudStack Usage DataWorking with CloudStack Usage Data
Working with CloudStack Usage Data
 
002 Introducing Neo4j 5 for Administrators - NODES2022 AMERICAS Beginner 2 - ...
002 Introducing Neo4j 5 for Administrators - NODES2022 AMERICAS Beginner 2 - ...002 Introducing Neo4j 5 for Administrators - NODES2022 AMERICAS Beginner 2 - ...
002 Introducing Neo4j 5 for Administrators - NODES2022 AMERICAS Beginner 2 - ...
 
Edge to ai analytics from edge to cloud with efficient movement of machine data
Edge to ai  analytics from edge to cloud with efficient movement of machine dataEdge to ai  analytics from edge to cloud with efficient movement of machine data
Edge to ai analytics from edge to cloud with efficient movement of machine data
 
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
 
Cloudfoundry Introduction
Cloudfoundry IntroductionCloudfoundry Introduction
Cloudfoundry Introduction
 
Azure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiAzure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish Kalamati
 
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlueCloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
 
Zure Azure PaaS Zero to Hero - DevOps training day
Zure Azure PaaS Zero to Hero - DevOps training dayZure Azure PaaS Zero to Hero - DevOps training day
Zure Azure PaaS Zero to Hero - DevOps training day
 
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
Edge to AI:  Analytics from Edge to Cloud with Efficient Movement of Machine ...Edge to AI:  Analytics from Edge to Cloud with Efficient Movement of Machine ...
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
 
New ThousandEyes Product Features and Release Highlights: October 2023
New ThousandEyes Product Features and Release Highlights: October 2023New ThousandEyes Product Features and Release Highlights: October 2023
New ThousandEyes Product Features and Release Highlights: October 2023
 
PDF_Slide__Memodernisasi_aplikasi_Microsoft_Anda_dengan_cepat_di_AWS.pdf
PDF_Slide__Memodernisasi_aplikasi_Microsoft_Anda_dengan_cepat_di_AWS.pdfPDF_Slide__Memodernisasi_aplikasi_Microsoft_Anda_dengan_cepat_di_AWS.pdf
PDF_Slide__Memodernisasi_aplikasi_Microsoft_Anda_dengan_cepat_di_AWS.pdf
 
BRKSEC-3771 - WSA with wccp.pdf
BRKSEC-3771 - WSA with wccp.pdfBRKSEC-3771 - WSA with wccp.pdf
BRKSEC-3771 - WSA with wccp.pdf
 
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
 

Plus de Timothy Spann

Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
2024 XTREMEJ_ Building Real-time Pipelines with FLaNK_ A Case Study with Tra...
2024 XTREMEJ_  Building Real-time Pipelines with FLaNK_ A Case Study with Tra...2024 XTREMEJ_  Building Real-time Pipelines with FLaNK_ A Case Study with Tra...
2024 XTREMEJ_ Building Real-time Pipelines with FLaNK_ A Case Study with Tra...Timothy Spann
 
28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-PipelinesTimothy Spann
 
TCFPro24 Building Real-Time Generative AI Pipelines
TCFPro24 Building Real-Time Generative AI PipelinesTCFPro24 Building Real-Time Generative AI Pipelines
TCFPro24 Building Real-Time Generative AI PipelinesTimothy Spann
 
2024 Build Generative AI for Non-Profits
2024 Build Generative AI for Non-Profits2024 Build Generative AI for Non-Profits
2024 Build Generative AI for Non-ProfitsTimothy Spann
 
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...Timothy Spann
 
Conf42-Python-Building Apache NiFi 2.0 Python Processors
Conf42-Python-Building Apache NiFi 2.0 Python ProcessorsConf42-Python-Building Apache NiFi 2.0 Python Processors
Conf42-Python-Building Apache NiFi 2.0 Python ProcessorsTimothy Spann
 
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...Timothy Spann
 
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI PipelinesTimothy Spann
 
DBA Fundamentals Group: Continuous SQL with Kafka and Flink
DBA Fundamentals Group: Continuous SQL with Kafka and FlinkDBA Fundamentals Group: Continuous SQL with Kafka and Flink
DBA Fundamentals Group: Continuous SQL with Kafka and FlinkTimothy Spann
 
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...Timothy Spann
 
OSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
OSACon 2023_ Unlocking Financial Data with Real-Time PipelinesOSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
OSACon 2023_ Unlocking Financial Data with Real-Time PipelinesTimothy Spann
 
Building Real-Time Travel Alerts
Building Real-Time Travel AlertsBuilding Real-Time Travel Alerts
Building Real-Time Travel AlertsTimothy Spann
 
JConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkTimothy Spann
 
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data PipelinesTimothy Spann
 
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoEvolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoTimothy Spann
 
AIDevWorldApacheNiFi101
AIDevWorldApacheNiFi101AIDevWorldApacheNiFi101
AIDevWorldApacheNiFi101Timothy Spann
 

Plus de Timothy Spann (20)

Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
2024 XTREMEJ_ Building Real-time Pipelines with FLaNK_ A Case Study with Tra...
2024 XTREMEJ_  Building Real-time Pipelines with FLaNK_ A Case Study with Tra...2024 XTREMEJ_  Building Real-time Pipelines with FLaNK_ A Case Study with Tra...
2024 XTREMEJ_ Building Real-time Pipelines with FLaNK_ A Case Study with Tra...
 
28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines
 
TCFPro24 Building Real-Time Generative AI Pipelines
TCFPro24 Building Real-Time Generative AI PipelinesTCFPro24 Building Real-Time Generative AI Pipelines
TCFPro24 Building Real-Time Generative AI Pipelines
 
2024 Build Generative AI for Non-Profits
2024 Build Generative AI for Non-Profits2024 Build Generative AI for Non-Profits
2024 Build Generative AI for Non-Profits
 
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
 
Conf42-Python-Building Apache NiFi 2.0 Python Processors
Conf42-Python-Building Apache NiFi 2.0 Python ProcessorsConf42-Python-Building Apache NiFi 2.0 Python Processors
Conf42-Python-Building Apache NiFi 2.0 Python Processors
 
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...
 
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
 
DBA Fundamentals Group: Continuous SQL with Kafka and Flink
DBA Fundamentals Group: Continuous SQL with Kafka and FlinkDBA Fundamentals Group: Continuous SQL with Kafka and Flink
DBA Fundamentals Group: Continuous SQL with Kafka and Flink
 
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
 
OSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
OSACon 2023_ Unlocking Financial Data with Real-Time PipelinesOSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
OSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
 
Building Real-Time Travel Alerts
Building Real-Time Travel AlertsBuilding Real-Time Travel Alerts
Building Real-Time Travel Alerts
 
JConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and Flink
 
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
 
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoEvolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
 
AIDevWorldApacheNiFi101
AIDevWorldApacheNiFi101AIDevWorldApacheNiFi101
AIDevWorldApacheNiFi101
 

Dernier

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 

Dernier (20)

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 

Meet the Committers Webinar_ Lab Preparation

  • 1. Meet the Committers Lab Preparation 3-May-2023
  • 2. © 2023 Cloudera, Inc. All rights reserved. IMPORTANT NOTES Guidance Documentation Daily Zoom Sessions Examples Ready Flows Slack Channel Flow Proctors Shared Environment Since it is shared environment, each user has access to every other users’ flow. No production data should be used. We will stop your design sessions after 4 hours of inactivity.
  • 3. © 2023 Cloudera, Inc. All rights reserved. SANDBOX FROM MAY 3, 2023 to MAY 9 MIDNIGHT, 2023 Sandbox will be destroyed at midnight EST May 9, 2023 before May 10, 2023. You must complete your item, Save and Download Your Flows Before Then. All data and code will be destroyed on the end of the trial Submit your flow (CRN), video and text via this form.
  • 4. © 2023 Cloudera, Inc. All rights reserved. NAVIGATE IN CHROME TO THE SHARED SANDBOX https://login.cdpworkshops.cloudera.com/auth/realms/se-workshop-5 /protocol/saml/clients/cdp-sso
  • 5. © 2023 Cloudera, Inc. All rights reserved. REGISTRATION Click Register Must use Recent Chrome Browser
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. © 2023 Cloudera, Inc. All rights reserved. GETTING STARTED - GUIDED USE CASES ● Syslog to Kafka topic ● Reading and Filtering a Syslog Stream ● Writing Critical Syslog Events to Apache Iceberg ● Must use Recent Chrome Browser
  • 11. © 2023 Cloudera, Inc. All rights reserved. BEST IN FLOW COMPETITION - BUILD & DOCUMENT A FLOW A chance to win a $2,000 Amazon gift card. A great way to get recognition. Cloudera public award social media post.
  • 12. © 2023 Cloudera, Inc. All rights reserved. BEST IN FLOW COMPETITION - BUILD & DOCUMENT A FLOW A chance to win a $2,000 Amazon gift card. A great way to get recognition. Cloudera public award social media post.
  • 13. © 2023 Cloudera, Inc. All rights reserved. WHAT TO BUILD? You can extend or try one of our tutorials You can extend or use one of our Ready Flows You can connect to external resources (passwords are visible, only use public data or examples)
  • 14. © 2023 Cloudera, Inc. All rights reserved. FLOW REQUIREMENTS The following are the requirements for the flow to be considered eligible for the competition: 1. The flow must be developed using the new DataFlow Designer in the DataFlow Service sandbox. 2. The flow must have at least one “source” data. 3. The flow must have at least one “destination” where the data is delivered 4. The flow must be functional, tested, and working using the Test session feature of the DataFlow Service. The Data viewer should be used to inspect the data payload within the different flow steps. 5. The flow must be checked into the DataFlow Catalog, deployed using the deployment wizard, and validated that it is correctly running. 6. Each submitted Flow must include the following additional details: ○ The CRN of the flow was checked into the flow catalog with a detailed description of the flow and use case. ○ Link to a short blog describing the use case and the flow that was built and deployed using DataFlow Designer ○ Link to a short video showing the flow running in the Flow Designer with the test session and data traversing through flow. The Data viewer should be used to inspect the data payload within the different flow steps. ○ Product feedback on the DataFlow Service.
  • 15. © 2023 Cloudera, Inc. All rights reserved. Criteria Description Complete Flow Artifacts The submitted flow entry contains all the required artifacts, including Flow CRN in the Catalog, a link to the blog describing the use case and the flow, and a short video link showing the flow running with data traversing through the flow. Adheres to NiFi flow best practices Follows NiFi flow design best practices like record-oriented processors, controller services, and parameters. Showcases NiFi processing capabilities Showcases NiFi processing capabilities including protocol bridging, schema transformation, routing, filtering, enrichment, compression, etc. Universal Data Distribution The flow showcases multiple data sources and delivers data to multiple destinations. Uses the latest NiFi processors and controllers services Showcases the latest NiFi processors in the latest Apache NiFi release: 1.20, 1.19, 1.18, 1.17, including PutSnowflakeInternalStage, PutIceberg, UpdateDeltaLakeTable, Amazon ML Processors: Amazon Web Services Polly, Textract, Translate, and Transcribe services, etc. ReadyFlow The flow addresses a common data pipeline use case and can be reused by other users hence a good candidate to be added to the ReadyFlow gallery. Deployable The flow should be able to be deployed with minimum effort with the appropriate documentation (e.g.: description of parameters in the parameter context, the blog details, etc..)
  • 16. © 2023 Cloudera, Inc. All rights reserved. SANDBOX FLOW DEVELOPMENT BEST PRACTICES Uniquely Name your processors/ connections with yourid_ Parameterize connection information Don’t use sensitive data in sandbox Don’t use or change other people’s assets, only your own
  • 17. © 2023 Cloudera, Inc. All rights reserved. Don’t use or change other people’s assets, only your own Reuse components via Copy and Process Groups We are here to help reach out via Slack or Zoom. SANDBOX FLOW DEVELOPMENT BEST PRACTICES
  • 18. © 2023 Cloudera, Inc. All rights reserved. DAILY ZOOM https://cloudera.zoom.us/j/964 60893376?pwd=eWZEVDhpZm pFSDNRejFzMXkvcHpOdz09
  • 19. © 2023 Cloudera, Inc. All rights reserved. SLACK CHANNEL https://bestinflow.slack.com/join /shared_invite/zt-1uj1ti8hc-8mnh mbr_AbOCD7f~A68P0w#/shared -invite/email
  • 20. © 2023 Cloudera, Inc. All rights reserved. SOURCE CODE AND EXAMPLES https://github.com/tspannhw/FLaN K-DataFlows
  • 21. © 2023 Cloudera, Inc. All rights reserved. Submit Your Flow https://docs.google.com/forms/d/1Ku2KSDFoxJy45jiOWuLRDi9Trpgm-42aaxeAVwy-fpo
  • 22. © 2023 Cloudera, Inc. All rights reserved. ADDITIONAL RESOURCES Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development Streaming Data Ingestion into an Open Data Lakehouse Made Easy with DataFlow Example Cloudera DataFlow Designer: Kafka to Iceberg in Cloudera Data Warehouse Serverless NiFi Flows with DataFlow Functions DataFlow Functions Technical Demo DataFlow Documentation
  • 23. 23 © 2023 Cloudera, Inc. All rights reserved. Marketing Carolyn Duby Field CTO Field Meet the Data-In-Motion Team Tim Spann Developer Advocate Richard Walden DIM SME Lead Engineering Product Chris Joynt Product Marketing Joe Witt Engineering Leader George Vetticaden Product Leader Michael Kohs Product Owner for DataFlow Pierre Villard Product Owner for DataFlow Andre Araujo Product Owner for Stream Processing John Kuchmek DIM SME Expert
  • 24. © 2023 Cloudera, Inc. All rights reserved. WARNING “Notwithstanding any contrary terms in the Agreement, Customer acknowledges that information shared using the Trial Product is in a shared environment with similarly situated customers. All information in the shared environment is accessible by all other customers participating in the trial and such information will not be deemed Confidential Information.” https://www.cloudera.com/legal/commercial-terms-and-conditio ns/cdp-public-cloud-trial-agreement.html
  • 25. © 2023 Cloudera, Inc. All rights reserved. CONTAINER BASED DATAFLOW Flow Deployment Flow Monitoring Allows easy flow deployment based on NiFi 1.20 across CDP environments (Dev, QA, Prod) Define and assign KPIs to your flows Easy NiFi version upgrades Update/Add KPIs, Update Parameters, Change sizing configuration Automatic infrastructure scaling based on CPU utilization Central monitoring console for all your flows across environments Monitor flow metrics and infrastructure usage Define alerts for flows breaching assigned KPIs Flow Catalog Keep track of your flow definitions and versions in a central catalog Reuse your existing NiFi flows by uploading them to the catalog Discover, search and reuse existing flows easily
  • 26. 26 © 2023 Cloudera, Inc. All rights reserved. FLOW CATALOG • Central repository for flow definitions • Import existing NiFi flows • Manage flow definitions • Initiate flow deployments
  • 27. 27 © 2023 Cloudera, Inc. All rights reserved. TURNS FLOW DEFINITIONS INTO FLOW DEPLOYMENTS 2.) NiFi Config 4.) Configure Sizing & Scaling 5.) Define KPIs 1.) Start Deployment Wizard 3.) Provide Parameters for NiFi
  • 28. 28 © 2023 Cloudera, Inc. All rights reserved. KEY PERFORMANCE INDICATORS • Visibility into flow deployments • Track high level flow performance • Track in-depth NiFi component metrics • Defined in Deployment Wizard • Monitoring & Alerts in Deployment Details KPI Definition in Deployment Wizard KPI Monitoring
  • 29. 29 © 2023 Cloudera, Inc. All rights reserved. DASHBOARD • Central Monitoring View • Monitors flow deployments across CDP environments • Monitors flow deployment health & performance • Drill into flow deployment to monitor system metrics and deployment events
  • 30. 30 © 2023 Cloudera, Inc. All rights reserved. DEPLOYMENT MANAGER • Manage flow deployment lifecycle (Suspend/Start/Terminate) • Add/Edit KPIs • Change sizing configuration • Update parameters • Change NiFi version of the deployment • Gateway to NiFi canvas
  • 31.
  • 32. 32 © 2023 Cloudera, Inc. All rights reserved. TH N Y U