SlideShare a Scribd company logo
1 of 22
Download to read offline
(Leeg, niet verwijderen s.v.p.)
What Does an Event mean?
Manage the Meaning of Your Data
KAFKA SUMMIT AMERICAS
15 September 2021
(Leeg, niet verwijderen s.v.p.)
Introduction Video:
Look around - Van Oord has changed the world around you
(
L
e
e
g
,
n
i
e
t
v
e
r
w
i
j
d
e
r
e
n
s
.
v
.
p
.
)
2
(Leeg, niet verwijderen s.v.p.)
Speaker Introduction
(Leeg, niet verwijderen s.v.p.)
Marlon Hiralal
Enterprise Architect
20+ years of experience
Role in the Project: Overall Architect
• Extensive experience with (real-time) data management
• Led several Industrial Digital Transformation initiatives, such as
Smart Factory/End2End Digital Supply Chain, IT-OT
Architecture and Industrial Internet of Things/Connected
Machines Things
• Worked on several streaming platforms as product manager
and system/enterprise architect
(Leeg, niet verwijderen s.v.p.)
Andreas Wombacher
CTO and Co-Founder at Aurelius Enterprise B.V.
20+ years of experience
Role in the Project: Data Architect
• Expertise in workflow and data management, ranging
from data integration, sensor data fusion, data mining,
event-based systems, and data analysis
• Worked with data on different scales and abstraction
levels from time series sensor data to information system
or human event data
(Leeg, niet verwijderen s.v.p.)
Pre-Project State at Van Oord
(Leeg, niet verwijderen s.v.p.)
We have over 250 applications of which:
8 core business IT applications
+ Critical OT (Operational Technology) applications
• No Data Integration
• No Application Integration
• No Process Integration
• No OT/IT Integration
IT Applications
- HR
- Finance
- Fleet Logistics
- Procurement
- …
OT Applications
- Vessel Data Logger
- Bathymetric Survey
- GIS
- AIS
- …
(Leeg, niet verwijderen s.v.p.)
Insights on Operation, Production and Cost
are needed daily!
The challenges are:
• Ownership of data
• Quality
• Availability
• Combining batch and
streaming data
• Different data formats
(Leeg, niet verwijderen s.v.p.)
The Data Management
Platform
(Leeg, niet verwijderen s.v.p.)
Process
Integration
Application
Integration
Data
Integration
Data Governance
Data
In
Motion
Data Management Platform – Goals
(Leeg, niet verwijderen s.v.p.)
Functional Solution
Stream
Processing
Data Storage &
Search
Analysis &
Visualization
Data
Governance Tool
Enterprise
Architecture
Platform
Data Producer
Data Producer
Data
Producer
Data Producer
Data Producer
Data
Consumer
(Leeg, niet verwijderen s.v.p.)
Technology Selection
(Leeg, niet verwijderen s.v.p.)
Digital Enterprise Architecture
(Models4Insight)
§ Architecture model is like a spider web
• All concepts are related with each other
• You are interested in one concept and all its
dependencies
§ Architecture model is built partly by:
• Team of architects in a collaboration
• Scripts in an automated way to reduce
maintenance effort and increase speed
(Leeg, niet verwijderen s.v.p.)
Model extraction reduces the maintenance
effort
Conceptual data
definitions and
ownership
Deployment of actual
infrastructure
Data governance
IaC
Micro service
Data architecture
Sync
(Leeg, niet verwijderen s.v.p.)
Change is constant
Consumer
Producer
Proof of Concept Enterprise Solution
- Complexity
- Number of changes
• Which data is available on the platform?
• Where did this data come from?
• What shape does this data have?
• What does this data mean?
(Leeg, niet verwijderen s.v.p.)
Digital Data Governance (a.k.a. DG 4.0) for
data in motion
Characteristics:
• Data as strategic asset
• Data in Rest vs Data in Motion
• Empower data users
• Trustable data & analytics
• Lineage
• Proactive
DG
1.0
No Meta data
Ownership
IT Driven:
Meta data Mgmt for IT
DG
3.0
DG
4.0
Process Driven:
Meta data dictionaries
for data stewards
Value Driven:
Contextualized meta data
for the diverse data users
Nineties Early 2000s
DG
2.0
2020 Today
Characteristics:
• Process focused
• Manual
• Data in Rest
• Descriptive
Characteristics:
• IT focused
• Manual
• Data in Rest
• Corrective
Characteristics:
• Manual
• Ad hoc
(Leeg, niet verwijderen s.v.p.)
Demos
(Leeg, niet verwijderen s.v.p.)
§ A conceptual attributes is marked as PII and therefore all technical implementations of this
attribute are also PII relevant information
§ A classification of the attribute enables the propagation of the attribute to all related fields
Scenario 1 GDPR & PII – Data steward
18
Manually adding
conceptual information
(Excel file)
Publish governance and
classify PII attributes
(micro service)
PII classification is
propagated to technical
concepts (Apache Atlas)
(Leeg, niet verwijderen s.v.p.)
§ Data quality rules are specified for a technical field
§ Data quality is assessed automatically on a regular basis
§ A dataset is marked to have bad data quality
§ This classification is propagated along the data linage/data flow
§ Data quality results are summarized in a data quality dashboard
Scenario 2 Data Quality – Data steward
19
Data quality rules are
specified per technical
field (Apache Atlas)
Data quality is assessed
and documented
(micro service)
Bad quality classification
propagates along data
linage (Apache Atlas)
Bad quality results are
visualized in a dashboard
(Leeg, niet verwijderen s.v.p.)
§ Find information about values saying something about a "full-time equivalent"
§ It is found in the description of the FTE attribute, where the actual data can be found in
the FTE field
Scenario 3 Search – Data scientist
20
Search for a key phrase
(Apache Atlas)
Find the related
conceptual attribute
(Apache Atlas)
Find the related technical
fields
(Apache Atlas)
(Leeg, niet verwijderen s.v.p.)
Wrap up
(Leeg, niet verwijderen s.v.p.)
Wrap up
• DMP captures meaning of data in motion
• Data Governance provides conceptual meaning
• Digital Enterprise Architecture provides contextual meaning
• HR: Van Oord is seen as an interesting employer for technical
talents within the Netherlands

More Related Content

What's hot

Introducing Events and Stream Processing into Nationwide Building Society
Introducing Events and Stream Processing into Nationwide Building SocietyIntroducing Events and Stream Processing into Nationwide Building Society
Introducing Events and Stream Processing into Nationwide Building Society
confluent
 
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
confluent
 
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
Kai Wähner
 
Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...
Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...
Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...
confluent
 
USA Information Security Compliance Market Overview
USA Information Security Compliance Market OverviewUSA Information Security Compliance Market Overview
USA Information Security Compliance Market Overview
Niraj Singhvi
 

What's hot (20)

Introducing Events and Stream Processing into Nationwide Building Society
Introducing Events and Stream Processing into Nationwide Building SocietyIntroducing Events and Stream Processing into Nationwide Building Society
Introducing Events and Stream Processing into Nationwide Building Society
 
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
 
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
 
Why Kafka Works the Way It Does (And Not Some Other Way) | Tim Berglund, Conf...
Why Kafka Works the Way It Does (And Not Some Other Way) | Tim Berglund, Conf...Why Kafka Works the Way It Does (And Not Some Other Way) | Tim Berglund, Conf...
Why Kafka Works the Way It Does (And Not Some Other Way) | Tim Berglund, Conf...
 
From Legacy SQL Server to High Powered Confluent & Kafka Monitoring System at...
From Legacy SQL Server to High Powered Confluent & Kafka Monitoring System at...From Legacy SQL Server to High Powered Confluent & Kafka Monitoring System at...
From Legacy SQL Server to High Powered Confluent & Kafka Monitoring System at...
 
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
 
IoT meets AI in the Clouds
IoT meets AI in the CloudsIoT meets AI in the Clouds
IoT meets AI in the Clouds
 
Scalable Data Management for Kafka and Beyond | Dan Rice, BigID
Scalable Data Management for Kafka and Beyond | Dan Rice, BigIDScalable Data Management for Kafka and Beyond | Dan Rice, BigID
Scalable Data Management for Kafka and Beyond | Dan Rice, BigID
 
Stream me to the Cloud (and back) with Confluent & MongoDB
Stream me to the Cloud (and back) with Confluent & MongoDBStream me to the Cloud (and back) with Confluent & MongoDB
Stream me to the Cloud (and back) with Confluent & MongoDB
 
Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...
 
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
 
Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...
Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...
Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...
 
Death of the dumb pipes: Using Apache Kafka® for Integration projects
Death of the dumb pipes: Using Apache Kafka® for Integration projectsDeath of the dumb pipes: Using Apache Kafka® for Integration projects
Death of the dumb pipes: Using Apache Kafka® for Integration projects
 
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...
 
Kafka Summit NYC 2017 - The Rise of the Streaming Platform
Kafka Summit NYC 2017 - The Rise of the Streaming PlatformKafka Summit NYC 2017 - The Rise of the Streaming Platform
Kafka Summit NYC 2017 - The Rise of the Streaming Platform
 
Money Heist - A Stream Processing Original! | Meha Pandey and Shengze Yu, Net...
Money Heist - A Stream Processing Original! | Meha Pandey and Shengze Yu, Net...Money Heist - A Stream Processing Original! | Meha Pandey and Shengze Yu, Net...
Money Heist - A Stream Processing Original! | Meha Pandey and Shengze Yu, Net...
 
Kafka Vienna Meetup 020719
Kafka Vienna Meetup 020719Kafka Vienna Meetup 020719
Kafka Vienna Meetup 020719
 
USA Information Security Compliance Market Overview
USA Information Security Compliance Market OverviewUSA Information Security Compliance Market Overview
USA Information Security Compliance Market Overview
 
Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...
 
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
 

Similar to What does an event mean? Manage the meaning of your data! | Andreas Wombacher, Aurelius Enterprise B.V and Van Oord, Marlon Hiralal

Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Kai Wähner
 
Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystem
Yael Garten
 
Mapping Manager Product Overview
Mapping Manager Product OverviewMapping Manager Product Overview
Mapping Manager Product Overview
Rakesh Kumar
 

Similar to What does an event mean? Manage the meaning of your data! | Andreas Wombacher, Aurelius Enterprise B.V and Van Oord, Marlon Hiralal (20)

Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An Introduction
 
Data virtualization an introduction
Data virtualization an introductionData virtualization an introduction
Data virtualization an introduction
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
Dell Digital Transformation Through AI and Data Analytics Webinar
Dell Digital Transformation Through AI and  Data Analytics WebinarDell Digital Transformation Through AI and  Data Analytics Webinar
Dell Digital Transformation Through AI and Data Analytics Webinar
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
 
Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystem
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
 
DATA @ NFLX (Tableau Conference 2014 Presentation)
DATA @ NFLX (Tableau Conference 2014 Presentation)DATA @ NFLX (Tableau Conference 2014 Presentation)
DATA @ NFLX (Tableau Conference 2014 Presentation)
 
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDSSPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
 
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
 
Build Answer-generating Apps that Users Love: Development best practices for ...
Build Answer-generating Apps that Users Love: Development best practices for ...Build Answer-generating Apps that Users Love: Development best practices for ...
Build Answer-generating Apps that Users Love: Development best practices for ...
 
Continuous Intelligence - Intersecting Event-Based Business Logic and ML
Continuous Intelligence - Intersecting Event-Based Business Logic and MLContinuous Intelligence - Intersecting Event-Based Business Logic and ML
Continuous Intelligence - Intersecting Event-Based Business Logic and ML
 
AppliFire Blue Print Design Guidelines
AppliFire Blue Print Design GuidelinesAppliFire Blue Print Design Guidelines
AppliFire Blue Print Design Guidelines
 
Mapping Manager Product Overview
Mapping Manager Product OverviewMapping Manager Product Overview
Mapping Manager Product Overview
 
Ms net work-sharepoint 2013-applied architecture from the field v4
Ms net work-sharepoint 2013-applied architecture from the field v4Ms net work-sharepoint 2013-applied architecture from the field v4
Ms net work-sharepoint 2013-applied architecture from the field v4
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)
 
ALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngineALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngine
 
Agile Testing Days 2017 Introducing AgileBI Sustainably
Agile Testing Days 2017 Introducing AgileBI SustainablyAgile Testing Days 2017 Introducing AgileBI Sustainably
Agile Testing Days 2017 Introducing AgileBI Sustainably
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 

More from HostedbyConfluent

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolEvolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at Trendyol
HostedbyConfluent
 

More from HostedbyConfluent (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Renaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit LondonRenaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit London
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolEvolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at Trendyol
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking TechniquesEnsuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
 
Exactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and KafkaExactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and Kafka
 
Fish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit LondonFish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit London
 
Tiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit LondonTiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit London
 
Building a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And WhyBuilding a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And Why
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
 
Navigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka ClustersNavigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka Clusters
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data PlatformApache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
 
Explaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy PubExplaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy Pub
 
TL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit LondonTL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit London
 
A Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSLA Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSL
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing PerformanceMastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
 
Data Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and BeyondData Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and Beyond
 
Code-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink AppsCode-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink Apps
 
Debezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC EcosystemDebezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC Ecosystem
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local DisksBeyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local Disks
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Recently uploaded (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

What does an event mean? Manage the meaning of your data! | Andreas Wombacher, Aurelius Enterprise B.V and Van Oord, Marlon Hiralal

  • 1. (Leeg, niet verwijderen s.v.p.) What Does an Event mean? Manage the Meaning of Your Data KAFKA SUMMIT AMERICAS 15 September 2021
  • 2. (Leeg, niet verwijderen s.v.p.) Introduction Video: Look around - Van Oord has changed the world around you ( L e e g , n i e t v e r w i j d e r e n s . v . p . ) 2
  • 3. (Leeg, niet verwijderen s.v.p.) Speaker Introduction
  • 4. (Leeg, niet verwijderen s.v.p.) Marlon Hiralal Enterprise Architect 20+ years of experience Role in the Project: Overall Architect • Extensive experience with (real-time) data management • Led several Industrial Digital Transformation initiatives, such as Smart Factory/End2End Digital Supply Chain, IT-OT Architecture and Industrial Internet of Things/Connected Machines Things • Worked on several streaming platforms as product manager and system/enterprise architect
  • 5. (Leeg, niet verwijderen s.v.p.) Andreas Wombacher CTO and Co-Founder at Aurelius Enterprise B.V. 20+ years of experience Role in the Project: Data Architect • Expertise in workflow and data management, ranging from data integration, sensor data fusion, data mining, event-based systems, and data analysis • Worked with data on different scales and abstraction levels from time series sensor data to information system or human event data
  • 6. (Leeg, niet verwijderen s.v.p.) Pre-Project State at Van Oord
  • 7. (Leeg, niet verwijderen s.v.p.) We have over 250 applications of which: 8 core business IT applications + Critical OT (Operational Technology) applications • No Data Integration • No Application Integration • No Process Integration • No OT/IT Integration IT Applications - HR - Finance - Fleet Logistics - Procurement - … OT Applications - Vessel Data Logger - Bathymetric Survey - GIS - AIS - …
  • 8. (Leeg, niet verwijderen s.v.p.) Insights on Operation, Production and Cost are needed daily! The challenges are: • Ownership of data • Quality • Availability • Combining batch and streaming data • Different data formats
  • 9. (Leeg, niet verwijderen s.v.p.) The Data Management Platform
  • 10. (Leeg, niet verwijderen s.v.p.) Process Integration Application Integration Data Integration Data Governance Data In Motion Data Management Platform – Goals
  • 11. (Leeg, niet verwijderen s.v.p.) Functional Solution Stream Processing Data Storage & Search Analysis & Visualization Data Governance Tool Enterprise Architecture Platform Data Producer Data Producer Data Producer Data Producer Data Producer Data Consumer
  • 12. (Leeg, niet verwijderen s.v.p.) Technology Selection
  • 13. (Leeg, niet verwijderen s.v.p.) Digital Enterprise Architecture (Models4Insight) § Architecture model is like a spider web • All concepts are related with each other • You are interested in one concept and all its dependencies § Architecture model is built partly by: • Team of architects in a collaboration • Scripts in an automated way to reduce maintenance effort and increase speed
  • 14. (Leeg, niet verwijderen s.v.p.) Model extraction reduces the maintenance effort Conceptual data definitions and ownership Deployment of actual infrastructure Data governance IaC Micro service Data architecture Sync
  • 15. (Leeg, niet verwijderen s.v.p.) Change is constant Consumer Producer Proof of Concept Enterprise Solution - Complexity - Number of changes • Which data is available on the platform? • Where did this data come from? • What shape does this data have? • What does this data mean?
  • 16. (Leeg, niet verwijderen s.v.p.) Digital Data Governance (a.k.a. DG 4.0) for data in motion Characteristics: • Data as strategic asset • Data in Rest vs Data in Motion • Empower data users • Trustable data & analytics • Lineage • Proactive DG 1.0 No Meta data Ownership IT Driven: Meta data Mgmt for IT DG 3.0 DG 4.0 Process Driven: Meta data dictionaries for data stewards Value Driven: Contextualized meta data for the diverse data users Nineties Early 2000s DG 2.0 2020 Today Characteristics: • Process focused • Manual • Data in Rest • Descriptive Characteristics: • IT focused • Manual • Data in Rest • Corrective Characteristics: • Manual • Ad hoc
  • 17. (Leeg, niet verwijderen s.v.p.) Demos
  • 18. (Leeg, niet verwijderen s.v.p.) § A conceptual attributes is marked as PII and therefore all technical implementations of this attribute are also PII relevant information § A classification of the attribute enables the propagation of the attribute to all related fields Scenario 1 GDPR & PII – Data steward 18 Manually adding conceptual information (Excel file) Publish governance and classify PII attributes (micro service) PII classification is propagated to technical concepts (Apache Atlas)
  • 19. (Leeg, niet verwijderen s.v.p.) § Data quality rules are specified for a technical field § Data quality is assessed automatically on a regular basis § A dataset is marked to have bad data quality § This classification is propagated along the data linage/data flow § Data quality results are summarized in a data quality dashboard Scenario 2 Data Quality – Data steward 19 Data quality rules are specified per technical field (Apache Atlas) Data quality is assessed and documented (micro service) Bad quality classification propagates along data linage (Apache Atlas) Bad quality results are visualized in a dashboard
  • 20. (Leeg, niet verwijderen s.v.p.) § Find information about values saying something about a "full-time equivalent" § It is found in the description of the FTE attribute, where the actual data can be found in the FTE field Scenario 3 Search – Data scientist 20 Search for a key phrase (Apache Atlas) Find the related conceptual attribute (Apache Atlas) Find the related technical fields (Apache Atlas)
  • 21. (Leeg, niet verwijderen s.v.p.) Wrap up
  • 22. (Leeg, niet verwijderen s.v.p.) Wrap up • DMP captures meaning of data in motion • Data Governance provides conceptual meaning • Digital Enterprise Architecture provides contextual meaning • HR: Van Oord is seen as an interesting employer for technical talents within the Netherlands