SlideShare une entreprise Scribd logo
1  sur  17
Télécharger pour lire hors ligne
Azure Synapse data lakehouse
Customer presentation
Intro
| 2
OQuila helps organisations to transform to
a data-driven organisation.
About OQuila
• Data & Analytics, Internet of Things and Application
Innovation solutions
• Joining forces with established IT company
• Innovation & transformation with trusted technologies
Evolution of data platforms
© OQuila 2021 | 5
Data Lake vs Data Warehouse
© OQuila 2021 | 6
Data Lake
Schema on read; answers also the
questions of tomorrow
Scales without limits
Can hold any type of data
Data Warehouse
Schema on write; answers the
questions of today
Mainly for relational data (tables
and rows)
Can be part of an Enterprise data
lake or lakehouse
≠
Overview
| 7
General principles OQuila Achitecture
| 8
© OQuila 2021
1
2
3
4
5
6
Use of standard components
100% Cloud Services: PaaS or
SaaS. No installations or Virtual
Machines
No custom development
Use of components within the
same ecosystem: e.g. Microsoft
Azure Synapse
Minimize maintenance by using
Services (maintained by
Microsoft)
Dynamic and scalable
Agile Data Model
• No traditional schema or fixed model
• RAW, STAGED, CURATED:
• No rework when adding additional sources
• RAW and CURATED stores data separately
• Preparations/calculations are done in STAGED environnment and are reusable
• Supports changes to business rules with ease
• Schema on read; answers also the questions of tomorrow
© OQuila 2021 | 9
© OQuila 2021 10
Data Sources
Azure Synapse Analytics
RAW STAGE CURATED
Data Lake
Gen 2
Cleansing and Transformations via Spark clusters
Synapse Pipelines On demand
SQL pool
Power BI
Synapse Data Flow: Monitoring Quality of Data
Validated
Anomaly
Excel
Power Apps
Automation
Flows
Azure Machine Learning
Synapse components
• Data pipelines:
• A lot of standard connectors (SQL, Oracle, CSV, API, …)
• Data extraction from online and on-prem systems
• Add new systems easily
• Data Lake:
• RAW, STAGE and CURATED folders (level maturity en correctness data)
• Parquet files to be able to work efficiently with large amounts of data
• Spark Cluster:
• Performant transformation and cleansing actions via notebooks
• Transfers “edited” data to the next stage (RAW, STAGE, CURATED)
• Synapse Data flows:
• Definition business rules via graphical designer (missing values, inconsistencies, …)
• Puts anomalies in a separate STAGE environment
© OQuila 2021 | 11
Synapse components
• On demand SQL Pool:
• Build in in Azure Synapse
• Links directly to Parquet files in CURATED zone (without having to copy data to tables).
• Row level security
• Allows to access data via:
• Queries
• Power BI
• Excel
• Automation tools
• …
© OQuila 2021 | 12
Synapse Data Flow
© OQuila 2021 13
Our PoV/PoC approach
| 14
Dream Big, Start Small, Grow Fast
Synapse based Data
Platform
Proof of Value
Rollout 2
Rollout 3
Rollout 4
...
Proof of Concept Project approach
• Make smart choices about the scope
• Define the ‘low hanging fruit’ data sources eligible for the PoC
• Define a quick-win report
• Define a lean & mean project team
• After kick-off – OQuila will
• Set-up the Azure environment
• Set-up the OQuila’s Synapse Data lakehouse framework
• Set-up and deploy the selected data pipeline(s)
• Build the report
• Document the solution
• Present the solution
• Ready for use and grow!
© OQuila 2021 16
Thank you !

Contenu connexe

Similaire à Azure Synapse Data Lakehouse - Customer Presentation (Oquila, jan 2022).pdf

Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?DATAVERSITY
 
SQL Server 2019 hotlap - WARDY IT Solutions
SQL Server 2019 hotlap - WARDY IT SolutionsSQL Server 2019 hotlap - WARDY IT Solutions
SQL Server 2019 hotlap - WARDY IT SolutionsMichaela Murray
 
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...Splunk
 
Introduction To IPaaS: Drivers, Requirements And Use Cases
Introduction To IPaaS: Drivers, Requirements And Use CasesIntroduction To IPaaS: Drivers, Requirements And Use Cases
Introduction To IPaaS: Drivers, Requirements And Use CasesSynerzip
 
20160331 sa introduction to big data pipelining berlin meetup 0.3
20160331 sa introduction to big data pipelining berlin meetup   0.320160331 sa introduction to big data pipelining berlin meetup   0.3
20160331 sa introduction to big data pipelining berlin meetup 0.3Simon Ambridge
 
1 Introduction to Microsoft data platform analytics for release
1 Introduction to Microsoft data platform analytics for release1 Introduction to Microsoft data platform analytics for release
1 Introduction to Microsoft data platform analytics for releaseJen Stirrup
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...Cloudera, Inc.
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
 
Data Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxData Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxArunPandiyan890855
 
Key Database Criteria for Cloud Applications
Key Database Criteria for Cloud ApplicationsKey Database Criteria for Cloud Applications
Key Database Criteria for Cloud ApplicationsNuoDB
 
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...Lucas Jellema
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyAlluxio, Inc.
 
BI, Reporting and Analytics on Apache Cassandra
BI, Reporting and Analytics on Apache CassandraBI, Reporting and Analytics on Apache Cassandra
BI, Reporting and Analytics on Apache CassandraVictor Coustenoble
 
Bridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architectureBridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architectureIBM Analytics
 
Mainframe Modernization with Precisely and Microsoft Azure
Mainframe Modernization with Precisely and Microsoft AzureMainframe Modernization with Precisely and Microsoft Azure
Mainframe Modernization with Precisely and Microsoft AzurePrecisely
 
Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...
Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...
Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...Qubole
 
Azure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layerAzure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layerMicrosoft Tech Community
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Denodo
 
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL DatabaseModern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL DatabaseEric Bragas
 

Similaire à Azure Synapse Data Lakehouse - Customer Presentation (Oquila, jan 2022).pdf (20)

Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?
 
SQL Server 2019 hotlap - WARDY IT Solutions
SQL Server 2019 hotlap - WARDY IT SolutionsSQL Server 2019 hotlap - WARDY IT Solutions
SQL Server 2019 hotlap - WARDY IT Solutions
 
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
 
Introduction To IPaaS: Drivers, Requirements And Use Cases
Introduction To IPaaS: Drivers, Requirements And Use CasesIntroduction To IPaaS: Drivers, Requirements And Use Cases
Introduction To IPaaS: Drivers, Requirements And Use Cases
 
20160331 sa introduction to big data pipelining berlin meetup 0.3
20160331 sa introduction to big data pipelining berlin meetup   0.320160331 sa introduction to big data pipelining berlin meetup   0.3
20160331 sa introduction to big data pipelining berlin meetup 0.3
 
1 Introduction to Microsoft data platform analytics for release
1 Introduction to Microsoft data platform analytics for release1 Introduction to Microsoft data platform analytics for release
1 Introduction to Microsoft data platform analytics for release
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
Data Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxData Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptx
 
Key Database Criteria for Cloud Applications
Key Database Criteria for Cloud ApplicationsKey Database Criteria for Cloud Applications
Key Database Criteria for Cloud Applications
 
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
 
BI, Reporting and Analytics on Apache Cassandra
BI, Reporting and Analytics on Apache CassandraBI, Reporting and Analytics on Apache Cassandra
BI, Reporting and Analytics on Apache Cassandra
 
Bridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architectureBridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architecture
 
Mainframe Modernization with Precisely and Microsoft Azure
Mainframe Modernization with Precisely and Microsoft AzureMainframe Modernization with Precisely and Microsoft Azure
Mainframe Modernization with Precisely and Microsoft Azure
 
Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...
Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...
Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...
 
Azure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layerAzure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layer
 
CC -Unit4.pptx
CC -Unit4.pptxCC -Unit4.pptx
CC -Unit4.pptx
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
 
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL DatabaseModern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
 

Plus de havoc2003

NICE CXone Attendant Data Sheet (mar 2022).pdf
NICE CXone Attendant Data Sheet (mar 2022).pdfNICE CXone Attendant Data Sheet (mar 2022).pdf
NICE CXone Attendant Data Sheet (mar 2022).pdfhavoc2003
 
documen.site_sd08 2.pdf
documen.site_sd08 2.pdfdocumen.site_sd08 2.pdf
documen.site_sd08 2.pdfhavoc2003
 
Qlik-Sense-Product-Presentation.compressed.pdf
Qlik-Sense-Product-Presentation.compressed.pdfQlik-Sense-Product-Presentation.compressed.pdf
Qlik-Sense-Product-Presentation.compressed.pdfhavoc2003
 
presentation-efficacy-effectiveness-models_en.pdf
presentation-efficacy-effectiveness-models_en.pdfpresentation-efficacy-effectiveness-models_en.pdf
presentation-efficacy-effectiveness-models_en.pdfhavoc2003
 
Predictions-2023-Europe.pdf
Predictions-2023-Europe.pdfPredictions-2023-Europe.pdf
Predictions-2023-Europe.pdfhavoc2003
 
The Acord Framework - An Insurance Enterprise Architecture (2011).pdf
The Acord Framework - An Insurance Enterprise Architecture (2011).pdfThe Acord Framework - An Insurance Enterprise Architecture (2011).pdf
The Acord Framework - An Insurance Enterprise Architecture (2011).pdfhavoc2003
 
Opportunity Snapshot - Build Your Zero Trust Security Strategy With Microsegm...
Opportunity Snapshot - Build Your Zero Trust Security Strategy With Microsegm...Opportunity Snapshot - Build Your Zero Trust Security Strategy With Microsegm...
Opportunity Snapshot - Build Your Zero Trust Security Strategy With Microsegm...havoc2003
 
Opportunity Snapshot - Accelerating Digital Transformation With Technology (F...
Opportunity Snapshot - Accelerating Digital Transformation With Technology (F...Opportunity Snapshot - Accelerating Digital Transformation With Technology (F...
Opportunity Snapshot - Accelerating Digital Transformation With Technology (F...havoc2003
 
the-data-deprecation-challenge-and-the-promise-of-zero-party-data.pdf
the-data-deprecation-challenge-and-the-promise-of-zero-party-data.pdfthe-data-deprecation-challenge-and-the-promise-of-zero-party-data.pdf
the-data-deprecation-challenge-and-the-promise-of-zero-party-data.pdfhavoc2003
 
The-Evolution-of-Marketing-Ops-Analytics-and-Measurement-FINAL.pdf
The-Evolution-of-Marketing-Ops-Analytics-and-Measurement-FINAL.pdfThe-Evolution-of-Marketing-Ops-Analytics-and-Measurement-FINAL.pdf
The-Evolution-of-Marketing-Ops-Analytics-and-Measurement-FINAL.pdfhavoc2003
 
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...havoc2003
 

Plus de havoc2003 (11)

NICE CXone Attendant Data Sheet (mar 2022).pdf
NICE CXone Attendant Data Sheet (mar 2022).pdfNICE CXone Attendant Data Sheet (mar 2022).pdf
NICE CXone Attendant Data Sheet (mar 2022).pdf
 
documen.site_sd08 2.pdf
documen.site_sd08 2.pdfdocumen.site_sd08 2.pdf
documen.site_sd08 2.pdf
 
Qlik-Sense-Product-Presentation.compressed.pdf
Qlik-Sense-Product-Presentation.compressed.pdfQlik-Sense-Product-Presentation.compressed.pdf
Qlik-Sense-Product-Presentation.compressed.pdf
 
presentation-efficacy-effectiveness-models_en.pdf
presentation-efficacy-effectiveness-models_en.pdfpresentation-efficacy-effectiveness-models_en.pdf
presentation-efficacy-effectiveness-models_en.pdf
 
Predictions-2023-Europe.pdf
Predictions-2023-Europe.pdfPredictions-2023-Europe.pdf
Predictions-2023-Europe.pdf
 
The Acord Framework - An Insurance Enterprise Architecture (2011).pdf
The Acord Framework - An Insurance Enterprise Architecture (2011).pdfThe Acord Framework - An Insurance Enterprise Architecture (2011).pdf
The Acord Framework - An Insurance Enterprise Architecture (2011).pdf
 
Opportunity Snapshot - Build Your Zero Trust Security Strategy With Microsegm...
Opportunity Snapshot - Build Your Zero Trust Security Strategy With Microsegm...Opportunity Snapshot - Build Your Zero Trust Security Strategy With Microsegm...
Opportunity Snapshot - Build Your Zero Trust Security Strategy With Microsegm...
 
Opportunity Snapshot - Accelerating Digital Transformation With Technology (F...
Opportunity Snapshot - Accelerating Digital Transformation With Technology (F...Opportunity Snapshot - Accelerating Digital Transformation With Technology (F...
Opportunity Snapshot - Accelerating Digital Transformation With Technology (F...
 
the-data-deprecation-challenge-and-the-promise-of-zero-party-data.pdf
the-data-deprecation-challenge-and-the-promise-of-zero-party-data.pdfthe-data-deprecation-challenge-and-the-promise-of-zero-party-data.pdf
the-data-deprecation-challenge-and-the-promise-of-zero-party-data.pdf
 
The-Evolution-of-Marketing-Ops-Analytics-and-Measurement-FINAL.pdf
The-Evolution-of-Marketing-Ops-Analytics-and-Measurement-FINAL.pdfThe-Evolution-of-Marketing-Ops-Analytics-and-Measurement-FINAL.pdf
The-Evolution-of-Marketing-Ops-Analytics-and-Measurement-FINAL.pdf
 
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
 

Dernier

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 

Dernier (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

Azure Synapse Data Lakehouse - Customer Presentation (Oquila, jan 2022).pdf

  • 1. Azure Synapse data lakehouse Customer presentation
  • 3. OQuila helps organisations to transform to a data-driven organisation.
  • 4. About OQuila • Data & Analytics, Internet of Things and Application Innovation solutions • Joining forces with established IT company • Innovation & transformation with trusted technologies
  • 5. Evolution of data platforms © OQuila 2021 | 5
  • 6. Data Lake vs Data Warehouse © OQuila 2021 | 6 Data Lake Schema on read; answers also the questions of tomorrow Scales without limits Can hold any type of data Data Warehouse Schema on write; answers the questions of today Mainly for relational data (tables and rows) Can be part of an Enterprise data lake or lakehouse ≠
  • 8. General principles OQuila Achitecture | 8 © OQuila 2021 1 2 3 4 5 6 Use of standard components 100% Cloud Services: PaaS or SaaS. No installations or Virtual Machines No custom development Use of components within the same ecosystem: e.g. Microsoft Azure Synapse Minimize maintenance by using Services (maintained by Microsoft) Dynamic and scalable
  • 9. Agile Data Model • No traditional schema or fixed model • RAW, STAGED, CURATED: • No rework when adding additional sources • RAW and CURATED stores data separately • Preparations/calculations are done in STAGED environnment and are reusable • Supports changes to business rules with ease • Schema on read; answers also the questions of tomorrow © OQuila 2021 | 9
  • 10. © OQuila 2021 10 Data Sources Azure Synapse Analytics RAW STAGE CURATED Data Lake Gen 2 Cleansing and Transformations via Spark clusters Synapse Pipelines On demand SQL pool Power BI Synapse Data Flow: Monitoring Quality of Data Validated Anomaly Excel Power Apps Automation Flows Azure Machine Learning
  • 11. Synapse components • Data pipelines: • A lot of standard connectors (SQL, Oracle, CSV, API, …) • Data extraction from online and on-prem systems • Add new systems easily • Data Lake: • RAW, STAGE and CURATED folders (level maturity en correctness data) • Parquet files to be able to work efficiently with large amounts of data • Spark Cluster: • Performant transformation and cleansing actions via notebooks • Transfers “edited” data to the next stage (RAW, STAGE, CURATED) • Synapse Data flows: • Definition business rules via graphical designer (missing values, inconsistencies, …) • Puts anomalies in a separate STAGE environment © OQuila 2021 | 11
  • 12. Synapse components • On demand SQL Pool: • Build in in Azure Synapse • Links directly to Parquet files in CURATED zone (without having to copy data to tables). • Row level security • Allows to access data via: • Queries • Power BI • Excel • Automation tools • … © OQuila 2021 | 12
  • 13. Synapse Data Flow © OQuila 2021 13
  • 15. Dream Big, Start Small, Grow Fast Synapse based Data Platform Proof of Value Rollout 2 Rollout 3 Rollout 4 ...
  • 16. Proof of Concept Project approach • Make smart choices about the scope • Define the ‘low hanging fruit’ data sources eligible for the PoC • Define a quick-win report • Define a lean & mean project team • After kick-off – OQuila will • Set-up the Azure environment • Set-up the OQuila’s Synapse Data lakehouse framework • Set-up and deploy the selected data pipeline(s) • Build the report • Document the solution • Present the solution • Ready for use and grow! © OQuila 2021 16