SlideShare une entreprise Scribd logo
1  sur  37
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Sean Martin, CTO
Cambridge Semantics Inc.
DBTA, 21 April 2020
• Big data volume
• Ad hoc queries
• Unstructured
• Semi-structured
• Exploratory
• Raw
• Self service
• On-demand
• Clean
• Consistent
• Integrated
• Accessible
• Searchable
• Secure
• Governed
• Privacy (PII)
• Clean
• Consistent
• Integrated
• Accessible
• Searchable
• Secure
• Governed
• Privacy (PII)
• Clean
• Consistent
• Integrated
• Accessible
• Searchable
• Secure
• Governed
• Privacy (PII)
• Big data volume
• Ad hoc queries
• Unstructured
• Semi-structured
• Exploratory
• Raw
• Self service
• On-demand
• Cataloged
• Linked
• Modeled
• Persisted
• Virtualized
• Collaborative
Data Fabrics are the modern successor to warehouses and lakes.
Fully connected and integrated
Data Fabric
Data Lake
Data
Warehouse
Flexibility and scale
Quality and control
RDBMS/OLTP Big Data / Hadoop Document Repositories
Traditional BI Cloud
CLAIM
CUSTOMER
PRODUCTS
POLICY
Semantics and graph allow the data fabric to be an overlay spanning
and encompassing the existing data and analytics landscape.
• Patients
• Encounters
• Providers
• Medications
• Costs
• Care Plans
• Claims
• Etc.
Providers
Care
Plans
Patients
Costs
Inpatient
Claims
Carrier
Claims
Outpatient
Claims
Prescriptiom
Drug_Events
Beneficiary
Summary
BestPractiseLinks
careprog2
careprog1
Medications
Patient
Encounters
Observations
Conditions
Allergies
Patients
Procedures
Imaging
Studies
Immunizations
Care
Plans
care planscanonicalelectronic medical records claims
How it works: Business Friendly Data models
Semantic Graph data models to capture and navigate data relationships
Real World Graphs
Get Big Fast
Vast
Hundreds of sources, representing
thousands of entity types
Siloed
Different technologies, schemas,
formats
Complex
Sprawling disconnected schemas,
wide flat tables, and cryptic names
Unstructured
documents, emails, logs
Valuable
Hidden connections and common
business definitions
Graph Data Models & Semantics
Simplifies access to complex data to address
unanticipated questions
Quickly profiles, connects and harmonizes data
from multiple sources, including unstructured
Presents tailored views and experiences
to different personas with conceptual models
Flexibly accommodates new data sources
and use cases on the fly, with minimal impact
Scales horizontally to accommodate enterprise
data fabric scale
What it is
● An Enterprise Data Fabric Platform
○ Metadata Hub
○ Data Catalog
○ GraphMarts
○ Data Layers (where graph data blending happens)
○ REST Query Service Endpoints
○ Hi-Res Graph Aware Dash-Boarding Tool
What it does
● Accelerated Data Integration as Services
○ Creates and stores a vast metadata description of the
enterprises data landscape
○ Creates and stores metadata describing the
transformations required to turn all raw data sources into
a well described Enterprise Knowledge Graph
○ Automates the on-demand creation of the portions of the
Knowledge Graph and brokers query access to it
What it is
● The first Graph Data Warehouse
○ GOLAP (Graph Online Analytics Processing)
○ In-Memory Massively Parallel Processing (MPP)
○ Linear Scale (Largest cluster 200x64 CPU servers)
○ Like Snowflake or AWS Redshift, but for Graph
○ Enterprise Scale Knowledge Graphs
What it does
● Accelerated Data Integration & Analytics
○ ELT & Data Virtualization (VKG)
○ Knowledge Graph ingested from data sources using > 200
data source connectors
○ Reporting and BI analytics & aggregates
○ Graph Algorithms e.g. Page Rank, Shortest Path
○ Data Science libraries & Feature Engineering
Transformations e.g Matrices, PCA, SVD
○ Labeled property graph (LPG)
○ Inferencing, windowed aggregates & views
Standards
• Supports Open Standards
Supports RDF* and SPARQL
1.1 standards
Where you can deploy
• Fully Automated Deployment
on premises or on cloud with
Kubernetes Operator
Automation with Kubernetes
Kubernetes / K8s
Kubernetes API &
AnzoGraph Operator
Kubernetes API &
AnzoGraph Operator
Kubernetes Cluster
Kubernetes API &
AnzoGraph Operator
Kubernetes
Container
Kubernetes Cluster
Kubernetes API &
AnzoGraph Operator
Kubernetes
Container
Cluster
Kubernetes API &
AnzoGraph Operator
Kubernetes API &
AnzoGraph Operator
Kubernetes API &
AnzoGraph Operator
Kubernetes API &
AnzoGraph Operator
Kubernetes API &
AnzoGraph Operator
Kubernetes API &
AnzoGraph Operator
Who supports Kubernetes?
The Kubernetes API provides a common automation abstraction across all cloud
providers as well as on-premises implementations which allow us to deliver a hybrid
multi-cloud deployment model for Anzo Enterprise Data Fabric with very low switching
costs.
Because all data transformation mappings, graph linking & blending instructions and all
computing configurations are held as metadata in Anzo, customers can decide both
when and where to deploy their data integration and analytics computing at the most
granular level.
This allows them to take advantage of the best available pricing and to more easily
keep some workloads (and their data) behind their firewalls.
Thank You

Contenu connexe

Tendances

Tendances (20)

Modern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingModern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail Banking
 
Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data Fabric
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
 
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
 
Modern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in InsuranceModern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in Insurance
 
The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge Graph
 
Accelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success Stories
 
Introduction to Anzo Unstructured
Introduction to Anzo UnstructuredIntroduction to Anzo Unstructured
Introduction to Anzo Unstructured
 
Sustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive AnalyticsSustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive Analytics
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
 
How to Build a Smart Data Lake Using Semantics
How to Build a Smart Data Lake Using SemanticsHow to Build a Smart Data Lake Using Semantics
How to Build a Smart Data Lake Using Semantics
 
Graph-based Discovery and Analytics at Enterprise Scale
Graph-based Discovery and Analytics at Enterprise ScaleGraph-based Discovery and Analytics at Enterprise Scale
Graph-based Discovery and Analytics at Enterprise Scale
 
Building A Self Service Analytics Platform on Hadoop
Building A Self Service Analytics Platform on HadoopBuilding A Self Service Analytics Platform on Hadoop
Building A Self Service Analytics Platform on Hadoop
 
GoDaddy Customer Success Dashboard Using Apache Spark with Baburao Kamble
GoDaddy Customer Success Dashboard Using Apache Spark with Baburao KambleGoDaddy Customer Success Dashboard Using Apache Spark with Baburao Kamble
GoDaddy Customer Success Dashboard Using Apache Spark with Baburao Kamble
 
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowScalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
Scalability and Graph Analytics with Neo4j - Stefan Kolmar, Neo4j
Scalability and Graph Analytics with Neo4j - Stefan Kolmar, Neo4jScalability and Graph Analytics with Neo4j - Stefan Kolmar, Neo4j
Scalability and Graph Analytics with Neo4j - Stefan Kolmar, Neo4j
 

Similaire à Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric

Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 

Similaire à Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric (20)

Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
Amazon Redshift with Full 360 Inc.
Amazon Redshift with Full 360 Inc.Amazon Redshift with Full 360 Inc.
Amazon Redshift with Full 360 Inc.
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data Lake
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
 
Transform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataTransform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big Data
 
CC -Unit4.pptx
CC -Unit4.pptxCC -Unit4.pptx
CC -Unit4.pptx
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
 
Data Platform on GCP
Data Platform on GCPData Platform on GCP
Data Platform on GCP
 
Next Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data WarehouseNext Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data Warehouse
 
Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
 
Microsoft Fabric Introduction
Microsoft Fabric IntroductionMicrosoft Fabric Introduction
Microsoft Fabric Introduction
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Moving data to the cloud BY CESAR ROJAS from Pivotal
Moving data to the cloud BY CESAR ROJAS from PivotalMoving data to the cloud BY CESAR ROJAS from Pivotal
Moving data to the cloud BY CESAR ROJAS from Pivotal
 
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
 
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsHow to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
 

Plus de Cambridge Semantics

Plus de Cambridge Semantics (8)

Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
 
Introduction to RDF*
Introduction to RDF*Introduction to RDF*
Introduction to RDF*
 
AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101
 
Healthcare and Life Sciences: Two Industries Separated by Common Data
Healthcare and Life Sciences: Two Industries Separated by Common DataHealthcare and Life Sciences: Two Industries Separated by Common Data
Healthcare and Life Sciences: Two Industries Separated by Common Data
 
Accelerate Pharma R&D with Cross-Study Analytics
Accelerate Pharma R&D with Cross-Study AnalyticsAccelerate Pharma R&D with Cross-Study Analytics
Accelerate Pharma R&D with Cross-Study Analytics
 
Large Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel ProcessingLarge Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel Processing
 
Applying Data Engineering and Semantic Standards to Tame the "Perfect Storm" ...
Applying Data Engineering and Semantic Standards to Tame the "Perfect Storm" ...Applying Data Engineering and Semantic Standards to Tame the "Perfect Storm" ...
Applying Data Engineering and Semantic Standards to Tame the "Perfect Storm" ...
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational Databases
 

Dernier

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
HyderabadDolls
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 

Dernier (20)

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
Introduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxIntroduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptx
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
Giridih Escorts Service Girl ^ 9332606886, WhatsApp Anytime Giridih
Giridih Escorts Service Girl ^ 9332606886, WhatsApp Anytime GiridihGiridih Escorts Service Girl ^ 9332606886, WhatsApp Anytime Giridih
Giridih Escorts Service Girl ^ 9332606886, WhatsApp Anytime Giridih
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 

Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric

  • 1. Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric Sean Martin, CTO Cambridge Semantics Inc. DBTA, 21 April 2020
  • 2. • Big data volume • Ad hoc queries • Unstructured • Semi-structured • Exploratory • Raw • Self service • On-demand • Clean • Consistent • Integrated • Accessible • Searchable • Secure • Governed • Privacy (PII) • Clean • Consistent • Integrated • Accessible • Searchable • Secure • Governed • Privacy (PII) • Clean • Consistent • Integrated • Accessible • Searchable • Secure • Governed • Privacy (PII) • Big data volume • Ad hoc queries • Unstructured • Semi-structured • Exploratory • Raw • Self service • On-demand • Cataloged • Linked • Modeled • Persisted • Virtualized • Collaborative Data Fabrics are the modern successor to warehouses and lakes. Fully connected and integrated Data Fabric Data Lake Data Warehouse Flexibility and scale Quality and control
  • 3. RDBMS/OLTP Big Data / Hadoop Document Repositories Traditional BI Cloud CLAIM CUSTOMER PRODUCTS POLICY Semantics and graph allow the data fabric to be an overlay spanning and encompassing the existing data and analytics landscape.
  • 4. • Patients • Encounters • Providers • Medications • Costs • Care Plans • Claims • Etc. Providers Care Plans Patients Costs Inpatient Claims Carrier Claims Outpatient Claims Prescriptiom Drug_Events Beneficiary Summary BestPractiseLinks careprog2 careprog1 Medications Patient Encounters Observations Conditions Allergies Patients Procedures Imaging Studies Immunizations Care Plans care planscanonicalelectronic medical records claims How it works: Business Friendly Data models Semantic Graph data models to capture and navigate data relationships
  • 5. Real World Graphs Get Big Fast Vast Hundreds of sources, representing thousands of entity types Siloed Different technologies, schemas, formats Complex Sprawling disconnected schemas, wide flat tables, and cryptic names Unstructured documents, emails, logs Valuable Hidden connections and common business definitions
  • 6. Graph Data Models & Semantics Simplifies access to complex data to address unanticipated questions Quickly profiles, connects and harmonizes data from multiple sources, including unstructured Presents tailored views and experiences to different personas with conceptual models Flexibly accommodates new data sources and use cases on the fly, with minimal impact Scales horizontally to accommodate enterprise data fabric scale
  • 7. What it is ● An Enterprise Data Fabric Platform ○ Metadata Hub ○ Data Catalog ○ GraphMarts ○ Data Layers (where graph data blending happens) ○ REST Query Service Endpoints ○ Hi-Res Graph Aware Dash-Boarding Tool What it does ● Accelerated Data Integration as Services ○ Creates and stores a vast metadata description of the enterprises data landscape ○ Creates and stores metadata describing the transformations required to turn all raw data sources into a well described Enterprise Knowledge Graph ○ Automates the on-demand creation of the portions of the Knowledge Graph and brokers query access to it
  • 8. What it is ● The first Graph Data Warehouse ○ GOLAP (Graph Online Analytics Processing) ○ In-Memory Massively Parallel Processing (MPP) ○ Linear Scale (Largest cluster 200x64 CPU servers) ○ Like Snowflake or AWS Redshift, but for Graph ○ Enterprise Scale Knowledge Graphs What it does ● Accelerated Data Integration & Analytics ○ ELT & Data Virtualization (VKG) ○ Knowledge Graph ingested from data sources using > 200 data source connectors ○ Reporting and BI analytics & aggregates ○ Graph Algorithms e.g. Page Rank, Shortest Path ○ Data Science libraries & Feature Engineering Transformations e.g Matrices, PCA, SVD ○ Labeled property graph (LPG) ○ Inferencing, windowed aggregates & views Standards • Supports Open Standards Supports RDF* and SPARQL 1.1 standards Where you can deploy • Fully Automated Deployment on premises or on cloud with Kubernetes Operator
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 19. Kubernetes API & AnzoGraph Operator Kubernetes Cluster
  • 20. Kubernetes API & AnzoGraph Operator Kubernetes Container Kubernetes Cluster
  • 21. Kubernetes API & AnzoGraph Operator Kubernetes Container Cluster
  • 29. The Kubernetes API provides a common automation abstraction across all cloud providers as well as on-premises implementations which allow us to deliver a hybrid multi-cloud deployment model for Anzo Enterprise Data Fabric with very low switching costs. Because all data transformation mappings, graph linking & blending instructions and all computing configurations are held as metadata in Anzo, customers can decide both when and where to deploy their data integration and analytics computing at the most granular level. This allows them to take advantage of the best available pricing and to more easily keep some workloads (and their data) behind their firewalls.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.