SlideShare une entreprise Scribd logo
1  sur  40
Télécharger pour lire hors ligne
Vertica Open Source Relations Manager
Python + MPP Database = In Production Faster
Paige Roberts
3
https://www.brighttalk.com/webcast/8913/351928
Mauro Barbieri, Senior Scientist at Philips
SQL Server
Philips Remote
Service
Network
Teradata
(Salesforce,
SAP data)
Visualization /
Reporting /
Application
Distributed
Pub/Sub
System
Data
Sources
Large-Scale
Storage
ETL – Extract,
Transform,
Load
MPP Analytics
/ Machine
Learning
Batch
Low
Latency
Production Machine Learning Needs
Speed
Fast data processing
without heavy
operations cost
Ease of Use
High level of
abstraction
functions
Features
A wide panel of
functionalities
Flexibility
Open
Architecture
Being able to connect
with a lot of different
technologies
Change is constant –
code, deployment,
data sources,
algorithms, …
Advantages of Python
Broad Utility
Many functionalities - one
of the most broadly useful
programming languages.
Flexibility
It Many right paths to do
things, a lot of freedom,
works on many platforms.
Ease of Use
High level of abstraction
makes Python one of the
easiest programming
languages.
Strong Community
Most data scientists master
Python. Many useful
packages (pandas, scikit, …)
Python Uses & Challenges
Python is great for …
 Predictive Maintenance
 Ensuring Quality of Service
 Proactive Sales
 New Products & Markets
 Differentiation
 A/B Testing
 Marketing behaviors and click analysis
… Data Science
Python has challenges with:
 Performance with big data
- Global interpreter lock
- CPU Thread management
- Access to data in multiple nodes
- Methods for efficiently accessing data (indexing
and data optimization)
- Concurrency
End-to-End Machine Learning Process
8
Business
Understanding Data Analysis Data
Preparation Modeling Evaluation Deployment
End-to-End Machine Learning Process
9
Business
Understanding Data Analysis Data
Preparation Modeling DeploymentEvaluation
Challenges of Machine Learning at Scale
The need for speed at
reasonable cost
Not easy to move
big data around
Sub-sampling can
compromise accuracy
Challenges of Machine Learning at Scale
Sub-sampling can
compromise accuracy
Work with all of
your data
Sampling vs. Full Dataset
13
Source: https://towardsdatascience.com/breaking-the-curse-of-small-
datasets-in-machine-learning-part-1-36f28b0c044d
 Data usually matters more than algorithms for complex problems
 Small data sets usually lack generalization and are prone to over-fitting
Large datasets result in better model generalization
Challenges of Machine Learning at Scale
Not easy to move
big data around
Bring models to
the data
Bring Data to the Model
Slow
 Data transfer is bottleneck – fighting inertia
 Need to downsample reduces accuracy
 Results are not where you need them to
interact with production systems
15
Data Has Gravity
Bring the Model to the Data
Fast!
 Ease of integration with production systems
 Parallelized
 Data stays where it is – security, provenance
 Model management in the database
16
Data Has Gravity
Challenges of Machine Learning at Scale
The need for speed at
reasonable cost
Pick (the right) scaling
architecture
RDBMS
MySQL,
PostgreSQL …
Cassandra,
Key/Value
DB
Schema
Enforced
ETL
(Flattened,
Modeled
Tables)
Hive, Spark,
Presto,
Notebooks
Recent
Data
Applications:
• ETL/Modeling
• CityOps
• Machine
Learning
• Experiments
Ad Hoc Analytics:
• CityOps
• Data Scientists
Batch
Low
Latency Ingestion EL
(Extract,
Load)
Visualization /
Reporting /
Application
Distributed
Pub/Sub
System
Data
Sources
Large-Scale
Storage
ETL – Extract,
Transform,
Load
MPP Analytics
/ Machine
Learning
Advantages of MPP Analytical Database
MPP Scale
Clusters with no name
node or other single point
of failure allow unlimited
scale
Speed and
Concurrency
Query optimization and
resource management
across multiple nodes
Features
ML algorithm
parallelization, moving
windows, geospatial
analysis, time series joins,
fast data prep...
Open Architecture
Integration with many
other applications - BI, ETL,
Kafka, Spark, Data Science
Labs …
High Performance + High Concurrency
20
Get data quickly enough to act upon it, explore your data interactively,
and enable everyone to make their own data-driven decisions
Enable everyone to make their own data-driven decisions.
Get data quickly enough to act on it.
Explore data interactively.
Scale Data Volumes Scale Users
SQL Database
++
Analytics & ML Query Engine
Advantages of Python + MPP Analytical Database
MPP Scale
Clusters with no name
node or other single point
of failure allow unlimited
scale
Speed and
Concurrency
Query optimization and
resource management
across multiple nodes
Features
ML algorithm
parallelization, Moving
Windows, Geospatial,
Time Series, fast data
prep...
Open Architecture
Integration with many
other applications - BI, ETL,
Kafka, Spark, Data Science
Labs …
Broad Utility
Many functionalities - one
of the most broadly useful
programming languages.
Flexibility
It Many right paths to do
things, a lot of freedom,
works on many platforms.
Ease of Use
High level of abstraction
makes Python one of the
easiest programming
languages.
Strong Community
Most data scientists master
Python. Many useful
packages (pandas, scikit, …)
Parallelization
22
Predicting and scoring on multiple nodes
 Python models get copied to all
nodes
 Different portions of data are
processed simultaneously
 Result: Fast response
Node 3
Data
Node 2Node 1
DataData
Built-In Statistical and Quality Functions
Business
Understanding
Data
Exploration
Data
Preparation Modeling Evaluation Deployment
Parallel Machine Learning
Algorithms
Speed
ANSI SQL
Scalability
Parallel Data Preparation
Deploy Anywhere
Outlier
Detection
Normalization
Imbalanced
Data Processing
Sampling
Missing Value
Imputation
And More…
Pattern
Matching
Date/
Time Algebra
Window/
Partition
Date Type
Handling
Sequences
And More…
Sessionize
Time Series
Statistical
Summary
SQL SQLSQL SQLSQL
Automate Model Training and Validation
Business
Understanding
Data
Exploration
Data
Preparation Modeling Evaluation Deployment
Parallel Machine Learning
Algorithms
Speed
ANSI SQL
Scalability
Parallel Data Preparation
Deploy Anywhere
Outlier
Detection
Normalization
Imbalanced
Data Processing
Sampling
Missing Value
Imputation
And More…
Pattern
Matching
Date/
Time Algebra
Window/
Partition
Date Type
Handling
Sequences
And More…
Sessionize
Time Series
Statistical
Summary
SQL SQLSQL SQLSQL
SVM
Random
Forests
Logistic
Regression
Linear
Regression
Ridge
Regression
Naive Bayes
Cross
Validation
And More…
Model-level
Stats
ROC Tables
Error Rate
Lift Table
Confusion
Matrix
R-Squared
MSE
Manage Model Life Cycle
Business
Understanding
Data
Exploration
Data
Preparation Modeling Evaluation Deployment
Parallel Machine Learning
Algorithms
Speed
ANSI SQL
Scalability
Parallel Data Preparation
Deploy Anywhere
Outlier
Detection
Normalization
Imbalanced
Data Processing
Sampling
Missing Value
Imputation
And More…
Pattern
Matching
Date/
Time Algebra
Window/
Partition
Date Type
Handling
Sequences
And More…
Sessionize
Time Series
Statistical
Summary
SQL SQLSQL SQLSQL
SVM
Random
Forests
Logistic
Regression
Linear
Regression
Ridge
Regression
Naive Bayes
Cross
Validation
And More…
Model-level
Stats
ROC Tables
Error Rate
Lift Table
Confusion
Matrix
R-Squared
MSE
In-Database
Scoring
Speed
Scale
Security
26
Bring your R,
TensorFlow, and Python
code inside the database
– analyze the data in
place.
https://github.com/vertica/vertica-python
https://github.com/vertica/Vertica-ML-Python
 Huge improvements in stability and
performance after moving to Vertica
 24 mins on Spark, 3 mins in Vertica
 Can incorporate other data like weather to
optimize predictive thermostat efficiency
after moving to Vertica ML
 Citing speed of analytics, ease of use when
coding in SQL, and improvements in the
accuracy of models after moving workloads
to Vertica ML
 Solving issues that were previously unsolvable
 Minimal hardware, software, and personnel
investments when differentiating with
data science.
27
Thank you!
Learn More: academy.vertica.com
Try it Free: vertica.com/try
Paige Roberts
Open Source Relations Manager
E: Paige.Roberts@microfocus.com
Advantages of In-Database Machine Learning
• Eliminate overhead of data transfer
• Keep data secure with clear provenance
• Store and manage models and data together
• Serve hundreds of concurrent users
• Use highly scalable, high performance
machine learning functionalities
• Avoid maintenance cost of a separate
analytical system
• Increase productivity with simple SQL calls
instead of coding everything
• Prep data faster
30
Node 1 Node 2 Node 3
Schema
Tables
Models
Schema
Tables
Models
Schema
Tables
Models
Network
Benefits of In-database Machine Learning
31
Scale Speed Accuracy
Empower more users within
your organization to leverage
machine learning and increase
data scientist productivity with a
simple SQL interface
You need massively parallel
processing power to build and
train models at the speed of
business
Run machine learning models
based on all your historical
data, not just a subset of
down-sampled data
Democratized predictive
analytics applications
Faster time to market for
machine learning projects
Deploy predictive use
cases and stay ahead
In-database machine learning transforms the way data scientists and analysts interact with data
Simple SQL Execution
32
Put the power of predictive analytics in the hands of more analysts and database users
With Vertica, users can create, train and deploy machine learning models
using simple SQL calls, at massive scale
Linear
Regression
Logistic
Regression
K-Means
Clustering
Random
Forrest
Naive
Bayes
Support Vector
Machines
SQL
An Open Architecture with a Rich Ecosystem
Python
SQL
C++
Geospatial
TimeSeries
EventSeries
Real-time
User-DefinedStorage
Security
External Tables:Analyze inPlace
MachineLearning
TextAnalytics
Regression
PatternMatching
User-DefinedFunctions
DataTransformation
Streaming
ETL
User-Defined
Loads
BI &Visualization
ODBC
JDBC
OLEDB
S3
R Java
The Vertica Analytics Platform
34
Native High
Availability
Standard SQL
Interface
Column
Orientation
Machine
Learning
Advanced
Compression
MPP Massive
Parallel
Processing
Leverages BI, ETL,
Hadoop/MapReduce and
OLTP investments
No disk I/O bottleneck
simultaneously load &
query
Native DB-aware
clustering on low-cost x86
Linux nodes
Built-in redundancy that
also speeds up queries
In-database machine
learning functions for
predictive analytics at
scale
Up to 90% space
reduction using 10+
algorithms
 10-50x faster than legacy
databases
 Scales from TB to PB with
industry-standard
hardware
 Simple integration with
existing ETL and BI
solutions
 SQL-99+ compliant
 Ultimate deployment
flexibility
 Extended analytics
 In-database machine
learning
 24/7 Load & Query
Online Examples
Predictive Maintenance Demo
36
Analyze sensor data
from cooling towers
across the US ,
enabling equipment
manufacturers to
predict and prevent
equipment failure
Flight Tracker Demo
37
Vertica operates
at the “edge”
with flight track
detail. Sensor
data is collected
using a Raspberry
pi with radio
receiver and
antenna. Data is
loaded into
Vertica as
thousands of
records per
second and
builds to billions
of flight data
points collected
within a 250-mile
radius.
https://www.vertica.com/blog/blog-post-series-using-vertica-track-
commercial-aircraft-near-real-time/
Customer Examples
Moving data science workloads from Spark on Hadoop to in-database
Improvements in stability and performance
Creating customer segmentation via clustering algorithms on a
15 million customer dataset took 24 mins on Spark - 3 mins in database
Concurrently running other algorithms without performance impact
Cardlytics partners with more than 1,500
financial institutions to run their online and
mobile banking rewards programs, which
gives us a robust view into where and when
consumers are spending their money.
Fidelis Cybersecurity protects the
world's most sensitive data by
identifying and removing attackers
no matter where they're hiding on
your network and endpoints.
40
Data science team was experiencing
challenges with performance while
using Spark ML
Moving workloads from Spark ML to
in-database ML provided:
Speed of analytics
Ease of use when coding in SQL
Increased accuracy of models
Some Vertica IoT Customer Resources
Case Studies
 Anritsu ROI case study: https://www.vertica.com/wp-
content/uploads/2017/01/r24-HPE-Vertica-ROI-case-study-Anritsu.pdf
 Infographic of ROI: https://www.vertica.com/wp-
content/uploads/2017/03/Anritsu-v2.pdf
 Nimble Storage ROI case study: https://www.vertica.com/wp-
content/uploads/2017/08/Nimble-Storage-ROI.pdf
 Optimal+ case study: https://www.vertica.com/wp-
content/uploads/2017/06/Optimal-MF-rebrand-FINAL-lo-res.pdf
 *Climate Corp case study: https://www.vertica.com/wp-
content/uploads/2019/01/Climate-Corp_Success-Story-FINAL.pdf
Webcasts – Data Disruptors
 Philips: https://www.brighttalk.com/webcast/10477/277693
 Climate Corp: https://www.brighttalk.com/webcast/8913/336201
 Nimble Storage (HPE InfoBright):
https://www.brighttalk.com/webcast/8913/330769
 Zebrium: https://www.brighttalk.com/webcast/8913/332838
 Simpli.fi: https://www.brighttalk.com/webcast/8913/354325/simpli-fi-
delivers-advertising-insights-on-billions-of-streaming-bid-messages
Videos
 Optimal+:
https://www.youtube.com/watch?v=IZkkoy5ZT1M&feature=youtu.be
 Anritsu:
https://www.youtube.com/watch?v=QZ5vWqblVXU&feature=youtu.be
41
42
Try Vertica
• 3 Easy ways to try Vertica (https://www.vertica.com/try/)
o Get Started in Minutes with Vertica by the Hour from AWS Marketplace,
Google Cloud or Microsoft Azure
o Free Community Edition (for up to 1TB and 3-node cluster)
o Vertica Start-Up Accelerator Program (Free 1-year term, 25 TB license)
vertica.com/try

Contenu connexe

Tendances

DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code DeploysDevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
Andreas Grabner
 

Tendances (20)

(R)evolutionize APM
(R)evolutionize APM(R)evolutionize APM
(R)evolutionize APM
 
HSPS 2015 - SharePoint Performance Santiy Checks
HSPS 2015 - SharePoint Performance Santiy ChecksHSPS 2015 - SharePoint Performance Santiy Checks
HSPS 2015 - SharePoint Performance Santiy Checks
 
Docker/DevOps Meetup: Metrics-Driven Continuous Performance and Scalabilty
Docker/DevOps Meetup: Metrics-Driven Continuous Performance and ScalabiltyDocker/DevOps Meetup: Metrics-Driven Continuous Performance and Scalabilty
Docker/DevOps Meetup: Metrics-Driven Continuous Performance and Scalabilty
 
Four Practices to Fix Your Top .NET Performance Problems
Four Practices to Fix Your Top .NET Performance ProblemsFour Practices to Fix Your Top .NET Performance Problems
Four Practices to Fix Your Top .NET Performance Problems
 
Java Performance Mistakes
Java Performance MistakesJava Performance Mistakes
Java Performance Mistakes
 
Hugs instead of Bugs: Dreaming of Quality Tools for Devs and Testers
Hugs instead of Bugs: Dreaming of Quality Tools for Devs and TestersHugs instead of Bugs: Dreaming of Quality Tools for Devs and Testers
Hugs instead of Bugs: Dreaming of Quality Tools for Devs and Testers
 
From Zero to Performance Hero in Minutes - Agile Testing Days 2014 Potsdam
From Zero to Performance Hero in Minutes - Agile Testing Days 2014 PotsdamFrom Zero to Performance Hero in Minutes - Agile Testing Days 2014 Potsdam
From Zero to Performance Hero in Minutes - Agile Testing Days 2014 Potsdam
 
Splunk for Developers
Splunk for DevelopersSplunk for Developers
Splunk for Developers
 
Spring 5.0 meets reactive programming
Spring 5.0 meets reactive programmingSpring 5.0 meets reactive programming
Spring 5.0 meets reactive programming
 
London WebPerf Meetup: End-To-End Performance Problems
London WebPerf Meetup: End-To-End Performance ProblemsLondon WebPerf Meetup: End-To-End Performance Problems
London WebPerf Meetup: End-To-End Performance Problems
 
Careful - APIs Inside: Testing and Monitoring for App Development
Careful - APIs Inside: Testing and Monitoring for App DevelopmentCareful - APIs Inside: Testing and Monitoring for App Development
Careful - APIs Inside: Testing and Monitoring for App Development
 
Deploy Faster Without Failing Faster - Metrics-Driven - Dynatrace User Groups...
Deploy Faster Without Failing Faster - Metrics-Driven - Dynatrace User Groups...Deploy Faster Without Failing Faster - Metrics-Driven - Dynatrace User Groups...
Deploy Faster Without Failing Faster - Metrics-Driven - Dynatrace User Groups...
 
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code DeploysDevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
 
Metrics Driven DevOps - Automate Scalability and Performance Into your Pipeline
Metrics Driven DevOps - Automate Scalability and Performance Into your PipelineMetrics Driven DevOps - Automate Scalability and Performance Into your Pipeline
Metrics Driven DevOps - Automate Scalability and Performance Into your Pipeline
 
How to keep you out of the News: Web and End-to-End Performance Tips
How to keep you out of the News: Web and End-to-End Performance TipsHow to keep you out of the News: Web and End-to-End Performance Tips
How to keep you out of the News: Web and End-to-End Performance Tips
 
Top .NET, Java & Web Performance Mistakes - Meetup Jan 2015
Top .NET, Java & Web Performance Mistakes - Meetup Jan 2015Top .NET, Java & Web Performance Mistakes - Meetup Jan 2015
Top .NET, Java & Web Performance Mistakes - Meetup Jan 2015
 
Splunk for ITOps
Splunk for ITOpsSplunk for ITOps
Splunk for ITOps
 
Sydney Continuous Delivery Meetup May 2014
Sydney Continuous Delivery Meetup May 2014Sydney Continuous Delivery Meetup May 2014
Sydney Continuous Delivery Meetup May 2014
 
Security guidelines
Security guidelinesSecurity guidelines
Security guidelines
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 

Similaire à Python + MPP Database = Large Scale AI/ML Projects in Production Faster

Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Precisely
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
Provectus
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 

Similaire à Python + MPP Database = Large Scale AI/ML Projects in Production Faster (20)

Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
 
Building ML Pipelines with DCOS
Building ML Pipelines with DCOSBuilding ML Pipelines with DCOS
Building ML Pipelines with DCOS
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
 
Lessons Learned from Modernizing USCIS Data Analytics Platform
Lessons Learned from Modernizing USCIS Data Analytics PlatformLessons Learned from Modernizing USCIS Data Analytics Platform
Lessons Learned from Modernizing USCIS Data Analytics Platform
 
The Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the MassesThe Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the Masses
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsApache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Production
 
Big Data Meetup #7
Big Data Meetup #7Big Data Meetup #7
Big Data Meetup #7
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
DevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleDevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-Oracle
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
How to Radically Simplify Your Business Data Management
How to Radically Simplify Your Business Data ManagementHow to Radically Simplify Your Business Data Management
How to Radically Simplify Your Business Data Management
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
 
Above the cloud joarder kamal
Above the cloud   joarder kamalAbove the cloud   joarder kamal
Above the cloud joarder kamal
 
Building the Next-gen Digital Meter Platform for Fluvius
Building the Next-gen Digital Meter Platform for FluviusBuilding the Next-gen Digital Meter Platform for Fluvius
Building the Next-gen Digital Meter Platform for Fluvius
 

Dernier

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 

Dernier (20)

Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 

Python + MPP Database = Large Scale AI/ML Projects in Production Faster

  • 1. Vertica Open Source Relations Manager Python + MPP Database = In Production Faster Paige Roberts
  • 3. SQL Server Philips Remote Service Network Teradata (Salesforce, SAP data) Visualization / Reporting / Application Distributed Pub/Sub System Data Sources Large-Scale Storage ETL – Extract, Transform, Load MPP Analytics / Machine Learning Batch Low Latency
  • 4. Production Machine Learning Needs Speed Fast data processing without heavy operations cost Ease of Use High level of abstraction functions Features A wide panel of functionalities Flexibility Open Architecture Being able to connect with a lot of different technologies Change is constant – code, deployment, data sources, algorithms, …
  • 5. Advantages of Python Broad Utility Many functionalities - one of the most broadly useful programming languages. Flexibility It Many right paths to do things, a lot of freedom, works on many platforms. Ease of Use High level of abstraction makes Python one of the easiest programming languages. Strong Community Most data scientists master Python. Many useful packages (pandas, scikit, …)
  • 6. Python Uses & Challenges Python is great for …  Predictive Maintenance  Ensuring Quality of Service  Proactive Sales  New Products & Markets  Differentiation  A/B Testing  Marketing behaviors and click analysis … Data Science Python has challenges with:  Performance with big data - Global interpreter lock - CPU Thread management - Access to data in multiple nodes - Methods for efficiently accessing data (indexing and data optimization) - Concurrency
  • 7. End-to-End Machine Learning Process 8 Business Understanding Data Analysis Data Preparation Modeling Evaluation Deployment
  • 8. End-to-End Machine Learning Process 9 Business Understanding Data Analysis Data Preparation Modeling DeploymentEvaluation
  • 9. Challenges of Machine Learning at Scale The need for speed at reasonable cost Not easy to move big data around Sub-sampling can compromise accuracy
  • 10. Challenges of Machine Learning at Scale Sub-sampling can compromise accuracy Work with all of your data
  • 11. Sampling vs. Full Dataset 13 Source: https://towardsdatascience.com/breaking-the-curse-of-small- datasets-in-machine-learning-part-1-36f28b0c044d  Data usually matters more than algorithms for complex problems  Small data sets usually lack generalization and are prone to over-fitting Large datasets result in better model generalization
  • 12. Challenges of Machine Learning at Scale Not easy to move big data around Bring models to the data
  • 13. Bring Data to the Model Slow  Data transfer is bottleneck – fighting inertia  Need to downsample reduces accuracy  Results are not where you need them to interact with production systems 15 Data Has Gravity
  • 14. Bring the Model to the Data Fast!  Ease of integration with production systems  Parallelized  Data stays where it is – security, provenance  Model management in the database 16 Data Has Gravity
  • 15. Challenges of Machine Learning at Scale The need for speed at reasonable cost Pick (the right) scaling architecture
  • 16. RDBMS MySQL, PostgreSQL … Cassandra, Key/Value DB Schema Enforced ETL (Flattened, Modeled Tables) Hive, Spark, Presto, Notebooks Recent Data Applications: • ETL/Modeling • CityOps • Machine Learning • Experiments Ad Hoc Analytics: • CityOps • Data Scientists Batch Low Latency Ingestion EL (Extract, Load) Visualization / Reporting / Application Distributed Pub/Sub System Data Sources Large-Scale Storage ETL – Extract, Transform, Load MPP Analytics / Machine Learning
  • 17. Advantages of MPP Analytical Database MPP Scale Clusters with no name node or other single point of failure allow unlimited scale Speed and Concurrency Query optimization and resource management across multiple nodes Features ML algorithm parallelization, moving windows, geospatial analysis, time series joins, fast data prep... Open Architecture Integration with many other applications - BI, ETL, Kafka, Spark, Data Science Labs …
  • 18. High Performance + High Concurrency 20 Get data quickly enough to act upon it, explore your data interactively, and enable everyone to make their own data-driven decisions Enable everyone to make their own data-driven decisions. Get data quickly enough to act on it. Explore data interactively. Scale Data Volumes Scale Users SQL Database ++ Analytics & ML Query Engine
  • 19. Advantages of Python + MPP Analytical Database MPP Scale Clusters with no name node or other single point of failure allow unlimited scale Speed and Concurrency Query optimization and resource management across multiple nodes Features ML algorithm parallelization, Moving Windows, Geospatial, Time Series, fast data prep... Open Architecture Integration with many other applications - BI, ETL, Kafka, Spark, Data Science Labs … Broad Utility Many functionalities - one of the most broadly useful programming languages. Flexibility It Many right paths to do things, a lot of freedom, works on many platforms. Ease of Use High level of abstraction makes Python one of the easiest programming languages. Strong Community Most data scientists master Python. Many useful packages (pandas, scikit, …)
  • 20. Parallelization 22 Predicting and scoring on multiple nodes  Python models get copied to all nodes  Different portions of data are processed simultaneously  Result: Fast response Node 3 Data Node 2Node 1 DataData
  • 21. Built-In Statistical and Quality Functions Business Understanding Data Exploration Data Preparation Modeling Evaluation Deployment Parallel Machine Learning Algorithms Speed ANSI SQL Scalability Parallel Data Preparation Deploy Anywhere Outlier Detection Normalization Imbalanced Data Processing Sampling Missing Value Imputation And More… Pattern Matching Date/ Time Algebra Window/ Partition Date Type Handling Sequences And More… Sessionize Time Series Statistical Summary SQL SQLSQL SQLSQL
  • 22. Automate Model Training and Validation Business Understanding Data Exploration Data Preparation Modeling Evaluation Deployment Parallel Machine Learning Algorithms Speed ANSI SQL Scalability Parallel Data Preparation Deploy Anywhere Outlier Detection Normalization Imbalanced Data Processing Sampling Missing Value Imputation And More… Pattern Matching Date/ Time Algebra Window/ Partition Date Type Handling Sequences And More… Sessionize Time Series Statistical Summary SQL SQLSQL SQLSQL SVM Random Forests Logistic Regression Linear Regression Ridge Regression Naive Bayes Cross Validation And More… Model-level Stats ROC Tables Error Rate Lift Table Confusion Matrix R-Squared MSE
  • 23. Manage Model Life Cycle Business Understanding Data Exploration Data Preparation Modeling Evaluation Deployment Parallel Machine Learning Algorithms Speed ANSI SQL Scalability Parallel Data Preparation Deploy Anywhere Outlier Detection Normalization Imbalanced Data Processing Sampling Missing Value Imputation And More… Pattern Matching Date/ Time Algebra Window/ Partition Date Type Handling Sequences And More… Sessionize Time Series Statistical Summary SQL SQLSQL SQLSQL SVM Random Forests Logistic Regression Linear Regression Ridge Regression Naive Bayes Cross Validation And More… Model-level Stats ROC Tables Error Rate Lift Table Confusion Matrix R-Squared MSE In-Database Scoring Speed Scale Security
  • 24. 26 Bring your R, TensorFlow, and Python code inside the database – analyze the data in place. https://github.com/vertica/vertica-python https://github.com/vertica/Vertica-ML-Python
  • 25.  Huge improvements in stability and performance after moving to Vertica  24 mins on Spark, 3 mins in Vertica  Can incorporate other data like weather to optimize predictive thermostat efficiency after moving to Vertica ML  Citing speed of analytics, ease of use when coding in SQL, and improvements in the accuracy of models after moving workloads to Vertica ML  Solving issues that were previously unsolvable  Minimal hardware, software, and personnel investments when differentiating with data science. 27
  • 26. Thank you! Learn More: academy.vertica.com Try it Free: vertica.com/try Paige Roberts Open Source Relations Manager E: Paige.Roberts@microfocus.com
  • 27.
  • 28. Advantages of In-Database Machine Learning • Eliminate overhead of data transfer • Keep data secure with clear provenance • Store and manage models and data together • Serve hundreds of concurrent users • Use highly scalable, high performance machine learning functionalities • Avoid maintenance cost of a separate analytical system • Increase productivity with simple SQL calls instead of coding everything • Prep data faster 30 Node 1 Node 2 Node 3 Schema Tables Models Schema Tables Models Schema Tables Models Network
  • 29. Benefits of In-database Machine Learning 31 Scale Speed Accuracy Empower more users within your organization to leverage machine learning and increase data scientist productivity with a simple SQL interface You need massively parallel processing power to build and train models at the speed of business Run machine learning models based on all your historical data, not just a subset of down-sampled data Democratized predictive analytics applications Faster time to market for machine learning projects Deploy predictive use cases and stay ahead In-database machine learning transforms the way data scientists and analysts interact with data
  • 30. Simple SQL Execution 32 Put the power of predictive analytics in the hands of more analysts and database users With Vertica, users can create, train and deploy machine learning models using simple SQL calls, at massive scale Linear Regression Logistic Regression K-Means Clustering Random Forrest Naive Bayes Support Vector Machines SQL
  • 31. An Open Architecture with a Rich Ecosystem Python SQL C++ Geospatial TimeSeries EventSeries Real-time User-DefinedStorage Security External Tables:Analyze inPlace MachineLearning TextAnalytics Regression PatternMatching User-DefinedFunctions DataTransformation Streaming ETL User-Defined Loads BI &Visualization ODBC JDBC OLEDB S3 R Java
  • 32. The Vertica Analytics Platform 34 Native High Availability Standard SQL Interface Column Orientation Machine Learning Advanced Compression MPP Massive Parallel Processing Leverages BI, ETL, Hadoop/MapReduce and OLTP investments No disk I/O bottleneck simultaneously load & query Native DB-aware clustering on low-cost x86 Linux nodes Built-in redundancy that also speeds up queries In-database machine learning functions for predictive analytics at scale Up to 90% space reduction using 10+ algorithms  10-50x faster than legacy databases  Scales from TB to PB with industry-standard hardware  Simple integration with existing ETL and BI solutions  SQL-99+ compliant  Ultimate deployment flexibility  Extended analytics  In-database machine learning  24/7 Load & Query
  • 34. Predictive Maintenance Demo 36 Analyze sensor data from cooling towers across the US , enabling equipment manufacturers to predict and prevent equipment failure
  • 35. Flight Tracker Demo 37 Vertica operates at the “edge” with flight track detail. Sensor data is collected using a Raspberry pi with radio receiver and antenna. Data is loaded into Vertica as thousands of records per second and builds to billions of flight data points collected within a 250-mile radius. https://www.vertica.com/blog/blog-post-series-using-vertica-track- commercial-aircraft-near-real-time/
  • 37. Moving data science workloads from Spark on Hadoop to in-database Improvements in stability and performance Creating customer segmentation via clustering algorithms on a 15 million customer dataset took 24 mins on Spark - 3 mins in database Concurrently running other algorithms without performance impact Cardlytics partners with more than 1,500 financial institutions to run their online and mobile banking rewards programs, which gives us a robust view into where and when consumers are spending their money.
  • 38. Fidelis Cybersecurity protects the world's most sensitive data by identifying and removing attackers no matter where they're hiding on your network and endpoints. 40 Data science team was experiencing challenges with performance while using Spark ML Moving workloads from Spark ML to in-database ML provided: Speed of analytics Ease of use when coding in SQL Increased accuracy of models
  • 39. Some Vertica IoT Customer Resources Case Studies  Anritsu ROI case study: https://www.vertica.com/wp- content/uploads/2017/01/r24-HPE-Vertica-ROI-case-study-Anritsu.pdf  Infographic of ROI: https://www.vertica.com/wp- content/uploads/2017/03/Anritsu-v2.pdf  Nimble Storage ROI case study: https://www.vertica.com/wp- content/uploads/2017/08/Nimble-Storage-ROI.pdf  Optimal+ case study: https://www.vertica.com/wp- content/uploads/2017/06/Optimal-MF-rebrand-FINAL-lo-res.pdf  *Climate Corp case study: https://www.vertica.com/wp- content/uploads/2019/01/Climate-Corp_Success-Story-FINAL.pdf Webcasts – Data Disruptors  Philips: https://www.brighttalk.com/webcast/10477/277693  Climate Corp: https://www.brighttalk.com/webcast/8913/336201  Nimble Storage (HPE InfoBright): https://www.brighttalk.com/webcast/8913/330769  Zebrium: https://www.brighttalk.com/webcast/8913/332838  Simpli.fi: https://www.brighttalk.com/webcast/8913/354325/simpli-fi- delivers-advertising-insights-on-billions-of-streaming-bid-messages Videos  Optimal+: https://www.youtube.com/watch?v=IZkkoy5ZT1M&feature=youtu.be  Anritsu: https://www.youtube.com/watch?v=QZ5vWqblVXU&feature=youtu.be 41
  • 40. 42 Try Vertica • 3 Easy ways to try Vertica (https://www.vertica.com/try/) o Get Started in Minutes with Vertica by the Hour from AWS Marketplace, Google Cloud or Microsoft Azure o Free Community Edition (for up to 1TB and 3-node cluster) o Vertica Start-Up Accelerator Program (Free 1-year term, 25 TB license) vertica.com/try