SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Suqiang Song, Director, Chapter Leader of Data Engineering & AI
Mastercard
AI as a Service
Build Shared AI Service Platforms Based on Deep
Learning Technologies
#AI1SAIS
Differentiation starts with consumer insights from a
massive worldwide payments network and our
experience in data cleansing, analytics and modeling
Mastercard Big Data & AI Expertise
WAREHOUSED
• 10 petabytes
• 5+ year historic global view
• Rapid retrieval
• Above-and-beyond privacy protection and security
MULTI-SOURCED
• 38MM+ merchant locations
• 22,000 issuers CLEANSED, AGGREGATD, ANONYMOUS, AUGMENTED
• 1.5MM automated rules
• Continuously tested
TRANSFORMED INTO ACTIONABLE INSIGHTS
• Reports, indexes, benchmarks
• Behavioral variables
• Models, scores, forecasting
• Econometrics
What can
2.4 BILLION
Global Cards and
56 BILLION
Transactions/
Year mean
to you?
Mastercard Enhanced Artificial Intelligence Capability
with the Acquisitions of Applied Predictive
Technologies(2015) and Brighterion (2017)
What is the AI as a Service ?
©2018Mastercard.ProprietaryandConfidential.
AI Applications
Machine learning frameworks
• Machine learning frameworks:
Provide stable and secure
environments and consolidate
integrated wrappers on top of
variable technologies for regular
machine learning works
• Applications build silos from scratch
Three modes of AI as a Services
• Fully managed machine learning servic
es use templates, pre-built models and
drag-and-drop development tools to si
mplify and expedite the process of usin
g a machine learning framework
• Applications share templates and pre-
built models , assembly and infer them
into pipelines or business context
• Automation Services, tasks like explora
tory data analysis, pre-processing of da
ta, hyper-parameter tuning, model sele
ction and putting models into producti
on can be automated
• “God's Return to God, Satan's Return
to Satan , Math’s Return AI, Business’s
Return Biz”
Machine learning frameworks
AI Applications
Machine learning frameworks
Fully managed
machine learning services
AI Applications
On / Off Premise Advanced
Infrastructure
Fully managed
machine learning services
On / Off Premise Advanced
Infrastructure
On / Off Premise Advanced
Infrastructure
Automation
Services
©2018Mastercard.ProprietaryandConfidential.
5
Time
Cost
Data Exploration &
Harmonization
Features
Engineering
0,0
Regular Mode :Machine learning frameworks
Evaluation
& Benchmarking
Model Deployment
&Serving
$100,000
6 weeks
Modeling
Example : Machine Learning Sandbox
©2018Mastercard.ProprietaryandConfidential.
6
Time
Cost
Features
Engineering
0,0
Plus Mode : Fully managed machine learning services
Evaluation
& Benchmarking
$50,000
2 weeks
Model Deployment
&Serving
Modeling
Data Exploration &
Harmonization
Example : Data Science Workbench
©2018Mastercard.ProprietaryandConfidential.
7
Time
Cost
Features
Engineering
0,0
Premium Mode: Automation Services
Evaluation
& Benchmarking
$10,000
2 days
Model Deployment
&ServingData Exploration &
Harmonization
Modeling
Example : Amazon SageMaker ?
©2018Mastercard.ProprietaryandConfidential.
8
Feature engineering bottlenecks
Pre-calculate hundreds or thousands Long
Term Variables take lots of resources and times
Model scalability limitations
Trade-off between automation in parallel and
scaling machine learning to ever larger datasets
and ever more complicated models
Model Serving to multiple contexts
Gap to connect to existing business
pipelines , offline ,streaming and real-time
Heavily relies on human machine learning
experts
Relies on human to perform the most of tasks
API Enablement and automate deployment
Low productivity to create more models with
low level raw APIs
Isolated promotions and operation readness
with automate deployment
Less integration with end to end data
pipelines, fill in the loop
Gap to bring machine learning process into
the existing enterprise data pipelines ,
including batch , streaming and real-time
1
2
3
4
5
6
Challenges to achieve Premium Automation AI Service
Learning Automation Serving Automation
©2018Mastercard.ProprietaryandConfidential.
What Deep Learning can help ?
©2018Mastercard.ProprietaryandConfidential.
10
Bottlenecks
 Need to pre-calculate hundreds or thousands Long Term Variables for each user, such as total
spends /visits for merchants list, category list divided by week, months and years
 The computation time for LTV features took > 70% of the data processing time for the whole
lifecycle and occupied lots of resources which had huge impact to other critical workloads.
 Miss the feature selection optimizations which could save the data engineering efforts a lot
AUTH DETAIL from last weekLTV DATA from last week MERCHANT
AGED LTV DATA
GEO
CATEGORY
ITEM LEVEL DATA
FILTERED TRANSACTIONS
SUMMED BY USER
AGED BY USERAGED LTV DATA
LTV DATA FOR THIS WEEK
Challenges with Traditional ML : Feature engineering bottlenecks
©2018Mastercard.ProprietaryandConfidential.
11
Improvements
 When build model , only focus on few
pre-defined sliding features and custom
overlap features ( Users only need to
identify the columns names from data
source)
 Remove most of the LTV pre-calculations
works, saved hours time and lots of
resources
 Deep learning algorithm generates
exponential growth of hidden embedding
features ,do the internal features selections
and optimization automatically when it
does cross validation at training stage
With Deep Learning : Remove lots of LTV workloads and simply the feature engineering
©2018Mastercard.ProprietaryandConfidential.
12
…
Item 1 * Users
Item 2* Users
Item n* Users
Feature
Engineering
Training 1
Training 2
Training n
Model 1
Model 2
Model n
Merge
2
2
2
3
3
3
4
1
Prebuilt correlation
Model
Merge all the
prediction results
Evaluation 1
Evaluation 2
Evaluation 3
Limitations
 All the pipelines separated by items and
generate one model for each item
 Have to pre-calculate the correlation
matrix between items
 Lots of redundant duplications and
computations at feature engineering
,training and testing process
 Run items in parallel and occupied
most of cluster resources when executed
 Bad metrics for items with few
transactions
 It is very hard to scale more items , from
hundreds to millions ?
Challenges with Traditional ML : Model scalability
©2018Mastercard.ProprietaryandConfidential.
13
•NCF
• Scenario:Neural Collaborative
Filtering ,recommend products to
customers (priority is to
recommend to active users)
according to customers’ past
history activities.
• https://www.comp.nus.edu.sg/~xia
ngnan/papers/ncf.pdf
•Wide & Deep learning
• Scenario: jointly trained wide linear
models and deep neural networks-
--to combine the benefits of
memorization and generalization
for recommender systems.
• https://pdfs.semanticscholar.org/aa
9d/39e938c84a867ddf2a8cabc575f
fba27b721.pdf
Linear 2
ReLU
Linear 1
ReLU
Concat
CMul
LookupTable
(MF User)
LookupTable
(MLP User)
LookupTable
(MF Item)
Linear 3
Sigmoid
Select
LookupTable
(MLP Item)
ConcatTable
Conca
SelectSelect Select
User index User indexItem Index
User Item Pair
MLP
MF
Embedding
Layers
Item Index
MLP User Embedding MLP Item EmbeddingMF Item EmbeddingMF User Embedding
With Deep Learning : Scale models in deeper and wider without decreasing metrics
©2018Mastercard.ProprietaryandConfidential.
14
Relies on human to perform the following tasks:
Select and construct appropriate features.
Select an appropriate model family.
Optimize model hyper parameters.
Post process machine learning models.
Critically analyze the results obtained.
Challenges with Traditional ML : Heavily relies on human machine learning experts
Training Data Sets
Data Source
Partitioning
Model 2
Model 1
Model n
Testing Data Sets
Validation Data Sets
Choose Best Model
Validate Model Metrics
©2018Mastercard.ProprietaryandConfidential.
15
Improvements
 Common neural network
"tricks", including initialization, L2
and dropout regularization, Batch
normalization, gradient checking
 A variety of optimization
algorithms, such as mini-batch
gradient descent, Momentum,
RMSprop and Adam
 Provides optimization-as-a-
service using an ensemble of
optimization strategies, allowing
practitioners to efficiently
optimize models faster and
cheaper than standard
approaches.
With Deep Learning : Gives more options for finding an optimally performing robust
configuration
Our Explore & Evaluation Journey
©2016Mastercard.ProprietaryandConfidential.
Enterprise requirements for Deep Learning
Seamless integration with
Products Internal & External
• Add deep learning capabilities to existing
Analytic Applications and/or machine learning
workflows rather than rebuild all of them
Collocated with mass data
storage
• Analyze a large amount of data on the
same Big Data clusters where the data
are stored (HDFS, HBase, Hive, etc.) rather
than move or duplicate data
Shared infrastructure with Multi-
tenant isolated resources
• Leverage existing Big Data clusters and deep
learning workloads should be managed and
monitored with other workloads (ETL, data
warehouse, traditional ML etc..) rather than
run DL workloads standalone in separate
clusters
Data governance with
restricted Processing
• Follow data privacy, regulation and
compliance ( such as PCI/PII compliance
and GDPR rather than operate data in
unsecured zones
©2016Mastercard.ProprietaryandConfidential.
• Claimed that the GPU computing are better than CPU which requires new hardware
infrastructure (very long timeline normally )
• Success requires many engineer-hours ( Impossible to Install a Tensor Flow Cluster at
STAGE ...)
• Low level APIs with steep learning curve ( Where is your PHD degree ? )
• Not well integrated with other enterprise tools and need data movements (couldn't
leverage the existing ETL, data warehousing and other analytic relevant data pipelines,
technologies and tool sets. And it is also a big challenge to make duplicate data
pipelines and data copy to the capacity and performance.)
• Tedious and fragile to distribute computations ( less monitoring )
• The concerns of Enterprise Maturity and InfoSec ( use GPU cluster with Tensor Flow from
Google Cloud )
…………..
Maybe not your story , but we have ....
Challenges and limitations to Production considering some “Super Stars”….
©2016Mastercard.ProprietaryandConfidential.
Integrations with existing DL
libraries
• Deep Learning Pipelines (from Databricks)
• Caffe (CaffeOnSpark)
• Keras (Elephas)
• mxnet
• Paddle
• TensorFlow (TensorFlow on Spark,
TensorFrames)
• CNTK (mmlspark)
Implementations of DL on Spark
• BigDL
• DeepDist
• DeepLearning4J
• SparkCL
• SparkNet
What does Spark offer?
©2016Mastercard.ProprietaryandConfidential.
Tensor Flow-on-Spark (or Caffe-on-Spark) uses Spark executors (tasks) to launch Tensor Flow/Caffe
instances in the cluster; however, the distributed deep learning (e.g., training, tuning and prediction) are
performed outside of Spark (across multiple Tensor Flow or Caffe instances).
(1) As a results, Tensor Flow/Caffe still runs on specialized HW (such as GPU servers interconnected by
InfiniBand), and the Open MP implementations in Tensor Flow/Caffe conflicts with the JVM threading in
Spark (resulting in lower performance).
(2) In addition, in this case Tensor Flow/Caffe can only interact the rest of the analytics pipelines in a
very coarse-grained fashion (running as standalone jobs outside of the pipeline, and using HDFS files as
job input and output).
Programming
interface
Contributors commits
BigDL Scala & Python 50 2221
TensorflowOnSpark Python 9 257
Databricks/tensor Python 9 185
Databricks/spark-deep-
learning
Python 8 51
StatisticscollectedonMar5th
, 2018
Need more break down …..
©2016Mastercard.ProprietaryandConfidential.
21
Train Wide and Deep Model ( BigDL)
features Models
model
candidatesampled
partition
Training Data
…
10~12
Months
Raw
Txns
+
Negative
samples
Load Parquet
Train Multiple Models
Train AIS Model ( Mlib)
sampled
partition
sampled
partition
Post
Processing
Simple
Feature
Engineering
models
models
Spark ML Pipeline Stages
Test Data
Predictions
Test
Spark Data FramesParquet Files
Pre-processing
1~2
Months
Feature
Selections
Feature
Selection
Model
Ensemble
Inference
SparkPipeline
Neural Recommender
Using BigDL NCF/ Wide And Deep
Transformer Model
Evaluation
& Fine
Tune
Estimator
Spark Mllib
Train NCF Model ( BigDL)
models
…
Benchmark
User-Merchant
User-Category
User-Geo
User-Merchant-Geo
….
POC: Benchmark BigDL & Spark Mllib
©2016Mastercard.ProprietaryandConfidential.
22
AUROC: A
AUPRCs: B
recall: C
precision: D
20 precision: E
Mllib AIS
Parameters :
MaxIter(100)
RegParam(0.01)
Rank(200)
Alpha(0.01)
BigDL NCF
AUROC: A+23%
AUPRCs: B+31%
recall: C+18%
precision: D+47%
20 precision: E+51%
Parameters :
MaxEpoch(10)
learningRate(3e-2)
learningRateDecay(3e-7)
uOutput(100)
mOutput(200)
batchSize(1.6 M)
BigDL WAD
Parameters :
MaxEpoch(10)
learningRate(1e-2)
learningRateDecay(1e-7)
uOutput(100)
mOutput(200)
batchSize(0.6 M)
AUROC: A+20% (3 % down)
AUPRCs: B+30% (1% down)
recall: C+12% (4 % down)
precision: D+49% (2 % up)
20 precision: E+54% (3% up)
Benchmark results ( > 100 rounds)
©2016Mastercard.ProprietaryandConfidential.
Beyond Deep Learning library , we
need more automated platform
capabilities to fit PROD adoption gaps
©2016Mastercard.ProprietaryandConfidential.
24
Incremental Tuning ( only re-run the
whole pipeline with incremental changed
datasets such as daily changed transactions and
benchmark the models )
 Refresh the dimensional datasets ( such
as adding new users , items …)
 Load the history model to the context
and update incremental parts of model
based on the incremental data sets
 Periodic Re-training with a batch
algorithm and time-series prediction
 Benchmark the history model and update
model and on-board the better ones.
…
Incremental
Fact
Incremental
Dimensional
History Model
Incremental Set
Ingest
Model
Fine Tuner
Lookups Refresher
Model Loader
Models
Benchmark
Ingest
Periodic Incremental Tuning
Incremental Fine Tuning &
Benchmark
Gap 1 : Incremental Tuning
©2016Mastercard.ProprietaryandConfidential.
25
Model Serving (Connect to existing business pipelines , offline ,streaming and real-time )
 Build the model serving capability by exporting model to scoring/prediction/recommendation
services and integration points
 Integrate the model serving services inside the business pipelines , such as embed them into
Spark jobs for offline, Spark Streaming jobs for streaming , the real-time “dialogue” with Kafka
messaging …
Gap 2 : Model Serving to multiple contexts
©2016Mastercard.ProprietaryandConfidential.
26
Gap 3 : Build user friendly high level pipeline APIs
High level pipeline APIs
 Abstract and purify high level data and learning pipeline APIs on top of BigDL lib to simply the
deep learning model assembly process and increase productivity
©2016Mastercard.ProprietaryandConfidential.
27
Gap 4 : Integrated with end to end data pipelines, fill in the loop
Embedded the deep learning process into existing enterprise data pipelines
 Build pre-defined templates and customized processors to bring deep learning process
into the existing enterprise data pipelines , including batch , streaming and real-time
©2016Mastercard.ProprietaryandConfidential.
28
Design and Implement pipelines at Visualized workbench
Pipelines Promotion
Biz. A
Biz. B
Biz. C
Biz. D
Biz. E
Biz. F
Pipeline Designer
AI Pipelines and Flows
Local Dev
Dev
Sandbox
Prod(s)
Stage
Configuration
Management
(Tag /
Branches)
Pipeline
Registry
Generate AI Pipelines
 Deployment sequences
Continuous
integration
(Parameter,
template)
Automate deployment with CI/CD pipelines
Gap 5 : AI Pipelines promotion with automated CI/CD deployment
©2016Mastercard.ProprietaryandConfidential.
Easier to build end-to-end analytics + AI applications
• Reference use cases
• Anomaly detection, sentiment analysis, fraud detection, chatbot, sequence prediction, etc.
• Predefined models
• Object detection, image classification, text classification, recommendations, GAN, etc.
• Feature engineering & transformations
• Image, text, speech, 3D imaging, time-series, etc.
• High level pipeline APIs
• Dataframes, ML Pipelines, autograd, transfer learning, Keras/Keras2, etc.
https://github.com/intel-analytics/analytics-zoo
Community improvements : Analytics Zoo -> Unified Analytics + AI Platform for Spark
and BigDL
©2016Mastercard.ProprietaryandConfidential.
Thanks
Q & A

Contenu connexe

Tendances

Understanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityUnderstanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityDevOps.com
 
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...Denodo
 
The Data Platform for Today’s Intelligent Applications
The Data Platform for Today’s Intelligent ApplicationsThe Data Platform for Today’s Intelligent Applications
The Data Platform for Today’s Intelligent ApplicationsNeo4j
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Data Centric Transformation in Telecom
Data Centric Transformation in TelecomData Centric Transformation in Telecom
Data Centric Transformation in TelecomDataWorks Summit
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Mihai Criveti
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Tristan Baker
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks DeltaDatabricks
 
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...HostedbyConfluent
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceDenodo
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyNeo4j
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Knowledge Graphs and Generative AI
Knowledge Graphs and Generative AIKnowledge Graphs and Generative AI
Knowledge Graphs and Generative AINeo4j
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of MetadataDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureDmitry Anoshin
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
 

Tendances (20)

Understanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityUnderstanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application Quality
 
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
 
The Data Platform for Today’s Intelligent Applications
The Data Platform for Today’s Intelligent ApplicationsThe Data Platform for Today’s Intelligent Applications
The Data Platform for Today’s Intelligent Applications
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Data Centric Transformation in Telecom
Data Centric Transformation in TelecomData Centric Transformation in Telecom
Data Centric Transformation in Telecom
 
Data mesh
Data meshData mesh
Data mesh
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph Technology
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Knowledge Graphs and Generative AI
Knowledge Graphs and Generative AIKnowledge Graphs and Generative AI
Knowledge Graphs and Generative AI
 
Enterprise Knowledge Graph
Enterprise Knowledge GraphEnterprise Knowledge Graph
Enterprise Knowledge Graph
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft Azure
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 

Similaire à AI as a Service, Build Shared AI Service Platforms Based on Deep Learning Technologies with Suqiang Song

Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?SnapLogic
 
C19013010 the tutorial to build shared ai services session 1
C19013010  the tutorial to build shared ai services session 1C19013010  the tutorial to build shared ai services session 1
C19013010 the tutorial to build shared ai services session 1Bill Liu
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsStreamsets Inc.
 
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBWebinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBMongoDB
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data productsVikas Sardana
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Platforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringPlatforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringDATAVERSITY
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduCloudera, Inc.
 
Productionizing Hadoop: 7 Architectural Best Practices
Productionizing Hadoop: 7 Architectural Best PracticesProductionizing Hadoop: 7 Architectural Best Practices
Productionizing Hadoop: 7 Architectural Best PracticesMapR Technologies
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
From Data to Services at the Speed of Business
From Data to Services at the Speed of BusinessFrom Data to Services at the Speed of Business
From Data to Services at the Speed of BusinessAli Hodroj
 
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB
 
Achieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsAchieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsSense Corp
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
Paris FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationParis FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationAbdelkrim Hadjidj
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon Web Services
 
Veritas + MongoDB
Veritas + MongoDBVeritas + MongoDB
Veritas + MongoDBMongoDB
 

Similaire à AI as a Service, Build Shared AI Service Platforms Based on Deep Learning Technologies with Suqiang Song (20)

Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
 
C19013010 the tutorial to build shared ai services session 1
C19013010  the tutorial to build shared ai services session 1C19013010  the tutorial to build shared ai services session 1
C19013010 the tutorial to build shared ai services session 1
 
Smartscale Executive Summary
Smartscale Executive SummarySmartscale Executive Summary
Smartscale Executive Summary
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
 
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBWebinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDB
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data products
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Platforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringPlatforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern Engineering
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
 
Productionizing Hadoop: 7 Architectural Best Practices
Productionizing Hadoop: 7 Architectural Best PracticesProductionizing Hadoop: 7 Architectural Best Practices
Productionizing Hadoop: 7 Architectural Best Practices
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
From Data to Services at the Speed of Business
From Data to Services at the Speed of BusinessFrom Data to Services at the Speed of Business
From Data to Services at the Speed of Business
 
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
 
Achieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsAchieve New Heights with Modern Analytics
Achieve New Heights with Modern Analytics
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Paris FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationParis FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant Presentation
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)
 
Analytics&IoT
Analytics&IoTAnalytics&IoT
Analytics&IoT
 
Veritas + MongoDB
Veritas + MongoDBVeritas + MongoDB
Veritas + MongoDB
 

Plus de Databricks

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Databricks
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDatabricks
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringDatabricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsDatabricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkDatabricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesDatabricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkDatabricks
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeDatabricks
 
Machine Learning CI/CD for Email Attack Detection
Machine Learning CI/CD for Email Attack DetectionMachine Learning CI/CD for Email Attack Detection
Machine Learning CI/CD for Email Attack DetectionDatabricks
 

Plus de Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 
Machine Learning CI/CD for Email Attack Detection
Machine Learning CI/CD for Email Attack DetectionMachine Learning CI/CD for Email Attack Detection
Machine Learning CI/CD for Email Attack Detection
 

Dernier

Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 

Dernier (20)

Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 

AI as a Service, Build Shared AI Service Platforms Based on Deep Learning Technologies with Suqiang Song

  • 1. Suqiang Song, Director, Chapter Leader of Data Engineering & AI Mastercard AI as a Service Build Shared AI Service Platforms Based on Deep Learning Technologies #AI1SAIS
  • 2. Differentiation starts with consumer insights from a massive worldwide payments network and our experience in data cleansing, analytics and modeling Mastercard Big Data & AI Expertise WAREHOUSED • 10 petabytes • 5+ year historic global view • Rapid retrieval • Above-and-beyond privacy protection and security MULTI-SOURCED • 38MM+ merchant locations • 22,000 issuers CLEANSED, AGGREGATD, ANONYMOUS, AUGMENTED • 1.5MM automated rules • Continuously tested TRANSFORMED INTO ACTIONABLE INSIGHTS • Reports, indexes, benchmarks • Behavioral variables • Models, scores, forecasting • Econometrics What can 2.4 BILLION Global Cards and 56 BILLION Transactions/ Year mean to you? Mastercard Enhanced Artificial Intelligence Capability with the Acquisitions of Applied Predictive Technologies(2015) and Brighterion (2017)
  • 3. What is the AI as a Service ?
  • 4. ©2018Mastercard.ProprietaryandConfidential. AI Applications Machine learning frameworks • Machine learning frameworks: Provide stable and secure environments and consolidate integrated wrappers on top of variable technologies for regular machine learning works • Applications build silos from scratch Three modes of AI as a Services • Fully managed machine learning servic es use templates, pre-built models and drag-and-drop development tools to si mplify and expedite the process of usin g a machine learning framework • Applications share templates and pre- built models , assembly and infer them into pipelines or business context • Automation Services, tasks like explora tory data analysis, pre-processing of da ta, hyper-parameter tuning, model sele ction and putting models into producti on can be automated • “God's Return to God, Satan's Return to Satan , Math’s Return AI, Business’s Return Biz” Machine learning frameworks AI Applications Machine learning frameworks Fully managed machine learning services AI Applications On / Off Premise Advanced Infrastructure Fully managed machine learning services On / Off Premise Advanced Infrastructure On / Off Premise Advanced Infrastructure Automation Services
  • 5. ©2018Mastercard.ProprietaryandConfidential. 5 Time Cost Data Exploration & Harmonization Features Engineering 0,0 Regular Mode :Machine learning frameworks Evaluation & Benchmarking Model Deployment &Serving $100,000 6 weeks Modeling Example : Machine Learning Sandbox
  • 6. ©2018Mastercard.ProprietaryandConfidential. 6 Time Cost Features Engineering 0,0 Plus Mode : Fully managed machine learning services Evaluation & Benchmarking $50,000 2 weeks Model Deployment &Serving Modeling Data Exploration & Harmonization Example : Data Science Workbench
  • 7. ©2018Mastercard.ProprietaryandConfidential. 7 Time Cost Features Engineering 0,0 Premium Mode: Automation Services Evaluation & Benchmarking $10,000 2 days Model Deployment &ServingData Exploration & Harmonization Modeling Example : Amazon SageMaker ?
  • 8. ©2018Mastercard.ProprietaryandConfidential. 8 Feature engineering bottlenecks Pre-calculate hundreds or thousands Long Term Variables take lots of resources and times Model scalability limitations Trade-off between automation in parallel and scaling machine learning to ever larger datasets and ever more complicated models Model Serving to multiple contexts Gap to connect to existing business pipelines , offline ,streaming and real-time Heavily relies on human machine learning experts Relies on human to perform the most of tasks API Enablement and automate deployment Low productivity to create more models with low level raw APIs Isolated promotions and operation readness with automate deployment Less integration with end to end data pipelines, fill in the loop Gap to bring machine learning process into the existing enterprise data pipelines , including batch , streaming and real-time 1 2 3 4 5 6 Challenges to achieve Premium Automation AI Service Learning Automation Serving Automation
  • 10. ©2018Mastercard.ProprietaryandConfidential. 10 Bottlenecks  Need to pre-calculate hundreds or thousands Long Term Variables for each user, such as total spends /visits for merchants list, category list divided by week, months and years  The computation time for LTV features took > 70% of the data processing time for the whole lifecycle and occupied lots of resources which had huge impact to other critical workloads.  Miss the feature selection optimizations which could save the data engineering efforts a lot AUTH DETAIL from last weekLTV DATA from last week MERCHANT AGED LTV DATA GEO CATEGORY ITEM LEVEL DATA FILTERED TRANSACTIONS SUMMED BY USER AGED BY USERAGED LTV DATA LTV DATA FOR THIS WEEK Challenges with Traditional ML : Feature engineering bottlenecks
  • 11. ©2018Mastercard.ProprietaryandConfidential. 11 Improvements  When build model , only focus on few pre-defined sliding features and custom overlap features ( Users only need to identify the columns names from data source)  Remove most of the LTV pre-calculations works, saved hours time and lots of resources  Deep learning algorithm generates exponential growth of hidden embedding features ,do the internal features selections and optimization automatically when it does cross validation at training stage With Deep Learning : Remove lots of LTV workloads and simply the feature engineering
  • 12. ©2018Mastercard.ProprietaryandConfidential. 12 … Item 1 * Users Item 2* Users Item n* Users Feature Engineering Training 1 Training 2 Training n Model 1 Model 2 Model n Merge 2 2 2 3 3 3 4 1 Prebuilt correlation Model Merge all the prediction results Evaluation 1 Evaluation 2 Evaluation 3 Limitations  All the pipelines separated by items and generate one model for each item  Have to pre-calculate the correlation matrix between items  Lots of redundant duplications and computations at feature engineering ,training and testing process  Run items in parallel and occupied most of cluster resources when executed  Bad metrics for items with few transactions  It is very hard to scale more items , from hundreds to millions ? Challenges with Traditional ML : Model scalability
  • 13. ©2018Mastercard.ProprietaryandConfidential. 13 •NCF • Scenario:Neural Collaborative Filtering ,recommend products to customers (priority is to recommend to active users) according to customers’ past history activities. • https://www.comp.nus.edu.sg/~xia ngnan/papers/ncf.pdf •Wide & Deep learning • Scenario: jointly trained wide linear models and deep neural networks- --to combine the benefits of memorization and generalization for recommender systems. • https://pdfs.semanticscholar.org/aa 9d/39e938c84a867ddf2a8cabc575f fba27b721.pdf Linear 2 ReLU Linear 1 ReLU Concat CMul LookupTable (MF User) LookupTable (MLP User) LookupTable (MF Item) Linear 3 Sigmoid Select LookupTable (MLP Item) ConcatTable Conca SelectSelect Select User index User indexItem Index User Item Pair MLP MF Embedding Layers Item Index MLP User Embedding MLP Item EmbeddingMF Item EmbeddingMF User Embedding With Deep Learning : Scale models in deeper and wider without decreasing metrics
  • 14. ©2018Mastercard.ProprietaryandConfidential. 14 Relies on human to perform the following tasks: Select and construct appropriate features. Select an appropriate model family. Optimize model hyper parameters. Post process machine learning models. Critically analyze the results obtained. Challenges with Traditional ML : Heavily relies on human machine learning experts Training Data Sets Data Source Partitioning Model 2 Model 1 Model n Testing Data Sets Validation Data Sets Choose Best Model Validate Model Metrics
  • 15. ©2018Mastercard.ProprietaryandConfidential. 15 Improvements  Common neural network "tricks", including initialization, L2 and dropout regularization, Batch normalization, gradient checking  A variety of optimization algorithms, such as mini-batch gradient descent, Momentum, RMSprop and Adam  Provides optimization-as-a- service using an ensemble of optimization strategies, allowing practitioners to efficiently optimize models faster and cheaper than standard approaches. With Deep Learning : Gives more options for finding an optimally performing robust configuration
  • 16. Our Explore & Evaluation Journey
  • 17. ©2016Mastercard.ProprietaryandConfidential. Enterprise requirements for Deep Learning Seamless integration with Products Internal & External • Add deep learning capabilities to existing Analytic Applications and/or machine learning workflows rather than rebuild all of them Collocated with mass data storage • Analyze a large amount of data on the same Big Data clusters where the data are stored (HDFS, HBase, Hive, etc.) rather than move or duplicate data Shared infrastructure with Multi- tenant isolated resources • Leverage existing Big Data clusters and deep learning workloads should be managed and monitored with other workloads (ETL, data warehouse, traditional ML etc..) rather than run DL workloads standalone in separate clusters Data governance with restricted Processing • Follow data privacy, regulation and compliance ( such as PCI/PII compliance and GDPR rather than operate data in unsecured zones
  • 18. ©2016Mastercard.ProprietaryandConfidential. • Claimed that the GPU computing are better than CPU which requires new hardware infrastructure (very long timeline normally ) • Success requires many engineer-hours ( Impossible to Install a Tensor Flow Cluster at STAGE ...) • Low level APIs with steep learning curve ( Where is your PHD degree ? ) • Not well integrated with other enterprise tools and need data movements (couldn't leverage the existing ETL, data warehousing and other analytic relevant data pipelines, technologies and tool sets. And it is also a big challenge to make duplicate data pipelines and data copy to the capacity and performance.) • Tedious and fragile to distribute computations ( less monitoring ) • The concerns of Enterprise Maturity and InfoSec ( use GPU cluster with Tensor Flow from Google Cloud ) ………….. Maybe not your story , but we have .... Challenges and limitations to Production considering some “Super Stars”….
  • 19. ©2016Mastercard.ProprietaryandConfidential. Integrations with existing DL libraries • Deep Learning Pipelines (from Databricks) • Caffe (CaffeOnSpark) • Keras (Elephas) • mxnet • Paddle • TensorFlow (TensorFlow on Spark, TensorFrames) • CNTK (mmlspark) Implementations of DL on Spark • BigDL • DeepDist • DeepLearning4J • SparkCL • SparkNet What does Spark offer?
  • 20. ©2016Mastercard.ProprietaryandConfidential. Tensor Flow-on-Spark (or Caffe-on-Spark) uses Spark executors (tasks) to launch Tensor Flow/Caffe instances in the cluster; however, the distributed deep learning (e.g., training, tuning and prediction) are performed outside of Spark (across multiple Tensor Flow or Caffe instances). (1) As a results, Tensor Flow/Caffe still runs on specialized HW (such as GPU servers interconnected by InfiniBand), and the Open MP implementations in Tensor Flow/Caffe conflicts with the JVM threading in Spark (resulting in lower performance). (2) In addition, in this case Tensor Flow/Caffe can only interact the rest of the analytics pipelines in a very coarse-grained fashion (running as standalone jobs outside of the pipeline, and using HDFS files as job input and output). Programming interface Contributors commits BigDL Scala & Python 50 2221 TensorflowOnSpark Python 9 257 Databricks/tensor Python 9 185 Databricks/spark-deep- learning Python 8 51 StatisticscollectedonMar5th , 2018 Need more break down …..
  • 21. ©2016Mastercard.ProprietaryandConfidential. 21 Train Wide and Deep Model ( BigDL) features Models model candidatesampled partition Training Data … 10~12 Months Raw Txns + Negative samples Load Parquet Train Multiple Models Train AIS Model ( Mlib) sampled partition sampled partition Post Processing Simple Feature Engineering models models Spark ML Pipeline Stages Test Data Predictions Test Spark Data FramesParquet Files Pre-processing 1~2 Months Feature Selections Feature Selection Model Ensemble Inference SparkPipeline Neural Recommender Using BigDL NCF/ Wide And Deep Transformer Model Evaluation & Fine Tune Estimator Spark Mllib Train NCF Model ( BigDL) models … Benchmark User-Merchant User-Category User-Geo User-Merchant-Geo …. POC: Benchmark BigDL & Spark Mllib
  • 22. ©2016Mastercard.ProprietaryandConfidential. 22 AUROC: A AUPRCs: B recall: C precision: D 20 precision: E Mllib AIS Parameters : MaxIter(100) RegParam(0.01) Rank(200) Alpha(0.01) BigDL NCF AUROC: A+23% AUPRCs: B+31% recall: C+18% precision: D+47% 20 precision: E+51% Parameters : MaxEpoch(10) learningRate(3e-2) learningRateDecay(3e-7) uOutput(100) mOutput(200) batchSize(1.6 M) BigDL WAD Parameters : MaxEpoch(10) learningRate(1e-2) learningRateDecay(1e-7) uOutput(100) mOutput(200) batchSize(0.6 M) AUROC: A+20% (3 % down) AUPRCs: B+30% (1% down) recall: C+12% (4 % down) precision: D+49% (2 % up) 20 precision: E+54% (3% up) Benchmark results ( > 100 rounds)
  • 23. ©2016Mastercard.ProprietaryandConfidential. Beyond Deep Learning library , we need more automated platform capabilities to fit PROD adoption gaps
  • 24. ©2016Mastercard.ProprietaryandConfidential. 24 Incremental Tuning ( only re-run the whole pipeline with incremental changed datasets such as daily changed transactions and benchmark the models )  Refresh the dimensional datasets ( such as adding new users , items …)  Load the history model to the context and update incremental parts of model based on the incremental data sets  Periodic Re-training with a batch algorithm and time-series prediction  Benchmark the history model and update model and on-board the better ones. … Incremental Fact Incremental Dimensional History Model Incremental Set Ingest Model Fine Tuner Lookups Refresher Model Loader Models Benchmark Ingest Periodic Incremental Tuning Incremental Fine Tuning & Benchmark Gap 1 : Incremental Tuning
  • 25. ©2016Mastercard.ProprietaryandConfidential. 25 Model Serving (Connect to existing business pipelines , offline ,streaming and real-time )  Build the model serving capability by exporting model to scoring/prediction/recommendation services and integration points  Integrate the model serving services inside the business pipelines , such as embed them into Spark jobs for offline, Spark Streaming jobs for streaming , the real-time “dialogue” with Kafka messaging … Gap 2 : Model Serving to multiple contexts
  • 26. ©2016Mastercard.ProprietaryandConfidential. 26 Gap 3 : Build user friendly high level pipeline APIs High level pipeline APIs  Abstract and purify high level data and learning pipeline APIs on top of BigDL lib to simply the deep learning model assembly process and increase productivity
  • 27. ©2016Mastercard.ProprietaryandConfidential. 27 Gap 4 : Integrated with end to end data pipelines, fill in the loop Embedded the deep learning process into existing enterprise data pipelines  Build pre-defined templates and customized processors to bring deep learning process into the existing enterprise data pipelines , including batch , streaming and real-time
  • 28. ©2016Mastercard.ProprietaryandConfidential. 28 Design and Implement pipelines at Visualized workbench Pipelines Promotion Biz. A Biz. B Biz. C Biz. D Biz. E Biz. F Pipeline Designer AI Pipelines and Flows Local Dev Dev Sandbox Prod(s) Stage Configuration Management (Tag / Branches) Pipeline Registry Generate AI Pipelines  Deployment sequences Continuous integration (Parameter, template) Automate deployment with CI/CD pipelines Gap 5 : AI Pipelines promotion with automated CI/CD deployment
  • 29. ©2016Mastercard.ProprietaryandConfidential. Easier to build end-to-end analytics + AI applications • Reference use cases • Anomaly detection, sentiment analysis, fraud detection, chatbot, sequence prediction, etc. • Predefined models • Object detection, image classification, text classification, recommendations, GAN, etc. • Feature engineering & transformations • Image, text, speech, 3D imaging, time-series, etc. • High level pipeline APIs • Dataframes, ML Pipelines, autograd, transfer learning, Keras/Keras2, etc. https://github.com/intel-analytics/analytics-zoo Community improvements : Analytics Zoo -> Unified Analytics + AI Platform for Spark and BigDL