SlideShare une entreprise Scribd logo
1  sur  33
Télécharger pour lire hors ligne
EUROPEAN BIG DATA VALUE FORUM 2020EUROPEAN BIG DATA VALUE FORUM 2020
Session 1.
The current
landscape of Big
Data benchmarks
EUROPEAN BIG DATA VALUE FORUM 2020
EUROPEAN BIG DATA VALUE FORUM 2020
Panelists
Axel Ngonga
University of Paderborn,
Lead of BDVA TF6
Benchmarking group
Wangling Gao
Assistant professor, Institute
of Computing Technology,
Chinese Academy of
Sciences
Rekha Singhal
Senior scientist and Head of
the Computing Systems-
Software Research area in
TCS
Arne Berre
Chief Scientist, SINTEF
Todor Ivanov
Senior consultant, Lead
Consult
10:15-11:15 Session 1.
The current landscape of Big Data and AI benchmarks
• 10:15-10:25: DataBench Framework for Benchmarks, Arne J. Berre, SINTEF
• 10:25-10:40: Benchmarking platforms and AI, Axel Ngonga, BDVA TF6 Benchmark
Lead, University of Paderborn
• 10:40-10:55: BenchCouncil Big Data and AI Benchmarks, Wanling Gao, Chinese
Academy of Sciences
• 10:55-11:10: MLPerf AI and ABench, Rekha Singhal, Senior Scientist, TCS, India
• 11:10-11:15: Conclusion on Big Data and AI Benchmarks, Todor Ivanov, LeadConsult
BDV – Big Data and Analytics/Machine Learning
Reference Model
<Data sources>
Generic
Data pipeline
5
<Data sources>
Generic
Data pipeline
Setup Model
Train Model
Verify Model
Validate Model
Exercise Model
Data Analytics/ML
(including data
processing for analysis,
AI and Machine
Learning)
Data Visualisation,
Action/Interaction
(including data presentation
environment/boundary/user
action and interaction)
Data
Acquisition/Collection
(including Data Ingestion
processing, Streaming, Data
Extraction, Ingestion Storage,
Different data types)
Data
Storage/Preparation
(including Storage
Retrieval/Access/Queries
Data Protection, Curation,
Integration, Publication)
Top level Generic Big Data and AI Pipeline pattern
(For all Benchmarks and Project pipeline to be related to)
PI-DV
PI-DA x x x x x x x
PI-DS x x x x x x x x x x x x x x x x x x x x x x x x
PI-DI x x x x x x x x x x x x x x x x x x x x x x x x x xTPC-H
TPC-DSv1
LinearRoad
HadoopWorkloadExamples
GridMix
PigMix
MRBench
CALDA
HiBench
Liquid
YCSB
SWIM
CloudRank-D
PUMABenchmarkSuite
CloudeSuite
MRBS
AMPLabBigDataBenchmark
BigBench
BigDataBench
LinkBench
BigFrame
PRIMEBALL
SemanticPublishingBenchmark(SPB)
SocialNetworkBenchmark
ALOJA
gmark
Convnet
WatDiv
StreamBench
TPCx-HS
1999
2002
2004
2009
2011
2014
Benchmarks
2007
2008
2010
2012
2013
PI-DV x x x x x x x
PI-DA x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x
PI-DS x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x
PI-DI x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x
SparkBench
TPCx-V
IoTAbench
BigFUN
TPC-DSv2
TPCx-BB
CityBench
Graphalytics
YahooStreamingBenchmark(YSB)
ShenZhenTransportationSystem(SZTS)
DeepBench
DeepMark
TensorFlowBenchmarks
Fathom
AdBench
RIoTBench
HobbitBenchmark
TPCx-HSv2
BigBenchV2
Sanzu
AIMBenchmark
GARDENIA
Pennmachinelearningbenchmark(PMLB)
OpenMLbenchmarksuites
Benchip
DeepLearningBenchmarkingSuite(DLBS)
TPCx-IoT
Senska
DAWNBench
BlockBench
IDEBench
ABench
StreamWatDiv
TERMinatorSuite
HERMIT
MLBenchServices
MLBenchDistributed
MLPerf
TrainingBenchmarkforDNNs(TBD)
PolyBench
NNBench-X
GDPRbench
BenchIoT
IoTBench
VisualRoad
AdaBench
MiDBench
CBench-Dynamo
EdgeAIBench
AIBench
HPCAI500
SparkAIBench
AIMatrix
2019
Benchmarks
2015
2016
2017
2018
Mapping between the Pipeline Steps
and the Benchmark Ecosystem
→ matrix available in the DataBench
ToolBox
10:15-11:15 Session 1.
The current landscape of Big Data and AI benchmarks
10:25-10:40: Benchmarking platforms and AI, Axel Ngonga, BDVA TF6 Benchmark
Lead, University of Paderborn
10:40-10:55: BenchCouncil Big Data and AI Benchmarks, Wanling Gao, Chinese
Academy of Sciences
10:55-11:10: MLPerf AI and ABench, Rekha Singhal, Senior Scientist, TCS, India
11:10-11:15: Conclusion on Big Data and AI Benchmarks, Todor Ivanov, LeadConsult
Framework for Benchmarking
ML/DL Workloads
15
Rekha Singhal
Senior Scientist & Head Computing Systems, Tata Consultancy Services
Rekha.Singhal@tcs.com
ML/DL
Pipeline
16
Data Exploration
IDE benchmarks
Data Preprocessing
ADABench
Data Extraction (SQL)
Big Bench, Polystore Benchmarks
Feature generation
Model
Building/Tuning
MLPerf
ML pipeline Deployment
Architecture (Abench)
New challenges for ML/DL workloads
 Heterogeneous hardware- CPU, GPU., TPU, ASIC
 Many frameworks- Pytorch, Tensorflow
 Heterogenous big data technology in the Pipeline
 Many kinds of NN models - RNN, CNN, LSTM, RL…..
17
Data Lake
Data
e.g. Users’ Interaction
Datasets
Domain
Meta Model
Domain
Expert
Data Preprocessing
Feature Store
ML/DL Model Building
Aggregation, Windowing average, Min/Max,
Hash Map, One hot/Matrix Encoding etc.
Data to
Meta Model
Mapping
Model Post Processing
MODEL
Data
Sources
Data Lake
ML- KNN, XGBoost etc. DL – LSTM, GNN
etc.
iPrescribe: Recommendation Benchmark
SparkScala,
SparkPython,
Ignite
MongoDB,
Ignite
Python(Training)
Tornado (Inference)
Ref: Fast Online 'Next Best Offers' using Deep Learning, ACM COMAD 2019
18
ABench
ABench: Big Data Architecture Stack Benchmark *
Todor Ivanov and Rekha Singhal
* Published in Proceedings of the 9th ACM/SPEC International Conference on Performance
Engineering (ICPE 2018), April 9-13, Berlin, Germany, 2018
19
Motivation
• Growing number of new Big Data technologies and connectors in the Big Data Stacks
▪ Challenges for Solution Architects, Data Engineers, Data Scientist, Developers, etc.
• Missing benchmarks for each technology, connector or a combination of them
• Consequence: Increasing complexity in the Big Data Architecture Stacks
20
DRIVER
BigDataBench
Benchmark
Repository
BigBench V2
WG: Q1…Q30,
Q1’..Q5’
DG: (Un, Semi)Structured,
Stream
StreamBenchmark
YCSB B’
Benchmark Synthesizer
System/Architecture Under Benchmark
Data
L1
L2
LN
Workload Benchmark Validation
Functional,
Performance
Results
Configuration Catalog
Workload
Configuration File
(Use case, concurrency:
number of users/sec,
stream velocity &
window)
Architecture
Configuration
File
(Technology,
Parameters, data
flow)
Data Generator
Configuration
File
(Data Model,
Types, Size)
Benchmark Metric
Configuration File
(Throughput, latency,
points of
measurements, Utils)
Deployment
Configuration File
(Cluster or cloud
details- IP, ports,
cores, RAM)
21
MLPerf
https://mlperf.org/
MLPerf Inference Benchmark - Vijay J. Reddi et al. http://arxiv.org/abs/1911.02549
MLPerf Training Benchmark - Peter Mattson et al. http://arxiv.org/abs/1910.01500
22
MLPerf Goals
● Accelerate progress in ML via fair and useful measurement
● Serve both the commercial and research communities
● Enable fair comparison of competing systems yet encourage innovation to improve the state-of-the-art
of ML
● Enforce replicability to ensure reliable results
● Keep benchmarking effort affordable (so all can play)
Benchmarks Considered for MLPerf
Area Vision Language Audio Commerce Action / RL Other
Problem
Image Classification
Object Detection /
Segmentation
Face ID
HealthCare (Radiology)
Video Detection
Self-Driving
Translation
Language Model
Word Embedding
Speech Recognition Text-
to-Speech
Question Answering
Keyword Spotting
Language Modeling
Chatbots
Speaker ID
Graph embeddings
Content ID
Rating
Recommendations
Sentiment Analysis
Next-action
Healthcare (EHR)
Fraud detection
Anomaly detection
Time series prediction
Large scale regression
Games
Go
Robotics
Health Care
Bioinformatics
GANs
3D point
clouds
Word
embeddingsDatasets
ImageNet
COCO
WMT English-
German
LibriSpeech
SQuAD
LM-Benchmark
MovieLens-20M
Amazon
IMDB
Atari
Go
Chess
Grasping
Models
ResNet-50
TF Object Detection
Detectron
Transformer
OpenNMT
Deep Speech 2
SQuAD Explorer
Neural Collaborative Filtering
CNNs
DQN
PPO
Metrics
COCO mAP
Prediction accuracy
BLEU
WER
Perplexity
Prediction accuracy
Prediction accuracy
Win/Loss
MLPerf metric: Training time to reach quality target
+ cost or power
● Quality target is specific for each benchmark and close to state-of-the-art
○ Updated w/ each release to keep up with the state-of-the-art
● Time includes preprocessing, validation over median of 5 runs
● Available: reference implementations that achieve quality target
In addition, either:
● Cost of public cloud resources (no spot/preemptible instances)
● Power utilization for on-premise hardware
Important for benchmark to capture
both performance and quality
Conclusions
26
Recommendation – ML/DL
Supply Chain- RL
Recommendation – ML/DL Recommendation – ML/DL
Recommendation –RL
Recommendation –RL (distributed training)
Recommendation –RL
Supply Chain- RL Supply Chain- RL (distributed training)
27
Thank You for your attention!
Any Questions ?
EUROPEAN BIG DATA VALUE FORUM 2020EUROPEAN BIG DATA VALUE FORUM 2020
Conclusion on Big Data
and AI Benchmarks
Todor Ivanov (LeadConsult)
EUROPEAN BIG DATA VALUE FORUM 2020
Data Analytics/ML
(including data
processing for analysis,
AI and Machine
Learning)
Data Visualisation,
Action/Interaction
(including data presentation
environment/boundary/user
action and interaction)
Data
Acquisition/Collection
(including Data Ingestion
processing, Streaming, Data
Extraction, Ingestion Storage,
Different data types)
Data
Storage/Preparation
(including Storage
Retrieval/Access/Queries
Data Protection, Curation,
Integration, Publication)
Streaming/ Realtime Processing
Interactive Processing
Batch Processing
Data Privacy/Security
Data Governance/Mgmt
Data Storage
Industrial Analytics (Descriptive,
Diagnostic, Predictive, Prescriptive)
Machine Learning, AI, Data Science
Visual Analytics
BDVA Reference Model
DataBench Pipeline Methodology
Big Data and AI Landscape Big Data and AI Benchmark EcosystemICT Big Data PPPICT-13-14 Projects
▪ I-BiDaaS
▪ TheyBuyForYou (TBFY)
▪ Track&Know
▪ DataBio
▪ DeepHealth
EUROPEAN BIG DATA VALUE FORUM 2020
BenchCounsil Benchmarks
Data Analytics/ML
(including data
processing for analysis,
AI and Machine
Learning)
Data Visualisation,
Action/Interaction
(including data presentation
environment/boundary/user
action and interaction)
Data
Acquisition/Collection
(including Data Ingestion
processing, Streaming, Data
Extraction, Ingestion Storage,
Different data types)
Data
Storage/Preparation
(including Storage
Retrieval/Access/Queries
Data Protection, Curation,
Integration, Publication)
DataBench Pipeline Methodology
AIBench
BigDataBench
AIBench
BigDataBench
AIBench
BigDataBench
AIBench
BigDataBench
HPC AI500 HPC AI500 HPC AI500
Edge AIBench Edge AIBench
AIoTBench AIoTBench
MLPerf
ABench ABench
MLPerf
ABench
Hobbit Benchmark
Platform
Hobbit Benchmark
Platform
Hobbit Benchmark
Platform
Hobbit Benchmark
Platform
Linked Data Benchmark Council (LDBC)
Graphalytics
Semantic Publishing
Benchmark (SPB)
Graphalytics
Semantic Publishing
Benchmark (SPB)
Semantic Publishing
Benchmark (SPB)
TheyBuyForYou
(TBFY)
TPC-H TPC-H
HiBench HiBench HiBench
I-BiDaaS
EUROPEAN BIG DATA VALUE FORUM 2020
PI-DV x x x x x x x
PI-DA x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x
PI-DS x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x
PI-DI x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x
SparkBench
TPCx-V
IoTAbench
BigFUN
TPC-DSv2
TPCx-BB
CityBench
Graphalytics
YahooStreamingBenchmark(YSB)
ShenZhenTransportationSystem(SZTS)
DeepBench
DeepMark
TensorFlowBenchmarks
Fathom
AdBench
RIoTBench
HobbitBenchmark
TPCx-HSv2
BigBenchV2
Sanzu
AIMBenchmark
GARDENIA
Pennmachinelearningbenchmark(PMLB)
OpenMLbenchmarksuites
Benchip
DeepLearningBenchmarkingSuite(DLBS)
TPCx-IoT
Senska
DAWNBench
BlockBench
IDEBench
ABench
StreamWatDiv
TERMinatorSuite
HERMIT
MLBenchServices
MLBenchDistributed
MLPerf
TrainingBenchmarkforDNNs(TBD)
PolyBench
NNBench-X
GDPRbench
BenchIoT
IoTBench
VisualRoad
AdaBench
MiDBench
CBench-Dynamo
EdgeAIBench
AIBench
HPCAI500
SparkAIBench
AIMatrix
2019
Benchmarks
2015
2016
2017
2018
Mapping between the DataBench Pipeline Steps and the Benchmark Ecosystem
→ matrix available in the DataBench ToolBox
Data Analytics/ML
(including data
processing for analysis,
AI and Machine
Learning)
Data Visualisation,
Action/Interaction
(including data presentation
environment/boundary/user
action and interaction)
Data
Acquisition/Collection
(including Data Ingestion
processing, Streaming, Data
Extraction, Ingestion Storage,
Different data types)
Data
Storage/Preparation
(including Storage
Retrieval/Access/Queries
Data Protection, Curation,
Integration, Publication)
DataBench Pipeline Methodology
This project has received funding from the European Horizon 2020 Programme for research,
technological development and demonstration under grant agreement n° 780966
Contacts
info@databench.eu
@DataBench_eu
DataBench
www.databench.eu
DataBench Project
DataBench
DataBench Project

Contenu connexe

Tendances

ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
 ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens... ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...Databricks
 
Real time applications using the R Language
Real time applications using the R LanguageReal time applications using the R Language
Real time applications using the R LanguageLou Bajuk
 
SECURE & EFFICIENT AUDIT SERVICE OUTSOURCING FOR DATA INTEGRITY IN CLOUDS
SECURE & EFFICIENT AUDIT SERVICE OUTSOURCING FOR DATA INTEGRITY IN CLOUDSSECURE & EFFICIENT AUDIT SERVICE OUTSOURCING FOR DATA INTEGRITY IN CLOUDS
SECURE & EFFICIENT AUDIT SERVICE OUTSOURCING FOR DATA INTEGRITY IN CLOUDSGyan Prakash
 
Using Graph Algorithms for Advanced Analytics - Part 5 Classification
Using Graph Algorithms for Advanced Analytics - Part 5 ClassificationUsing Graph Algorithms for Advanced Analytics - Part 5 Classification
Using Graph Algorithms for Advanced Analytics - Part 5 ClassificationTigerGraph
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...Bill Liu
 
Future is private intel dev fest
Future is private   intel dev festFuture is private   intel dev fest
Future is private intel dev festgeetachauhan
 
Decentralized AI Draper
Decentralized AI   DraperDecentralized AI   Draper
Decentralized AI Drapergeetachauhan
 
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsHighly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsBill Liu
 
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016MLconf
 
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AIGraph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AITigerGraph
 
Graph Gurus Episode 26: Using Graph Algorithms for Advanced Analytics Part 1
Graph Gurus Episode 26: Using Graph Algorithms for Advanced Analytics Part 1Graph Gurus Episode 26: Using Graph Algorithms for Advanced Analytics Part 1
Graph Gurus Episode 26: Using Graph Algorithms for Advanced Analytics Part 1TigerGraph
 
TensorFlow 16: Building a Data Science Platform
TensorFlow 16: Building a Data Science Platform TensorFlow 16: Building a Data Science Platform
TensorFlow 16: Building a Data Science Platform Seldon
 

Tendances (13)

Monitoring AI with AI
Monitoring AI with AIMonitoring AI with AI
Monitoring AI with AI
 
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
 ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens... ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
 
Real time applications using the R Language
Real time applications using the R LanguageReal time applications using the R Language
Real time applications using the R Language
 
SECURE & EFFICIENT AUDIT SERVICE OUTSOURCING FOR DATA INTEGRITY IN CLOUDS
SECURE & EFFICIENT AUDIT SERVICE OUTSOURCING FOR DATA INTEGRITY IN CLOUDSSECURE & EFFICIENT AUDIT SERVICE OUTSOURCING FOR DATA INTEGRITY IN CLOUDS
SECURE & EFFICIENT AUDIT SERVICE OUTSOURCING FOR DATA INTEGRITY IN CLOUDS
 
Using Graph Algorithms for Advanced Analytics - Part 5 Classification
Using Graph Algorithms for Advanced Analytics - Part 5 ClassificationUsing Graph Algorithms for Advanced Analytics - Part 5 Classification
Using Graph Algorithms for Advanced Analytics - Part 5 Classification
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
 
Future is private intel dev fest
Future is private   intel dev festFuture is private   intel dev fest
Future is private intel dev fest
 
Decentralized AI Draper
Decentralized AI   DraperDecentralized AI   Draper
Decentralized AI Draper
 
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsHighly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
 
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
 
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AIGraph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
 
Graph Gurus Episode 26: Using Graph Algorithms for Advanced Analytics Part 1
Graph Gurus Episode 26: Using Graph Algorithms for Advanced Analytics Part 1Graph Gurus Episode 26: Using Graph Algorithms for Advanced Analytics Part 1
Graph Gurus Episode 26: Using Graph Algorithms for Advanced Analytics Part 1
 
TensorFlow 16: Building a Data Science Platform
TensorFlow 16: Building a Data Science Platform TensorFlow 16: Building a Data Science Platform
TensorFlow 16: Building a Data Science Platform
 

Similaire à Session 1 - The Current Landscape of Big Data Benchmarks

Technology and AI sharing - From 2016 to Y2017 and Beyond
Technology and AI sharing - From 2016 to Y2017 and BeyondTechnology and AI sharing - From 2016 to Y2017 and Beyond
Technology and AI sharing - From 2016 to Y2017 and BeyondJames Huang
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 editionDavid Talby
 
Deploying Data Science Engines to Production
Deploying Data Science Engines to ProductionDeploying Data Science Engines to Production
Deploying Data Science Engines to ProductionMostafa Majidpour
 
Saving Human Lives with the IoT
Saving Human Lives with the IoTSaving Human Lives with the IoT
Saving Human Lives with the IoTDat Tran
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowJan Kirenz
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooJason Dai
 
Introduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIntroduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIndrajit Poddar
 
Facebook ML Infrastructure - 2018 slides
Facebook ML Infrastructure - 2018 slidesFacebook ML Infrastructure - 2018 slides
Facebook ML Infrastructure - 2018 slidesKarthik Murugesan
 
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Ian Gomez
 
Overview Of Parallel Development - Ericnel
Overview Of Parallel Development -  EricnelOverview Of Parallel Development -  Ericnel
Overview Of Parallel Development - Ericnelukdpe
 
Automatic generation of hardware memory architectures for HPC
Automatic generation of hardware memory architectures for HPCAutomatic generation of hardware memory architectures for HPC
Automatic generation of hardware memory architectures for HPCFacultad de Informática UCM
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...Codemotion Tel Aviv
 
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Herman Wu
 
Data Agility—A Journey to Advanced Analytics and Machine Learning at Scale
Data Agility—A Journey to Advanced Analytics and Machine Learning at ScaleData Agility—A Journey to Advanced Analytics and Machine Learning at Scale
Data Agility—A Journey to Advanced Analytics and Machine Learning at ScaleDatabricks
 
Big Data for Testing - Heading for Post Process and Analytics
Big Data for Testing - Heading for Post Process and AnalyticsBig Data for Testing - Heading for Post Process and Analytics
Big Data for Testing - Heading for Post Process and AnalyticsOPNFV
 
Monitoring of GPU Usage with Tensorflow Models Using Prometheus
Monitoring of GPU Usage with Tensorflow Models Using PrometheusMonitoring of GPU Usage with Tensorflow Models Using Prometheus
Monitoring of GPU Usage with Tensorflow Models Using PrometheusDatabricks
 
Austin,TX Meetup presentation tensorflow final oct 26 2017
Austin,TX Meetup presentation tensorflow final oct 26 2017Austin,TX Meetup presentation tensorflow final oct 26 2017
Austin,TX Meetup presentation tensorflow final oct 26 2017Clarisse Hedglin
 
CPaaS.io Y1 Review Meeting - Use Cases
CPaaS.io Y1 Review Meeting - Use CasesCPaaS.io Y1 Review Meeting - Use Cases
CPaaS.io Y1 Review Meeting - Use CasesStephan Haller
 

Similaire à Session 1 - The Current Landscape of Big Data Benchmarks (20)

Technology and AI sharing - From 2016 to Y2017 and Beyond
Technology and AI sharing - From 2016 to Y2017 and BeyondTechnology and AI sharing - From 2016 to Y2017 and Beyond
Technology and AI sharing - From 2016 to Y2017 and Beyond
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
Deploying Data Science Engines to Production
Deploying Data Science Engines to ProductionDeploying Data Science Engines to Production
Deploying Data Science Engines to Production
 
Saving Human Lives with the IoT
Saving Human Lives with the IoTSaving Human Lives with the IoT
Saving Human Lives with the IoT
 
Build 2019 Recap
Build 2019 RecapBuild 2019 Recap
Build 2019 Recap
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics Zoo
 
Introduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIntroduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI Platform
 
Facebook ML Infrastructure - 2018 slides
Facebook ML Infrastructure - 2018 slidesFacebook ML Infrastructure - 2018 slides
Facebook ML Infrastructure - 2018 slides
 
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
 
Overview Of Parallel Development - Ericnel
Overview Of Parallel Development -  EricnelOverview Of Parallel Development -  Ericnel
Overview Of Parallel Development - Ericnel
 
Automatic generation of hardware memory architectures for HPC
Automatic generation of hardware memory architectures for HPCAutomatic generation of hardware memory architectures for HPC
Automatic generation of hardware memory architectures for HPC
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
 
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
 
Data Agility—A Journey to Advanced Analytics and Machine Learning at Scale
Data Agility—A Journey to Advanced Analytics and Machine Learning at ScaleData Agility—A Journey to Advanced Analytics and Machine Learning at Scale
Data Agility—A Journey to Advanced Analytics and Machine Learning at Scale
 
Big Data for Testing - Heading for Post Process and Analytics
Big Data for Testing - Heading for Post Process and AnalyticsBig Data for Testing - Heading for Post Process and Analytics
Big Data for Testing - Heading for Post Process and Analytics
 
Monitoring of GPU Usage with Tensorflow Models Using Prometheus
Monitoring of GPU Usage with Tensorflow Models Using PrometheusMonitoring of GPU Usage with Tensorflow Models Using Prometheus
Monitoring of GPU Usage with Tensorflow Models Using Prometheus
 
Decision trees in hadoop
Decision trees in hadoopDecision trees in hadoop
Decision trees in hadoop
 
Austin,TX Meetup presentation tensorflow final oct 26 2017
Austin,TX Meetup presentation tensorflow final oct 26 2017Austin,TX Meetup presentation tensorflow final oct 26 2017
Austin,TX Meetup presentation tensorflow final oct 26 2017
 
CPaaS.io Y1 Review Meeting - Use Cases
CPaaS.io Y1 Review Meeting - Use CasesCPaaS.io Y1 Review Meeting - Use Cases
CPaaS.io Y1 Review Meeting - Use Cases
 

Plus de DataBench

Welcome to DataBench
Welcome to DataBenchWelcome to DataBench
Welcome to DataBenchDataBench
 
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...DataBench
 
Session 3 - The DataBench Framework: A compelling offering to measure the Imp...
Session 3 - The DataBench Framework: A compelling offering to measure the Imp...Session 3 - The DataBench Framework: A compelling offering to measure the Imp...
Session 3 - The DataBench Framework: A compelling offering to measure the Imp...DataBench
 
Session 4 - A practical journey on how to use the DataBench Toolbox
Session 4 - A practical journey on how to use the DataBench ToolboxSession 4 - A practical journey on how to use the DataBench Toolbox
Session 4 - A practical journey on how to use the DataBench ToolboxDataBench
 
CoreBigBench: Benchmarking Big Data Core Operations
CoreBigBench: Benchmarking Big Data Core OperationsCoreBigBench: Benchmarking Big Data Core Operations
CoreBigBench: Benchmarking Big Data Core OperationsDataBench
 
DataBench Toolbox in a Nutshell
DataBench Toolbox in a NutshellDataBench Toolbox in a Nutshell
DataBench Toolbox in a NutshellDataBench
 
Success Stories on Big Data & Analytics
Success Stories on Big Data & AnalyticsSuccess Stories on Big Data & Analytics
Success Stories on Big Data & AnalyticsDataBench
 
DataBench Virtual BenchLearning "Success storie on Big Data & Analytics use c...
DataBench Virtual BenchLearning "Success storie on Big Data & Analytics use c...DataBench Virtual BenchLearning "Success storie on Big Data & Analytics use c...
DataBench Virtual BenchLearning "Success storie on Big Data & Analytics use c...DataBench
 
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...DataBench
 
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...DataBench
 
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...DataBench
 
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...DataBench
 
DataBench session @ BDV Meet-Up Riga: The case of HOBBIT, 27/06/2019
DataBench session @ BDV Meet-Up Riga: The case of HOBBIT, 27/06/2019DataBench session @ BDV Meet-Up Riga: The case of HOBBIT, 27/06/2019
DataBench session @ BDV Meet-Up Riga: The case of HOBBIT, 27/06/2019DataBench
 
DataBench in a Nutshell - The market: Assessing Industrial Needs, Richard Ste...
DataBench in a Nutshell - The market: Assessing Industrial Needs, Richard Ste...DataBench in a Nutshell - The market: Assessing Industrial Needs, Richard Ste...
DataBench in a Nutshell - The market: Assessing Industrial Needs, Richard Ste...DataBench
 
Big Data Benchmarking, Tomas Pariente Lobo, Open Expo Europe, 20/06/2019
Big Data Benchmarking, Tomas Pariente Lobo, Open Expo Europe, 20/06/2019Big Data Benchmarking, Tomas Pariente Lobo, Open Expo Europe, 20/06/2019
Big Data Benchmarking, Tomas Pariente Lobo, Open Expo Europe, 20/06/2019DataBench
 
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...DataBench
 
Impacts of data-driven AI in business sectors, Richard Stevens, ICT 2018, 05/...
Impacts of data-driven AI in business sectors, Richard Stevens, ICT 2018, 05/...Impacts of data-driven AI in business sectors, Richard Stevens, ICT 2018, 05/...
Impacts of data-driven AI in business sectors, Richard Stevens, ICT 2018, 05/...DataBench
 
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...DataBench
 
Exploratory Analysis of Spark Structured Streaming, Todor Ivanov, Jason Taafe...
Exploratory Analysis of Spark Structured Streaming, Todor Ivanov, Jason Taafe...Exploratory Analysis of Spark Structured Streaming, Todor Ivanov, Jason Taafe...
Exploratory Analysis of Spark Structured Streaming, Todor Ivanov, Jason Taafe...DataBench
 
Building a Bridge between Technical and Business Benchmarking, Gabriella Catt...
Building a Bridge between Technical and Business Benchmarking, Gabriella Catt...Building a Bridge between Technical and Business Benchmarking, Gabriella Catt...
Building a Bridge between Technical and Business Benchmarking, Gabriella Catt...DataBench
 

Plus de DataBench (20)

Welcome to DataBench
Welcome to DataBenchWelcome to DataBench
Welcome to DataBench
 
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
 
Session 3 - The DataBench Framework: A compelling offering to measure the Imp...
Session 3 - The DataBench Framework: A compelling offering to measure the Imp...Session 3 - The DataBench Framework: A compelling offering to measure the Imp...
Session 3 - The DataBench Framework: A compelling offering to measure the Imp...
 
Session 4 - A practical journey on how to use the DataBench Toolbox
Session 4 - A practical journey on how to use the DataBench ToolboxSession 4 - A practical journey on how to use the DataBench Toolbox
Session 4 - A practical journey on how to use the DataBench Toolbox
 
CoreBigBench: Benchmarking Big Data Core Operations
CoreBigBench: Benchmarking Big Data Core OperationsCoreBigBench: Benchmarking Big Data Core Operations
CoreBigBench: Benchmarking Big Data Core Operations
 
DataBench Toolbox in a Nutshell
DataBench Toolbox in a NutshellDataBench Toolbox in a Nutshell
DataBench Toolbox in a Nutshell
 
Success Stories on Big Data & Analytics
Success Stories on Big Data & AnalyticsSuccess Stories on Big Data & Analytics
Success Stories on Big Data & Analytics
 
DataBench Virtual BenchLearning "Success storie on Big Data & Analytics use c...
DataBench Virtual BenchLearning "Success storie on Big Data & Analytics use c...DataBench Virtual BenchLearning "Success storie on Big Data & Analytics use c...
DataBench Virtual BenchLearning "Success storie on Big Data & Analytics use c...
 
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
 
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
DataBench Virtual BenchLearning "Big Data - Benchmark your way to Excellent B...
 
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
 
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
 
DataBench session @ BDV Meet-Up Riga: The case of HOBBIT, 27/06/2019
DataBench session @ BDV Meet-Up Riga: The case of HOBBIT, 27/06/2019DataBench session @ BDV Meet-Up Riga: The case of HOBBIT, 27/06/2019
DataBench session @ BDV Meet-Up Riga: The case of HOBBIT, 27/06/2019
 
DataBench in a Nutshell - The market: Assessing Industrial Needs, Richard Ste...
DataBench in a Nutshell - The market: Assessing Industrial Needs, Richard Ste...DataBench in a Nutshell - The market: Assessing Industrial Needs, Richard Ste...
DataBench in a Nutshell - The market: Assessing Industrial Needs, Richard Ste...
 
Big Data Benchmarking, Tomas Pariente Lobo, Open Expo Europe, 20/06/2019
Big Data Benchmarking, Tomas Pariente Lobo, Open Expo Europe, 20/06/2019Big Data Benchmarking, Tomas Pariente Lobo, Open Expo Europe, 20/06/2019
Big Data Benchmarking, Tomas Pariente Lobo, Open Expo Europe, 20/06/2019
 
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
 
Impacts of data-driven AI in business sectors, Richard Stevens, ICT 2018, 05/...
Impacts of data-driven AI in business sectors, Richard Stevens, ICT 2018, 05/...Impacts of data-driven AI in business sectors, Richard Stevens, ICT 2018, 05/...
Impacts of data-driven AI in business sectors, Richard Stevens, ICT 2018, 05/...
 
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
 
Exploratory Analysis of Spark Structured Streaming, Todor Ivanov, Jason Taafe...
Exploratory Analysis of Spark Structured Streaming, Todor Ivanov, Jason Taafe...Exploratory Analysis of Spark Structured Streaming, Todor Ivanov, Jason Taafe...
Exploratory Analysis of Spark Structured Streaming, Todor Ivanov, Jason Taafe...
 
Building a Bridge between Technical and Business Benchmarking, Gabriella Catt...
Building a Bridge between Technical and Business Benchmarking, Gabriella Catt...Building a Bridge between Technical and Business Benchmarking, Gabriella Catt...
Building a Bridge between Technical and Business Benchmarking, Gabriella Catt...
 

Dernier

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 

Dernier (20)

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 

Session 1 - The Current Landscape of Big Data Benchmarks

  • 1. EUROPEAN BIG DATA VALUE FORUM 2020EUROPEAN BIG DATA VALUE FORUM 2020 Session 1. The current landscape of Big Data benchmarks
  • 2. EUROPEAN BIG DATA VALUE FORUM 2020 EUROPEAN BIG DATA VALUE FORUM 2020 Panelists Axel Ngonga University of Paderborn, Lead of BDVA TF6 Benchmarking group Wangling Gao Assistant professor, Institute of Computing Technology, Chinese Academy of Sciences Rekha Singhal Senior scientist and Head of the Computing Systems- Software Research area in TCS Arne Berre Chief Scientist, SINTEF Todor Ivanov Senior consultant, Lead Consult
  • 3. 10:15-11:15 Session 1. The current landscape of Big Data and AI benchmarks • 10:15-10:25: DataBench Framework for Benchmarks, Arne J. Berre, SINTEF • 10:25-10:40: Benchmarking platforms and AI, Axel Ngonga, BDVA TF6 Benchmark Lead, University of Paderborn • 10:40-10:55: BenchCouncil Big Data and AI Benchmarks, Wanling Gao, Chinese Academy of Sciences • 10:55-11:10: MLPerf AI and ABench, Rekha Singhal, Senior Scientist, TCS, India • 11:10-11:15: Conclusion on Big Data and AI Benchmarks, Todor Ivanov, LeadConsult
  • 4. BDV – Big Data and Analytics/Machine Learning Reference Model <Data sources> Generic Data pipeline
  • 5. 5 <Data sources> Generic Data pipeline Setup Model Train Model Verify Model Validate Model Exercise Model
  • 6. Data Analytics/ML (including data processing for analysis, AI and Machine Learning) Data Visualisation, Action/Interaction (including data presentation environment/boundary/user action and interaction) Data Acquisition/Collection (including Data Ingestion processing, Streaming, Data Extraction, Ingestion Storage, Different data types) Data Storage/Preparation (including Storage Retrieval/Access/Queries Data Protection, Curation, Integration, Publication) Top level Generic Big Data and AI Pipeline pattern (For all Benchmarks and Project pipeline to be related to)
  • 7.
  • 8.
  • 9.
  • 10. PI-DV PI-DA x x x x x x x PI-DS x x x x x x x x x x x x x x x x x x x x x x x x PI-DI x x x x x x x x x x x x x x x x x x x x x x x x x xTPC-H TPC-DSv1 LinearRoad HadoopWorkloadExamples GridMix PigMix MRBench CALDA HiBench Liquid YCSB SWIM CloudRank-D PUMABenchmarkSuite CloudeSuite MRBS AMPLabBigDataBenchmark BigBench BigDataBench LinkBench BigFrame PRIMEBALL SemanticPublishingBenchmark(SPB) SocialNetworkBenchmark ALOJA gmark Convnet WatDiv StreamBench TPCx-HS 1999 2002 2004 2009 2011 2014 Benchmarks 2007 2008 2010 2012 2013 PI-DV x x x x x x x PI-DA x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x PI-DS x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x PI-DI x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x SparkBench TPCx-V IoTAbench BigFUN TPC-DSv2 TPCx-BB CityBench Graphalytics YahooStreamingBenchmark(YSB) ShenZhenTransportationSystem(SZTS) DeepBench DeepMark TensorFlowBenchmarks Fathom AdBench RIoTBench HobbitBenchmark TPCx-HSv2 BigBenchV2 Sanzu AIMBenchmark GARDENIA Pennmachinelearningbenchmark(PMLB) OpenMLbenchmarksuites Benchip DeepLearningBenchmarkingSuite(DLBS) TPCx-IoT Senska DAWNBench BlockBench IDEBench ABench StreamWatDiv TERMinatorSuite HERMIT MLBenchServices MLBenchDistributed MLPerf TrainingBenchmarkforDNNs(TBD) PolyBench NNBench-X GDPRbench BenchIoT IoTBench VisualRoad AdaBench MiDBench CBench-Dynamo EdgeAIBench AIBench HPCAI500 SparkAIBench AIMatrix 2019 Benchmarks 2015 2016 2017 2018 Mapping between the Pipeline Steps and the Benchmark Ecosystem → matrix available in the DataBench ToolBox
  • 11. 10:15-11:15 Session 1. The current landscape of Big Data and AI benchmarks 10:25-10:40: Benchmarking platforms and AI, Axel Ngonga, BDVA TF6 Benchmark Lead, University of Paderborn 10:40-10:55: BenchCouncil Big Data and AI Benchmarks, Wanling Gao, Chinese Academy of Sciences 10:55-11:10: MLPerf AI and ABench, Rekha Singhal, Senior Scientist, TCS, India 11:10-11:15: Conclusion on Big Data and AI Benchmarks, Todor Ivanov, LeadConsult
  • 12.
  • 13.
  • 14.
  • 15. Framework for Benchmarking ML/DL Workloads 15 Rekha Singhal Senior Scientist & Head Computing Systems, Tata Consultancy Services Rekha.Singhal@tcs.com
  • 16. ML/DL Pipeline 16 Data Exploration IDE benchmarks Data Preprocessing ADABench Data Extraction (SQL) Big Bench, Polystore Benchmarks Feature generation Model Building/Tuning MLPerf ML pipeline Deployment Architecture (Abench)
  • 17. New challenges for ML/DL workloads  Heterogeneous hardware- CPU, GPU., TPU, ASIC  Many frameworks- Pytorch, Tensorflow  Heterogenous big data technology in the Pipeline  Many kinds of NN models - RNN, CNN, LSTM, RL….. 17
  • 18. Data Lake Data e.g. Users’ Interaction Datasets Domain Meta Model Domain Expert Data Preprocessing Feature Store ML/DL Model Building Aggregation, Windowing average, Min/Max, Hash Map, One hot/Matrix Encoding etc. Data to Meta Model Mapping Model Post Processing MODEL Data Sources Data Lake ML- KNN, XGBoost etc. DL – LSTM, GNN etc. iPrescribe: Recommendation Benchmark SparkScala, SparkPython, Ignite MongoDB, Ignite Python(Training) Tornado (Inference) Ref: Fast Online 'Next Best Offers' using Deep Learning, ACM COMAD 2019 18
  • 19. ABench ABench: Big Data Architecture Stack Benchmark * Todor Ivanov and Rekha Singhal * Published in Proceedings of the 9th ACM/SPEC International Conference on Performance Engineering (ICPE 2018), April 9-13, Berlin, Germany, 2018 19
  • 20. Motivation • Growing number of new Big Data technologies and connectors in the Big Data Stacks ▪ Challenges for Solution Architects, Data Engineers, Data Scientist, Developers, etc. • Missing benchmarks for each technology, connector or a combination of them • Consequence: Increasing complexity in the Big Data Architecture Stacks 20
  • 21. DRIVER BigDataBench Benchmark Repository BigBench V2 WG: Q1…Q30, Q1’..Q5’ DG: (Un, Semi)Structured, Stream StreamBenchmark YCSB B’ Benchmark Synthesizer System/Architecture Under Benchmark Data L1 L2 LN Workload Benchmark Validation Functional, Performance Results Configuration Catalog Workload Configuration File (Use case, concurrency: number of users/sec, stream velocity & window) Architecture Configuration File (Technology, Parameters, data flow) Data Generator Configuration File (Data Model, Types, Size) Benchmark Metric Configuration File (Throughput, latency, points of measurements, Utils) Deployment Configuration File (Cluster or cloud details- IP, ports, cores, RAM) 21
  • 22. MLPerf https://mlperf.org/ MLPerf Inference Benchmark - Vijay J. Reddi et al. http://arxiv.org/abs/1911.02549 MLPerf Training Benchmark - Peter Mattson et al. http://arxiv.org/abs/1910.01500 22
  • 23. MLPerf Goals ● Accelerate progress in ML via fair and useful measurement ● Serve both the commercial and research communities ● Enable fair comparison of competing systems yet encourage innovation to improve the state-of-the-art of ML ● Enforce replicability to ensure reliable results ● Keep benchmarking effort affordable (so all can play)
  • 24. Benchmarks Considered for MLPerf Area Vision Language Audio Commerce Action / RL Other Problem Image Classification Object Detection / Segmentation Face ID HealthCare (Radiology) Video Detection Self-Driving Translation Language Model Word Embedding Speech Recognition Text- to-Speech Question Answering Keyword Spotting Language Modeling Chatbots Speaker ID Graph embeddings Content ID Rating Recommendations Sentiment Analysis Next-action Healthcare (EHR) Fraud detection Anomaly detection Time series prediction Large scale regression Games Go Robotics Health Care Bioinformatics GANs 3D point clouds Word embeddingsDatasets ImageNet COCO WMT English- German LibriSpeech SQuAD LM-Benchmark MovieLens-20M Amazon IMDB Atari Go Chess Grasping Models ResNet-50 TF Object Detection Detectron Transformer OpenNMT Deep Speech 2 SQuAD Explorer Neural Collaborative Filtering CNNs DQN PPO Metrics COCO mAP Prediction accuracy BLEU WER Perplexity Prediction accuracy Prediction accuracy Win/Loss
  • 25. MLPerf metric: Training time to reach quality target + cost or power ● Quality target is specific for each benchmark and close to state-of-the-art ○ Updated w/ each release to keep up with the state-of-the-art ● Time includes preprocessing, validation over median of 5 runs ● Available: reference implementations that achieve quality target In addition, either: ● Cost of public cloud resources (no spot/preemptible instances) ● Power utilization for on-premise hardware Important for benchmark to capture both performance and quality
  • 26. Conclusions 26 Recommendation – ML/DL Supply Chain- RL Recommendation – ML/DL Recommendation – ML/DL Recommendation –RL Recommendation –RL (distributed training) Recommendation –RL Supply Chain- RL Supply Chain- RL (distributed training)
  • 27. 27 Thank You for your attention! Any Questions ?
  • 28.
  • 29. EUROPEAN BIG DATA VALUE FORUM 2020EUROPEAN BIG DATA VALUE FORUM 2020 Conclusion on Big Data and AI Benchmarks Todor Ivanov (LeadConsult)
  • 30. EUROPEAN BIG DATA VALUE FORUM 2020 Data Analytics/ML (including data processing for analysis, AI and Machine Learning) Data Visualisation, Action/Interaction (including data presentation environment/boundary/user action and interaction) Data Acquisition/Collection (including Data Ingestion processing, Streaming, Data Extraction, Ingestion Storage, Different data types) Data Storage/Preparation (including Storage Retrieval/Access/Queries Data Protection, Curation, Integration, Publication) Streaming/ Realtime Processing Interactive Processing Batch Processing Data Privacy/Security Data Governance/Mgmt Data Storage Industrial Analytics (Descriptive, Diagnostic, Predictive, Prescriptive) Machine Learning, AI, Data Science Visual Analytics BDVA Reference Model DataBench Pipeline Methodology Big Data and AI Landscape Big Data and AI Benchmark EcosystemICT Big Data PPPICT-13-14 Projects ▪ I-BiDaaS ▪ TheyBuyForYou (TBFY) ▪ Track&Know ▪ DataBio ▪ DeepHealth
  • 31. EUROPEAN BIG DATA VALUE FORUM 2020 BenchCounsil Benchmarks Data Analytics/ML (including data processing for analysis, AI and Machine Learning) Data Visualisation, Action/Interaction (including data presentation environment/boundary/user action and interaction) Data Acquisition/Collection (including Data Ingestion processing, Streaming, Data Extraction, Ingestion Storage, Different data types) Data Storage/Preparation (including Storage Retrieval/Access/Queries Data Protection, Curation, Integration, Publication) DataBench Pipeline Methodology AIBench BigDataBench AIBench BigDataBench AIBench BigDataBench AIBench BigDataBench HPC AI500 HPC AI500 HPC AI500 Edge AIBench Edge AIBench AIoTBench AIoTBench MLPerf ABench ABench MLPerf ABench Hobbit Benchmark Platform Hobbit Benchmark Platform Hobbit Benchmark Platform Hobbit Benchmark Platform Linked Data Benchmark Council (LDBC) Graphalytics Semantic Publishing Benchmark (SPB) Graphalytics Semantic Publishing Benchmark (SPB) Semantic Publishing Benchmark (SPB) TheyBuyForYou (TBFY) TPC-H TPC-H HiBench HiBench HiBench I-BiDaaS
  • 32. EUROPEAN BIG DATA VALUE FORUM 2020 PI-DV x x x x x x x PI-DA x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x PI-DS x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x PI-DI x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x SparkBench TPCx-V IoTAbench BigFUN TPC-DSv2 TPCx-BB CityBench Graphalytics YahooStreamingBenchmark(YSB) ShenZhenTransportationSystem(SZTS) DeepBench DeepMark TensorFlowBenchmarks Fathom AdBench RIoTBench HobbitBenchmark TPCx-HSv2 BigBenchV2 Sanzu AIMBenchmark GARDENIA Pennmachinelearningbenchmark(PMLB) OpenMLbenchmarksuites Benchip DeepLearningBenchmarkingSuite(DLBS) TPCx-IoT Senska DAWNBench BlockBench IDEBench ABench StreamWatDiv TERMinatorSuite HERMIT MLBenchServices MLBenchDistributed MLPerf TrainingBenchmarkforDNNs(TBD) PolyBench NNBench-X GDPRbench BenchIoT IoTBench VisualRoad AdaBench MiDBench CBench-Dynamo EdgeAIBench AIBench HPCAI500 SparkAIBench AIMatrix 2019 Benchmarks 2015 2016 2017 2018 Mapping between the DataBench Pipeline Steps and the Benchmark Ecosystem → matrix available in the DataBench ToolBox Data Analytics/ML (including data processing for analysis, AI and Machine Learning) Data Visualisation, Action/Interaction (including data presentation environment/boundary/user action and interaction) Data Acquisition/Collection (including Data Ingestion processing, Streaming, Data Extraction, Ingestion Storage, Different data types) Data Storage/Preparation (including Storage Retrieval/Access/Queries Data Protection, Curation, Integration, Publication) DataBench Pipeline Methodology
  • 33. This project has received funding from the European Horizon 2020 Programme for research, technological development and demonstration under grant agreement n° 780966 Contacts info@databench.eu @DataBench_eu DataBench www.databench.eu DataBench Project DataBench DataBench Project