SlideShare une entreprise Scribd logo
1  sur  34
Télécharger pour lire hors ligne
Paris, Sophia Antipolis, London, San Jose USA
A Key Technology Provider and Actor
in the Cloud Migration
within all Big Compute verticals and at the heart of IA/Machine Learning
revolution
Paris, Sophia Antipolis, London, San Jose USA, Montreal CA
1. Company / Team
400 man / year of R&D
2 patents
30 highly qualified engineers out of which 17 are PhD’s
References in all Industries in the US and EMEA
Global Locations
Partnerships
Key information
Management
Denis
Caromel,
CEO
François
Tournesac,
CSO
Fabien
Viale,
CTO
Marco
Castigliego,
CAO
Company
ISV Founded in 2007 by Denis Caromel in Sophia-Antipolis, Spin-off of INRIA
Addressing $80 Billion Hybrid Cloud Market with 27% CAGR
Disruptive Patented Technology w/ Exceptional Business Outcomes
60% of the revenue from international
Sophia-Antipolis (France)
Paris (France)
London (United Kingdom)
San-Jose (United States)
Montreal (Canada)
Fribourg (Switzerland)
Dakar (Senegal)
ProActive Solution
Job Scheduling, Workload Automation
Orchestration & Meta-Scheduling
On-premises and on all clouds
Open Source
2005 An R&D Team of 45 persons headed by Denis Caromel developing a
Core Kernel for Distributed, Parallel & Cloud at INRIA (largest EU Computer
Science Research Institute, 6 000 persons).
Foundation of ActiveEon
Co-development between INRIA Team & ActiveEon
IP Technology Transfer from INRIA to ActiveEon
2007
2009 Scheduler added to the Core
2011 Resource Manager added
2013
2014
2016
2017
Orchestration with Powerful Workflows added
First very large customer references in Production
International Expansions in UK, USA, Africa
ActiveEon Story
R&D Investment
250 M/Y at INRIA + 150 M/Y at AE = 400 Man/Year
2018 Machine Learning Open Studio added to ProActive
International Expansions in Canada
Express Business Needs with Granular
Workflows
• Distributed & Parallel Computing
• On-premises & Cloud
Architectures
A 30+ PhD/Engineer team with focused fields of high expertise:
• Big Data
• IA, ML
• IoT
• Finance
• Gov.
• HPC
• ...
• Data Science, Machine Learning, IA, Matlab,
R
• Big Compute and HPC
Visdom
ActiveEon Technology
R&D Investment: 250 M/Y at INRIA + 150 M/Y at AE = 400 Man/Year
Process Flow & Operation
Execute &
Monitor
Design
Schedule, Monitor &
Connect the
resources
IT department, Data
Scientist, Business Lines,
Activeeon Services IT department,
Business owner
Operational team
1
2
3
PROACTIVE
STUDIO
PROACTIVE
RESOURCES MANAGER
PROACTIVE SCHEDULER
PROACTIVE
AUTOMATION
PORTAL
Next Generation
Scheduler/Orchestration
Scheduler and Orchestration
Priority
& Planning
Parallel
Executions
Error
Management
Multi Users
</>OpenRESTAPI
Resource Management and Monitoring
Slurm
SGE
PBS
LSF
Multi-
platform
Local
Machine
Network
Resource
Batch
Scheduler
Cloud
Processing and Automation Workflows
Any
language
Secured
Data
Transfers
Meta-
scheduler
ETL, ERP,
ELT, …
Full
integration
Translator
Open Workflow Studio
Machine Learning Open Studio
https://www.youtube.com/watch?v=mbrQxCf4lqM
Automation Dashboard - Catalog
Workflows stored in buckets in the Catalog
RBAC support for each bucket / Users can share workflows and templates
Keep track of the revisions with a versioning feature integrated
Job Planner
DefineCalendars AssociateWorkflowstoCalendars VisualizeExecutionPlanning
Manage recurring Jobs
Forecast and check future
Executions
Control recurring jobs from one
endpoint
Schedule Exceptions through
Exclusion Calendars &
Inclusion Calendars
Cloud Automation: On-demand PaaS
On-Demand PaaS Services with full Life-Cycle Management
Scalable & Elastic resources
Incremental resource
deployment
100% resources
usage, no waste
Smart scale down
Provides cloud computing power according to your needs.
Minimize costs by deploying VMs only when needed (configurable
load factor). Never exceed your budget (min/max VMs threshold).
Smart and fully configurable elastic policy. Shutdown unused
VMs whenever it's possible. Prevent time-consuming re-
deployments by adjusting idle nodes’ release delay (avoid scale
up/down cycles).
Global Locations
Some Supported Languages and Connectors
AWSAzure GCP Docker OpenShiftLinux Windows Solaris VMware Openstack
Infrastructure
Slurm PBS LSF
LSF
HPC Schedulers
Google
Cloud Platform
PBS
Works
Cmd Java Scala Javascript Groovy Ruby Jython Python Perle PHP R Cron LDAPPowerShell
Languages and Predefined Tasks
FTPURL SFTP MySQL Oracle
Data Connectors
Linux Bash
MongoDB Cassandra AWS-S3
Clouds
PostgreSQL Greenplum SQL Server Azure Storage
Azure
Data Lake
KafkaZookeeper Spark Hadoop
Big Data
Azure
Databricks Hadoop HDFS Twitter
LogstashSAP Elasticsearch
SGE
SGE
CNTK Keras PyTorch YOLO H2OTensoFlow
Machine Learning & Artificial Intelligence
Caffe Spark MLlib Pandas JupyterLab
Visdom
Visualization
KibanaSwarm Storm Clearwater
Cognitive
Services
Grafana
Scikit-Learn MXNet
AIX AS/400
C++/C#
Cuda
Specialized
Open/CL FPGA
DLib BigDL
DeepLearning
G4J
Kubernetes
20 000 Cores Azure Benchmarks
With ActiveEon Workflows & Scheduler:
15 mn to trigger and acquire 20 000 Azure
Cores and to schedule 20 000 Tasks
99% of requests having less than 90ms
response-time
On 20K Cores, with 19K running Tasks,
only 5 sec. to detect a software failure and
redeploy the Task
With 19K running tasks, only 30.8 sec. to
execute a Job with 10 Tasks of each 30 sec.,
97.4% efficiency.
2. Some Typical Customer Cases:
Capabilities & Portfolio Revue
Large Worldwide International Companies
Early Adopters
Using ActiveEon for Critical Business Applications
Finance
IoT
Gov.
Manufacturing
Automotive
Aerospace
Nuclear
RedHat OpenShift
Some Major Customers
Telco & IT Bio Tech
& Health
FinanceEngineering Aeronautics Energy
& Space
Some Partners:
Media
Distribution
Government
IoTCosmetics
L&G a leading multinational finance and insurance company with headquarters in London
Situation
Comply with new European regulations: Solvency II, Basel III, etc.
Transform legacy system and embrace cloud computing
Solution
Activeeon ProActive and migration to the Cloud have enabled
faster and more reliable execution:
• Cloud bursting
• Error management
• Prioritization
Benefits
From 18 hours to 2 hours for priority reports
Agile development with an objective of 4,000 cores
 $1.2m / year committed spent on Cloud
Finance
Time
64VMs,eachwith16vCPUs
Home Hoffice is the UK Ministry of Interior. They are using ActiveEon for 2 critical
applications:
• Visa Delivery Process, and
• Big Data & Analytics platform for Crime Reduction (HODAC).
Situation
In need to integrate 25 different sources of Data in order to build a consolidated
Data Lake and analytics platform to be used for many Home Land security
applications.
Solution
ActiveEon used as the central Orchestrator to Schedule and Meta-Schedule all the
Big Data, ETL, Analytics, Machine Learnigs software appliance of the platform
(Hadoop, SAS, TIBCO Spotfire, Python, Anaconda, GreenPlum, ElasticSearch, …).
Benefits
• Central Orchestration Tool
• Workflow Expressiveness: universal & comprehensive
• Management of Security for highly sensitive environments
• Management of Resources for all appliances (SAS, TIBCO,… ).
« ActiveEon is the only solution capable
to Schedule any Big Data Analytics,
mono-threaded, multi-threaded, multi-
core, parallel and distributed »
Cap Gemini Lead Engineer for Home
Office
Gov.: UK Ministry of Interior
Komatsu is a Japanese multinational corporation
It manufactures construction, mining, industrial and military equipment.
Situation
ActiveEon Orchestrates on Cloud execution over hot and cold storage for streaming and batch analytics
> 1,200 tasks executed per hour
Solution
Activeeon ProActive has enabled control over and scheduling over execution:
• Error Management – Notification, Automated Recovery
• Job Planner
• Distribution & Parallelization
Benefits
• Reliable execution to orchestrate multiple services and resources
• Provide consistent results and KPIs to end users and BI Tools
IoT
PEPs is the French platform that offers access to the products of the Sentinel satellites (S1A and S1B, S2A and S2B, S3A
and S3B) of the European Union Program for Earth observation and monitoring Copernicus
Situation
Make Sentinel data available to the greatest number and
encourage the development of applications using them (agriculture, maritime field...)
1 petabyte (1015 bytes) in 20 years and 7 petabytes in 2 years!
Solution
Proactive Solution provided by ActiveEon to execute on Azure in hybrid mode
allows enhancing PEPS data and making them available to API providers :
• Multi-Cloud Ecosystem Platform
• Remove complexity for Data Scientists
• Provide Cloud performance
Benefits
• Faster execution, Optimisation of On-Prem ressources & Clouds,
• Easier to use by end-users
Space & Image Processing
Platform for Cosmetic Formulation for 2000 persons around the world and
for Innovation Team. (Statistic, Machine Learning, Use of Language R)
2 000 persons
around the World
Innovation Team
(Statistics, ML, R)
Workflows OrchestrationMonitoring
Data
Compute
Data
Mining
Private
Network
+
HTTPS
ProActive
Cloud Watch
Environment Environment
MachineLearninginITLogAnalysisforErrorDetection&PredictioninFinancialMarket
Analysis &
Classification
• Machine Learning
• Artificial Intelligence
• Probabilistic Analysis
Resources /
Applications /
Services
Resources /
Applications /
Services
Resources /
Applications /
Services
Business Users
11 1
1
2
3
Collect data from
any sources
Update model
Update event
driven system
Events
Monitoring
Complex Event
Processing
• Rule based
• Actions triggering
3
Alert
Predictive
Incident
Request for incident
analysis
2
Automated
Preventive
Action
Incidents
Incidents
Finance Domain: Deep ML for IT Infrastructure
Main Benefits
Openness and diversity of ML
frameworks to be used (vs.
Splunk)
Both Batch and Streaming
Workflow Expressiveness:
universal & comprehensive
IT Users
Orchestration of RedHat OpenShift On-Prem & On Azure
Orchestrate & Manage all layers: IaaS, PaaS, SaaS.
Multi-Cloud, Hybrid, Scalable,
Digital transformation for manufacturing
BENEFITS
Reduce the distance between the virtual and the
manufacturing process
Take advantage of digitalization in the machine tool
field for intelligent manufacturing and more efficient
production
FEATURES
Cloud-based big data analytics during
machining
Optimization of machining parameters
using workflows
Process simulation and optimization tools
Physical measurements and monitoring
Virtual / real part model correction
Use of AI
TARGETED SECTORS
Manufacturing, automotive, aerospace
Cloud processing services in manufacturing
END USERS
Workflows for HPC multi-physics engineering
simulations in automotive and aerospace
BENEFITS
Thermal resistance for engine partsFEATURES
Parallel evaluation of optimal mesh size for
the best tradeoff between execution time
and result accuracy
Complex workflow management: monitoring,
scheduling and orchestration
Infrastructure management: on-premises and
cloud HPC
Data collection and processing
END USERS
Pollution levels in a district
Workflow for exploration of tradeoff
between execution time and result accuracy
DOMAIN: COMPUTATIONAL FLUID DYNAMICS (CFD) AND POST-PROCESSING TOOLS
Acceleration and Automation of
Design Analysis and Optimizations
Deep Learning forAnomaly Detection in
Satellite Manufacturing
FEATURES
Detection of wires defect on a set of images
from production line using Deep Learning
Deep Learning on images of wires: occlusion,
variation, noise, grayscale, semantic analysis
Detection of defaults using a pre-defined wire
model and computing a distance measure
Workflows for model training and prediction for
parallel execution
BENEFITS
Automatic detection of defaults in hybrid
circuits manufacturing
Higher precision of Machine Learning results
Faster results with parallel execution of
machine learning workflows
Workflows can be used for other applications
Faulty wires come out in red
Big DataAnalysis forAutomatedAnomaly
Tracking in Satellite Communication
FEATURES
Data analysis: checking packets number of service
telemetries, order and type
Incident evolution forecasts
Big data workflows for automation of Test Scenarios
Automatic detection of remote controls that didn’t
receive expected telemetries
Data visualization in browser
BENEFITS
Automatic and early detection of defaults via trends
analysis of test results
Engineering process improvement: margin assessment,
robustness analysis, model elaboration based on actual
behaviors
Workflows allowing to accelerate treatments of fast-
growing test data amounts
Data fetching from many sources
ProActive workflow for service
telemetries verification
Visualisation of anomalies
Acceleration of Non-Destructive Evaluation (NDE)
for Nuclear Energy, Oil & Gas,Aerospace
FEATURES
NDE batch processing, parametric studies,
non-regression tests on multiple clusters
Transfer Input and Output data
Event programming to follow executions
Workflow process definition
Activeeon guidance and support
Cloud version: Execution on Microsoft Azure
with 50 VMs/day per CIVA user  25K
nodes/year
A potential of $1M$ Azure spending per Year
BENEFITS
Flexibility and enabler of interoperability
between heterogeneous infrastructures
Ability to run large POD (Probability of Detection)
computations, which were taking months on a
single computer
Large-scale simulations with Microsoft Azure cloud
Radiography – Pipes weld inspection
ABOUT CIVA NDE SOLUTION:
Multi-technique (Ultrasound, Eddy current,
Radiography) software platform developed
by the CEA LIST and its partners
The software is distributed by EXTENDE
and its distributors
Eddy current - Simulations
END USERS
Nuclear Energy, Oil & Gas, Aeronautics,
Resource Manager
Scheduler Calendar
Sync
200 to 300 jobs
planned per week
72 000 patient diagnostics
delivered to nurses
Main Benefits
Job Visualization within Calendar
Edit job planning from both
interfaces
Visualize parallel tasks
Visualize task information in one
view
Usage of customer’s external database:
Oracle 11g Database
using Red Hat Hibernate ORM
(Object – Relational – Mapping)
Formerly part of
Task-Centric View Used
Scheduler
Passive
Mediametrie:
TV Audience
Measurement
Scheduler
Active
EC2 Spot Instances
Low costs
EC2 Instances
Regular costs
IaaS
On-Prem
Main Benefits
Deployed On Premise (Capex) or
on a Hosting Service (Opex)
Auto-scaling on infrastructure to
match capacity and demand
Huge costs optimization using only
the VMs needed and interruptible
low cost instances (e.g. EC2 Spot
instances)
CHALLENGES
Process 500 terabytes per year
Flexibility and enabler of interoperability
between heterogeneous services
Job affinity with data location
Transfer sensitive data to the cloud for
processing
RESULTS
Efficient metagenomics pipeline
Granular compute management
User friendly system for maximum utilization
Secure transfers
Simple workflow process definition
Workflow model and data management
Compute migration from on-prem to the
cloud
MAIN DRIVER
REQUIREMENTS
Guidance and support to achieve high
performances
Fit in hybrid architecture multiplatform
Integration with R
FlexLM support (licenses manager)
Remote Visualization for interactive
tasks
COMPANY PROFILE
Industry: BioTech
Product: Metagenomics
Quantitative Metagenomics Platform
for gene profiling and statistical analysis
Domain-specific
Users
Windows
Cluster 1
192 cores
Linux
Cluster 2
366 cores
Scheduler
Web Portal
Total
DNA
QC/Library preparation
SoLiD/Illumina
Sequencing
1TB /
Sequence
Analysis
40TB
Parallel DataBase
Pre, Post Processing of Data Analysis
Flexibility, Speed of Analysis
Granular execution
Fast
Architecture Overview
Paris, Sophia Antipolis, London, San Jose USA @activeeon
contact@activeeon.com
+33 988 777 660
Automate Accelerate & Scale
10K Nodes, 20K Tasks, 1M Jobs
Paris, Sophia Antipolis, London, San Jose USA, Montreal CA

Contenu connexe

Tendances

High Performance Predictive Analytics in R and Hadoop
High Performance Predictive Analytics in R and HadoopHigh Performance Predictive Analytics in R and Hadoop
High Performance Predictive Analytics in R and HadoopRevolution Analytics
 
Fast Cars, Big Data - How Streaming Can Help Formula 1
Fast Cars, Big Data - How Streaming Can Help Formula 1Fast Cars, Big Data - How Streaming Can Help Formula 1
Fast Cars, Big Data - How Streaming Can Help Formula 1Tugdual Grall
 
Using Apache Spark Structured Streaming on Azure Databricks for Predictive M...
 Using Apache Spark Structured Streaming on Azure Databricks for Predictive M... Using Apache Spark Structured Streaming on Azure Databricks for Predictive M...
Using Apache Spark Structured Streaming on Azure Databricks for Predictive M...Databricks
 
Accelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAccelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAmazon Web Services
 
The Potential of GPU-driven High Performance Data Analytics in Spark
The Potential of GPU-driven High Performance Data Analytics in SparkThe Potential of GPU-driven High Performance Data Analytics in Spark
The Potential of GPU-driven High Performance Data Analytics in SparkSpark Summit
 
Insight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationInsight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationMapR Technologies
 
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...Sarah Aerni
 
How to deploy machine learning models into production
How to deploy machine learning models into productionHow to deploy machine learning models into production
How to deploy machine learning models into productionDataWorks Summit
 
Hadoop Graph Processing with Apache Giraph
Hadoop Graph Processing with Apache GiraphHadoop Graph Processing with Apache Giraph
Hadoop Graph Processing with Apache GiraphDataWorks Summit
 
R and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopR and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopRevolution Analytics
 
Galvanise NYC - Scaling R with Hadoop & Spark. V1.0
Galvanise NYC - Scaling R with Hadoop & Spark. V1.0Galvanise NYC - Scaling R with Hadoop & Spark. V1.0
Galvanise NYC - Scaling R with Hadoop & Spark. V1.0vithakur
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsIgor José F. Freitas
 
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg SchadSmack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg SchadSpark Summit
 
Apache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on HadoopApache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on HadoopTed Dunning
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBaseCarol McDonald
 
Getting more out of your big data
Getting more out of your big dataGetting more out of your big data
Getting more out of your big dataNathan Bijnens
 
Predicting Flight Delays with Spark Machine Learning
Predicting Flight Delays with Spark Machine LearningPredicting Flight Delays with Spark Machine Learning
Predicting Flight Delays with Spark Machine LearningCarol McDonald
 
Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...
Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...
Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...Spark Summit
 

Tendances (20)

High Performance Predictive Analytics in R and Hadoop
High Performance Predictive Analytics in R and HadoopHigh Performance Predictive Analytics in R and Hadoop
High Performance Predictive Analytics in R and Hadoop
 
Fast Cars, Big Data - How Streaming Can Help Formula 1
Fast Cars, Big Data - How Streaming Can Help Formula 1Fast Cars, Big Data - How Streaming Can Help Formula 1
Fast Cars, Big Data - How Streaming Can Help Formula 1
 
Using Apache Spark Structured Streaming on Azure Databricks for Predictive M...
 Using Apache Spark Structured Streaming on Azure Databricks for Predictive M... Using Apache Spark Structured Streaming on Azure Databricks for Predictive M...
Using Apache Spark Structured Streaming on Azure Databricks for Predictive M...
 
Accelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAccelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of Genomics
 
The Potential of GPU-driven High Performance Data Analytics in Spark
The Potential of GPU-driven High Performance Data Analytics in SparkThe Potential of GPU-driven High Performance Data Analytics in Spark
The Potential of GPU-driven High Performance Data Analytics in Spark
 
Insight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationInsight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital Transformation
 
Architecting a multi-tenanted platform
Architecting a multi-tenanted platform Architecting a multi-tenanted platform
Architecting a multi-tenanted platform
 
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
 
How to deploy machine learning models into production
How to deploy machine learning models into productionHow to deploy machine learning models into production
How to deploy machine learning models into production
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
Hadoop Graph Processing with Apache Giraph
Hadoop Graph Processing with Apache GiraphHadoop Graph Processing with Apache Giraph
Hadoop Graph Processing with Apache Giraph
 
R and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopR and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with Hadoop
 
Galvanise NYC - Scaling R with Hadoop & Spark. V1.0
Galvanise NYC - Scaling R with Hadoop & Spark. V1.0Galvanise NYC - Scaling R with Hadoop & Spark. V1.0
Galvanise NYC - Scaling R with Hadoop & Spark. V1.0
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systems
 
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg SchadSmack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
 
Apache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on HadoopApache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on Hadoop
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBase
 
Getting more out of your big data
Getting more out of your big dataGetting more out of your big data
Getting more out of your big data
 
Predicting Flight Delays with Spark Machine Learning
Predicting Flight Delays with Spark Machine LearningPredicting Flight Delays with Spark Machine Learning
Predicting Flight Delays with Spark Machine Learning
 
Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...
Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...
Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...
 

Similaire à Activeeon - Scale Beyond Limits

Activeeon technology for Big Compute and cloud migration
Activeeon technology for Big Compute and cloud migrationActiveeon technology for Big Compute and cloud migration
Activeeon technology for Big Compute and cloud migrationActiveeon
 
Activeeon use cases for cloud, digital transformation, IoT and big data autom...
Activeeon use cases for cloud, digital transformation, IoT and big data autom...Activeeon use cases for cloud, digital transformation, IoT and big data autom...
Activeeon use cases for cloud, digital transformation, IoT and big data autom...Activeeon
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooJason Dai
 
DDDP 2019 - Brown to Green
DDDP 2019  - Brown to GreenDDDP 2019  - Brown to Green
DDDP 2019 - Brown to GreenJohn Archer
 
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & AlluxioAlluxio, Inc.
 
Distributed Trace & Log Analysis using ML
Distributed Trace & Log Analysis using MLDistributed Trace & Log Analysis using ML
Distributed Trace & Log Analysis using MLJorge Cardoso
 
Red hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyOrgad Kimchi
 
General Introduction to technologies that will be seen in the school
General Introduction to technologies that will be seen in the school General Introduction to technologies that will be seen in the school
General Introduction to technologies that will be seen in the school ISSGC Summer School
 
Google Cloud Platform: Prototype ->Production-> Planet scale
Google Cloud Platform: Prototype ->Production-> Planet scaleGoogle Cloud Platform: Prototype ->Production-> Planet scale
Google Cloud Platform: Prototype ->Production-> Planet scaleIdan Tohami
 
Device to Intelligence, IOT and Big Data in Oracle
Device to Intelligence, IOT and Big Data in OracleDevice to Intelligence, IOT and Big Data in Oracle
Device to Intelligence, IOT and Big Data in OracleJunSeok Seo
 
Critical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsCritical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsData Driven Innovation
 
HIPAS UCP HSP Openstack Sascha Oehl
HIPAS UCP HSP Openstack Sascha OehlHIPAS UCP HSP Openstack Sascha Oehl
HIPAS UCP HSP Openstack Sascha OehlSascha Oehl
 
AIOps: Anomalous Span Detection in Distributed Traces Using Deep Learning
AIOps: Anomalous Span Detection in Distributed Traces Using Deep LearningAIOps: Anomalous Span Detection in Distributed Traces Using Deep Learning
AIOps: Anomalous Span Detection in Distributed Traces Using Deep LearningJorge Cardoso
 
On the Application of AI for Failure Management: Problems, Solutions and Algo...
On the Application of AI for Failure Management: Problems, Solutions and Algo...On the Application of AI for Failure Management: Problems, Solutions and Algo...
On the Application of AI for Failure Management: Problems, Solutions and Algo...Jorge Cardoso
 
Octo and the DevSecOps Evolution at Oracle by Ian Van Hoven
Octo and the DevSecOps Evolution at Oracle by Ian Van HovenOcto and the DevSecOps Evolution at Oracle by Ian Van Hoven
Octo and the DevSecOps Evolution at Oracle by Ian Van HovenInfluxData
 
PrestoCloud project: Big Data Orchestration for Batch+Stream in Clouds&Fogs w...
PrestoCloud project: Big Data Orchestration for Batch+Stream in Clouds&Fogs w...PrestoCloud project: Big Data Orchestration for Batch+Stream in Clouds&Fogs w...
PrestoCloud project: Big Data Orchestration for Batch+Stream in Clouds&Fogs w...OW2
 
#OSSPARIS17 - PrestoCloud Project: Big Data Orchestration for Batch+Stream in...
#OSSPARIS17 - PrestoCloud Project: Big Data Orchestration for Batch+Stream in...#OSSPARIS17 - PrestoCloud Project: Big Data Orchestration for Batch+Stream in...
#OSSPARIS17 - PrestoCloud Project: Big Data Orchestration for Batch+Stream in...Paris Open Source Summit
 

Similaire à Activeeon - Scale Beyond Limits (20)

Activeeon technology for Big Compute and cloud migration
Activeeon technology for Big Compute and cloud migrationActiveeon technology for Big Compute and cloud migration
Activeeon technology for Big Compute and cloud migration
 
Activeeon use cases for cloud, digital transformation, IoT and big data autom...
Activeeon use cases for cloud, digital transformation, IoT and big data autom...Activeeon use cases for cloud, digital transformation, IoT and big data autom...
Activeeon use cases for cloud, digital transformation, IoT and big data autom...
 
Big Data and OSS at IBM
Big Data and OSS at IBMBig Data and OSS at IBM
Big Data and OSS at IBM
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics Zoo
 
Enterprise Data Lakes
Enterprise Data LakesEnterprise Data Lakes
Enterprise Data Lakes
 
DDDP 2019 - Brown to Green
DDDP 2019  - Brown to GreenDDDP 2019  - Brown to Green
DDDP 2019 - Brown to Green
 
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
 
Distributed Trace & Log Analysis using ML
Distributed Trace & Log Analysis using MLDistributed Trace & Log Analysis using ML
Distributed Trace & Log Analysis using ML
 
Red hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategy
 
General Introduction to technologies that will be seen in the school
General Introduction to technologies that will be seen in the school General Introduction to technologies that will be seen in the school
General Introduction to technologies that will be seen in the school
 
Google Cloud Platform: Prototype ->Production-> Planet scale
Google Cloud Platform: Prototype ->Production-> Planet scaleGoogle Cloud Platform: Prototype ->Production-> Planet scale
Google Cloud Platform: Prototype ->Production-> Planet scale
 
BigData_Krishna Kumar Sharma
BigData_Krishna Kumar SharmaBigData_Krishna Kumar Sharma
BigData_Krishna Kumar Sharma
 
Device to Intelligence, IOT and Big Data in Oracle
Device to Intelligence, IOT and Big Data in OracleDevice to Intelligence, IOT and Big Data in Oracle
Device to Intelligence, IOT and Big Data in Oracle
 
Critical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsCritical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and Analytics
 
HIPAS UCP HSP Openstack Sascha Oehl
HIPAS UCP HSP Openstack Sascha OehlHIPAS UCP HSP Openstack Sascha Oehl
HIPAS UCP HSP Openstack Sascha Oehl
 
AIOps: Anomalous Span Detection in Distributed Traces Using Deep Learning
AIOps: Anomalous Span Detection in Distributed Traces Using Deep LearningAIOps: Anomalous Span Detection in Distributed Traces Using Deep Learning
AIOps: Anomalous Span Detection in Distributed Traces Using Deep Learning
 
On the Application of AI for Failure Management: Problems, Solutions and Algo...
On the Application of AI for Failure Management: Problems, Solutions and Algo...On the Application of AI for Failure Management: Problems, Solutions and Algo...
On the Application of AI for Failure Management: Problems, Solutions and Algo...
 
Octo and the DevSecOps Evolution at Oracle by Ian Van Hoven
Octo and the DevSecOps Evolution at Oracle by Ian Van HovenOcto and the DevSecOps Evolution at Oracle by Ian Van Hoven
Octo and the DevSecOps Evolution at Oracle by Ian Van Hoven
 
PrestoCloud project: Big Data Orchestration for Batch+Stream in Clouds&Fogs w...
PrestoCloud project: Big Data Orchestration for Batch+Stream in Clouds&Fogs w...PrestoCloud project: Big Data Orchestration for Batch+Stream in Clouds&Fogs w...
PrestoCloud project: Big Data Orchestration for Batch+Stream in Clouds&Fogs w...
 
#OSSPARIS17 - PrestoCloud Project: Big Data Orchestration for Batch+Stream in...
#OSSPARIS17 - PrestoCloud Project: Big Data Orchestration for Batch+Stream in...#OSSPARIS17 - PrestoCloud Project: Big Data Orchestration for Batch+Stream in...
#OSSPARIS17 - PrestoCloud Project: Big Data Orchestration for Batch+Stream in...
 

Plus de Activeeon

Different usages of Machine Learning Open Studio
Different usages of Machine Learning Open StudioDifferent usages of Machine Learning Open Studio
Different usages of Machine Learning Open StudioActiveeon
 
ProActive Workflows & Scheduling
ProActive Workflows & Scheduling ProActive Workflows & Scheduling
ProActive Workflows & Scheduling Activeeon
 
Machine Learning open studio solution for data scientists & developers
Machine Learning open studio solution for data scientists & developersMachine Learning open studio solution for data scientists & developers
Machine Learning open studio solution for data scientists & developersActiveeon
 
Infinite power at your fingertips with Microsoft Azure Cloud & ActiveEon
Infinite power at your fingertips with Microsoft Azure Cloud & ActiveEonInfinite power at your fingertips with Microsoft Azure Cloud & ActiveEon
Infinite power at your fingertips with Microsoft Azure Cloud & ActiveEonActiveeon
 
Machine learning-for-dummies-andrews-sobral-activeeon
Machine learning-for-dummies-andrews-sobral-activeeonMachine learning-for-dummies-andrews-sobral-activeeon
Machine learning-for-dummies-andrews-sobral-activeeonActiveeon
 
ActiveEon - Automate, Accelerate, Scale
ActiveEon - Automate, Accelerate, ScaleActiveEon - Automate, Accelerate, Scale
ActiveEon - Automate, Accelerate, ScaleActiveeon
 

Plus de Activeeon (6)

Different usages of Machine Learning Open Studio
Different usages of Machine Learning Open StudioDifferent usages of Machine Learning Open Studio
Different usages of Machine Learning Open Studio
 
ProActive Workflows & Scheduling
ProActive Workflows & Scheduling ProActive Workflows & Scheduling
ProActive Workflows & Scheduling
 
Machine Learning open studio solution for data scientists & developers
Machine Learning open studio solution for data scientists & developersMachine Learning open studio solution for data scientists & developers
Machine Learning open studio solution for data scientists & developers
 
Infinite power at your fingertips with Microsoft Azure Cloud & ActiveEon
Infinite power at your fingertips with Microsoft Azure Cloud & ActiveEonInfinite power at your fingertips with Microsoft Azure Cloud & ActiveEon
Infinite power at your fingertips with Microsoft Azure Cloud & ActiveEon
 
Machine learning-for-dummies-andrews-sobral-activeeon
Machine learning-for-dummies-andrews-sobral-activeeonMachine learning-for-dummies-andrews-sobral-activeeon
Machine learning-for-dummies-andrews-sobral-activeeon
 
ActiveEon - Automate, Accelerate, Scale
ActiveEon - Automate, Accelerate, ScaleActiveEon - Automate, Accelerate, Scale
ActiveEon - Automate, Accelerate, Scale
 

Dernier

Strategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsStrategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsJean Silva
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolsosttopstonverter
 
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesAmazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesKrzysztofKkol1
 
Patterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencePatterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencessuser9e7c64
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
What’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesWhat’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesVictoriaMetrics
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?Alexandre Beguel
 
Keeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldKeeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldRoberto Pérez Alcolea
 
2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shardsChristopher Curtin
 
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxReal-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxRTS corp
 
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...OnePlan Solutions
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slidesvaideheekore1
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
Zer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfZer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfmaor17
 
Osi security architecture in network.pptx
Osi security architecture in network.pptxOsi security architecture in network.pptx
Osi security architecture in network.pptxVinzoCenzo
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingShane Coughlan
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecturerahul_net
 

Dernier (20)

Strategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsStrategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero results
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration tools
 
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesAmazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
 
Patterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencePatterns for automating API delivery. API conference
Patterns for automating API delivery. API conference
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
What’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesWhat’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 Updates
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?
 
Keeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldKeeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository world
 
2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards
 
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxReal-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
 
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slides
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
Zer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfZer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdf
 
Osi security architecture in network.pptx
Osi security architecture in network.pptxOsi security architecture in network.pptx
Osi security architecture in network.pptx
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecture
 

Activeeon - Scale Beyond Limits

  • 1. Paris, Sophia Antipolis, London, San Jose USA A Key Technology Provider and Actor in the Cloud Migration within all Big Compute verticals and at the heart of IA/Machine Learning revolution Paris, Sophia Antipolis, London, San Jose USA, Montreal CA
  • 2. 1. Company / Team 400 man / year of R&D 2 patents 30 highly qualified engineers out of which 17 are PhD’s References in all Industries in the US and EMEA
  • 3. Global Locations Partnerships Key information Management Denis Caromel, CEO François Tournesac, CSO Fabien Viale, CTO Marco Castigliego, CAO Company ISV Founded in 2007 by Denis Caromel in Sophia-Antipolis, Spin-off of INRIA Addressing $80 Billion Hybrid Cloud Market with 27% CAGR Disruptive Patented Technology w/ Exceptional Business Outcomes 60% of the revenue from international Sophia-Antipolis (France) Paris (France) London (United Kingdom) San-Jose (United States) Montreal (Canada) Fribourg (Switzerland) Dakar (Senegal) ProActive Solution Job Scheduling, Workload Automation Orchestration & Meta-Scheduling On-premises and on all clouds Open Source
  • 4. 2005 An R&D Team of 45 persons headed by Denis Caromel developing a Core Kernel for Distributed, Parallel & Cloud at INRIA (largest EU Computer Science Research Institute, 6 000 persons). Foundation of ActiveEon Co-development between INRIA Team & ActiveEon IP Technology Transfer from INRIA to ActiveEon 2007 2009 Scheduler added to the Core 2011 Resource Manager added 2013 2014 2016 2017 Orchestration with Powerful Workflows added First very large customer references in Production International Expansions in UK, USA, Africa ActiveEon Story R&D Investment 250 M/Y at INRIA + 150 M/Y at AE = 400 Man/Year 2018 Machine Learning Open Studio added to ProActive International Expansions in Canada
  • 5. Express Business Needs with Granular Workflows • Distributed & Parallel Computing • On-premises & Cloud Architectures A 30+ PhD/Engineer team with focused fields of high expertise: • Big Data • IA, ML • IoT • Finance • Gov. • HPC • ... • Data Science, Machine Learning, IA, Matlab, R • Big Compute and HPC Visdom ActiveEon Technology R&D Investment: 250 M/Y at INRIA + 150 M/Y at AE = 400 Man/Year
  • 6. Process Flow & Operation Execute & Monitor Design Schedule, Monitor & Connect the resources IT department, Data Scientist, Business Lines, Activeeon Services IT department, Business owner Operational team 1 2 3 PROACTIVE STUDIO PROACTIVE RESOURCES MANAGER PROACTIVE SCHEDULER PROACTIVE AUTOMATION PORTAL
  • 7. Next Generation Scheduler/Orchestration Scheduler and Orchestration Priority & Planning Parallel Executions Error Management Multi Users </>OpenRESTAPI Resource Management and Monitoring Slurm SGE PBS LSF Multi- platform Local Machine Network Resource Batch Scheduler Cloud Processing and Automation Workflows Any language Secured Data Transfers Meta- scheduler ETL, ERP, ELT, … Full integration Translator
  • 9. Machine Learning Open Studio https://www.youtube.com/watch?v=mbrQxCf4lqM
  • 10. Automation Dashboard - Catalog Workflows stored in buckets in the Catalog RBAC support for each bucket / Users can share workflows and templates Keep track of the revisions with a versioning feature integrated
  • 11. Job Planner DefineCalendars AssociateWorkflowstoCalendars VisualizeExecutionPlanning Manage recurring Jobs Forecast and check future Executions Control recurring jobs from one endpoint Schedule Exceptions through Exclusion Calendars & Inclusion Calendars
  • 12. Cloud Automation: On-demand PaaS On-Demand PaaS Services with full Life-Cycle Management
  • 13. Scalable & Elastic resources Incremental resource deployment 100% resources usage, no waste Smart scale down Provides cloud computing power according to your needs. Minimize costs by deploying VMs only when needed (configurable load factor). Never exceed your budget (min/max VMs threshold). Smart and fully configurable elastic policy. Shutdown unused VMs whenever it's possible. Prevent time-consuming re- deployments by adjusting idle nodes’ release delay (avoid scale up/down cycles).
  • 14. Global Locations Some Supported Languages and Connectors AWSAzure GCP Docker OpenShiftLinux Windows Solaris VMware Openstack Infrastructure Slurm PBS LSF LSF HPC Schedulers Google Cloud Platform PBS Works Cmd Java Scala Javascript Groovy Ruby Jython Python Perle PHP R Cron LDAPPowerShell Languages and Predefined Tasks FTPURL SFTP MySQL Oracle Data Connectors Linux Bash MongoDB Cassandra AWS-S3 Clouds PostgreSQL Greenplum SQL Server Azure Storage Azure Data Lake KafkaZookeeper Spark Hadoop Big Data Azure Databricks Hadoop HDFS Twitter LogstashSAP Elasticsearch SGE SGE CNTK Keras PyTorch YOLO H2OTensoFlow Machine Learning & Artificial Intelligence Caffe Spark MLlib Pandas JupyterLab Visdom Visualization KibanaSwarm Storm Clearwater Cognitive Services Grafana Scikit-Learn MXNet AIX AS/400 C++/C# Cuda Specialized Open/CL FPGA DLib BigDL DeepLearning G4J Kubernetes
  • 15. 20 000 Cores Azure Benchmarks With ActiveEon Workflows & Scheduler: 15 mn to trigger and acquire 20 000 Azure Cores and to schedule 20 000 Tasks 99% of requests having less than 90ms response-time On 20K Cores, with 19K running Tasks, only 5 sec. to detect a software failure and redeploy the Task With 19K running tasks, only 30.8 sec. to execute a Job with 10 Tasks of each 30 sec., 97.4% efficiency.
  • 16. 2. Some Typical Customer Cases: Capabilities & Portfolio Revue Large Worldwide International Companies Early Adopters Using ActiveEon for Critical Business Applications Finance IoT Gov. Manufacturing Automotive Aerospace Nuclear RedHat OpenShift
  • 17. Some Major Customers Telco & IT Bio Tech & Health FinanceEngineering Aeronautics Energy & Space Some Partners: Media Distribution Government IoTCosmetics
  • 18. L&G a leading multinational finance and insurance company with headquarters in London Situation Comply with new European regulations: Solvency II, Basel III, etc. Transform legacy system and embrace cloud computing Solution Activeeon ProActive and migration to the Cloud have enabled faster and more reliable execution: • Cloud bursting • Error management • Prioritization Benefits From 18 hours to 2 hours for priority reports Agile development with an objective of 4,000 cores  $1.2m / year committed spent on Cloud Finance Time 64VMs,eachwith16vCPUs
  • 19. Home Hoffice is the UK Ministry of Interior. They are using ActiveEon for 2 critical applications: • Visa Delivery Process, and • Big Data & Analytics platform for Crime Reduction (HODAC). Situation In need to integrate 25 different sources of Data in order to build a consolidated Data Lake and analytics platform to be used for many Home Land security applications. Solution ActiveEon used as the central Orchestrator to Schedule and Meta-Schedule all the Big Data, ETL, Analytics, Machine Learnigs software appliance of the platform (Hadoop, SAS, TIBCO Spotfire, Python, Anaconda, GreenPlum, ElasticSearch, …). Benefits • Central Orchestration Tool • Workflow Expressiveness: universal & comprehensive • Management of Security for highly sensitive environments • Management of Resources for all appliances (SAS, TIBCO,… ). « ActiveEon is the only solution capable to Schedule any Big Data Analytics, mono-threaded, multi-threaded, multi- core, parallel and distributed » Cap Gemini Lead Engineer for Home Office Gov.: UK Ministry of Interior
  • 20. Komatsu is a Japanese multinational corporation It manufactures construction, mining, industrial and military equipment. Situation ActiveEon Orchestrates on Cloud execution over hot and cold storage for streaming and batch analytics > 1,200 tasks executed per hour Solution Activeeon ProActive has enabled control over and scheduling over execution: • Error Management – Notification, Automated Recovery • Job Planner • Distribution & Parallelization Benefits • Reliable execution to orchestrate multiple services and resources • Provide consistent results and KPIs to end users and BI Tools IoT
  • 21. PEPs is the French platform that offers access to the products of the Sentinel satellites (S1A and S1B, S2A and S2B, S3A and S3B) of the European Union Program for Earth observation and monitoring Copernicus Situation Make Sentinel data available to the greatest number and encourage the development of applications using them (agriculture, maritime field...) 1 petabyte (1015 bytes) in 20 years and 7 petabytes in 2 years! Solution Proactive Solution provided by ActiveEon to execute on Azure in hybrid mode allows enhancing PEPS data and making them available to API providers : • Multi-Cloud Ecosystem Platform • Remove complexity for Data Scientists • Provide Cloud performance Benefits • Faster execution, Optimisation of On-Prem ressources & Clouds, • Easier to use by end-users Space & Image Processing
  • 22. Platform for Cosmetic Formulation for 2000 persons around the world and for Innovation Team. (Statistic, Machine Learning, Use of Language R) 2 000 persons around the World Innovation Team (Statistics, ML, R) Workflows OrchestrationMonitoring Data Compute Data Mining Private Network + HTTPS
  • 23. ProActive Cloud Watch Environment Environment MachineLearninginITLogAnalysisforErrorDetection&PredictioninFinancialMarket Analysis & Classification • Machine Learning • Artificial Intelligence • Probabilistic Analysis Resources / Applications / Services Resources / Applications / Services Resources / Applications / Services Business Users 11 1 1 2 3 Collect data from any sources Update model Update event driven system Events Monitoring Complex Event Processing • Rule based • Actions triggering 3 Alert Predictive Incident Request for incident analysis 2 Automated Preventive Action Incidents Incidents Finance Domain: Deep ML for IT Infrastructure Main Benefits Openness and diversity of ML frameworks to be used (vs. Splunk) Both Batch and Streaming Workflow Expressiveness: universal & comprehensive IT Users
  • 24. Orchestration of RedHat OpenShift On-Prem & On Azure Orchestrate & Manage all layers: IaaS, PaaS, SaaS. Multi-Cloud, Hybrid, Scalable,
  • 25. Digital transformation for manufacturing BENEFITS Reduce the distance between the virtual and the manufacturing process Take advantage of digitalization in the machine tool field for intelligent manufacturing and more efficient production FEATURES Cloud-based big data analytics during machining Optimization of machining parameters using workflows Process simulation and optimization tools Physical measurements and monitoring Virtual / real part model correction Use of AI TARGETED SECTORS Manufacturing, automotive, aerospace Cloud processing services in manufacturing END USERS
  • 26. Workflows for HPC multi-physics engineering simulations in automotive and aerospace BENEFITS Thermal resistance for engine partsFEATURES Parallel evaluation of optimal mesh size for the best tradeoff between execution time and result accuracy Complex workflow management: monitoring, scheduling and orchestration Infrastructure management: on-premises and cloud HPC Data collection and processing END USERS Pollution levels in a district Workflow for exploration of tradeoff between execution time and result accuracy DOMAIN: COMPUTATIONAL FLUID DYNAMICS (CFD) AND POST-PROCESSING TOOLS Acceleration and Automation of Design Analysis and Optimizations
  • 27. Deep Learning forAnomaly Detection in Satellite Manufacturing FEATURES Detection of wires defect on a set of images from production line using Deep Learning Deep Learning on images of wires: occlusion, variation, noise, grayscale, semantic analysis Detection of defaults using a pre-defined wire model and computing a distance measure Workflows for model training and prediction for parallel execution BENEFITS Automatic detection of defaults in hybrid circuits manufacturing Higher precision of Machine Learning results Faster results with parallel execution of machine learning workflows Workflows can be used for other applications Faulty wires come out in red
  • 28. Big DataAnalysis forAutomatedAnomaly Tracking in Satellite Communication FEATURES Data analysis: checking packets number of service telemetries, order and type Incident evolution forecasts Big data workflows for automation of Test Scenarios Automatic detection of remote controls that didn’t receive expected telemetries Data visualization in browser BENEFITS Automatic and early detection of defaults via trends analysis of test results Engineering process improvement: margin assessment, robustness analysis, model elaboration based on actual behaviors Workflows allowing to accelerate treatments of fast- growing test data amounts Data fetching from many sources ProActive workflow for service telemetries verification Visualisation of anomalies
  • 29. Acceleration of Non-Destructive Evaluation (NDE) for Nuclear Energy, Oil & Gas,Aerospace FEATURES NDE batch processing, parametric studies, non-regression tests on multiple clusters Transfer Input and Output data Event programming to follow executions Workflow process definition Activeeon guidance and support Cloud version: Execution on Microsoft Azure with 50 VMs/day per CIVA user  25K nodes/year A potential of $1M$ Azure spending per Year BENEFITS Flexibility and enabler of interoperability between heterogeneous infrastructures Ability to run large POD (Probability of Detection) computations, which were taking months on a single computer Large-scale simulations with Microsoft Azure cloud Radiography – Pipes weld inspection ABOUT CIVA NDE SOLUTION: Multi-technique (Ultrasound, Eddy current, Radiography) software platform developed by the CEA LIST and its partners The software is distributed by EXTENDE and its distributors Eddy current - Simulations END USERS Nuclear Energy, Oil & Gas, Aeronautics,
  • 30. Resource Manager Scheduler Calendar Sync 200 to 300 jobs planned per week 72 000 patient diagnostics delivered to nurses Main Benefits Job Visualization within Calendar Edit job planning from both interfaces Visualize parallel tasks Visualize task information in one view Usage of customer’s external database: Oracle 11g Database using Red Hat Hibernate ORM (Object – Relational – Mapping) Formerly part of Task-Centric View Used
  • 31. Scheduler Passive Mediametrie: TV Audience Measurement Scheduler Active EC2 Spot Instances Low costs EC2 Instances Regular costs IaaS On-Prem Main Benefits Deployed On Premise (Capex) or on a Hosting Service (Opex) Auto-scaling on infrastructure to match capacity and demand Huge costs optimization using only the VMs needed and interruptible low cost instances (e.g. EC2 Spot instances)
  • 32. CHALLENGES Process 500 terabytes per year Flexibility and enabler of interoperability between heterogeneous services Job affinity with data location Transfer sensitive data to the cloud for processing RESULTS Efficient metagenomics pipeline Granular compute management User friendly system for maximum utilization Secure transfers Simple workflow process definition Workflow model and data management Compute migration from on-prem to the cloud MAIN DRIVER REQUIREMENTS Guidance and support to achieve high performances Fit in hybrid architecture multiplatform Integration with R FlexLM support (licenses manager) Remote Visualization for interactive tasks COMPANY PROFILE Industry: BioTech Product: Metagenomics
  • 33. Quantitative Metagenomics Platform for gene profiling and statistical analysis Domain-specific Users Windows Cluster 1 192 cores Linux Cluster 2 366 cores Scheduler Web Portal Total DNA QC/Library preparation SoLiD/Illumina Sequencing 1TB / Sequence Analysis 40TB Parallel DataBase Pre, Post Processing of Data Analysis Flexibility, Speed of Analysis Granular execution Fast Architecture Overview
  • 34. Paris, Sophia Antipolis, London, San Jose USA @activeeon contact@activeeon.com +33 988 777 660 Automate Accelerate & Scale 10K Nodes, 20K Tasks, 1M Jobs Paris, Sophia Antipolis, London, San Jose USA, Montreal CA