SlideShare une entreprise Scribd logo
1  sur  43
The CERN OpenStack Cloud
Compute Resource Provisioning for the
Large Hadron Collider
2
Jan van Eldik
for the CERN Cloud Team
OpenStack Days Nordic
Stockholm
September 22, 2016
3The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Agenda
• Introduction to CERN
• Computing at CERN scale
• Cloud Service Overview
• Operations
• Performance
• Outlook
CERN: home.cern
• European Organization for Nuclear
Research (Conseil Européen pour la Recherche Nucléaire)
- Founded in 1954, today 22 member states
- World’s largest particle physics laboratory
- Located at Franco-Swiss border near Geneva
- ~2’300 staff members, >12’500 users
- Budget: ~1000 MCHF (2016)
• CERN’s mission
- Answer fundamental questions of the universe
- Advance the technology frontiers
- Train the scientists and engineers of tomorrow
- Bring nations together
4The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
The Large Hadron Collider (LHC)
5
Largest machine on earth: 27km circumference
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
LHC: 9’600 Magnets for Beam Control
6
1232 superconducting dipoles for bending: 14m, 35t, 8.3T, 12kA
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
LHC: Coldest Temperature
7
World’s largest cryogenic system: colder than outer space (1.9K/2.7K), 120t of He
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
LHC: Highest Vacuum
8
Vacuum system: 104 km of pipes, 10-10-10-11 mbar (comparable to the moon)
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
9
LHC: Detectors
Four main experiments to study the fundamental properties of the universe
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
10
A collision at LHC
January 2013 - The LHC Computing Grid - Nils
Hoymir
Ian.Bird@cern.ch 11
The Data Acquisition
January 2013 - The LHC Computing Grid - Nils
Hoymir
Tier 0 at CERN: Acquisition, First pass reconstruction,
Storage & Distribution
1.25 GB/sec (ions)
2011: 400-500 MB/sec
2011: 4-6 GB/sec
Solution: the Grid
• Use the Grid to unite computing
resources of particle physics institutes
around the world
The World Wide Web
provides seamless access to
information that is stored in
many millions of different
geographical locations
The Grid is an infrastructure
that provides seamless
access to computing power
and data storage capacity
distributed over the globe
14
LHC: World-wide Computing Grid
TIER-1:
permanent storage,
re-processing,
analysis
TIER-0 (CERN):
data recording,
reconstruction and
distribution
TIER-2:
simulation,
end-user analysis
> 2 million jobs/day
~350’000 cores
500 PB of storage
nearly 170 sites,
40 countries
10-100 Gb links
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
LHC: Data & Compute Growth
0
20
40
60
80
100
120
140
160
Run 1 Run 2 Run 3 Run 4
GRID
ATLAS
CMS
LHCb
ALICE
0.0
50.0
100.0
150.0
200.0
250.0
300.0
350.0
400.0
450.0
Run 1 Run 2 Run 3 Run 4
CMS
ATLAS
ALICE
LHCb
PB/year109HS06sec/month
What we
can afford
15The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Collisions produce ~1 PB/s
2012: Enter the cloud
• Aim: virtualize all the machines
• Unless really, really, really not possible
• Offer Cloud endpoints to users
• Scale horizontally
• Consolidate server provisioning
• Yes, use the private cloud for server
consolidation usecases as well
16The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
In production:
• 4 clouds
• >200K cores
• >8,000
hypervisors
90% of CERN’s
compute resources
are now delivered
on top of OpenStack
OpenStack at CERN
17The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Cloud Service Context
• CERN IT to enable the laboratory to fulfill its mission
- Main data center on the Geneva site
- Wigner data center, Budapest, 23ms distance
- Connected via two dedicated 100Gbs links
• CERN Cloud Service one of the three
major components in IT’s AI project
- Policy: Servers in CERN IT shall be virtual
• Based on OpenStack
- Production service since July 2013
- Performed 4 rolling upgrades since
- Currently in transition from Liberty to Mitaka
- Nova, Glance, Keystone, Horizon, Cinder, Ceilometer, Heat, Neutron, Magnum,
Barbican
18The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
http://goo.gl/maps/K5SoG
CERN Cloud Architecture (1)
19
• Deployment spans our two
data centers
- 1 region (to have 1 API), ~40 cells
- Cells map use cases
hardware, hypervisor type, location, users, …
• Top cell on physical and virtual
nodes in HA
- Clustered RabbitMQ with mirrored queues
- API servers are VMs in various child cells
• Child cell controllers are OpenStack VMs
- One controller per cell
- Tradeoff between complexity and failure impact
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
CERN Cloud Architecture (2)
20
nova-cells
rabbitmq
Top cell controller API server
nova-api
rabbitmq
nova-cells
nova-api
nova-scheduler
nova-conductor
nova-network
Child cell controller
Compute node
nova-compute
rabbitmq
nova-cells
nova-api
nova-scheduler
nova-conductor
nova-network
Child cell controller
Compute node
nova-compute
DB infrastructure
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
CERN Cloud in Numbers (1)
• ~6700 hypervisors in production
- Split over 40+ Nova cells
- Vast majority qemu/kvm now on CERN CentOS 7 (~150 Hyper-V hosts)
- ~2’100 HVs at Wigner in Hungary (batch, compute, services)
- 370 HVs on critical power
• 190k Cores
• ~430 TB RAM
• ~20’000 VMs
• Big increase during 2016!
- +57k cores in spring
- +40k cores in autumn
21The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
CERN Cloud in Numbers (2)
• 2’700 images/snapshots
- Glance on Ceph
• 2’300 volumes
- Cinder on Ceph (& NetApp) in GVA & Wigner
22
Only issue during
2 years in prod:
Ceph Issue 6480
Rally down
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Every 10s a VM
gets created or
deleted in our
cloud!
Software Deployment
23The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
• Deployment based on CentOS and RDO
- Upstream, only patched where necessary
(e.g. nova/neutron for CERN networks)
- Some few customizations
- Works well for us
• Puppet for config’ management
- Introduced with the adoption of AI paradigm
• We submit upstream whenever possible
- openstack, openstack-puppet, RDO, …
• Updates done service-by-service over several months
- Running services on dedicated (virtual) servers helps
(Exception: ceilometer and nova on compute nodes)
• Upgrade testing done with packstack and devstack
- Depends on service: from simple DB upgrades to full shadow installations
‘dev’ environment
24The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
• Simulate the full CERN cloud environment
on your laptop, even offline
• Docker containers in a Kubernetes cluster
- Clone of central Puppet configuration
- Mock-up of
- our two Ceph clusters
- our secret store
- our network DB & DHCP
- Central DB & Rabbit instances
- One POD per service
• Change & test in seconds
• Full upgrade testing
25The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
(*) Pilot
ESSEX
Nova (*)
Swift
Glance (*)
Horizon (*)
Keystone (*)
FOLSOM
Nova (*)
Swift
Glance (*)
Horizon (*)
Keystone (*)
Quantum
Cinder
GRIZZLY
Nova
Swift
Glance
Horizon
Keystone
Quantum
Cinder
Ceilometer (*)
HAVANA
Nova
Swift
Glance
Horizon
Keystone
Neutron
Cinder
Ceilometer (*)
Heat
ICEHOUSE
Nova
Swift
Glance
Horizon
Keystone
Neutron
Cinder
Ceilometer
Heat
JUNO
Nova
Swift
Glance
Horizon
Keystone
Neutron
Cinder
Ceilometer
Heat (*)
Rally (*)
5 April 2012 27 September 2012 4 April 2013 17 October 2013 17 April 2014 16 October 2014
July 2013
CERN OpenStack
Production Service
February 2014
CERN OpenStack
Havana Release
October 2014
CERN OpenStack
Icehouse Release
30 April 2015
March2015
CERN OpenStack
Juno Release
LIBERTY
Nova
Swift
Glance
Horizon
Keystone
Neutron (*)
Cinder
Ceilometer
Heat
Rally
Magnum (*)
Barbican (*)
15 October 2015
September 2015
CERN OpenStack
Kilo Release
KILO
Nova
Swift
Glance
Horizon
Keystone
Neutron
Cinder
Ceilometer
Heat
Rally
May 2016
CERN OpenStack
ongoing Liberty
Cloud Service Release Evolution
Rich Usage Spectrum …
26The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
• Batch service
- Physics data analysis
• IT Services
- Sometimes built on top of other
virtualised services
• Experiment services
- E.g. build machines
• Engineering services
- E.g. micro-electronics/chip design
• Infrastructure services
- E.g. hostel booking, car rental, …
• Personal VMs
- Development
… rich requirement spectrum!
Usecases (1)
Server consolidation:
• Service nodes, dev boxes, Personal VMs, …
• Performance less important than “durability”
• Live-migration is desirable
• Persistent block storage is required
• Linux VM @ KVM, Windows VMs @ HyperV
• Starting to run Win VMs under KVM
• “Pets usecase”: 32K cores, 7500 VMs
27The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Usecases (2)
• Compute workloads
• Optimize for compute efficiency
• CPU passthrough, NUMA aware flavours
• Still, very different workloads
• IT Batch: LSF and HTCondor, longlived VMs,
8 and 16-core VMs, “full-node” flavors
• CMS Tier-0: medium-long, 8-core VMs
• LHCb Vcycle: short-lived, single core VMs
• Low-SLA, “cattle usecase”
• 150K cores, 12500 VMs @ 6000 compute nodes
28The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Wide Hardware Spectrum
29The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
• The ~6700 hypervisors differ in …
- Processor architectures: AMD vs. Intel (av. features, NUMA, …)
- Core-to-RAM ratio (1:2, 1:4, 1:1.5, ...)
- Core-to-disk ratio (going down with SSDs!)
- Disk layout (2 HDDs, 3 HDDs, 2 HDDs + 1 SSD, 2 SSDs, …)
- Network (1GbE vs 10 GbE)
- Critical vs physics power
- Physical location (Geneva vs. Budapest)
- Network domain (LCG vs. GPN vs. TN)
- CERN CentOS 7, RHEL7, SLC6, Windows
- …
• Variety reflected/accessible via instance types,
cells, projects … variety not necessarily visible to
users!
- We try to keep things simple and hide some of the complexity
- We can react to (most of the) requests with special needs
Basic Building Blocks: Instance Types
30The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Name vCPUs RAM [GB] Ephemeral
[GB]
m2.small 1 1.875 10
m2.medium 2 3.750 20
m2.large 4 7.5 40
m2.xlarge 8 15 80
m2.2xlarge 16 30 160
m2.3xlarge 32 60 320
~80 flavors in total (swap,
additional/large ephemerals, core-to-
RAM, …)
Basic Building Blocks: Volume Types
31The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Name IOPS b/w [MB/s] feature Location Backend
standard 100 80 GVA
io1 500 120 fast GVA
cp1 100 80 critical GVA
cpio1 500 120 critical/fast GVA
cp2 n.a. 120 critical/Windows GVA
wig-cp1 100 80 critical WIG
wig-cpio1 500 120 critical/fast WIG
m2.* flavor family plus volumes as
basic building blocks for services
Automate provisioning with
32The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Automate routine procedures
- Common place for workflows
- Clean web interface
- Scheduled jobs, cron-style
- Traceability and auditing
- Fine-grained access control
- …
Procedures for
- OpenStack project creation
- OpenStack quota changes
- Notifications of VM owners
- Usage and health reports
- …
Disable
compute node
• Disable nova-service
• Switch Alarms OFF
• Update Service-Now ticket
Notifications
• Send e-mail to VM owners
Other tasks
Post new
message broker
Add remote AT job
Save intervention
details
Send calendar
invitation
Operations: Retirement Campaign
33The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
• About 1’600 nodes to retire from the service by 3Q16
- ~1’200 from compute, ~400 with services (hosting ~5000 VMs)
• We have gained quite some experience with (manual)
migration
- Live where possible and cold where necessary
- Works reliably (where it can)
• We have developed a tool that you can instruct to drain
hypervisor (or simply migrate given VMs)
- Main tasks are VM classification and progress monitoring
- The nova scheduler will pick the target (nova patch)
• We are using the “IP service bridging” to handle
CERN network specifics
Operations: Network Migration
34
• We’ll need to replace nova-network
- It’s going to be deprecated (really really really this time)
• New potential features (Tenant networks, LBaaS,
Floating IPs, …)
• We have a number of patches to adapt to the CERN network
constraints
- We patched nova for each release …
- … neutron allows for out-of-tree plugins!
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
35
• We have ~5 Neutron cells in production
- Neutron control plane in Liberty (fully HA)
- Bridge agent in Kilo (nova)
• And it is very stable
• All new cells will be Neutron cells
• “As mentioned, there is currently no way to
cleanly migrate from nova-network to neutron.”
- All efforts to establish a general migration path failed so far
- Should be OK for us, various options (incl. in-place, w/ migration, …)
Operations: Neutron Status
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
36
• Magnum: OpenStack project to treat Container
Orchestration Engines (COEs) as 1st class
resources
• Pre-production service available
- Support for Docker Swarm, Kubernetes, Mesos
• Many users interested, usage ramping up
- GitLab CI, Jupyter/Swan, FTS, …
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Operations: Containers
Operations: Federated Clouds
Public Cloud such
as Rackspace or IBM
CERN Private Cloud
160K cores
ATLAS Trigger
28K cores
ALICE Trigger
9K cores
CMS Trigger
13K cores
INFN
Italy
Brookhaven
National Labs
NecTAR
Australia
Many Others
on Their Way
37The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
• Access to EduGAIN users via Horizon
- Allow (limited) access given appropriate membership
CPU Performance Issues
44
• The benchmarks on full-node VMs was about
20% lower than the one of the underlying host
- Smaller VMs much better
• Investigated various tuning options
- KSM*, EPT**, PAE, Pinning, … +hardware type dependencies
- Discrepancy down to ~10% between virtual and physical
• Comparison with Hyper-V: no general issue
- Loss w/o tuning ~3% (full-node), <1% for small VMs
- … NUMA-awareness!
*KSM on/off: beware of memory reclaim! **EPT on/off: beware of expensive page table walks!
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
CPU: NUMA
45
• NUMA-awareness identified as most
efficient setting
- Full node VMs have ~3% overhead in HS06
• “EPT-off” side-effect
- Small number of hosts, but very
visible there
• Use 2MB Huge Pages
- Keep the “EPT off” performance gain
with “EPT on”
• More details in this talk
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Operations: NUMA/THP Roll-out
46
• Rolled out on ~2’000 batch hypervisors (~6’000 VMs)
- HP allocation as boot parameter  reboot
- VM NUMA awareness as flavor metadata  delete/recreate
• Cell-by-cell (~200 hosts):
- Queue-reshuffle to minimize resource impact
- Draining & deletion of batch VMs
- Hypervisor reconfiguration (Puppet) & reboot
- Recreation of batch VMs
• Whole update took about 8 weeks
- Organized between batch and cloud teams
- No performance issue observed since
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Future Plans
47
• Investigate Ironic (Bare metal provisioning)
- OpenStack as one interface for compute resource provisioning
- Allow for complete accounting
- Use physical machines for containers
• Replace Hyper-V by qemu/kvm
- Windows expertise is a scarce resource in our team
- Reduce complexity in service setup
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
Summary
48
• OpenStack at CERN in production since 3 years
- We’re working closely with the various communities
- OpenStack, Ceph, RDO, Puppet, …
• Cloud service continues to grow and mature
- While experimental, good experience with Nova cells for scaling
- Experience gained helps with general resource provisioning
- New features added (containers, identity federation)
- Expansion planned (bare metal provisioning)
• Confronting some major operational challenges
- Transparent retirement of service hosts
- Replacement of network layer
• http://openstack-in-production.blogspot.com
(read about our recent 2M req/s Magnum & Kubernetes!)
The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
The OpenStack Cloud at CERN - OpenStack Nordic

Contenu connexe

Tendances

Future Science on Future OpenStack
Future Science on Future OpenStackFuture Science on Future OpenStack
Future Science on Future OpenStackBelmiro Moreira
 
20121017 OpenStack CERN Accelerating Science
20121017 OpenStack CERN Accelerating Science20121017 OpenStack CERN Accelerating Science
20121017 OpenStack CERN Accelerating ScienceTim Bell
 
CERN OpenStack Cloud Control Plane - From VMs to K8s
CERN OpenStack Cloud Control Plane - From VMs to K8sCERN OpenStack Cloud Control Plane - From VMs to K8s
CERN OpenStack Cloud Control Plane - From VMs to K8sBelmiro Moreira
 
Containers on Baremetal and Preemptible VMs at CERN and SKA
Containers on Baremetal and Preemptible VMs at CERN and SKAContainers on Baremetal and Preemptible VMs at CERN and SKA
Containers on Baremetal and Preemptible VMs at CERN and SKABelmiro Moreira
 
OpenStack at CERN : A 5 year perspective
OpenStack at CERN : A 5 year perspectiveOpenStack at CERN : A 5 year perspective
OpenStack at CERN : A 5 year perspectiveTim Bell
 
20190620 accelerating containers v3
20190620 accelerating containers v320190620 accelerating containers v3
20190620 accelerating containers v3Tim Bell
 
Moving from CellsV1 to CellsV2 at CERN
Moving from CellsV1 to CellsV2 at CERNMoving from CellsV1 to CellsV2 at CERN
Moving from CellsV1 to CellsV2 at CERNBelmiro Moreira
 
20181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v320181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v3Tim Bell
 
Learning to Scale OpenStack
Learning to Scale OpenStackLearning to Scale OpenStack
Learning to Scale OpenStackRainya Mosher
 
Unveiling CERN Cloud Architecture - October, 2015
Unveiling CERN Cloud Architecture - October, 2015Unveiling CERN Cloud Architecture - October, 2015
Unveiling CERN Cloud Architecture - October, 2015Belmiro Moreira
 
Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Deep Dive Into the CERN Cloud Infrastructure - November, 2013Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Deep Dive Into the CERN Cloud Infrastructure - November, 2013Belmiro Moreira
 
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015Belmiro Moreira
 
Evolution of Openstack Networking at CERN
Evolution of Openstack Networking at CERNEvolution of Openstack Networking at CERN
Evolution of Openstack Networking at CERNBelmiro Moreira
 
The OpenStack Cloud at CERN
The OpenStack Cloud at CERNThe OpenStack Cloud at CERN
The OpenStack Cloud at CERNArne Wiebalck
 
CERN IT Monitoring
CERN IT Monitoring CERN IT Monitoring
CERN IT Monitoring Tim Bell
 
Manila on CephFS at CERN (OpenStack Summit Boston, 11 May 2017)
Manila on CephFS at CERN (OpenStack Summit Boston, 11 May 2017)Manila on CephFS at CERN (OpenStack Summit Boston, 11 May 2017)
Manila on CephFS at CERN (OpenStack Summit Boston, 11 May 2017)Arne Wiebalck
 
Operators experience and perspective on SDN with VLANs and L3 Networks
Operators experience and perspective on SDN with VLANs and L3 NetworksOperators experience and perspective on SDN with VLANs and L3 Networks
Operators experience and perspective on SDN with VLANs and L3 NetworksJakub Pavlik
 
Integrating Bare-metal Provisioning into CERN's Private Cloud
Integrating Bare-metal Provisioning into CERN's Private CloudIntegrating Bare-metal Provisioning into CERN's Private Cloud
Integrating Bare-metal Provisioning into CERN's Private CloudArne Wiebalck
 
OpenContrail Implementations
OpenContrail ImplementationsOpenContrail Implementations
OpenContrail ImplementationsJakub Pavlik
 
OpenStack Paris 2014 - Federation, are we there yet ?
OpenStack Paris 2014 - Federation, are we there yet ?OpenStack Paris 2014 - Federation, are we there yet ?
OpenStack Paris 2014 - Federation, are we there yet ?Tim Bell
 

Tendances (20)

Future Science on Future OpenStack
Future Science on Future OpenStackFuture Science on Future OpenStack
Future Science on Future OpenStack
 
20121017 OpenStack CERN Accelerating Science
20121017 OpenStack CERN Accelerating Science20121017 OpenStack CERN Accelerating Science
20121017 OpenStack CERN Accelerating Science
 
CERN OpenStack Cloud Control Plane - From VMs to K8s
CERN OpenStack Cloud Control Plane - From VMs to K8sCERN OpenStack Cloud Control Plane - From VMs to K8s
CERN OpenStack Cloud Control Plane - From VMs to K8s
 
Containers on Baremetal and Preemptible VMs at CERN and SKA
Containers on Baremetal and Preemptible VMs at CERN and SKAContainers on Baremetal and Preemptible VMs at CERN and SKA
Containers on Baremetal and Preemptible VMs at CERN and SKA
 
OpenStack at CERN : A 5 year perspective
OpenStack at CERN : A 5 year perspectiveOpenStack at CERN : A 5 year perspective
OpenStack at CERN : A 5 year perspective
 
20190620 accelerating containers v3
20190620 accelerating containers v320190620 accelerating containers v3
20190620 accelerating containers v3
 
Moving from CellsV1 to CellsV2 at CERN
Moving from CellsV1 to CellsV2 at CERNMoving from CellsV1 to CellsV2 at CERN
Moving from CellsV1 to CellsV2 at CERN
 
20181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v320181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v3
 
Learning to Scale OpenStack
Learning to Scale OpenStackLearning to Scale OpenStack
Learning to Scale OpenStack
 
Unveiling CERN Cloud Architecture - October, 2015
Unveiling CERN Cloud Architecture - October, 2015Unveiling CERN Cloud Architecture - October, 2015
Unveiling CERN Cloud Architecture - October, 2015
 
Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Deep Dive Into the CERN Cloud Infrastructure - November, 2013Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Deep Dive Into the CERN Cloud Infrastructure - November, 2013
 
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
 
Evolution of Openstack Networking at CERN
Evolution of Openstack Networking at CERNEvolution of Openstack Networking at CERN
Evolution of Openstack Networking at CERN
 
The OpenStack Cloud at CERN
The OpenStack Cloud at CERNThe OpenStack Cloud at CERN
The OpenStack Cloud at CERN
 
CERN IT Monitoring
CERN IT Monitoring CERN IT Monitoring
CERN IT Monitoring
 
Manila on CephFS at CERN (OpenStack Summit Boston, 11 May 2017)
Manila on CephFS at CERN (OpenStack Summit Boston, 11 May 2017)Manila on CephFS at CERN (OpenStack Summit Boston, 11 May 2017)
Manila on CephFS at CERN (OpenStack Summit Boston, 11 May 2017)
 
Operators experience and perspective on SDN with VLANs and L3 Networks
Operators experience and perspective on SDN with VLANs and L3 NetworksOperators experience and perspective on SDN with VLANs and L3 Networks
Operators experience and perspective on SDN with VLANs and L3 Networks
 
Integrating Bare-metal Provisioning into CERN's Private Cloud
Integrating Bare-metal Provisioning into CERN's Private CloudIntegrating Bare-metal Provisioning into CERN's Private Cloud
Integrating Bare-metal Provisioning into CERN's Private Cloud
 
OpenContrail Implementations
OpenContrail ImplementationsOpenContrail Implementations
OpenContrail Implementations
 
OpenStack Paris 2014 - Federation, are we there yet ?
OpenStack Paris 2014 - Federation, are we there yet ?OpenStack Paris 2014 - Federation, are we there yet ?
OpenStack Paris 2014 - Federation, are we there yet ?
 

Similaire à The OpenStack Cloud at CERN - OpenStack Nordic

20181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v320181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v3Tim Bell
 
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...Databricks
 
Toward 10,000 Containers on OpenStack
Toward 10,000 Containers on OpenStackToward 10,000 Containers on OpenStack
Toward 10,000 Containers on OpenStackTon Ngo
 
CERN Mass and Agility talk at OSCON 2014
CERN Mass and Agility talk at OSCON 2014CERN Mass and Agility talk at OSCON 2014
CERN Mass and Agility talk at OSCON 2014Tim Bell
 
Openstackoverview-DEC2013
Openstackoverview-DEC2013Openstackoverview-DEC2013
Openstackoverview-DEC2013Michael Lessard
 
All about open stack
All about open stackAll about open stack
All about open stackDataCentred
 
OpenStack Ecosystem – Xen Cloud Platform and Integration into OpenStack - in...
OpenStack Ecosystem – Xen Cloud Platform and Integration into OpenStack -  in...OpenStack Ecosystem – Xen Cloud Platform and Integration into OpenStack -  in...
OpenStack Ecosystem – Xen Cloud Platform and Integration into OpenStack - in...IndicThreads
 
Openstack For Beginners
Openstack For BeginnersOpenstack For Beginners
Openstack For Beginnerscpallares
 
OpenstackOverview.pdf
OpenstackOverview.pdfOpenstackOverview.pdf
OpenstackOverview.pdfKevinBuck30
 
OpenCloud - A Research Cloud
OpenCloud - A Research CloudOpenCloud - A Research Cloud
OpenCloud - A Research CloudON.Lab
 
From OpenStack to Docker swarm
From OpenStack to Docker swarmFrom OpenStack to Docker swarm
From OpenStack to Docker swarmNikolay Milovanov
 
CERN Data Centre Evolution
CERN Data Centre EvolutionCERN Data Centre Evolution
CERN Data Centre EvolutionGavin McCance
 
Cisco: Cassandra adoption on Cisco UCS & OpenStack
Cisco: Cassandra adoption on Cisco UCS & OpenStackCisco: Cassandra adoption on Cisco UCS & OpenStack
Cisco: Cassandra adoption on Cisco UCS & OpenStackDataStax Academy
 
PSOCLD-1006 Cisco Cloud Architectures on OpenStack - Cisco Live! US 2015 San ...
PSOCLD-1006 Cisco Cloud Architectures on OpenStack - Cisco Live! US 2015 San ...PSOCLD-1006 Cisco Cloud Architectures on OpenStack - Cisco Live! US 2015 San ...
PSOCLD-1006 Cisco Cloud Architectures on OpenStack - Cisco Live! US 2015 San ...Rohit Agarwalla
 
What is OpenStack and the added value of IBM solutions
What is OpenStack and the added value of IBM solutionsWhat is OpenStack and the added value of IBM solutions
What is OpenStack and the added value of IBM solutionsSasha Lazarevic
 
HNSciCloud represented at HUAWEI CONNECT 2017 in Shanghai
HNSciCloud represented at HUAWEI CONNECT 2017 in ShanghaiHNSciCloud represented at HUAWEI CONNECT 2017 in Shanghai
HNSciCloud represented at HUAWEI CONNECT 2017 in ShanghaiHelix Nebula The Science Cloud
 
Ceph for Big Science - Dan van der Ster
Ceph for Big Science - Dan van der SterCeph for Big Science - Dan van der Ster
Ceph for Big Science - Dan van der SterCeph Community
 

Similaire à The OpenStack Cloud at CERN - OpenStack Nordic (20)

20181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v320181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v3
 
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
 
Toward 10,000 Containers on OpenStack
Toward 10,000 Containers on OpenStackToward 10,000 Containers on OpenStack
Toward 10,000 Containers on OpenStack
 
CERN Mass and Agility talk at OSCON 2014
CERN Mass and Agility talk at OSCON 2014CERN Mass and Agility talk at OSCON 2014
CERN Mass and Agility talk at OSCON 2014
 
Openstackoverview-DEC2013
Openstackoverview-DEC2013Openstackoverview-DEC2013
Openstackoverview-DEC2013
 
All about open stack
All about open stackAll about open stack
All about open stack
 
Hybrid Cloud for CERN
Hybrid Cloud for CERN Hybrid Cloud for CERN
Hybrid Cloud for CERN
 
OpenStack Ecosystem – Xen Cloud Platform and Integration into OpenStack - in...
OpenStack Ecosystem – Xen Cloud Platform and Integration into OpenStack -  in...OpenStack Ecosystem – Xen Cloud Platform and Integration into OpenStack -  in...
OpenStack Ecosystem – Xen Cloud Platform and Integration into OpenStack - in...
 
Openstack For Beginners
Openstack For BeginnersOpenstack For Beginners
Openstack For Beginners
 
OpenstackOverview.pdf
OpenstackOverview.pdfOpenstackOverview.pdf
OpenstackOverview.pdf
 
OpenCloud - A Research Cloud
OpenCloud - A Research CloudOpenCloud - A Research Cloud
OpenCloud - A Research Cloud
 
From OpenStack to Docker swarm
From OpenStack to Docker swarmFrom OpenStack to Docker swarm
From OpenStack to Docker swarm
 
CERN Data Centre Evolution
CERN Data Centre EvolutionCERN Data Centre Evolution
CERN Data Centre Evolution
 
Cisco: Cassandra adoption on Cisco UCS & OpenStack
Cisco: Cassandra adoption on Cisco UCS & OpenStackCisco: Cassandra adoption on Cisco UCS & OpenStack
Cisco: Cassandra adoption on Cisco UCS & OpenStack
 
Openstack 101
Openstack 101Openstack 101
Openstack 101
 
PSOCLD-1006 Cisco Cloud Architectures on OpenStack - Cisco Live! US 2015 San ...
PSOCLD-1006 Cisco Cloud Architectures on OpenStack - Cisco Live! US 2015 San ...PSOCLD-1006 Cisco Cloud Architectures on OpenStack - Cisco Live! US 2015 San ...
PSOCLD-1006 Cisco Cloud Architectures on OpenStack - Cisco Live! US 2015 San ...
 
What is OpenStack and the added value of IBM solutions
What is OpenStack and the added value of IBM solutionsWhat is OpenStack and the added value of IBM solutions
What is OpenStack and the added value of IBM solutions
 
HNSciCloud represented at HUAWEI CONNECT 2017 in Shanghai
HNSciCloud represented at HUAWEI CONNECT 2017 in ShanghaiHNSciCloud represented at HUAWEI CONNECT 2017 in Shanghai
HNSciCloud represented at HUAWEI CONNECT 2017 in Shanghai
 
Ceph for Big Science - Dan van der Ster
Ceph for Big Science - Dan van der SterCeph for Big Science - Dan van der Ster
Ceph for Big Science - Dan van der Ster
 
Oow2016 review-iaas-paas-13th-18thoctober
Oow2016 review-iaas-paas-13th-18thoctoberOow2016 review-iaas-paas-13th-18thoctober
Oow2016 review-iaas-paas-13th-18thoctober
 

Plus de Tim Bell

CERN Status at OpenStack Shanghai Summit November 2019
CERN Status at OpenStack Shanghai Summit November 2019CERN Status at OpenStack Shanghai Summit November 2019
CERN Status at OpenStack Shanghai Summit November 2019Tim Bell
 
20190314 cern register v3
20190314 cern register v320190314 cern register v3
20190314 cern register v3Tim Bell
 
20141103 cern open_stack_paris_v3
20141103 cern open_stack_paris_v320141103 cern open_stack_paris_v3
20141103 cern open_stack_paris_v3Tim Bell
 
20140509 cern open_stack_linuxtag_v3
20140509 cern open_stack_linuxtag_v320140509 cern open_stack_linuxtag_v3
20140509 cern open_stack_linuxtag_v3Tim Bell
 
Open stack operations feedback loop v1.4
Open stack operations feedback loop v1.4Open stack operations feedback loop v1.4
Open stack operations feedback loop v1.4Tim Bell
 
CERN clouds and culture at GigaOm London 2013
CERN clouds and culture at GigaOm London 2013CERN clouds and culture at GigaOm London 2013
CERN clouds and culture at GigaOm London 2013Tim Bell
 
20130529 openstack cee_day_v6
20130529 openstack cee_day_v620130529 openstack cee_day_v6
20130529 openstack cee_day_v6Tim Bell
 
Academic cloud experiences cern v4
Academic cloud experiences cern v4Academic cloud experiences cern v4
Academic cloud experiences cern v4Tim Bell
 
Ceilometer lsf-intergration-openstack-summit
Ceilometer lsf-intergration-openstack-summitCeilometer lsf-intergration-openstack-summit
Ceilometer lsf-intergration-openstack-summitTim Bell
 
Havana survey results-final-v2
Havana survey results-final-v2Havana survey results-final-v2
Havana survey results-final-v2Tim Bell
 
Havana survey results-final
Havana survey results-finalHavana survey results-final
Havana survey results-finalTim Bell
 
20121205 open stack_accelerating_science_v3
20121205 open stack_accelerating_science_v320121205 open stack_accelerating_science_v3
20121205 open stack_accelerating_science_v3Tim Bell
 
20121115 open stack_ch_user_group_v1.2
20121115 open stack_ch_user_group_v1.220121115 open stack_ch_user_group_v1.2
20121115 open stack_ch_user_group_v1.2Tim Bell
 
20121017 OpenStack Accelerating Science
20121017 OpenStack Accelerating Science20121017 OpenStack Accelerating Science
20121017 OpenStack Accelerating ScienceTim Bell
 
Accelerating science with Puppet
Accelerating science with PuppetAccelerating science with Puppet
Accelerating science with PuppetTim Bell
 
20120524 cern data centre evolution v2
20120524 cern data centre evolution v220120524 cern data centre evolution v2
20120524 cern data centre evolution v2Tim Bell
 

Plus de Tim Bell (16)

CERN Status at OpenStack Shanghai Summit November 2019
CERN Status at OpenStack Shanghai Summit November 2019CERN Status at OpenStack Shanghai Summit November 2019
CERN Status at OpenStack Shanghai Summit November 2019
 
20190314 cern register v3
20190314 cern register v320190314 cern register v3
20190314 cern register v3
 
20141103 cern open_stack_paris_v3
20141103 cern open_stack_paris_v320141103 cern open_stack_paris_v3
20141103 cern open_stack_paris_v3
 
20140509 cern open_stack_linuxtag_v3
20140509 cern open_stack_linuxtag_v320140509 cern open_stack_linuxtag_v3
20140509 cern open_stack_linuxtag_v3
 
Open stack operations feedback loop v1.4
Open stack operations feedback loop v1.4Open stack operations feedback loop v1.4
Open stack operations feedback loop v1.4
 
CERN clouds and culture at GigaOm London 2013
CERN clouds and culture at GigaOm London 2013CERN clouds and culture at GigaOm London 2013
CERN clouds and culture at GigaOm London 2013
 
20130529 openstack cee_day_v6
20130529 openstack cee_day_v620130529 openstack cee_day_v6
20130529 openstack cee_day_v6
 
Academic cloud experiences cern v4
Academic cloud experiences cern v4Academic cloud experiences cern v4
Academic cloud experiences cern v4
 
Ceilometer lsf-intergration-openstack-summit
Ceilometer lsf-intergration-openstack-summitCeilometer lsf-intergration-openstack-summit
Ceilometer lsf-intergration-openstack-summit
 
Havana survey results-final-v2
Havana survey results-final-v2Havana survey results-final-v2
Havana survey results-final-v2
 
Havana survey results-final
Havana survey results-finalHavana survey results-final
Havana survey results-final
 
20121205 open stack_accelerating_science_v3
20121205 open stack_accelerating_science_v320121205 open stack_accelerating_science_v3
20121205 open stack_accelerating_science_v3
 
20121115 open stack_ch_user_group_v1.2
20121115 open stack_ch_user_group_v1.220121115 open stack_ch_user_group_v1.2
20121115 open stack_ch_user_group_v1.2
 
20121017 OpenStack Accelerating Science
20121017 OpenStack Accelerating Science20121017 OpenStack Accelerating Science
20121017 OpenStack Accelerating Science
 
Accelerating science with Puppet
Accelerating science with PuppetAccelerating science with Puppet
Accelerating science with Puppet
 
20120524 cern data centre evolution v2
20120524 cern data centre evolution v220120524 cern data centre evolution v2
20120524 cern data centre evolution v2
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Dernier (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

The OpenStack Cloud at CERN - OpenStack Nordic

  • 1.
  • 2. The CERN OpenStack Cloud Compute Resource Provisioning for the Large Hadron Collider 2 Jan van Eldik for the CERN Cloud Team OpenStack Days Nordic Stockholm September 22, 2016
  • 3. 3The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 Agenda • Introduction to CERN • Computing at CERN scale • Cloud Service Overview • Operations • Performance • Outlook
  • 4. CERN: home.cern • European Organization for Nuclear Research (Conseil Européen pour la Recherche Nucléaire) - Founded in 1954, today 22 member states - World’s largest particle physics laboratory - Located at Franco-Swiss border near Geneva - ~2’300 staff members, >12’500 users - Budget: ~1000 MCHF (2016) • CERN’s mission - Answer fundamental questions of the universe - Advance the technology frontiers - Train the scientists and engineers of tomorrow - Bring nations together 4The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 5. The Large Hadron Collider (LHC) 5 Largest machine on earth: 27km circumference The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 6. LHC: 9’600 Magnets for Beam Control 6 1232 superconducting dipoles for bending: 14m, 35t, 8.3T, 12kA The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 7. LHC: Coldest Temperature 7 World’s largest cryogenic system: colder than outer space (1.9K/2.7K), 120t of He The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 8. LHC: Highest Vacuum 8 Vacuum system: 104 km of pipes, 10-10-10-11 mbar (comparable to the moon) The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 9. 9 LHC: Detectors Four main experiments to study the fundamental properties of the universe The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 10. 10 A collision at LHC January 2013 - The LHC Computing Grid - Nils Hoymir
  • 11. Ian.Bird@cern.ch 11 The Data Acquisition January 2013 - The LHC Computing Grid - Nils Hoymir
  • 12. Tier 0 at CERN: Acquisition, First pass reconstruction, Storage & Distribution 1.25 GB/sec (ions) 2011: 400-500 MB/sec 2011: 4-6 GB/sec
  • 13. Solution: the Grid • Use the Grid to unite computing resources of particle physics institutes around the world The World Wide Web provides seamless access to information that is stored in many millions of different geographical locations The Grid is an infrastructure that provides seamless access to computing power and data storage capacity distributed over the globe
  • 14. 14 LHC: World-wide Computing Grid TIER-1: permanent storage, re-processing, analysis TIER-0 (CERN): data recording, reconstruction and distribution TIER-2: simulation, end-user analysis > 2 million jobs/day ~350’000 cores 500 PB of storage nearly 170 sites, 40 countries 10-100 Gb links The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 15. LHC: Data & Compute Growth 0 20 40 60 80 100 120 140 160 Run 1 Run 2 Run 3 Run 4 GRID ATLAS CMS LHCb ALICE 0.0 50.0 100.0 150.0 200.0 250.0 300.0 350.0 400.0 450.0 Run 1 Run 2 Run 3 Run 4 CMS ATLAS ALICE LHCb PB/year109HS06sec/month What we can afford 15The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 Collisions produce ~1 PB/s
  • 16. 2012: Enter the cloud • Aim: virtualize all the machines • Unless really, really, really not possible • Offer Cloud endpoints to users • Scale horizontally • Consolidate server provisioning • Yes, use the private cloud for server consolidation usecases as well 16The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 17. In production: • 4 clouds • >200K cores • >8,000 hypervisors 90% of CERN’s compute resources are now delivered on top of OpenStack OpenStack at CERN 17The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 18. Cloud Service Context • CERN IT to enable the laboratory to fulfill its mission - Main data center on the Geneva site - Wigner data center, Budapest, 23ms distance - Connected via two dedicated 100Gbs links • CERN Cloud Service one of the three major components in IT’s AI project - Policy: Servers in CERN IT shall be virtual • Based on OpenStack - Production service since July 2013 - Performed 4 rolling upgrades since - Currently in transition from Liberty to Mitaka - Nova, Glance, Keystone, Horizon, Cinder, Ceilometer, Heat, Neutron, Magnum, Barbican 18The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 http://goo.gl/maps/K5SoG
  • 19. CERN Cloud Architecture (1) 19 • Deployment spans our two data centers - 1 region (to have 1 API), ~40 cells - Cells map use cases hardware, hypervisor type, location, users, … • Top cell on physical and virtual nodes in HA - Clustered RabbitMQ with mirrored queues - API servers are VMs in various child cells • Child cell controllers are OpenStack VMs - One controller per cell - Tradeoff between complexity and failure impact The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 20. CERN Cloud Architecture (2) 20 nova-cells rabbitmq Top cell controller API server nova-api rabbitmq nova-cells nova-api nova-scheduler nova-conductor nova-network Child cell controller Compute node nova-compute rabbitmq nova-cells nova-api nova-scheduler nova-conductor nova-network Child cell controller Compute node nova-compute DB infrastructure The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 21. CERN Cloud in Numbers (1) • ~6700 hypervisors in production - Split over 40+ Nova cells - Vast majority qemu/kvm now on CERN CentOS 7 (~150 Hyper-V hosts) - ~2’100 HVs at Wigner in Hungary (batch, compute, services) - 370 HVs on critical power • 190k Cores • ~430 TB RAM • ~20’000 VMs • Big increase during 2016! - +57k cores in spring - +40k cores in autumn 21The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 22. CERN Cloud in Numbers (2) • 2’700 images/snapshots - Glance on Ceph • 2’300 volumes - Cinder on Ceph (& NetApp) in GVA & Wigner 22 Only issue during 2 years in prod: Ceph Issue 6480 Rally down The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 Every 10s a VM gets created or deleted in our cloud!
  • 23. Software Deployment 23The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 • Deployment based on CentOS and RDO - Upstream, only patched where necessary (e.g. nova/neutron for CERN networks) - Some few customizations - Works well for us • Puppet for config’ management - Introduced with the adoption of AI paradigm • We submit upstream whenever possible - openstack, openstack-puppet, RDO, … • Updates done service-by-service over several months - Running services on dedicated (virtual) servers helps (Exception: ceilometer and nova on compute nodes) • Upgrade testing done with packstack and devstack - Depends on service: from simple DB upgrades to full shadow installations
  • 24. ‘dev’ environment 24The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 • Simulate the full CERN cloud environment on your laptop, even offline • Docker containers in a Kubernetes cluster - Clone of central Puppet configuration - Mock-up of - our two Ceph clusters - our secret store - our network DB & DHCP - Central DB & Rabbit instances - One POD per service • Change & test in seconds • Full upgrade testing
  • 25. 25The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 (*) Pilot ESSEX Nova (*) Swift Glance (*) Horizon (*) Keystone (*) FOLSOM Nova (*) Swift Glance (*) Horizon (*) Keystone (*) Quantum Cinder GRIZZLY Nova Swift Glance Horizon Keystone Quantum Cinder Ceilometer (*) HAVANA Nova Swift Glance Horizon Keystone Neutron Cinder Ceilometer (*) Heat ICEHOUSE Nova Swift Glance Horizon Keystone Neutron Cinder Ceilometer Heat JUNO Nova Swift Glance Horizon Keystone Neutron Cinder Ceilometer Heat (*) Rally (*) 5 April 2012 27 September 2012 4 April 2013 17 October 2013 17 April 2014 16 October 2014 July 2013 CERN OpenStack Production Service February 2014 CERN OpenStack Havana Release October 2014 CERN OpenStack Icehouse Release 30 April 2015 March2015 CERN OpenStack Juno Release LIBERTY Nova Swift Glance Horizon Keystone Neutron (*) Cinder Ceilometer Heat Rally Magnum (*) Barbican (*) 15 October 2015 September 2015 CERN OpenStack Kilo Release KILO Nova Swift Glance Horizon Keystone Neutron Cinder Ceilometer Heat Rally May 2016 CERN OpenStack ongoing Liberty Cloud Service Release Evolution
  • 26. Rich Usage Spectrum … 26The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 • Batch service - Physics data analysis • IT Services - Sometimes built on top of other virtualised services • Experiment services - E.g. build machines • Engineering services - E.g. micro-electronics/chip design • Infrastructure services - E.g. hostel booking, car rental, … • Personal VMs - Development … rich requirement spectrum!
  • 27. Usecases (1) Server consolidation: • Service nodes, dev boxes, Personal VMs, … • Performance less important than “durability” • Live-migration is desirable • Persistent block storage is required • Linux VM @ KVM, Windows VMs @ HyperV • Starting to run Win VMs under KVM • “Pets usecase”: 32K cores, 7500 VMs 27The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 28. Usecases (2) • Compute workloads • Optimize for compute efficiency • CPU passthrough, NUMA aware flavours • Still, very different workloads • IT Batch: LSF and HTCondor, longlived VMs, 8 and 16-core VMs, “full-node” flavors • CMS Tier-0: medium-long, 8-core VMs • LHCb Vcycle: short-lived, single core VMs • Low-SLA, “cattle usecase” • 150K cores, 12500 VMs @ 6000 compute nodes 28The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 29. Wide Hardware Spectrum 29The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 • The ~6700 hypervisors differ in … - Processor architectures: AMD vs. Intel (av. features, NUMA, …) - Core-to-RAM ratio (1:2, 1:4, 1:1.5, ...) - Core-to-disk ratio (going down with SSDs!) - Disk layout (2 HDDs, 3 HDDs, 2 HDDs + 1 SSD, 2 SSDs, …) - Network (1GbE vs 10 GbE) - Critical vs physics power - Physical location (Geneva vs. Budapest) - Network domain (LCG vs. GPN vs. TN) - CERN CentOS 7, RHEL7, SLC6, Windows - … • Variety reflected/accessible via instance types, cells, projects … variety not necessarily visible to users! - We try to keep things simple and hide some of the complexity - We can react to (most of the) requests with special needs
  • 30. Basic Building Blocks: Instance Types 30The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 Name vCPUs RAM [GB] Ephemeral [GB] m2.small 1 1.875 10 m2.medium 2 3.750 20 m2.large 4 7.5 40 m2.xlarge 8 15 80 m2.2xlarge 16 30 160 m2.3xlarge 32 60 320 ~80 flavors in total (swap, additional/large ephemerals, core-to- RAM, …)
  • 31. Basic Building Blocks: Volume Types 31The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 Name IOPS b/w [MB/s] feature Location Backend standard 100 80 GVA io1 500 120 fast GVA cp1 100 80 critical GVA cpio1 500 120 critical/fast GVA cp2 n.a. 120 critical/Windows GVA wig-cp1 100 80 critical WIG wig-cpio1 500 120 critical/fast WIG m2.* flavor family plus volumes as basic building blocks for services
  • 32. Automate provisioning with 32The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 Automate routine procedures - Common place for workflows - Clean web interface - Scheduled jobs, cron-style - Traceability and auditing - Fine-grained access control - … Procedures for - OpenStack project creation - OpenStack quota changes - Notifications of VM owners - Usage and health reports - … Disable compute node • Disable nova-service • Switch Alarms OFF • Update Service-Now ticket Notifications • Send e-mail to VM owners Other tasks Post new message broker Add remote AT job Save intervention details Send calendar invitation
  • 33. Operations: Retirement Campaign 33The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 • About 1’600 nodes to retire from the service by 3Q16 - ~1’200 from compute, ~400 with services (hosting ~5000 VMs) • We have gained quite some experience with (manual) migration - Live where possible and cold where necessary - Works reliably (where it can) • We have developed a tool that you can instruct to drain hypervisor (or simply migrate given VMs) - Main tasks are VM classification and progress monitoring - The nova scheduler will pick the target (nova patch) • We are using the “IP service bridging” to handle CERN network specifics
  • 34. Operations: Network Migration 34 • We’ll need to replace nova-network - It’s going to be deprecated (really really really this time) • New potential features (Tenant networks, LBaaS, Floating IPs, …) • We have a number of patches to adapt to the CERN network constraints - We patched nova for each release … - … neutron allows for out-of-tree plugins! The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 35. 35 • We have ~5 Neutron cells in production - Neutron control plane in Liberty (fully HA) - Bridge agent in Kilo (nova) • And it is very stable • All new cells will be Neutron cells • “As mentioned, there is currently no way to cleanly migrate from nova-network to neutron.” - All efforts to establish a general migration path failed so far - Should be OK for us, various options (incl. in-place, w/ migration, …) Operations: Neutron Status The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 36. 36 • Magnum: OpenStack project to treat Container Orchestration Engines (COEs) as 1st class resources • Pre-production service available - Support for Docker Swarm, Kubernetes, Mesos • Many users interested, usage ramping up - GitLab CI, Jupyter/Swan, FTS, … The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 Operations: Containers
  • 37. Operations: Federated Clouds Public Cloud such as Rackspace or IBM CERN Private Cloud 160K cores ATLAS Trigger 28K cores ALICE Trigger 9K cores CMS Trigger 13K cores INFN Italy Brookhaven National Labs NecTAR Australia Many Others on Their Way 37The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016 • Access to EduGAIN users via Horizon - Allow (limited) access given appropriate membership
  • 38. CPU Performance Issues 44 • The benchmarks on full-node VMs was about 20% lower than the one of the underlying host - Smaller VMs much better • Investigated various tuning options - KSM*, EPT**, PAE, Pinning, … +hardware type dependencies - Discrepancy down to ~10% between virtual and physical • Comparison with Hyper-V: no general issue - Loss w/o tuning ~3% (full-node), <1% for small VMs - … NUMA-awareness! *KSM on/off: beware of memory reclaim! **EPT on/off: beware of expensive page table walks! The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 39. CPU: NUMA 45 • NUMA-awareness identified as most efficient setting - Full node VMs have ~3% overhead in HS06 • “EPT-off” side-effect - Small number of hosts, but very visible there • Use 2MB Huge Pages - Keep the “EPT off” performance gain with “EPT on” • More details in this talk The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 40. Operations: NUMA/THP Roll-out 46 • Rolled out on ~2’000 batch hypervisors (~6’000 VMs) - HP allocation as boot parameter  reboot - VM NUMA awareness as flavor metadata  delete/recreate • Cell-by-cell (~200 hosts): - Queue-reshuffle to minimize resource impact - Draining & deletion of batch VMs - Hypervisor reconfiguration (Puppet) & reboot - Recreation of batch VMs • Whole update took about 8 weeks - Organized between batch and cloud teams - No performance issue observed since The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 41. Future Plans 47 • Investigate Ironic (Bare metal provisioning) - OpenStack as one interface for compute resource provisioning - Allow for complete accounting - Use physical machines for containers • Replace Hyper-V by qemu/kvm - Windows expertise is a scarce resource in our team - Reduce complexity in service setup The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016
  • 42. Summary 48 • OpenStack at CERN in production since 3 years - We’re working closely with the various communities - OpenStack, Ceph, RDO, Puppet, … • Cloud service continues to grow and mature - While experimental, good experience with Nova cells for scaling - Experience gained helps with general resource provisioning - New features added (containers, identity federation) - Expansion planned (bare metal provisioning) • Confronting some major operational challenges - Transparent retirement of service hosts - Replacement of network layer • http://openstack-in-production.blogspot.com (read about our recent 2M req/s Magnum & Kubernetes!) The CERN OpenStack Cloud - OpenStack Days Nordic - Sept 22,2016