SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Accelerating the adoption of Cloud Computing
Beyond Installation:
Managing Your OpenStack Cloud
May 6th, 2014
!  Ken Pepple is the co-founder and Chief
Technology Officer of Solinea
!  Prior to founding Solinea, he led the introduction
of Internap's OpenStack-based public cloud
services while serving as their Director of Cloud
Development
!  Code contributor since Bexar release of
OpenStack
!  Author of O'Reilly Media's "Deploying OpenStack"
and several other books
2
Speakers
Introduction
!  Installing OpenStack gets all the attention …
!  … but distributions like Red Hat OSP and
Cloudscaling are attacking this problem
!  They will beat it. Then what ?
!  The reality is that OpenStack management is
what we should be focusing on.
!  Installation is 2 – 3 weeks … management is
forever
3
OpenStack Architecture
OpenStack Object Store
OpenStack
Image Service OpenStack Compute
OpenStack
Dashboard
OpenStack
Identity
Service
OpenStack Compute
API /
Admin API
keystone
(service & admin APIs)
nova-api
(OS, EC2, Metadata, Admin)
nova-consoleauth
nova-cert/
objectstore
nova-console
nova-*proxy
VNC/Spice
OpenStack
Object API
http://www.solinea.com
Queue
nova-compute
nova-scheduler
nova
database
OpenStack
Compute API
OpenStack
Image API
Horizon
OpenStack
Image API
identity
backend
swift-proxy
objectcontaineraccount
object
store
account
DB
container
DB
OpenStack
Object API
HTTP(S)
OpenStack
Object API
OpenStack
Identity API
OpenStack
Identity
API
OpenStack
Identity
API
OpenStack
Image API
OpenStack
Identity
API
OpenStack
Image
API
catalog
backend
token backend
OpenStack
Identity
API
hypervisor
libvirt, XenAPI, etc.
HTTP(S)
Amazon
Web Services
EC2 API
Internet / Enterprise Network
OpenStack
Network Service
glance-api
glance
database
OpenStack
Block Storage
OpenStack
Block Storage API
cinder-api
cinder-volume
neutron-server
neutron
plugin(s)
OpenStack
Identity
API
cinder-scheduler
cinder
database
OpenStack
Network API
network
provider
OpenStack
Block
Storage
API
OpenStack
Network API
policy
backend
Queue
OpenStack
Network API
neutron
database
neutron
agent(s)
nova-conductor
memcached
OpenStack
Identity
API
⁃ OpenStack Command Line Tools (nova-client, swift-client, etc.)
⁃ Cloud Management Tools (Rightscale, Enstratius, etc.)
⁃ GUI tools (Cyberduck, iPhone client, etc.)
volume provider
cinder-backup
OpenStack Object API
OpenStack
Identity
API
Queue
OpenStack
Block Storage API
OpenStack
Orchestration
heat-api
heat-engine
heat
database
Queue
cloudwatch-api
OpenStack
Orchestration API
OpenStack
Identity
API
OpenStack Compute
API
OpenStack Bock Storage
API
OpenStack Network API
glance-api
OpenStack
Database
trove-api
trove-taskmgr
trove
database
trove-conductor
OpenStack
Identity
API
OpenStack
Database API
OpenStack Block Storage API
OpenStack Orchestration API
OpenStack Compute API
Agent
Queue
4
* Ceilometer omitted for clarity
OpenStack Deployment
5
OpenStack Management Basics
!  Development and test cluster
–  Smaller, but representative
–  Same set and version of services
–  Reproduce problems, test fixes and practice upgrades
!  Configuration management system
–  Chef, Puppet, Ansible, SaltStack, etc.
–  Your OpenStack distribution already uses one
–  Pick one and stick with it – everything falls under it
!  Skilled and trained staff
–  Experienced Linux admins with virtualization skills
–  Network architects that understand cloud
–  Trained for OpenStack
6
Developing Toolkit for Management
!  Troubleshooting tools
–  Operating system level tools
–  OpenStack specific tools
!  Administration tools
–  OpenStack specific tools
!  Monitoring tools
–  Monitoring platforms
–  Log management tools
!  Specialized tools
–  Image creation
7
Troubleshooting Tools
!  Tools used to investigate or fix problems within
your stack
!  Mostly Linux tools but some are specific to
OpenStack
!  These need to span virtualization, networking
and normal system administration
8
Troubleshooting Hypervisor
!  Vary by hypervisor, each one has it’s own tooling
!  Map VM to hypervisor by OpenStack CLI with
nova show!
!  Investigate hypervisor through virsh tools
!  Also can access backing store for VM through
hypervisor mount point or Cinder volume
9
VM Troubleshooting
# nova list!
+---------------------+-------+---------+------------+-------------+-------------------------------------+!
| ID | Name | Status | Task State | Power State | Networks |!
+---------------------+-------+---------+------------+-------------+-------------------------------------+!
| f94b097d-b030-473b- | ken | ACTIVE | - | Running | rdonet=192.168.90.11 |!
+---------------------+-------+---------+------------+-------------+-------------------------------------+!
# nova show f94b097d-b030-473b-86a3-d501091c650b!
+--------------------------------------+------------------------------------------------------------+!
| Property | Value |!
+--------------------------------------+------------------------------------------------------------+!
| OS-EXT-AZ:availability_zone | nova |!
| OS-EXT-SRV-ATTR:host | localhost.localdomain |!
| OS-EXT-SRV-ATTR:hypervisor_hostname | localhost.localdomain |!
| OS-EXT-SRV-ATTR:instance_name | instance-0000000e |!
| OS-EXT-STS:power_state | 1 |!
| OS-EXT-STS:task_state | - |!
| OS-EXT-STS:vm_state | active |!
| OS-SRV-USG:launched_at | 2014-05-06T06:13:01.000000 |!
| created | 2014-05-06T06:11:55Z |!
| flavor | m1.small (2) |!
| hostId | 7e31bda83a3586907464e8e68f83a035bf9fa500d9579b2b807fa9f0 |!
| id | f94b097d-b030-473b-86a3-d501091c650b |!
| image | cirros-0.3.2-x86_64 (f66d54e8-f8bd-4220-930f-86b6b44dfe4d) |!
| rdonet network | 192.168.90.11 |!
| security_groups | default |!
| status | ACTIVE |!
+--------------------------------------+------------------------------------------------------------+!
# vish list!
Id Name State!
----------------------------------------------------!
1 instance-0000000e running!
10
Troubleshooting Backing Store (Ephemeral)
# cd /var/lib/nova/!
# ls!
buckets CA images instances keys networks tmp!
# cd instances/!
# ll!
total 16!
drwxr-xr-x. 2 nova nova 4096 May 6 09:03 13e86b72-7e14-43f5-ab2f-
e7abf117213f!
drwxr-xr-x. 2 nova nova 4096 May 2 11:25 _base!
-rw-r--r--. 1 nova nova 45 May 5 23:18 compute_nodes!
drwxr-xr-x. 2 nova nova 4096 Apr 30 19:28 locks!
# cd 13e86b72-7e14-43f5-ab2f-e7abf117213f/!
# ll!
total 208!
-rw-rw----. 1 qemu qemu 0 May 6 09:04 console.log!
-rw-r--r--. 1 qemu qemu 262656 May 6 09:03 disk!
-rw-r--r--. 1 nova nova 79 May 6 09:03 disk.info!
-rw-r--r--. 1 nova nova 1529 May 6 09:04 libvirt.xml!
# file disk!
disk: Qemu Image, Format: Qcow , Version: 2!
11
The ‘disk’ file is
our qcow image.
The XML file is
the KVM
template.
Troubleshooting Network
!  Combination of Linux, OpenvSwitch and OpenStack
tools
!  OpenStack tools will show logical configuration of
Neutron’s ports, routers and subnets
–  neutron port-list, net-list and router-
list!
!  OpenvSwitch will map internal and external bridges
–  ovs-vsctl and ovs-dpctl!
!  Linux tools will show you inside VLAN and Linux
namespaces
–  ip netns, iptables and tcpdump!
12
Network Troubleshooting: Router
# ip netns show!
qdhcp-8a496b23-ef2c-4170-9919-611d9a12180f!
qrouter-e439ff2a-1973-4cda-86a4-20c977eec828!
qdhcp-16b5549e-3a1a-4254-b122-f7507f003597!
!
!
# ip netns exec qrouter-e439ff2a-1973-4cda-86a4-20c977eec828 ifconfig!
qg-bbe18331-0c Link encap:Ethernet HWaddr FA:16:3E:47:63:D0!
inet addr:192.168.57.132 Bcast:192.168.57.255 Mask:255.255.255.0!
inet6 addr: fe80::f816:3eff:fe47:63d0/64 Scope:Link!
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1!
RX packets:1335 errors:0 dropped:0 overruns:0 frame:0!
TX packets:145 errors:0 dropped:0 overruns:0 carrier:0!
collisions:0 txqueuelen:1000!
RX bytes:197195 (192.5 KiB) TX bytes:13110 (12.8 KiB)!
!
qr-c4e2b047-4a Link encap:Ethernet HWaddr FA:16:3E:FD:4E:A9!
inet addr:192.168.90.1 Bcast:192.168.90.255 Mask:255.255.255.0!
inet6 addr: fe80::f816:3eff:fefd:4ea9/64 Scope:Link!
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1!
RX packets:364 errors:0 dropped:0 overruns:0 frame:0!
TX packets:309 errors:0 dropped:0 overruns:0 carrier:0!
collisions:0 txqueuelen:1000!
RX bytes:34760 (33.9 KiB) TX bytes:36569 (35.7 KiB)!
!
!
!
# ip netns exec qrouter-e439ff2a-1973-4cda-86a4-20c977eec828 netstat -nr!
Kernel IP routing table!
Destination Gateway Genmask Flags MSS Window irtt Iface!
192.168.57.0 0.0.0.0 255.255.255.0 U 0 0 0 qg-bbe18331-0c!
192.168.90.0 0.0.0.0 255.255.255.0 U 0 0 0 qr-c4e2b047-4a!
13
Troubleshooting OVS Bridges
# ovs-vsctl show!
06667946-811b-4c7b-97a5-eafc8386e9ff!
Bridge br-int!
Port "qvo246622d1-02"!
tag: 2!
Interface "qvo246622d1-02"!
Port "tap3dfc8b70-ee"!
tag: 1!
Interface "tap3dfc8b70-ee"!
Port "tapecef7610-4f"!
tag: 2!
Interface "tapecef7610-4f"!
Port "tapc4e2b047-4a"!
tag: 2!
Interface "tapc4e2b047-4a"!
Port br-int!
Interface br-int!
type: internal!
Bridge br-ex!
Port br-ex!
Interface br-ex!
type: internal!
Port "eth1"!
Interface "eth1"!
Port "tapbbe18331-0c"!
Interface "tapbbe18331-0c"!
ovs_version: "1.11.0"!
14
Neutron’s integration bridge
connecting VMs
Neutron’s external bridge
Physical NIC for internet
access
Monitoring
!  Metering is not monitoring
–  Ceilometer isn’t a monitoring solution
!  Horizon doesn’t save history
!  Monitor for FCAPS: fault, configuration,
accounting, performance and security
!  Needs to be instrumented at multiple levels
–  Hardware/Operating System, OpenStack, VM
–  Although VM monitoring may be left to the user
!  Needs to be used across all elements and
processes
15
Operating System Monitoring
!  Required set of information as any other set of
systems
–  CPU, memory, availability, etc.
!  Process level information
–  RabbitMQ, database, OpenStack processes, etc.
!  Should rely on host sending information to
monitoring server (not ping model)
!  Ideally has APIs and strong discovery to aid
automation
16
!  Installed as part of
many distributions
!  Open source
!  Easy installation and
usage
!  API is an add-on
module
Nagios
17
Nagios OpenStack Plugin
!  Add service checks for some OpenStack
services
–  Glance
–  Keystone
–  Nova
–  Swift API and dispersion
!  Available in most Linux distributions
–  # sudo apt-get install nagios-plugins-openstack!
!  More information and checks available at
http://openstack.prov12n.com/monitoring-
openstack-nagios-3/
18
!  Open source
monitoring tool used
at several large
service provider
clouds
!  Strong API and
discovery modes
!  Templates can be
applied to host groups
for monitoring
Zabbix Console
19
Zabbix Templates
!  Templates created for each type of server
–  Compute nodes, controllers, Swift object servers, etc.
!  Each template checks processes running and
configuration management running
–  Should issue commands against processes not rely
on process table to catch hung processes
!  All nodes also get default OS template
!  Alerting set for pagerduty
20
Log Management
!  More than just for error viewing
!  Primary source of OpenStack data
!  Useful for
–  Finding OpenStack bugs
–  Understanding event timings (spin new VM)
–  Visualizing cluster level statistics (VMs running)
–  Creating dashboards
!  Can be challenging to store, query and interpret
data
–  Clusters can generate GBs per day
–  Use dedicated tools and data stores
–  May be required for legal / audit reasons
21
!  Commercial log
management solution
!  Visualization, ad hoc
queries, post
processing and add-
ons
!  Easy to setup
dashboards
!  Supported with
relatively easy
installer
Splunk
22
!  Open source
alternative to Splunk
!  Requires more
complicated setup to
parse logs correctly
!  Provides ad hoc
queries as well as
dashboards
!  Active community
Logstash, Kibana and ElasticSearch
23
Interesting Uses for Log Data
!  VMs
–  CPUs/Instances by hypervisor (scheduler efficiency)
–  Total vCPUs/CPUs in cluster available versus used
–  Spawn success and failures
–  Spawn time
–  Top Users of VMs/vCPUs
!  Authentication
–  Tokens generated versus invalidated
–  Failed authentications
!  Errors
–  All error messages / stack traces create alert
!  Logs that have stopped (zombie processes)
24
“Canary” Scripts
!  Highest level check for cloud infrastructure: “Can
we spin a new VM ?”
–  Custom written script that starts VM, attaches block
storage, assign IP address, pings outside world then
terminates
–  Logs to all actions with timings into log management
solution
!  Run every 5 to 15 minutes
!  Also can be run interactively
!  This should be written for your own site
25
Specialized Tools
!  Many sites will want to be able to create their
own custom images
–  CI/CD “golden images”
!  Several commercial and open source
alternatives
–  CohesiveFT Server3
–  Elasticbox (https://www.elasticbox.com/)
–  Packer (http://www.packer.io/)
!  All provide ability to create images with specified
software pre-installed via command line
26
Manageability Improvements in Icehouse
!  Nova live upgrade
!  Swift discovery API
27
Rolling (“live”) Upgrades
!  Ability to upgrade a running cluster to new
release
!  Upgrade controller(s) first then individual
compute nodes
!  Requires several pre-conditions
–  Neutron upgraded first
–  Nova-conductor being used to isolate DB schemas
–  Set icehouse compatibility mode
/etc/nova/nova.conf

# Set a version cap for messages sent to compute services. If

# you plan to do a live upgrade from havana to icehouse, you

# should set this option to "icehouse-compat" before beginning

# the live upgrade procedure. (string value)

compute=icehouse-compat

28
!  API calls to /info
will return information
about the cluster
!  Users now able to
take advantage of the
unique features
available in each
cluster
!  Turned on by default
but can be disabled
# swift capabilities!
Core: swift!
Options:!
account_listing_limit: 10000!
container_listing_limit: 10000!
max_account_name_length: 256!
max_container_name_length: 256!
max_file_size: 5368709122!
max_meta_count: 90!
max_meta_name_length: 128!
max_meta_value_length: 256!
max_object_name_length: 1024!
strict_cors_mode: True!
version: 1.13.1!
Additional middleware:
keystoneauth!
Additional middleware: staticweb!
Additional middleware: tempurl!
Options:!
methods: ['GET', 'HEAD', 'PUT']!
29
Swift Discoverability
Oreilly solinea-managing-openstack

Contenu connexe

Tendances

Tendances (19)

Openstack Testbed_ovs_virtualbox_devstack_single node
Openstack Testbed_ovs_virtualbox_devstack_single nodeOpenstack Testbed_ovs_virtualbox_devstack_single node
Openstack Testbed_ovs_virtualbox_devstack_single node
 
Curso de MySQL 5.7
Curso de MySQL 5.7Curso de MySQL 5.7
Curso de MySQL 5.7
 
Step by Step to Install oracle grid 11.2.0.3 on solaris 11.1
Step by Step to Install oracle grid 11.2.0.3 on solaris 11.1Step by Step to Install oracle grid 11.2.0.3 on solaris 11.1
Step by Step to Install oracle grid 11.2.0.3 on solaris 11.1
 
Fiware testbed from hardware to openstack
Fiware testbed from hardware to openstackFiware testbed from hardware to openstack
Fiware testbed from hardware to openstack
 
OSDC 2014: Nat Morris - Open Network Install Environment
OSDC 2014: Nat Morris - Open Network Install EnvironmentOSDC 2014: Nat Morris - Open Network Install Environment
OSDC 2014: Nat Morris - Open Network Install Environment
 
Jurijs Velikanovs Direct NFS - Why and How?
Jurijs Velikanovs Direct NFS - Why and How?Jurijs Velikanovs Direct NFS - Why and How?
Jurijs Velikanovs Direct NFS - Why and How?
 
Ef09 installing-alfresco-components-1-by-1
Ef09 installing-alfresco-components-1-by-1Ef09 installing-alfresco-components-1-by-1
Ef09 installing-alfresco-components-1-by-1
 
How the Internet Works
How the Internet WorksHow the Internet Works
How the Internet Works
 
Erp installation r12.2
Erp installation r12.2Erp installation r12.2
Erp installation r12.2
 
Troubleshooting Firewalls (2012 San Diego)
Troubleshooting Firewalls (2012 San Diego)Troubleshooting Firewalls (2012 San Diego)
Troubleshooting Firewalls (2012 San Diego)
 
EBS in an hour: Build a Vision instance - FAST - in Oracle Virtualbox
EBS in an hour: Build a Vision instance - FAST - in Oracle VirtualboxEBS in an hour: Build a Vision instance - FAST - in Oracle Virtualbox
EBS in an hour: Build a Vision instance - FAST - in Oracle Virtualbox
 
OTRS
OTRSOTRS
OTRS
 
RAC 12c
RAC 12cRAC 12c
RAC 12c
 
(SDD419) Amazon EC2 Networking Deep Dive and Best Practices | AWS re:Invent 2014
(SDD419) Amazon EC2 Networking Deep Dive and Best Practices | AWS re:Invent 2014(SDD419) Amazon EC2 Networking Deep Dive and Best Practices | AWS re:Invent 2014
(SDD419) Amazon EC2 Networking Deep Dive and Best Practices | AWS re:Invent 2014
 
Honeypots - November 8th Misec presentation
Honeypots - November 8th Misec presentationHoneypots - November 8th Misec presentation
Honeypots - November 8th Misec presentation
 
Rolling with the Times: Using wheels, pbr, and Twine for Distributing and Ins...
Rolling with the Times: Using wheels, pbr, and Twine for Distributing and Ins...Rolling with the Times: Using wheels, pbr, and Twine for Distributing and Ins...
Rolling with the Times: Using wheels, pbr, and Twine for Distributing and Ins...
 
11g r2 rac_guide
11g r2 rac_guide11g r2 rac_guide
11g r2 rac_guide
 
NTTドコモ様 導入事例 OpenStack Summit 2016 Barcelona 講演「Expanding and Deepening NTT D...
NTTドコモ様 導入事例 OpenStack Summit 2016 Barcelona 講演「Expanding and Deepening NTT D...NTTドコモ様 導入事例 OpenStack Summit 2016 Barcelona 講演「Expanding and Deepening NTT D...
NTTドコモ様 導入事例 OpenStack Summit 2016 Barcelona 講演「Expanding and Deepening NTT D...
 
UKOUG Tech17 - Stay Secure With Oracle Solaris
UKOUG Tech17 - Stay Secure With Oracle SolarisUKOUG Tech17 - Stay Secure With Oracle Solaris
UKOUG Tech17 - Stay Secure With Oracle Solaris
 

Similaire à Oreilly solinea-managing-openstack

Similaire à Oreilly solinea-managing-openstack (20)

VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
 
IxVM on CML
IxVM on CMLIxVM on CML
IxVM on CML
 
Why OpenStack on UCS? An Introduction to Red Hat and Cisco OpenStack Solution
Why OpenStack on UCS? An Introduction to Red Hat and Cisco OpenStack SolutionWhy OpenStack on UCS? An Introduction to Red Hat and Cisco OpenStack Solution
Why OpenStack on UCS? An Introduction to Red Hat and Cisco OpenStack Solution
 
OpenStack API's and WSGI
OpenStack API's and WSGIOpenStack API's and WSGI
OpenStack API's and WSGI
 
Support of containerized workloads in ONAP
Support of containerized workloads in ONAPSupport of containerized workloads in ONAP
Support of containerized workloads in ONAP
 
Openstack Summit Tokyo 2015 - Building a private cloud to efficiently handle ...
Openstack Summit Tokyo 2015 - Building a private cloud to efficiently handle ...Openstack Summit Tokyo 2015 - Building a private cloud to efficiently handle ...
Openstack Summit Tokyo 2015 - Building a private cloud to efficiently handle ...
 
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...
 
OSDC 2014: Yves Fauser - OpenStack Networking (Neutron) - Overview of network...
OSDC 2014: Yves Fauser - OpenStack Networking (Neutron) - Overview of network...OSDC 2014: Yves Fauser - OpenStack Networking (Neutron) - Overview of network...
OSDC 2014: Yves Fauser - OpenStack Networking (Neutron) - Overview of network...
 
Osdc2014 openstack networking yves_fauser
Osdc2014 openstack networking yves_fauserOsdc2014 openstack networking yves_fauser
Osdc2014 openstack networking yves_fauser
 
Open stack networking_101_update_2014
Open stack networking_101_update_2014Open stack networking_101_update_2014
Open stack networking_101_update_2014
 
Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014
 
DR_PRESENT 1
DR_PRESENT 1DR_PRESENT 1
DR_PRESENT 1
 
OpenStack Discovery and Networking Assurance - Koren Lev - Meetup
OpenStack Discovery and Networking Assurance - Koren Lev - MeetupOpenStack Discovery and Networking Assurance - Koren Lev - Meetup
OpenStack Discovery and Networking Assurance - Koren Lev - Meetup
 
Hyper v and live migration on cisco unified computing system - virtualized on...
Hyper v and live migration on cisco unified computing system - virtualized on...Hyper v and live migration on cisco unified computing system - virtualized on...
Hyper v and live migration on cisco unified computing system - virtualized on...
 
Harmonia open iris_basic_v0.1
Harmonia open iris_basic_v0.1Harmonia open iris_basic_v0.1
Harmonia open iris_basic_v0.1
 
Azure Kubernetes Service - benefits and challenges
Azure Kubernetes Service - benefits and challengesAzure Kubernetes Service - benefits and challenges
Azure Kubernetes Service - benefits and challenges
 
infraXstructure Alexis Dacquay, "How to win back visibility into your network...
infraXstructure Alexis Dacquay, "How to win back visibility into your network...infraXstructure Alexis Dacquay, "How to win back visibility into your network...
infraXstructure Alexis Dacquay, "How to win back visibility into your network...
 
Intel's Out of the Box Network Developers Ireland Meetup on March 29 2017 - ...
Intel's Out of the Box Network Developers Ireland Meetup on March 29 2017  - ...Intel's Out of the Box Network Developers Ireland Meetup on March 29 2017  - ...
Intel's Out of the Box Network Developers Ireland Meetup on March 29 2017 - ...
 
Stacks and Layers: Integrating P4, C, OVS and OpenStack
Stacks and Layers: Integrating P4, C, OVS and OpenStackStacks and Layers: Integrating P4, C, OVS and OpenStack
Stacks and Layers: Integrating P4, C, OVS and OpenStack
 
OpenStack hands-on (All-in-One)
OpenStack hands-on (All-in-One)OpenStack hands-on (All-in-One)
OpenStack hands-on (All-in-One)
 

Plus de Vietnam Open Infrastructure User Group

Plus de Vietnam Open Infrastructure User Group (20)

Room 3 - 5 - Nguyễn Văn Hoàn - 101 Bugs, issues when I work with Ceph
Room 3 - 5 - Nguyễn Văn Hoàn - 101 Bugs, issues when I work with CephRoom 3 - 5 - Nguyễn Văn Hoàn - 101 Bugs, issues when I work with Ceph
Room 3 - 5 - Nguyễn Văn Hoàn - 101 Bugs, issues when I work with Ceph
 
Room 2 - 3 - Nguyễn Hoài Nam & Nguyễn Việt Hùng - Terraform & Pulumi Comparin...
Room 2 - 3 - Nguyễn Hoài Nam & Nguyễn Việt Hùng - Terraform & Pulumi Comparin...Room 2 - 3 - Nguyễn Hoài Nam & Nguyễn Việt Hùng - Terraform & Pulumi Comparin...
Room 2 - 3 - Nguyễn Hoài Nam & Nguyễn Việt Hùng - Terraform & Pulumi Comparin...
 
Room 3 - 6 - Nguyễn Văn Thắng & Dzung Nguyen - Ứng dụng openzfs làm lưu trữ t...
Room 3 - 6 - Nguyễn Văn Thắng & Dzung Nguyen - Ứng dụng openzfs làm lưu trữ t...Room 3 - 6 - Nguyễn Văn Thắng & Dzung Nguyen - Ứng dụng openzfs làm lưu trữ t...
Room 3 - 6 - Nguyễn Văn Thắng & Dzung Nguyen - Ứng dụng openzfs làm lưu trữ t...
 
Room 3 - 4 - Lê Quang Hiếu - How to be a cool dad: Leverage DIY Home Automati...
Room 3 - 4 - Lê Quang Hiếu - How to be a cool dad: Leverage DIY Home Automati...Room 3 - 4 - Lê Quang Hiếu - How to be a cool dad: Leverage DIY Home Automati...
Room 3 - 4 - Lê Quang Hiếu - How to be a cool dad: Leverage DIY Home Automati...
 
Room 3 - 2 - Trần Tuấn Anh - Defending Software Supply Chain Security in Bank...
Room 3 - 2 - Trần Tuấn Anh - Defending Software Supply Chain Security in Bank...Room 3 - 2 - Trần Tuấn Anh - Defending Software Supply Chain Security in Bank...
Room 3 - 2 - Trần Tuấn Anh - Defending Software Supply Chain Security in Bank...
 
Room 3 - 7 - Nguyễn Như Phúc Huy - Vitastor: a fast and simple Ceph-like bloc...
Room 3 - 7 - Nguyễn Như Phúc Huy - Vitastor: a fast and simple Ceph-like bloc...Room 3 - 7 - Nguyễn Như Phúc Huy - Vitastor: a fast and simple Ceph-like bloc...
Room 3 - 7 - Nguyễn Như Phúc Huy - Vitastor: a fast and simple Ceph-like bloc...
 
Room 3 - 1 - Nguyễn Xuân Trường Lâm - Zero touch on-premise storage infrastru...
Room 3 - 1 - Nguyễn Xuân Trường Lâm - Zero touch on-premise storage infrastru...Room 3 - 1 - Nguyễn Xuân Trường Lâm - Zero touch on-premise storage infrastru...
Room 3 - 1 - Nguyễn Xuân Trường Lâm - Zero touch on-premise storage infrastru...
 
Room 2 - 2 - Giang Thiên Phú - Kinh nghiệm tối ưu mongodb với database hơn 10...
Room 2 - 2 - Giang Thiên Phú - Kinh nghiệm tối ưu mongodb với database hơn 10...Room 2 - 2 - Giang Thiên Phú - Kinh nghiệm tối ưu mongodb với database hơn 10...
Room 2 - 2 - Giang Thiên Phú - Kinh nghiệm tối ưu mongodb với database hơn 10...
 
Room 2 - 6 - Đinh Tuấn Phong - Migrate opensource database to Kubernetes easi...
Room 2 - 6 - Đinh Tuấn Phong - Migrate opensource database to Kubernetes easi...Room 2 - 6 - Đinh Tuấn Phong - Migrate opensource database to Kubernetes easi...
Room 2 - 6 - Đinh Tuấn Phong - Migrate opensource database to Kubernetes easi...
 
Room 2 - 4 - Juncheng Anthony Lin - Redhat - A Practical Approach to Traditio...
Room 2 - 4 - Juncheng Anthony Lin - Redhat - A Practical Approach to Traditio...Room 2 - 4 - Juncheng Anthony Lin - Redhat - A Practical Approach to Traditio...
Room 2 - 4 - Juncheng Anthony Lin - Redhat - A Practical Approach to Traditio...
 
Room 2 - 7 - Lã Mạnh Hà - Agile + DevOps = A great combination
Room 2 - 7 - Lã Mạnh Hà - Agile + DevOps = A great combinationRoom 2 - 7 - Lã Mạnh Hà - Agile + DevOps = A great combination
Room 2 - 7 - Lã Mạnh Hà - Agile + DevOps = A great combination
 
Room 2 - 1 - Phạm Quang Minh - A real DevOps culture in practice
Room 2 - 1 - Phạm Quang Minh - A real DevOps culture in practiceRoom 2 - 1 - Phạm Quang Minh - A real DevOps culture in practice
Room 2 - 1 - Phạm Quang Minh - A real DevOps culture in practice
 
Room 2 - 5 - Seong Soo - NHN Cloud - Upstream contribution mentoring program ...
Room 2 - 5 - Seong Soo - NHN Cloud - Upstream contribution mentoring program ...Room 2 - 5 - Seong Soo - NHN Cloud - Upstream contribution mentoring program ...
Room 2 - 5 - Seong Soo - NHN Cloud - Upstream contribution mentoring program ...
 
Room 1 - 2 - Nguyễn Văn Thắng & Dzung Nguyen - Proxmox VE và ZFS over iscsi
Room 1 - 2 - Nguyễn Văn Thắng & Dzung Nguyen - Proxmox VE và ZFS over iscsiRoom 1 - 2 - Nguyễn Văn Thắng & Dzung Nguyen - Proxmox VE và ZFS over iscsi
Room 1 - 2 - Nguyễn Văn Thắng & Dzung Nguyen - Proxmox VE và ZFS over iscsi
 
Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
 
Room 1 - 3 - Lê Anh Tuấn - Build a High Performance Identification at GHTK wi...
Room 1 - 3 - Lê Anh Tuấn - Build a High Performance Identification at GHTK wi...Room 1 - 3 - Lê Anh Tuấn - Build a High Performance Identification at GHTK wi...
Room 1 - 3 - Lê Anh Tuấn - Build a High Performance Identification at GHTK wi...
 
Room 1 - 7 - Lê Quốc Đạt - Upgrading network of Openstack to SDN with Tungste...
Room 1 - 7 - Lê Quốc Đạt - Upgrading network of Openstack to SDN with Tungste...Room 1 - 7 - Lê Quốc Đạt - Upgrading network of Openstack to SDN with Tungste...
Room 1 - 7 - Lê Quốc Đạt - Upgrading network of Openstack to SDN with Tungste...
 
Room 1 - 5 - Thủy Đặng - Load balancing k8s services on baremetal with Cilium...
Room 1 - 5 - Thủy Đặng - Load balancing k8s services on baremetal with Cilium...Room 1 - 5 - Thủy Đặng - Load balancing k8s services on baremetal with Cilium...
Room 1 - 5 - Thủy Đặng - Load balancing k8s services on baremetal with Cilium...
 
Room 1 - 4 - Phạm Tường Chiến & Trần Văn Thắng - Deliver managed Kubernetes C...
Room 1 - 4 - Phạm Tường Chiến & Trần Văn Thắng - Deliver managed Kubernetes C...Room 1 - 4 - Phạm Tường Chiến & Trần Văn Thắng - Deliver managed Kubernetes C...
Room 1 - 4 - Phạm Tường Chiến & Trần Văn Thắng - Deliver managed Kubernetes C...
 
Room 1 - 1 - Benoit TELLIER - On premise email inbound service with Apache James
Room 1 - 1 - Benoit TELLIER - On premise email inbound service with Apache JamesRoom 1 - 1 - Benoit TELLIER - On premise email inbound service with Apache James
Room 1 - 1 - Benoit TELLIER - On premise email inbound service with Apache James
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Oreilly solinea-managing-openstack

  • 1. Accelerating the adoption of Cloud Computing Beyond Installation: Managing Your OpenStack Cloud May 6th, 2014
  • 2. !  Ken Pepple is the co-founder and Chief Technology Officer of Solinea !  Prior to founding Solinea, he led the introduction of Internap's OpenStack-based public cloud services while serving as their Director of Cloud Development !  Code contributor since Bexar release of OpenStack !  Author of O'Reilly Media's "Deploying OpenStack" and several other books 2 Speakers
  • 3. Introduction !  Installing OpenStack gets all the attention … !  … but distributions like Red Hat OSP and Cloudscaling are attacking this problem !  They will beat it. Then what ? !  The reality is that OpenStack management is what we should be focusing on. !  Installation is 2 – 3 weeks … management is forever 3
  • 4. OpenStack Architecture OpenStack Object Store OpenStack Image Service OpenStack Compute OpenStack Dashboard OpenStack Identity Service OpenStack Compute API / Admin API keystone (service & admin APIs) nova-api (OS, EC2, Metadata, Admin) nova-consoleauth nova-cert/ objectstore nova-console nova-*proxy VNC/Spice OpenStack Object API http://www.solinea.com Queue nova-compute nova-scheduler nova database OpenStack Compute API OpenStack Image API Horizon OpenStack Image API identity backend swift-proxy objectcontaineraccount object store account DB container DB OpenStack Object API HTTP(S) OpenStack Object API OpenStack Identity API OpenStack Identity API OpenStack Identity API OpenStack Image API OpenStack Identity API OpenStack Image API catalog backend token backend OpenStack Identity API hypervisor libvirt, XenAPI, etc. HTTP(S) Amazon Web Services EC2 API Internet / Enterprise Network OpenStack Network Service glance-api glance database OpenStack Block Storage OpenStack Block Storage API cinder-api cinder-volume neutron-server neutron plugin(s) OpenStack Identity API cinder-scheduler cinder database OpenStack Network API network provider OpenStack Block Storage API OpenStack Network API policy backend Queue OpenStack Network API neutron database neutron agent(s) nova-conductor memcached OpenStack Identity API ⁃ OpenStack Command Line Tools (nova-client, swift-client, etc.) ⁃ Cloud Management Tools (Rightscale, Enstratius, etc.) ⁃ GUI tools (Cyberduck, iPhone client, etc.) volume provider cinder-backup OpenStack Object API OpenStack Identity API Queue OpenStack Block Storage API OpenStack Orchestration heat-api heat-engine heat database Queue cloudwatch-api OpenStack Orchestration API OpenStack Identity API OpenStack Compute API OpenStack Bock Storage API OpenStack Network API glance-api OpenStack Database trove-api trove-taskmgr trove database trove-conductor OpenStack Identity API OpenStack Database API OpenStack Block Storage API OpenStack Orchestration API OpenStack Compute API Agent Queue 4 * Ceilometer omitted for clarity
  • 6. OpenStack Management Basics !  Development and test cluster –  Smaller, but representative –  Same set and version of services –  Reproduce problems, test fixes and practice upgrades !  Configuration management system –  Chef, Puppet, Ansible, SaltStack, etc. –  Your OpenStack distribution already uses one –  Pick one and stick with it – everything falls under it !  Skilled and trained staff –  Experienced Linux admins with virtualization skills –  Network architects that understand cloud –  Trained for OpenStack 6
  • 7. Developing Toolkit for Management !  Troubleshooting tools –  Operating system level tools –  OpenStack specific tools !  Administration tools –  OpenStack specific tools !  Monitoring tools –  Monitoring platforms –  Log management tools !  Specialized tools –  Image creation 7
  • 8. Troubleshooting Tools !  Tools used to investigate or fix problems within your stack !  Mostly Linux tools but some are specific to OpenStack !  These need to span virtualization, networking and normal system administration 8
  • 9. Troubleshooting Hypervisor !  Vary by hypervisor, each one has it’s own tooling !  Map VM to hypervisor by OpenStack CLI with nova show! !  Investigate hypervisor through virsh tools !  Also can access backing store for VM through hypervisor mount point or Cinder volume 9
  • 10. VM Troubleshooting # nova list! +---------------------+-------+---------+------------+-------------+-------------------------------------+! | ID | Name | Status | Task State | Power State | Networks |! +---------------------+-------+---------+------------+-------------+-------------------------------------+! | f94b097d-b030-473b- | ken | ACTIVE | - | Running | rdonet=192.168.90.11 |! +---------------------+-------+---------+------------+-------------+-------------------------------------+! # nova show f94b097d-b030-473b-86a3-d501091c650b! +--------------------------------------+------------------------------------------------------------+! | Property | Value |! +--------------------------------------+------------------------------------------------------------+! | OS-EXT-AZ:availability_zone | nova |! | OS-EXT-SRV-ATTR:host | localhost.localdomain |! | OS-EXT-SRV-ATTR:hypervisor_hostname | localhost.localdomain |! | OS-EXT-SRV-ATTR:instance_name | instance-0000000e |! | OS-EXT-STS:power_state | 1 |! | OS-EXT-STS:task_state | - |! | OS-EXT-STS:vm_state | active |! | OS-SRV-USG:launched_at | 2014-05-06T06:13:01.000000 |! | created | 2014-05-06T06:11:55Z |! | flavor | m1.small (2) |! | hostId | 7e31bda83a3586907464e8e68f83a035bf9fa500d9579b2b807fa9f0 |! | id | f94b097d-b030-473b-86a3-d501091c650b |! | image | cirros-0.3.2-x86_64 (f66d54e8-f8bd-4220-930f-86b6b44dfe4d) |! | rdonet network | 192.168.90.11 |! | security_groups | default |! | status | ACTIVE |! +--------------------------------------+------------------------------------------------------------+! # vish list! Id Name State! ----------------------------------------------------! 1 instance-0000000e running! 10
  • 11. Troubleshooting Backing Store (Ephemeral) # cd /var/lib/nova/! # ls! buckets CA images instances keys networks tmp! # cd instances/! # ll! total 16! drwxr-xr-x. 2 nova nova 4096 May 6 09:03 13e86b72-7e14-43f5-ab2f- e7abf117213f! drwxr-xr-x. 2 nova nova 4096 May 2 11:25 _base! -rw-r--r--. 1 nova nova 45 May 5 23:18 compute_nodes! drwxr-xr-x. 2 nova nova 4096 Apr 30 19:28 locks! # cd 13e86b72-7e14-43f5-ab2f-e7abf117213f/! # ll! total 208! -rw-rw----. 1 qemu qemu 0 May 6 09:04 console.log! -rw-r--r--. 1 qemu qemu 262656 May 6 09:03 disk! -rw-r--r--. 1 nova nova 79 May 6 09:03 disk.info! -rw-r--r--. 1 nova nova 1529 May 6 09:04 libvirt.xml! # file disk! disk: Qemu Image, Format: Qcow , Version: 2! 11 The ‘disk’ file is our qcow image. The XML file is the KVM template.
  • 12. Troubleshooting Network !  Combination of Linux, OpenvSwitch and OpenStack tools !  OpenStack tools will show logical configuration of Neutron’s ports, routers and subnets –  neutron port-list, net-list and router- list! !  OpenvSwitch will map internal and external bridges –  ovs-vsctl and ovs-dpctl! !  Linux tools will show you inside VLAN and Linux namespaces –  ip netns, iptables and tcpdump! 12
  • 13. Network Troubleshooting: Router # ip netns show! qdhcp-8a496b23-ef2c-4170-9919-611d9a12180f! qrouter-e439ff2a-1973-4cda-86a4-20c977eec828! qdhcp-16b5549e-3a1a-4254-b122-f7507f003597! ! ! # ip netns exec qrouter-e439ff2a-1973-4cda-86a4-20c977eec828 ifconfig! qg-bbe18331-0c Link encap:Ethernet HWaddr FA:16:3E:47:63:D0! inet addr:192.168.57.132 Bcast:192.168.57.255 Mask:255.255.255.0! inet6 addr: fe80::f816:3eff:fe47:63d0/64 Scope:Link! UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1! RX packets:1335 errors:0 dropped:0 overruns:0 frame:0! TX packets:145 errors:0 dropped:0 overruns:0 carrier:0! collisions:0 txqueuelen:1000! RX bytes:197195 (192.5 KiB) TX bytes:13110 (12.8 KiB)! ! qr-c4e2b047-4a Link encap:Ethernet HWaddr FA:16:3E:FD:4E:A9! inet addr:192.168.90.1 Bcast:192.168.90.255 Mask:255.255.255.0! inet6 addr: fe80::f816:3eff:fefd:4ea9/64 Scope:Link! UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1! RX packets:364 errors:0 dropped:0 overruns:0 frame:0! TX packets:309 errors:0 dropped:0 overruns:0 carrier:0! collisions:0 txqueuelen:1000! RX bytes:34760 (33.9 KiB) TX bytes:36569 (35.7 KiB)! ! ! ! # ip netns exec qrouter-e439ff2a-1973-4cda-86a4-20c977eec828 netstat -nr! Kernel IP routing table! Destination Gateway Genmask Flags MSS Window irtt Iface! 192.168.57.0 0.0.0.0 255.255.255.0 U 0 0 0 qg-bbe18331-0c! 192.168.90.0 0.0.0.0 255.255.255.0 U 0 0 0 qr-c4e2b047-4a! 13
  • 14. Troubleshooting OVS Bridges # ovs-vsctl show! 06667946-811b-4c7b-97a5-eafc8386e9ff! Bridge br-int! Port "qvo246622d1-02"! tag: 2! Interface "qvo246622d1-02"! Port "tap3dfc8b70-ee"! tag: 1! Interface "tap3dfc8b70-ee"! Port "tapecef7610-4f"! tag: 2! Interface "tapecef7610-4f"! Port "tapc4e2b047-4a"! tag: 2! Interface "tapc4e2b047-4a"! Port br-int! Interface br-int! type: internal! Bridge br-ex! Port br-ex! Interface br-ex! type: internal! Port "eth1"! Interface "eth1"! Port "tapbbe18331-0c"! Interface "tapbbe18331-0c"! ovs_version: "1.11.0"! 14 Neutron’s integration bridge connecting VMs Neutron’s external bridge Physical NIC for internet access
  • 15. Monitoring !  Metering is not monitoring –  Ceilometer isn’t a monitoring solution !  Horizon doesn’t save history !  Monitor for FCAPS: fault, configuration, accounting, performance and security !  Needs to be instrumented at multiple levels –  Hardware/Operating System, OpenStack, VM –  Although VM monitoring may be left to the user !  Needs to be used across all elements and processes 15
  • 16. Operating System Monitoring !  Required set of information as any other set of systems –  CPU, memory, availability, etc. !  Process level information –  RabbitMQ, database, OpenStack processes, etc. !  Should rely on host sending information to monitoring server (not ping model) !  Ideally has APIs and strong discovery to aid automation 16
  • 17. !  Installed as part of many distributions !  Open source !  Easy installation and usage !  API is an add-on module Nagios 17
  • 18. Nagios OpenStack Plugin !  Add service checks for some OpenStack services –  Glance –  Keystone –  Nova –  Swift API and dispersion !  Available in most Linux distributions –  # sudo apt-get install nagios-plugins-openstack! !  More information and checks available at http://openstack.prov12n.com/monitoring- openstack-nagios-3/ 18
  • 19. !  Open source monitoring tool used at several large service provider clouds !  Strong API and discovery modes !  Templates can be applied to host groups for monitoring Zabbix Console 19
  • 20. Zabbix Templates !  Templates created for each type of server –  Compute nodes, controllers, Swift object servers, etc. !  Each template checks processes running and configuration management running –  Should issue commands against processes not rely on process table to catch hung processes !  All nodes also get default OS template !  Alerting set for pagerduty 20
  • 21. Log Management !  More than just for error viewing !  Primary source of OpenStack data !  Useful for –  Finding OpenStack bugs –  Understanding event timings (spin new VM) –  Visualizing cluster level statistics (VMs running) –  Creating dashboards !  Can be challenging to store, query and interpret data –  Clusters can generate GBs per day –  Use dedicated tools and data stores –  May be required for legal / audit reasons 21
  • 22. !  Commercial log management solution !  Visualization, ad hoc queries, post processing and add- ons !  Easy to setup dashboards !  Supported with relatively easy installer Splunk 22
  • 23. !  Open source alternative to Splunk !  Requires more complicated setup to parse logs correctly !  Provides ad hoc queries as well as dashboards !  Active community Logstash, Kibana and ElasticSearch 23
  • 24. Interesting Uses for Log Data !  VMs –  CPUs/Instances by hypervisor (scheduler efficiency) –  Total vCPUs/CPUs in cluster available versus used –  Spawn success and failures –  Spawn time –  Top Users of VMs/vCPUs !  Authentication –  Tokens generated versus invalidated –  Failed authentications !  Errors –  All error messages / stack traces create alert !  Logs that have stopped (zombie processes) 24
  • 25. “Canary” Scripts !  Highest level check for cloud infrastructure: “Can we spin a new VM ?” –  Custom written script that starts VM, attaches block storage, assign IP address, pings outside world then terminates –  Logs to all actions with timings into log management solution !  Run every 5 to 15 minutes !  Also can be run interactively !  This should be written for your own site 25
  • 26. Specialized Tools !  Many sites will want to be able to create their own custom images –  CI/CD “golden images” !  Several commercial and open source alternatives –  CohesiveFT Server3 –  Elasticbox (https://www.elasticbox.com/) –  Packer (http://www.packer.io/) !  All provide ability to create images with specified software pre-installed via command line 26
  • 27. Manageability Improvements in Icehouse !  Nova live upgrade !  Swift discovery API 27
  • 28. Rolling (“live”) Upgrades !  Ability to upgrade a running cluster to new release !  Upgrade controller(s) first then individual compute nodes !  Requires several pre-conditions –  Neutron upgraded first –  Nova-conductor being used to isolate DB schemas –  Set icehouse compatibility mode /etc/nova/nova.conf
 # Set a version cap for messages sent to compute services. If
 # you plan to do a live upgrade from havana to icehouse, you
 # should set this option to "icehouse-compat" before beginning
 # the live upgrade procedure. (string value)
 compute=icehouse-compat
 28
  • 29. !  API calls to /info will return information about the cluster !  Users now able to take advantage of the unique features available in each cluster !  Turned on by default but can be disabled # swift capabilities! Core: swift! Options:! account_listing_limit: 10000! container_listing_limit: 10000! max_account_name_length: 256! max_container_name_length: 256! max_file_size: 5368709122! max_meta_count: 90! max_meta_name_length: 128! max_meta_value_length: 256! max_object_name_length: 1024! strict_cors_mode: True! version: 1.13.1! Additional middleware: keystoneauth! Additional middleware: staticweb! Additional middleware: tempurl! Options:! methods: ['GET', 'HEAD', 'PUT']! 29 Swift Discoverability