2. Have you been stalking
your servers?
Marji Cermak
Sysadmin & DevOps Engineer at Morpht
marji@morpht.com
@cermakm
3. The rule of 3 things
picture: http://www.flickr.com/photos/helenaperezgarcia/5692392667/
4. The rule of 3 things
1. What is monitoring and why do you want to
monitor
2. Some monitoring tools available for you
3. It is easy to start with monitoring.
7. Monitoring
Monitoring is an intermittent (regular or
irregular) series of observations in time,
carried out to show the extent of compliance
with a formulated standard or degree of
deviation from an expected norm.
J. M. Hellawell (1991), modified by A. Brown
(2000), http://jncc.defra.gov.uk/page-2268
nature conservation area
8. Why you need to monitor
● to know about the bad news before your
customers (or your boss)
9. Why you need to monitor
● to know about the bad news before your
customers (or your boss)
● to scale up your server in advance
10. Why you need to monitor
● to know about the bad news before your
customers (or your boss)
● to scale up your server in advance
● to tune up your app
11. Why you need to monitor (cont.)
● to prove your uptime of 99.999 :)
12. The fun of the nines
Source: http://en.wikipedia.org/wiki/High_availability
Nines: http://en.wikipedia.org/wiki/List_of_unusual_units_of_measurement#Nines
13. Why you need to monitor (cont.)
● to prove your uptime of 99.999 :)
● to minimise downtime (expensive)
14. Why you need to monitor (cont.)
● to prove your uptime of 99.999 :)
● to minimise downtime (expensive)
● to capture customer information
15. Why you need to monitor (cont.)
● to have data / metrics to diagnose
36. Nagios /ˈnɑːɡiːoʊs/
Provides monitoring of:
● network services (SMTP, POP3, HTTP,
NNTP, ICMP, SNMP, FTP, SSH),
● host resources (processor load, disk usage,
system logs),
● anything else like probes (temperature,
alarms, etc).
Many plugins available.
37. Nagios /ˈnɑːɡiːoʊs/
Name and Pronunciation:
● NetSaint -> "Nagios Ain't Gonna Insist On
Sainthood"
● Agios' a transliteration of the Greek word
άγιος (saint)
38. Nagios /ˈnɑːɡiːoʊs/
● alerts by email/pager/IM...
● alerts to different contacts
● notification escalation
● service / host dependencies
● soft / hard states
45. Munin
● master / node architecture
● connects to all nodes at regular intervals
● it uses the RRDtool (round robin database
tool, handles time-series data)
51. ● they complement each other
● nagios normally alerts on one “service”
● munin can be used to correlate different
things
Nagios & Munin
52. APC - what is it?
The Alternative PHP Cache (APC) is a free
and open opcode cache for PHP.
53. APC - what is it?
The Alternative PHP Cache (APC) is a free
and open opcode cache for PHP.
Its goal is to provide a free, open, and robust
framework for caching and optimising PHP
intermediate code.
Inside your webserver (not a webcache)
60. How to install these tools?
Munin
sudo apt-get install munin munin-node
Nagios
sudo apt-get install nagios3
APC dashboard
php.apc script from php-apc package
61. How to configure these?
● It is a bit fiddly
● There are many guides targeting beginners
● You don’t want to do it again and again
62. puppet – a quick way to start
system for automating system administration
tasks
63. puppet – a quick way to start
● a declarative language for expressing
system configuration,
64. puppet – a quick way to start
● a declarative language for expressing
system configuration,
● a client and server for distributing it
65. puppet – a quick way to start
● a declarative language for expressing
system configuration,
● a client and server for distributing it
● and a library for realising the configuration.
79. Questions
Here is the get started monitoring repo:
https://github.com/morpht/stalk-your-box
Marji Cermak
Sysadmin & DevOps Engineer at Morpht
marji@morpht.com
@cermakm
81. THANK YOU!
WHAT DID YOU THINK?
Locate this session at the
DrupalCon Prague website:
http://prague2013.drupal.org/schedule
Click the “Take the survey” link