SlideShare a Scribd company logo
1 of 124
Munin User Group Japan http://munin.jp/
Masahito Zembutsu @zembutsu
September 8, 2012 OpenSource Conference 2012 Tokyo/Fall (#osc12tk)

“Practical Resource Monitoring with Munin - English Edition”
Nice to meet you. I’m @zembutsu.
Thank you for giving an opportunity of the presentation to me!




They are characters of Touhou Project, and "Please take it easy!!"(yukkuri site itte ne!) is one of the famous slang in Japan.
                                       http://en.wikipedia.org/wiki/Touhou_Project
Today’s topic is…
   Recipe for monitoring
I’m expecting that various burdens on
operation engineers reduce it by utilizing

         Munin.
And now…
Why Am I Here?                                                                     this is me


•          Masahito ZEMBUTSU @zembutsu

    – Solutions Engineer ( fiery zeal Otaku mind engineer )
       • Working as a server infrastructure engineer.
       • I want to provide relaxation and rest for theengineers.(Operation/Monitoring/Automation)
    – Communities of an opensource and the cloud computing
       • My website http://pocketstudio.jp/
    – Experience                                                http://opencloud.jp/                http://jaws-ug.jp/
       •   April 2000 - Support engineer of server hosting and the ISP
       •   May 2008 - Company internal network management and support
       •   November 2010 - Service development and upper escalation operation
                                                                              Don’t mind the careful thing!
       •   July 2012 – Operation, Development, Research
                        at datacenter somewhere.
filleting mackerel
It resembles server operations.


                       DevOps!
―Don't forget. always, somewhere,
                   someone is fighting for you.
               ―As long as you remember her.
                              you are not alone.
Operation      (Reference: “Puella Magi Madoka Magica” Episode 12 “My Very Best Friend” )




Monitoring
This is an image photograph of the data center that I’m working.




                                             This Photo is under creative commons license by torkildr
                                             http://www.flickr.com/photos/torkildr/3462606643/sizes/l/in/photostream/
A Dedicated Hosting Services
A HUMAN WORK
         Shutdown Attack, An Unfamiliar Specifications,
         Cloudcomputing’s Arrival in Japan, Shape of Server,
         A Business That’s Changing, My Purest Heart for Our Customers.

Troubleshooting
DECISIVE BATTLE
          The Phone That Never Stop Ringing, The Day a Datacenter Stood Still,
          The Choice of Priority, In sickness unto shutdown, and…, Sales
          Representative’s Invasion, Customer’s office the Throne of Souls, Tears.


You’re a loser only when you fail to try
We can (not) advance.
          The Birth of Special task force, The Value of Miracles, At Least, Be Human.
If you are a server administrator,
you will have thought once.
                              Perhaps…
My Little Servers Can't Be This Heavy...
         But, it may work with Munin
         and a solution of the problem.
This is that I want to do a share today.
• I think that it is necessary to adopt resource
  monitoring for an operative flow.

• As a result, it may reduce the burden on
  administrators. I'm extremely happy. XD

• We need is the culture to leave the office on time!!
  (Only as for me?)
Agenda
 1. What is Munin?

 2. Munin’s Architectre

 3. How to use Munin

 4. Practical trobleshootings!

 5. MY VERY BEST MONITORING TOOL
I hope…
1. Let's obtain a weapon
   called “resource monitoring” for us.
                                                                                              Wille zur Macht




2. We improve the efficiency of
   our working (server and network operations).

“Let's find happiness together.”
(Reference: Kiichi Goto, Patlabor: The Movie, 1989) I guess everybody's happy, that's fine.
Munin User Group Japan
     Munin Community in Japan
Munin.jp
• Munin User Group Japan
   – http://munin.jp/

• Wiki
   – http://munin.jp/wiki/

• Demo
   – http://demo.munin.jp/

• How to join us
   – http://munin.jp/mailing-list/
#1
What is Munin?
I dare to say, Munin is
resource monitoring tool.
 Munin is a networked resource monitoring tool.
                                           I dare to say!!
This is overview (frontpage) of Munin 2.0
When you click it any, a graph of a day / week / month / year are displays it
This zooming function is convenient. It is useful for me to make a report.
Vertical axis is a server, cross axle is metrics. The grouping function is characteristic.
The Melancholy of
Server Administrators
  Monitoring is like a box of chocolate
By the Time we Realized It, It Had Already Begun.

• troubles - alert systems can’t detect it (increased)
  – Mainly clientage for Social Networking Service
  – When the threshold of the alert exceeds it, it is already late.

• demand of the clientage – rapidly response
  – Because a loss per one second is wrong number of digits
    than before.
  – a loss of several hundred dollars / minutes :(
“There is something weird, will you check servers? :)”
                                     Request from my customer of us
• Very difficult request...
  – Clear cause identification often takes time.

• I want to do my best more!
   – Yes!! I stir myself and go to work.
     Administrators got exhausted…
   – I want to aim at the service improvement, but this
     thought is bad. Why? Let’s see the next slide.
An old network constitution.




                       One web server and one database server.
                                  It’s very simple!
An old network constitution.




                                                                 If it was a general
                                                                 Web server, it was
                                                                 such a constitution
                BIND
                                                                 to the utmost.




                       One web server and one database server.
                                  It’s very simple!
On the other hand, at present…
This Just Can't Be Right!!




                 BIND




Number and the management objects of the server are increasing in comparison with the past.
Therefore support takes the time, and the degree of difficulty rises, too.
Why did this happen?
• On the changing environment
  – Network
  – Server
  – Software
  – Middleware
  – Application
  – etc
Be freed from CONSOLE
     Ace Console: Fires of Liberation
most important thing, by troubleshooting
• Cause investigation work has top priority.
  “When we act, it is a first thing to do condition to notice.
    If there is a technique, anything cannot be settled. It
    becomes necessary to notice before a technique. The
    technical expert is in Japan no matter how much, but
    cannot be readily settled. The reason is because it
    does not notice.”
    Soichiro Honda (2008) "akku baran” (candidness ) PHP inc, 10pp.
    http://en.wikipedia.org/wiki/Soichiro_Honda
You sure that’s enough armor(tools)?
• “No problem. Everything’s fine.”
   – ps
   – top
   – vmstat
   – iostat
   – free
   – sar (sysstat) …etc
                       Really?
Past   Present day
Situation has changed
Past                                    Now (present day present time, hahaha!!)
• One or several servers                • Plural servers in the same
• Apache, Sendmail, Perl                  network (we assume)
• PostgreSQL, MySQL                     • Conventional software +
                                           nginx,Tomcat,ruby,PHP,Python,memcac
• Network appliance                        hed,Key-Value
  (sometimes)                              Store,Hadoop,Cassandra,MongoDB…etc

• No scale                              • The need for scalability
• Upgrading is effective                • Upgrading is not effective

               I think that one of the answers to this problem
                     is resource monitoring using Munin.
The essence of Munin is
many resources visualization
    I Know What Your Server Did Last Summer
MRTG has declined
 Is This MRTG? No, This Is Munin.

        We have lost a hero to our glorious and noble cause, but does this foreshadow our defeat?
        No. It is a new beginning. Compared to Cloud Computing Federation the national resources
        of Dedicated Server are less than one thirtieth of theirs. Despite this major difference,
        how is it that we have been able to fight the fight for so long? It is because our goal in
        this war is a righteous one. It’s been over fifty years since the elite of Cloud Computing,
        consumed by greed took control of the Cloud Computing Federation. We want our freedom.
        Never forget the times when the Federation has trampled us! We, the Principality of
        Dedicated Server, have had a long and arduous struggle to achieve freedom for all
        e n g i n e er s o f o u r g r e a t n e t w or k . Ou r f i g h t i s s a cr ed , ou r c a u s e d i v i n e .
        My beloved brother, MRTG, was sacrificed. Why? The war is at a stalemate.
Comparing resource monitoring tool

http://www.munin-monitoring.org/   http://oss.oetiker.ch/rrdtool/   http://oss.oetiker.ch/mrtg/
                                                       I’m interested, too!


                                                                      http://www.zabbix.com/

          http://ganglia.info/
                                     http://www.cacti.net/             http://www.nagios.org/
Comparing resource monitoring tool

http://www.munin-monitoring.org/   http://oss.oetiker.ch/rrdtool/   http://oss.oetiker.ch/mrtg/
                                                                        snmpd



                                                                      http://www.zabbix.com/

          http://ganglia.info/
                                     http://www.cacti.net/             http://www.nagios.org/
              C
Comparerative table
 Tool name                 Type              Datastore           Config    Web interaface     alerting
                        Resource
  Munin                 monitoring           RRDTool              CUI
                                                                             Reference only
                         Resouce
   Cacti                monitoring      RRDTool & MySQL          CUI/GUI
  We are friends all the time...


  MRTG
                        Resource
                        monitoring            original            CUI
                                                                             Reference only
                                                                                                ×
                             IT
 Zabbix                infrastructure
                         monitoring
                                        MySQL, PostgreSQL, etc    GUI

                             IT
 Nagios                infrastructure
                         monitoring
                                        MySQL or PostgreSQL      CUI/GUI


                              It is good points and bad points both.
                    I use Munin and a Nagios-based tool properly by my team.
What is Munin?
The Munin We Saw That Day
About Munin                                                                 Be alert!

• http://munin-monitoring.org/

• Resource monitoring tool
   – Munin can analyze resource trends
   – “what just happened to kill our performance?”

• Plug and Play architecture
   – It can monitor many items by default
  Munin is a networked resource monitoring tool that can help analyze resource trends and
  "what just happened to kill our performance?" problems. It is designed to be very plug and
  play. A default installation provides a lot of graphs with almost no work.
Developers
          Munin project                                github
•   http://munin-monitoring.org/       •   https://github.com/munin-monitoring




       Documents / FAQ / Trac / Wiki            Repository / tools / plugins
Progress in development
• Community based
  – Github
     • https://github.com/munin-monitoring
  – Mailing list
     • https://lists.sourceforge.net/lists/listinfo/munin-users
  – IRC
     • irc://irc.oftc.net/#munin

• Licence
  – GNU Public License version2
  – There is not commercial support
History
• 2002 - project began
  – The original name is “LRRD”

• 2004 - Munin 1.0 released
  – “munin-eye” name was changed to “munin-node”
  – took long time, and daily improvement continued

• 2009 - Munin 1.4 released
  – Perhaps I think that it is a version spreading most in 1.x.

• May 30, 2012 - Munin 2.0 (stable) released
Where is the Japanese information?
   • NOT YET!
   • Let’s make it together now!
       – How about write something to wiki first?
          • http://munin.jp/wiki/


                                        “Is the number of the invitation to the Munin
“I’m sorry,                             user group ZERO case this week, too?
my applogies…”                          Hum? Do you have a mind to do?”
Munin?
The Secret Of Munin
This Photo is under creative commons license
http://en.wikipedia.org/wiki/File:Odin_hrafnar.jpg
What obstacle
                                      factors there are!!
                                      Are you getting
                                      wise with me?




Speaking munin-eye’s mind ( now, munin-node )
Summarize the points
• Munin is a resource monitoring tool. (GPL v2)

• Simple and powerful architecture.

• Munin frees us from a console. (effectiveness)

• Munin mean is “memory”.
           You are never alone!
    Munin always here for you 24x7x365
#2
Munin’s Architecture
An Amazing Simple Munin
     The Architecture Of Munin
User’s viewpoint




                          It is only simple structure.
                   The user refers to the data on the server.
User’s viewpoint
Client Server Model
  Munin Master And Munin Node
Let's watch a little more
  detailed movement.
This is the data which we
referred to some time ago.
This is the work of the main Munin master, and a program is
executed by cron.
It thereby carry out the generation of the collection of data,
checking threshold, HTML files and graphs one by one.
This is “munin-node” agent. Munin
 Master acquires data via plugins.
munin-update connects with munin-node.
Its port number of munin-node is TCP 4949.
Plugins are executed in munin-node, and program is a
script acquiring various data. Munin-update stores the
data which I acquired in RRDTool.
And, munin-limits checks the threshold.
And munin-graph and munin-html
generate a graph and HTML for the
material in data (.rrd) stored away by
RRDtool.
These flows are basic movement of Munin.
  I think that it is really simple and cool!
Constitution of Munin
master ( SERVER )      munin-node ( CLIENT )
• Perl Libs            • Perl Libs
   – Munin::Common        – Munin::Common
• munin-cron           • munin-node
   –   munin-update       – config: munin-node.conf
   –   munin-limits       – Plugins
   –   munin-html      • Tools
   –   munin-graph        – munin-node-configure
• config: munin.conf      – munin-cron
About data collection
• munin-node collect various data.
• Port 4949(TCP)
  – Munin protocol
    •   LIST
    •   CONFIG
    •   FETCH
    •
    •
        VERSION
        QUIT
                     (T_T)4949
                     “4949” is onomatopoeia of Japanese "tearful face".
Data storage and graph generation are
work of RRDtool
• Data format is RRD (round robin database)
  – /var/lib/munin/<hostname>/<plugin’s name>.rrd
    -rw-r--r-- 1 munin munin 50612 10月 18   2010 localhost-cpu-idle-d.rrd
    -rw-r--r-- 1 munin munin 50612 10月 18   2010 localhost-cpu-iowait-d.rrd
    -rw-r--r-- 1 munin munin 50612 10月 18   2010 localhost-cpu-irq-d.rrd
    -rw-r--r-- 1 munin munin 50612 10月 18   2010 localhost-cpu-nice-d.rrd
    -rw-r--r-- 1 munin munin 50612 10月 18   2010 localhost-cpu-softirq-d.rrd
    -rw-r--r-- 1 munin munin 50612 10月 18   2010 localhost-cpu-steal-d.rrd
    -rw-r--r-- 1 munin munin 50612 10月 18   2010 localhost-cpu-system-d.rrd
    -rw-r--r-- 1 munin munin 50612 10月 18   2010 localhost-cpu-user-d.rrd



• 50KByte/one RRD file
  – More than 200KB/one plugin (MUST)
  – 150 to 250 files/munin-node (total about 8 to 15MB/node)
generate graphs per plugin
     Munin Master And Munin Node
Munin prepares for much plugins
• System resources
  – CPU, memory, Load Average, disk, S.M.A.R.T…
• Network
  – Traffic, SNMP, HTTP loadtime, TCP, UDP, ICMP…
• Applications, middleware
  – Apache, Nginx, Sendmail, Postfix, MySQL,
    PostgreSQL, MongoDB, memcached, PHP… etc
Ex) Load Average plugin
• /etc/munin/plugins/load
  – “Load average” is five minutes average
  – It’s a symbolic link
    • Original is /usr/share/munin/plugin/load
  – Simple shell script
    echo -n "load.value "
    cut -f2 -d' ' < /proc/loadavg
    load .value 3.22
#3
How to use Munin
Munin setup!
   Make My Day
Environment
• Perl5
• OS
  – Linux
     • Source code ( version 2.0.6 )
     • Binary Package
          – Red Hat Enterprise Linux 系 ( EPEL )
          – Debian
          – openSUSE
  – MacOS X
  – Windows
Setting up flow
•   Install Munin and Perl Libraries
•   Change a config file ( munin.conf )
•   Setting up munin-node ( munin-node.conf )
•   Check its graphs
Case) Red Hat Enterprise Linux
• Use EPEL*1(testing repository) package or source
• procedure
   – 1.   enabling EPEL
   – 2.   “yum install munin”
   – 3.   configure munin.conf
   – 4.   turn on munin-node and setup
   – 5.   check
 *1 Extra Packages for Enterprise Linux(EPEL) https://fedoraproject.org/wiki/EPEL
Case) Debian / Ubuntu
• Use apt (Debian PTS is testing) or Source
• Procedure
  – 1.   setting up Perl libraries (via apt-get)
  – 2.   install munin
  – 3.   configure munin.conf
  – 4.   turn on munin-node and setup
  – 5.   check
Basic of the setting
     How To Configure
Config files
Munin Master                         munin-node
• /etc/munin/munin.conf              • /etc/munin/munin-node.conf
  – Host tree (targeting nodes)        – Access control
  – Graph strategy                         • Host (IP address)
     • Cron or realtime generation         • Network CIDR
  – Paths                              – Node’s hostname
     • RRD files                       – Port number
     • logfiles                            • Default: TCP 4949 (T_T)
                                       – Plugin’s option
[munin.conf] set target node
[GroupName;node1.pocketstudio.net]
     address 127.0.0.1
     use_node_name yes
[munin-node.conf] Access control
• allow ^127.0.0.1$
  – Regular expression

• cidr_allow 192.0.2.0/24
  – Not regular expression

• If you change files, then you must restart
  munin-node!
Basic of the plugin
   How To Configure plugin
Basic knowledge of Munin plugin
• Original files is here ( shell or perl scripts )
  – /usr/share/munin/plugins/
• How to use
  – To make symbolic link to /etc/munin/plugins
  – configure munin-node.conf
  – munin-node restart (MUST)
  – Check graph and html
How to debug plugin
• /usr/sbin/munin-run <plugin-name>
  – “--debug” shows more detail
  – behavior is same as munin-node
  – useful


• Command line tool ( I made )
  – muninwalk & muninget ; perl script
    https://github.com/zembutsu/muninwalk
Plugins catalog
 How To Configure plugin
Apache
• Symbolic link
   # ln -s /usr/share/munin/plugins/apache_* /etc/munin/plugins/


• munin-node.conf
   [apache_*]
      env.url http://127.0.0.1:%d/server-status?auto
      env.ports 80


• httpd.conf
   ExtendedStatus On

   <Location /server-status>
     SetHandler server-status
     Order deny,allow
     Deny from all
     Allow from 127.0.0.1
   </Location>
MySQL
• Symbolic link
   # ln -s /usr/share/munin/plugins/mysql_* /etc/munin/plugins/


• munin-node.conf
   [mysql*]
     env.mysqlopts -u root -pPASSWORD
     env.mysqladmn /usr/bin/mysqladmin
BIND
• Symbolic link
   # ln -s /usr/share/munin/plugins/bind9_rndc /etc/munin/plugins/


• munin-node.conf
   [bind9_rndc]
      env.rndc         /usr/sbin/rndc
      env.querystats     /var/named/chroot/var/named/data/named_stats.txt
      user           root


• named.conf
    statistics-file "/var/named/data/named_stats.txt";
Plugins I made
   Plugins Catalog
https://github.com/zembutsu/AWS-EstimateCharge
Amazon Web Services; it estimated charges realtime via API
Quantity of generation of the electric power company (TEPCO)
And electricity consumption rate (TEPCO).
Munin can make anything plug in if digitized.
How to make plugin
      Take It Easy
Sample case; httping plugin
• http://www.vanheusden.com/httping/
              • "httping" is a command-line tool which can check
                response time of the Web server like a “ping”
                command.
              • If you set –S opsion, then you can check response
                time and processing time.
                $ httping -S http://210.239.46.254/
                PING 210.239.46.254:80 (http://210.239.46.254/):
                connected to 210.239.46.254:80 (380 bytes), seq=0   time=0.10+0.69=0.79 ms
                connected to 210.239.46.254:80 (380 bytes), seq=1   time=0.08+0.47=0.55 ms
                connected to 210.239.46.254:80 (380 bytes), seq=2   time=0.07+0.68=0.75 ms
                connected to 210.239.46.254:80 (380 bytes), seq=3   time=0.12+0.66=0.77 ms
                Got signal 2
                --- http://210.239.46.254/ ping statistics ---
                4 connects, 4 ok, 0.00% failed
Plugin: httping_
#!/bin/sh
#
# Plugin to monitor HTTP response (httping)

#%# family=auto
#%# capabilities=autoconf

URL=${URL:-"http://localhost/"}
COUNT=${COUNT:-"5"}
httping_bin=$(which httping)                                              This is substance of a httping plugin, and a
if [ "$1" = "autoconf" ]; then                                            file itself is a simple shell script.
       echo yes
       exit 0                                                             The contents are the definition about the
fi
            Define graphing                                               graph and commands to really acquire a value.
if [ "$1" = "config" ] ; then
       echo "graph_args -r --lower-limit 0 ";                             A point is to acquire data, and therefore the
       echo "graph_title http response $URL";
       echo "graph_category httping";                                     plug in can make even what kind of language
       echo "graph_info httping response time: $URL";
       echo 'graph_vlabel msec'                                           including perl and PHP.
     echo   "connect.label connect time"
     echo   "connect.draw AREA"
     echo   "connect.type GAUGE"
     echo   "processing.label processing time"
     echo   "processing.draw STACK"
     echo   "processing.type GAUGE"
     exit                                                           Output format is “xxx.Value ***”
fi

# format for httpiing 1.5.3 http://www.vanheusden.com/httping/
$httping_bin -c $COUNT -G -S $URL | tr '+|=' ' ' | awk '{connect+=$9; processing+=$10} END{print "connect.value",connect/'$COUNT'"n""processing.value",processing/'$COUNT'}'
Config: httping_
• /etc/munin/plugin-conf.d/httping
      [httping_localhost]
         env.URL        http://pocketstudio.jp/
         env.COUNT      5

      [httping_blog]
         env.URL        http://pocketstudio.jp/log3/
         env.COUNT      5

      [httping_node1]
         env.URL      http://node1.pocketstudio.net/
         env.COUNT    5

•   # ln -s /usr/share/munin/plugins/httping_ /etc/munin/plugins/httping_localhost
httping live demo
• http://demo.munin.jp/munin2/httping-day.html




It is a case having any problem neither for this server,   There is much partial (processing time) of this server group
response time and processing time.                         blue.It takes the processing time by certain CMS.
                                                           On the other hand, I understand that the network is good.
#4
Practical trobleshoot!
Never say never.
• Agility is the pivot of the service (in my case)
  – LOOKOUT, its cause solution of the trouble
     • Hardware or Software or Network
  – We need investigation
     • where a problem happens promptly
Live Munin demo
• http://demo.munin.jp/
  – Then let's observe the resource situation through
    this demonstration site of Munin.
     • Where is a bottleneck? or will be?
     • Even if you do not log in to a server, I think that you can
       refer to many resources.
This Just Can’t Be Right
      A Real Troubleshooting
Case) identified unauthorized access
• By the Time we Realized It, It Had Already
  Begun.
• situation
  – 1. Error emails beguns to arrive to postmaster
  – 2. There was not the alert with the monitoring tool
  – 3. Therefore at first I checked a resource in Munin
  – 4. I identified that CMS had vulnerability from the
    situation and acted promptly.
 I was able to perform the above-mentioned movement quickly in a short time by Munin.
How to find it.
Sendmail’s queue rised suddenly   Load Average has no problem
I confirmed the time                            MySQL’s queries were
       when traffic was strange                          rised suddenly, too




From the above-mentioned situation, I supposed illegal access for CMS. Actually, I understood
the attack for the specific URL when I investigated log of the time.
Identification and the action of the cause should have taken time more if I did not use Munin.
#5
My Bery Vest Monitoring tool
New Features of Munin 2.0
    I think that Munin is truly wonderful
Munin 2.0 has new features!
• Better UI and CGI integration
  – New look, Graph Zooming, FastCGI
• asynchronous I/O support
  – Better performance
• Native SSH transport
  – secure (port 22) & easy setup
• asynchronous proxy support
  – async-server substitutes for munin-node
• And more…
  – https://github.com/munin-
    monitoring/munin/blob/devel/Announce-2.0
[RemoteNetwork;backend-DB]
   addressssh://210.239.46.254/opt/munin/lib/munin-async-client --spooldir /var/opt/munin/spool/192.168.0.230 –spoolfetch
   use_node_name yes
No munin, No Troubleshoot.
     I'm Not Afraid of Anything Anymore
Munin changed support flow (my case)
• If I don’t use tools
   – Troubleshooting is various command execute (sysstat) and
     investigation of the log files.
   – But, this method need long time and many human resources
     need, and is bad for service.

• If I use Munin (now).
   – Even if I do not log in, I can understand the situation.
   – I can judge abnormality visually
      • “I see the ending of this troubleshooting!”
   – Agile Support
      • Troubleshooting that has Plan-Do-Check-Action (PDCA) cycles.
In work of my dedicated server hosting
• I really depend on Munin
  – Always, I setup Munin.       Neat

  – Munin is almost in several          I cannot part with Munin
    hundred servers which               for my work.
    I manage directly.
  – I think that Munin is
    indispensable to our
    service quality                              You believe it!
    improvement.
                                                BAM
                                                  BAM!
Trobuleshoot PDCA

Law of
Cycles
   Presage!!
Detecting problem
                                  What are
Plan
       and situation
                                  these alerts?
                      For real?



          Trobuleshoot PDCA

          Law of
          Cycles
              Presage!!
Detecting problem
                                                         What are
Plan
       and situation
                                                         these alerts?
                      For real?



          Trobuleshoot PDCA                                 Do
          Law of                  Suppose a cause

          Cycles                                    OK, Munin. Please tell me
                                                    that trouble lies hidden in
                                                    wherever?
              Presage!!
                                                                  Fire!




                                                                 Please stop!!
Detecting problem
                                                         What are
Plan
       and situation
                                                         these alerts?
                      For real?



          Trobuleshoot PDCA                                 Do
          Law of                  Suppose a cause

          Cycles                                    OK, Munin. Please tell me
                                                    that trouble lies hidden in
                                                    wherever?
              Presage!!
                                                                  Fire!


           I just talk about
           what I just looked
           in Munin!!             Check                          Please stop!!
                                                     To check resources
                                                     remotely
Detecting problem
                                                                               What are
                   Plan
                             and situation
                                                                               these alerts?
                                            For real?
  Wow!



              click-clack       Trobuleshoot PDCA                                 Do
               click-clack
                                Law of                  Suppose a cause

                                Cycles                                    OK, Munin. Please tell me
                                                                          that trouble lies hidden in


   Action
                                                                          wherever?
                                    Presage!!
                                                                                        Fire!

Log in and
                                 I just talk about
execute commands
                                 what I just looked
                                 in Munin!!             Check                          Please stop!!
                                                                           To check resources
                                                                           remotely
You are never alone!
Munin always here for you
       24 x 7 x 365
   The Only Thing I Have Left To Guide Me
Munin’s overview
・Munin is the resource monitoring tool that
 specialize to notice by the visualization.

・Simple architecture, and many plug-ins.

・Ths is most suitable for the system that
 quick support is necessary in a short time.
Conclusion                                * This is my personal impression.




 No munin, No Operation.
 While there’s Munin, there’s hope.
 MY VERY BEST MONITORING TOOL.
 Thank you for MUNIN. Good-bye to MRTG.
I wish…
• I would appreciate you use Munin that
  if you were interested in Munin by my
  presentation.

• Tomorrow is another day. Up to you.

                    Squidn’t you use Munin?
                    (Shoudn’t)
Questions?
• Do you have a questionable point for munin?


              I'm glad you asked.
              Let's give the rights that the reward buys Opoona for you.
              (but, here is wagon sale...)
References
•    Munin
      – http://munin-monitoring.org/

•    Munin User Group Japan
      – http://munin.jp/
      – http://munin.jp/wiki/

•    Website
      – Waiting for Munin 2.0 – Introduction – Personal Workflow Blog
           •   http://blog.pwkf.org/post/2010/06/Waiting-for-Munin-2.0-Introduction


      – /tags/2.0.0/ChangeLog – Munin – Trac
           •   http://munin-monitoring.org/browser/tags/2.0.0/ChangeLog


    Please feedback me zem@pocketstudio.jp or @zembutsu ( twitter )
    Thank you for your reading!

More Related Content

Similar to Practical resource monitoring with munin (English editon)

Load testing, Lessons learnt and Loadzen - Martin Buhr at DevTank - 31st Janu...
Load testing, Lessons learnt and Loadzen - Martin Buhr at DevTank - 31st Janu...Load testing, Lessons learnt and Loadzen - Martin Buhr at DevTank - 31st Janu...
Load testing, Lessons learnt and Loadzen - Martin Buhr at DevTank - 31st Janu...Loadzen
 
EVAIN Artificial intelligence and semantic annotation: are you serious about it?
EVAIN Artificial intelligence and semantic annotation: are you serious about it?EVAIN Artificial intelligence and semantic annotation: are you serious about it?
EVAIN Artificial intelligence and semantic annotation: are you serious about it?FIAT/IFTA
 
Challenges in Maintaining a High Performance Search Engine Written in Java
Challenges in Maintaining a High Performance Search Engine Written in JavaChallenges in Maintaining a High Performance Search Engine Written in Java
Challenges in Maintaining a High Performance Search Engine Written in Javalucenerevolution
 
How do we drive tech changes
How do we drive tech changesHow do we drive tech changes
How do we drive tech changesJaewoo Ahn
 
Microservices - when, why and how incontrodevops.it
Microservices  - when, why and how incontrodevops.itMicroservices  - when, why and how incontrodevops.it
Microservices - when, why and how incontrodevops.itGiuseppe Lavagetto
 
Capacity Planning For Web Operations Presentation
Capacity Planning For Web Operations PresentationCapacity Planning For Web Operations Presentation
Capacity Planning For Web Operations Presentationjward5519
 
Capacity Planning For Web Operations Presentation
Capacity Planning For Web Operations PresentationCapacity Planning For Web Operations Presentation
Capacity Planning For Web Operations Presentationjward5519
 
I Want These * Bugs Off My * Internet
I Want These * Bugs Off My * InternetI Want These * Bugs Off My * Internet
I Want These * Bugs Off My * InternetDan Kaminsky
 
Teaching Elephants to Dance (Federal Audience): A Developer's Journey to Digi...
Teaching Elephants to Dance (Federal Audience): A Developer's Journey to Digi...Teaching Elephants to Dance (Federal Audience): A Developer's Journey to Digi...
Teaching Elephants to Dance (Federal Audience): A Developer's Journey to Digi...Burr Sutter
 
The Lost Tales of Platform Design (February 2017)
The Lost Tales of Platform Design (February 2017)The Lost Tales of Platform Design (February 2017)
The Lost Tales of Platform Design (February 2017)Julien SIMON
 
[GDSC @ UoL] Chip Chat
[GDSC @ UoL] Chip Chat[GDSC @ UoL] Chip Chat
[GDSC @ UoL] Chip ChatTan
 
Javaland 2017: "You´ll do microservices now". Now what?
Javaland 2017: "You´ll do microservices now". Now what?Javaland 2017: "You´ll do microservices now". Now what?
Javaland 2017: "You´ll do microservices now". Now what?André Goliath
 
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"Daniel Bryant
 
Cloud conference - mongodb
Cloud conference - mongodbCloud conference - mongodb
Cloud conference - mongodbMitch Pirtle
 
Brev loc cloud data storage, backup and recovery pres
Brev loc cloud data storage, backup and recovery presBrev loc cloud data storage, backup and recovery pres
Brev loc cloud data storage, backup and recovery presdanmraz
 
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve PooleDevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve PooleJAXLondon_Conference
 
Creating Responsive HTML5 Touch Interfaces
Creating Responsive HTML5 Touch InterfacesCreating Responsive HTML5 Touch Interfaces
Creating Responsive HTML5 Touch InterfacesStephen Woods
 
The Difference Between Your Project Succeeding or Burning To A Crisp Is Actua...
The Difference Between Your Project Succeeding or Burning To A Crisp Is Actua...The Difference Between Your Project Succeeding or Burning To A Crisp Is Actua...
The Difference Between Your Project Succeeding or Burning To A Crisp Is Actua...Alan Quayle
 
Dev and Ops Collaboration and Awareness at Etsy and Flickr
Dev and Ops Collaboration and Awareness at Etsy and FlickrDev and Ops Collaboration and Awareness at Etsy and Flickr
Dev and Ops Collaboration and Awareness at Etsy and FlickrJohn Allspaw
 

Similar to Practical resource monitoring with munin (English editon) (20)

Load testing, Lessons learnt and Loadzen - Martin Buhr at DevTank - 31st Janu...
Load testing, Lessons learnt and Loadzen - Martin Buhr at DevTank - 31st Janu...Load testing, Lessons learnt and Loadzen - Martin Buhr at DevTank - 31st Janu...
Load testing, Lessons learnt and Loadzen - Martin Buhr at DevTank - 31st Janu...
 
EVAIN Artificial intelligence and semantic annotation: are you serious about it?
EVAIN Artificial intelligence and semantic annotation: are you serious about it?EVAIN Artificial intelligence and semantic annotation: are you serious about it?
EVAIN Artificial intelligence and semantic annotation: are you serious about it?
 
Challenges in Maintaining a High Performance Search Engine Written in Java
Challenges in Maintaining a High Performance Search Engine Written in JavaChallenges in Maintaining a High Performance Search Engine Written in Java
Challenges in Maintaining a High Performance Search Engine Written in Java
 
How do we drive tech changes
How do we drive tech changesHow do we drive tech changes
How do we drive tech changes
 
Microservices - when, why and how incontrodevops.it
Microservices  - when, why and how incontrodevops.itMicroservices  - when, why and how incontrodevops.it
Microservices - when, why and how incontrodevops.it
 
Capacity Planning For Web Operations Presentation
Capacity Planning For Web Operations PresentationCapacity Planning For Web Operations Presentation
Capacity Planning For Web Operations Presentation
 
Capacity Planning For Web Operations Presentation
Capacity Planning For Web Operations PresentationCapacity Planning For Web Operations Presentation
Capacity Planning For Web Operations Presentation
 
I Want These * Bugs Off My * Internet
I Want These * Bugs Off My * InternetI Want These * Bugs Off My * Internet
I Want These * Bugs Off My * Internet
 
Teaching Elephants to Dance (Federal Audience): A Developer's Journey to Digi...
Teaching Elephants to Dance (Federal Audience): A Developer's Journey to Digi...Teaching Elephants to Dance (Federal Audience): A Developer's Journey to Digi...
Teaching Elephants to Dance (Federal Audience): A Developer's Journey to Digi...
 
The Lost Tales of Platform Design (February 2017)
The Lost Tales of Platform Design (February 2017)The Lost Tales of Platform Design (February 2017)
The Lost Tales of Platform Design (February 2017)
 
[GDSC @ UoL] Chip Chat
[GDSC @ UoL] Chip Chat[GDSC @ UoL] Chip Chat
[GDSC @ UoL] Chip Chat
 
Javaland 2017: "You´ll do microservices now". Now what?
Javaland 2017: "You´ll do microservices now". Now what?Javaland 2017: "You´ll do microservices now". Now what?
Javaland 2017: "You´ll do microservices now". Now what?
 
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
 
Cloud conference - mongodb
Cloud conference - mongodbCloud conference - mongodb
Cloud conference - mongodb
 
Brev loc cloud data storage, backup and recovery pres
Brev loc cloud data storage, backup and recovery presBrev loc cloud data storage, backup and recovery pres
Brev loc cloud data storage, backup and recovery pres
 
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve PooleDevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
 
Creating Responsive HTML5 Touch Interfaces
Creating Responsive HTML5 Touch InterfacesCreating Responsive HTML5 Touch Interfaces
Creating Responsive HTML5 Touch Interfaces
 
Binary crosswords
Binary crosswordsBinary crosswords
Binary crosswords
 
The Difference Between Your Project Succeeding or Burning To A Crisp Is Actua...
The Difference Between Your Project Succeeding or Burning To A Crisp Is Actua...The Difference Between Your Project Succeeding or Burning To A Crisp Is Actua...
The Difference Between Your Project Succeeding or Burning To A Crisp Is Actua...
 
Dev and Ops Collaboration and Awareness at Etsy and Flickr
Dev and Ops Collaboration and Awareness at Etsy and FlickrDev and Ops Collaboration and Awareness at Etsy and Flickr
Dev and Ops Collaboration and Awareness at Etsy and Flickr
 

More from Masahito Zembutsu

忙しい人のための Rocky Linux 入門〜Rocky LinuxはCentOSの後継者たり得るか?〜
忙しい人のための Rocky Linux 入門〜Rocky LinuxはCentOSの後継者たり得るか?〜忙しい人のための Rocky Linux 入門〜Rocky LinuxはCentOSの後継者たり得るか?〜
忙しい人のための Rocky Linux 入門〜Rocky LinuxはCentOSの後継者たり得るか?〜Masahito Zembutsu
 
自由検証環境提供宣言+Docker Compose V2 GA
自由検証環境提供宣言+Docker Compose V2 GA自由検証環境提供宣言+Docker Compose V2 GA
自由検証環境提供宣言+Docker Compose V2 GAMasahito Zembutsu
 
CentOS Linux 8 の EOL と対応策の検討
CentOS Linux 8 の EOL と対応策の検討CentOS Linux 8 の EOL と対応策の検討
CentOS Linux 8 の EOL と対応策の検討Masahito Zembutsu
 
さくらインターネットのコミュニティ with COVID-19
さくらインターネットのコミュニティ with COVID-19さくらインターネットのコミュニティ with COVID-19
さくらインターネットのコミュニティ with COVID-19Masahito Zembutsu
 
ブックトーク@CROSS ~SF編~ 発表資料「攻殻機動隊」「導きの星」
ブックトーク@CROSS ~SF編~ 発表資料「攻殻機動隊」「導きの星」ブックトーク@CROSS ~SF編~ 発表資料「攻殻機動隊」「導きの星」
ブックトーク@CROSS ~SF編~ 発表資料「攻殻機動隊」「導きの星」Masahito Zembutsu
 
インターネットでウェブサイトを表示している裏側の話
インターネットでウェブサイトを表示している裏側の話インターネットでウェブサイトを表示している裏側の話
インターネットでウェブサイトを表示している裏側の話Masahito Zembutsu
 
3分で分かる「プログラミング教育・情報教育」
3分で分かる「プログラミング教育・情報教育」3分で分かる「プログラミング教育・情報教育」
3分で分かる「プログラミング教育・情報教育」Masahito Zembutsu
 
ようこそオンラインの展示会場へ
ようこそオンラインの展示会場へようこそオンラインの展示会場へ
ようこそオンラインの展示会場へMasahito Zembutsu
 
小学校プログラミング教育に対する企業の取り組みと課題 #KOF2020
小学校プログラミング教育に対する企業の取り組みと課題 #KOF2020小学校プログラミング教育に対する企業の取り組みと課題 #KOF2020
小学校プログラミング教育に対する企業の取り組みと課題 #KOF2020Masahito Zembutsu
 
オンライン発表で気を付けているポイント~姿勢編
オンライン発表で気を付けているポイント~姿勢編オンライン発表で気を付けているポイント~姿勢編
オンライン発表で気を付けているポイント~姿勢編Masahito Zembutsu
 
Docker道場オンライン#1 Docker基礎概念と用語の理解
Docker道場オンライン#1 Docker基礎概念と用語の理解Docker道場オンライン#1 Docker基礎概念と用語の理解
Docker道場オンライン#1 Docker基礎概念と用語の理解Masahito Zembutsu
 
Docker 9 tips~意外と知られていない日常で役立つ便利技
Docker 9 tips~意外と知られていない日常で役立つ便利技Docker 9 tips~意外と知られていない日常で役立つ便利技
Docker 9 tips~意外と知られていない日常で役立つ便利技Masahito Zembutsu
 
コンテナの作り方「Dockerは裏方で何をしているのか?」
コンテナの作り方「Dockerは裏方で何をしているのか?」コンテナの作り方「Dockerは裏方で何をしているのか?」
コンテナの作り方「Dockerは裏方で何をしているのか?」Masahito Zembutsu
 
クリスマスに工場(Factorio)を作るゲームをしよう
クリスマスに工場(Factorio)を作るゲームをしようクリスマスに工場(Factorio)を作るゲームをしよう
クリスマスに工場(Factorio)を作るゲームをしようMasahito Zembutsu
 
Dockerfileを改善するためのBest Practice 2019年版
Dockerfileを改善するためのBest Practice 2019年版Dockerfileを改善するためのBest Practice 2019年版
Dockerfileを改善するためのBest Practice 2019年版Masahito Zembutsu
 
Dockerfile を書くためのベストプラクティス解説編
Dockerfile を書くためのベストプラクティス解説編Dockerfile を書くためのベストプラクティス解説編
Dockerfile を書くためのベストプラクティス解説編Masahito Zembutsu
 
2020年から始まる小学校プログラミング教育の話 #osc19os
2020年から始まる小学校プログラミング教育の話 #osc19os2020年から始まる小学校プログラミング教育の話 #osc19os
2020年から始まる小学校プログラミング教育の話 #osc19osMasahito Zembutsu
 

More from Masahito Zembutsu (20)

忙しい人のための Rocky Linux 入門〜Rocky LinuxはCentOSの後継者たり得るか?〜
忙しい人のための Rocky Linux 入門〜Rocky LinuxはCentOSの後継者たり得るか?〜忙しい人のための Rocky Linux 入門〜Rocky LinuxはCentOSの後継者たり得るか?〜
忙しい人のための Rocky Linux 入門〜Rocky LinuxはCentOSの後継者たり得るか?〜
 
自由検証環境提供宣言+Docker Compose V2 GA
自由検証環境提供宣言+Docker Compose V2 GA自由検証環境提供宣言+Docker Compose V2 GA
自由検証環境提供宣言+Docker Compose V2 GA
 
CentOS Linux 8 の EOL と対応策の検討
CentOS Linux 8 の EOL と対応策の検討CentOS Linux 8 の EOL と対応策の検討
CentOS Linux 8 の EOL と対応策の検討
 
さくらインターネットのコミュニティ with COVID-19
さくらインターネットのコミュニティ with COVID-19さくらインターネットのコミュニティ with COVID-19
さくらインターネットのコミュニティ with COVID-19
 
Docker Chronicle 2021.09
Docker Chronicle  2021.09Docker Chronicle  2021.09
Docker Chronicle 2021.09
 
ブックトーク@CROSS ~SF編~ 発表資料「攻殻機動隊」「導きの星」
ブックトーク@CROSS ~SF編~ 発表資料「攻殻機動隊」「導きの星」ブックトーク@CROSS ~SF編~ 発表資料「攻殻機動隊」「導きの星」
ブックトーク@CROSS ~SF編~ 発表資料「攻殻機動隊」「導きの星」
 
インターネットでウェブサイトを表示している裏側の話
インターネットでウェブサイトを表示している裏側の話インターネットでウェブサイトを表示している裏側の話
インターネットでウェブサイトを表示している裏側の話
 
3分で分かる「プログラミング教育・情報教育」
3分で分かる「プログラミング教育・情報教育」3分で分かる「プログラミング教育・情報教育」
3分で分かる「プログラミング教育・情報教育」
 
ようこそオンラインの展示会場へ
ようこそオンラインの展示会場へようこそオンラインの展示会場へ
ようこそオンラインの展示会場へ
 
小学校プログラミング教育に対する企業の取り組みと課題 #KOF2020
小学校プログラミング教育に対する企業の取り組みと課題 #KOF2020小学校プログラミング教育に対する企業の取り組みと課題 #KOF2020
小学校プログラミング教育に対する企業の取り組みと課題 #KOF2020
 
オンライン発表で気を付けているポイント~姿勢編
オンライン発表で気を付けているポイント~姿勢編オンライン発表で気を付けているポイント~姿勢編
オンライン発表で気を付けているポイント~姿勢編
 
Docker道場オンライン#1 Docker基礎概念と用語の理解
Docker道場オンライン#1 Docker基礎概念と用語の理解Docker道場オンライン#1 Docker基礎概念と用語の理解
Docker道場オンライン#1 Docker基礎概念と用語の理解
 
Jitsi Meetとは?
Jitsi Meetとは?Jitsi Meetとは?
Jitsi Meetとは?
 
Docker 9 tips~意外と知られていない日常で役立つ便利技
Docker 9 tips~意外と知られていない日常で役立つ便利技Docker 9 tips~意外と知られていない日常で役立つ便利技
Docker 9 tips~意外と知られていない日常で役立つ便利技
 
コンテナの作り方「Dockerは裏方で何をしているのか?」
コンテナの作り方「Dockerは裏方で何をしているのか?」コンテナの作り方「Dockerは裏方で何をしているのか?」
コンテナの作り方「Dockerは裏方で何をしているのか?」
 
クリスマスに工場(Factorio)を作るゲームをしよう
クリスマスに工場(Factorio)を作るゲームをしようクリスマスに工場(Factorio)を作るゲームをしよう
クリスマスに工場(Factorio)を作るゲームをしよう
 
Dockerfileを改善するためのBest Practice 2019年版
Dockerfileを改善するためのBest Practice 2019年版Dockerfileを改善するためのBest Practice 2019年版
Dockerfileを改善するためのBest Practice 2019年版
 
Dockerfile を書くためのベストプラクティス解説編
Dockerfile を書くためのベストプラクティス解説編Dockerfile を書くためのベストプラクティス解説編
Dockerfile を書くためのベストプラクティス解説編
 
2020年から始まる小学校プログラミング教育の話 #osc19os
2020年から始まる小学校プログラミング教育の話 #osc19os2020年から始まる小学校プログラミング教育の話 #osc19os
2020年から始まる小学校プログラミング教育の話 #osc19os
 
Docker Compose 徹底解説
Docker Compose 徹底解説Docker Compose 徹底解説
Docker Compose 徹底解説
 

Recently uploaded

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 

Recently uploaded (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 

Practical resource monitoring with munin (English editon)

  • 1. Munin User Group Japan http://munin.jp/ Masahito Zembutsu @zembutsu September 8, 2012 OpenSource Conference 2012 Tokyo/Fall (#osc12tk) “Practical Resource Monitoring with Munin - English Edition”
  • 2. Nice to meet you. I’m @zembutsu. Thank you for giving an opportunity of the presentation to me! They are characters of Touhou Project, and "Please take it easy!!"(yukkuri site itte ne!) is one of the famous slang in Japan. http://en.wikipedia.org/wiki/Touhou_Project
  • 3. Today’s topic is… Recipe for monitoring
  • 4.
  • 5. I’m expecting that various burdens on operation engineers reduce it by utilizing Munin.
  • 7. Why Am I Here? this is me • Masahito ZEMBUTSU @zembutsu – Solutions Engineer ( fiery zeal Otaku mind engineer ) • Working as a server infrastructure engineer. • I want to provide relaxation and rest for theengineers.(Operation/Monitoring/Automation) – Communities of an opensource and the cloud computing • My website http://pocketstudio.jp/ – Experience http://opencloud.jp/ http://jaws-ug.jp/ • April 2000 - Support engineer of server hosting and the ISP • May 2008 - Company internal network management and support • November 2010 - Service development and upper escalation operation Don’t mind the careful thing! • July 2012 – Operation, Development, Research at datacenter somewhere.
  • 8. filleting mackerel It resembles server operations. DevOps!
  • 9. ―Don't forget. always, somewhere, someone is fighting for you. ―As long as you remember her. you are not alone. Operation (Reference: “Puella Magi Madoka Magica” Episode 12 “My Very Best Friend” ) Monitoring
  • 10. This is an image photograph of the data center that I’m working. This Photo is under creative commons license by torkildr http://www.flickr.com/photos/torkildr/3462606643/sizes/l/in/photostream/
  • 11. A Dedicated Hosting Services A HUMAN WORK Shutdown Attack, An Unfamiliar Specifications, Cloudcomputing’s Arrival in Japan, Shape of Server, A Business That’s Changing, My Purest Heart for Our Customers. Troubleshooting DECISIVE BATTLE The Phone That Never Stop Ringing, The Day a Datacenter Stood Still, The Choice of Priority, In sickness unto shutdown, and…, Sales Representative’s Invasion, Customer’s office the Throne of Souls, Tears. You’re a loser only when you fail to try We can (not) advance. The Birth of Special task force, The Value of Miracles, At Least, Be Human.
  • 12. If you are a server administrator, you will have thought once. Perhaps…
  • 13. My Little Servers Can't Be This Heavy... But, it may work with Munin and a solution of the problem.
  • 14. This is that I want to do a share today. • I think that it is necessary to adopt resource monitoring for an operative flow. • As a result, it may reduce the burden on administrators. I'm extremely happy. XD • We need is the culture to leave the office on time!! (Only as for me?)
  • 15. Agenda 1. What is Munin? 2. Munin’s Architectre 3. How to use Munin 4. Practical trobleshootings! 5. MY VERY BEST MONITORING TOOL
  • 16. I hope… 1. Let's obtain a weapon called “resource monitoring” for us. Wille zur Macht 2. We improve the efficiency of our working (server and network operations). “Let's find happiness together.” (Reference: Kiichi Goto, Patlabor: The Movie, 1989) I guess everybody's happy, that's fine.
  • 17. Munin User Group Japan Munin Community in Japan
  • 18. Munin.jp • Munin User Group Japan – http://munin.jp/ • Wiki – http://munin.jp/wiki/ • Demo – http://demo.munin.jp/ • How to join us – http://munin.jp/mailing-list/
  • 20. I dare to say, Munin is resource monitoring tool. Munin is a networked resource monitoring tool. I dare to say!!
  • 21. This is overview (frontpage) of Munin 2.0
  • 22. When you click it any, a graph of a day / week / month / year are displays it
  • 23. This zooming function is convenient. It is useful for me to make a report.
  • 24. Vertical axis is a server, cross axle is metrics. The grouping function is characteristic.
  • 25. The Melancholy of Server Administrators Monitoring is like a box of chocolate
  • 26. By the Time we Realized It, It Had Already Begun. • troubles - alert systems can’t detect it (increased) – Mainly clientage for Social Networking Service – When the threshold of the alert exceeds it, it is already late. • demand of the clientage – rapidly response – Because a loss per one second is wrong number of digits than before. – a loss of several hundred dollars / minutes :(
  • 27. “There is something weird, will you check servers? :)” Request from my customer of us • Very difficult request... – Clear cause identification often takes time. • I want to do my best more! – Yes!! I stir myself and go to work. Administrators got exhausted… – I want to aim at the service improvement, but this thought is bad. Why? Let’s see the next slide.
  • 28. An old network constitution. One web server and one database server. It’s very simple!
  • 29. An old network constitution. If it was a general Web server, it was such a constitution BIND to the utmost. One web server and one database server. It’s very simple!
  • 30. On the other hand, at present…
  • 31. This Just Can't Be Right!! BIND Number and the management objects of the server are increasing in comparison with the past. Therefore support takes the time, and the degree of difficulty rises, too.
  • 32. Why did this happen? • On the changing environment – Network – Server – Software – Middleware – Application – etc
  • 33. Be freed from CONSOLE Ace Console: Fires of Liberation
  • 34. most important thing, by troubleshooting • Cause investigation work has top priority. “When we act, it is a first thing to do condition to notice. If there is a technique, anything cannot be settled. It becomes necessary to notice before a technique. The technical expert is in Japan no matter how much, but cannot be readily settled. The reason is because it does not notice.” Soichiro Honda (2008) "akku baran” (candidness ) PHP inc, 10pp. http://en.wikipedia.org/wiki/Soichiro_Honda
  • 35. You sure that’s enough armor(tools)? • “No problem. Everything’s fine.” – ps – top – vmstat – iostat – free – sar (sysstat) …etc Really?
  • 36. Past Present day
  • 37. Situation has changed Past Now (present day present time, hahaha!!) • One or several servers • Plural servers in the same • Apache, Sendmail, Perl network (we assume) • PostgreSQL, MySQL • Conventional software + nginx,Tomcat,ruby,PHP,Python,memcac • Network appliance hed,Key-Value (sometimes) Store,Hadoop,Cassandra,MongoDB…etc • No scale • The need for scalability • Upgrading is effective • Upgrading is not effective I think that one of the answers to this problem is resource monitoring using Munin.
  • 38. The essence of Munin is many resources visualization I Know What Your Server Did Last Summer
  • 39. MRTG has declined Is This MRTG? No, This Is Munin. We have lost a hero to our glorious and noble cause, but does this foreshadow our defeat? No. It is a new beginning. Compared to Cloud Computing Federation the national resources of Dedicated Server are less than one thirtieth of theirs. Despite this major difference, how is it that we have been able to fight the fight for so long? It is because our goal in this war is a righteous one. It’s been over fifty years since the elite of Cloud Computing, consumed by greed took control of the Cloud Computing Federation. We want our freedom. Never forget the times when the Federation has trampled us! We, the Principality of Dedicated Server, have had a long and arduous struggle to achieve freedom for all e n g i n e er s o f o u r g r e a t n e t w or k . Ou r f i g h t i s s a cr ed , ou r c a u s e d i v i n e . My beloved brother, MRTG, was sacrificed. Why? The war is at a stalemate.
  • 40. Comparing resource monitoring tool http://www.munin-monitoring.org/ http://oss.oetiker.ch/rrdtool/ http://oss.oetiker.ch/mrtg/ I’m interested, too! http://www.zabbix.com/ http://ganglia.info/ http://www.cacti.net/ http://www.nagios.org/
  • 41. Comparing resource monitoring tool http://www.munin-monitoring.org/ http://oss.oetiker.ch/rrdtool/ http://oss.oetiker.ch/mrtg/ snmpd http://www.zabbix.com/ http://ganglia.info/ http://www.cacti.net/ http://www.nagios.org/ C
  • 42. Comparerative table Tool name Type Datastore Config Web interaface alerting Resource Munin monitoring RRDTool CUI Reference only Resouce Cacti monitoring RRDTool & MySQL CUI/GUI We are friends all the time... MRTG Resource monitoring original CUI Reference only × IT Zabbix infrastructure monitoring MySQL, PostgreSQL, etc GUI IT Nagios infrastructure monitoring MySQL or PostgreSQL CUI/GUI It is good points and bad points both. I use Munin and a Nagios-based tool properly by my team.
  • 43. What is Munin? The Munin We Saw That Day
  • 44. About Munin Be alert! • http://munin-monitoring.org/ • Resource monitoring tool – Munin can analyze resource trends – “what just happened to kill our performance?” • Plug and Play architecture – It can monitor many items by default Munin is a networked resource monitoring tool that can help analyze resource trends and "what just happened to kill our performance?" problems. It is designed to be very plug and play. A default installation provides a lot of graphs with almost no work.
  • 45. Developers Munin project github • http://munin-monitoring.org/ • https://github.com/munin-monitoring Documents / FAQ / Trac / Wiki Repository / tools / plugins
  • 46. Progress in development • Community based – Github • https://github.com/munin-monitoring – Mailing list • https://lists.sourceforge.net/lists/listinfo/munin-users – IRC • irc://irc.oftc.net/#munin • Licence – GNU Public License version2 – There is not commercial support
  • 47. History • 2002 - project began – The original name is “LRRD” • 2004 - Munin 1.0 released – “munin-eye” name was changed to “munin-node” – took long time, and daily improvement continued • 2009 - Munin 1.4 released – Perhaps I think that it is a version spreading most in 1.x. • May 30, 2012 - Munin 2.0 (stable) released
  • 48. Where is the Japanese information? • NOT YET! • Let’s make it together now! – How about write something to wiki first? • http://munin.jp/wiki/ “Is the number of the invitation to the Munin “I’m sorry, user group ZERO case this week, too? my applogies…” Hum? Do you have a mind to do?”
  • 50. This Photo is under creative commons license http://en.wikipedia.org/wiki/File:Odin_hrafnar.jpg
  • 51. What obstacle factors there are!! Are you getting wise with me? Speaking munin-eye’s mind ( now, munin-node )
  • 52. Summarize the points • Munin is a resource monitoring tool. (GPL v2) • Simple and powerful architecture. • Munin frees us from a console. (effectiveness) • Munin mean is “memory”. You are never alone! Munin always here for you 24x7x365
  • 54. An Amazing Simple Munin The Architecture Of Munin
  • 55. User’s viewpoint It is only simple structure. The user refers to the data on the server.
  • 57. Client Server Model Munin Master And Munin Node
  • 58. Let's watch a little more detailed movement.
  • 59. This is the data which we referred to some time ago.
  • 60. This is the work of the main Munin master, and a program is executed by cron. It thereby carry out the generation of the collection of data, checking threshold, HTML files and graphs one by one.
  • 61. This is “munin-node” agent. Munin Master acquires data via plugins.
  • 62. munin-update connects with munin-node. Its port number of munin-node is TCP 4949.
  • 63. Plugins are executed in munin-node, and program is a script acquiring various data. Munin-update stores the data which I acquired in RRDTool. And, munin-limits checks the threshold.
  • 64. And munin-graph and munin-html generate a graph and HTML for the material in data (.rrd) stored away by RRDtool.
  • 65. These flows are basic movement of Munin. I think that it is really simple and cool!
  • 66.
  • 67. Constitution of Munin master ( SERVER ) munin-node ( CLIENT ) • Perl Libs • Perl Libs – Munin::Common – Munin::Common • munin-cron • munin-node – munin-update – config: munin-node.conf – munin-limits – Plugins – munin-html • Tools – munin-graph – munin-node-configure • config: munin.conf – munin-cron
  • 68. About data collection • munin-node collect various data. • Port 4949(TCP) – Munin protocol • LIST • CONFIG • FETCH • • VERSION QUIT (T_T)4949 “4949” is onomatopoeia of Japanese "tearful face".
  • 69. Data storage and graph generation are work of RRDtool • Data format is RRD (round robin database) – /var/lib/munin/<hostname>/<plugin’s name>.rrd -rw-r--r-- 1 munin munin 50612 10月 18 2010 localhost-cpu-idle-d.rrd -rw-r--r-- 1 munin munin 50612 10月 18 2010 localhost-cpu-iowait-d.rrd -rw-r--r-- 1 munin munin 50612 10月 18 2010 localhost-cpu-irq-d.rrd -rw-r--r-- 1 munin munin 50612 10月 18 2010 localhost-cpu-nice-d.rrd -rw-r--r-- 1 munin munin 50612 10月 18 2010 localhost-cpu-softirq-d.rrd -rw-r--r-- 1 munin munin 50612 10月 18 2010 localhost-cpu-steal-d.rrd -rw-r--r-- 1 munin munin 50612 10月 18 2010 localhost-cpu-system-d.rrd -rw-r--r-- 1 munin munin 50612 10月 18 2010 localhost-cpu-user-d.rrd • 50KByte/one RRD file – More than 200KB/one plugin (MUST) – 150 to 250 files/munin-node (total about 8 to 15MB/node)
  • 70. generate graphs per plugin Munin Master And Munin Node
  • 71. Munin prepares for much plugins • System resources – CPU, memory, Load Average, disk, S.M.A.R.T… • Network – Traffic, SNMP, HTTP loadtime, TCP, UDP, ICMP… • Applications, middleware – Apache, Nginx, Sendmail, Postfix, MySQL, PostgreSQL, MongoDB, memcached, PHP… etc
  • 72. Ex) Load Average plugin • /etc/munin/plugins/load – “Load average” is five minutes average – It’s a symbolic link • Original is /usr/share/munin/plugin/load – Simple shell script echo -n "load.value " cut -f2 -d' ' < /proc/loadavg load .value 3.22
  • 73.
  • 74. #3 How to use Munin
  • 75. Munin setup! Make My Day
  • 76. Environment • Perl5 • OS – Linux • Source code ( version 2.0.6 ) • Binary Package – Red Hat Enterprise Linux 系 ( EPEL ) – Debian – openSUSE – MacOS X – Windows
  • 77. Setting up flow • Install Munin and Perl Libraries • Change a config file ( munin.conf ) • Setting up munin-node ( munin-node.conf ) • Check its graphs
  • 78. Case) Red Hat Enterprise Linux • Use EPEL*1(testing repository) package or source • procedure – 1. enabling EPEL – 2. “yum install munin” – 3. configure munin.conf – 4. turn on munin-node and setup – 5. check *1 Extra Packages for Enterprise Linux(EPEL) https://fedoraproject.org/wiki/EPEL
  • 79. Case) Debian / Ubuntu • Use apt (Debian PTS is testing) or Source • Procedure – 1. setting up Perl libraries (via apt-get) – 2. install munin – 3. configure munin.conf – 4. turn on munin-node and setup – 5. check
  • 80. Basic of the setting How To Configure
  • 81. Config files Munin Master munin-node • /etc/munin/munin.conf • /etc/munin/munin-node.conf – Host tree (targeting nodes) – Access control – Graph strategy • Host (IP address) • Cron or realtime generation • Network CIDR – Paths – Node’s hostname • RRD files – Port number • logfiles • Default: TCP 4949 (T_T) – Plugin’s option
  • 82. [munin.conf] set target node [GroupName;node1.pocketstudio.net] address 127.0.0.1 use_node_name yes
  • 83. [munin-node.conf] Access control • allow ^127.0.0.1$ – Regular expression • cidr_allow 192.0.2.0/24 – Not regular expression • If you change files, then you must restart munin-node!
  • 84. Basic of the plugin How To Configure plugin
  • 85. Basic knowledge of Munin plugin • Original files is here ( shell or perl scripts ) – /usr/share/munin/plugins/ • How to use – To make symbolic link to /etc/munin/plugins – configure munin-node.conf – munin-node restart (MUST) – Check graph and html
  • 86. How to debug plugin • /usr/sbin/munin-run <plugin-name> – “--debug” shows more detail – behavior is same as munin-node – useful • Command line tool ( I made ) – muninwalk & muninget ; perl script https://github.com/zembutsu/muninwalk
  • 87. Plugins catalog How To Configure plugin
  • 88. Apache • Symbolic link # ln -s /usr/share/munin/plugins/apache_* /etc/munin/plugins/ • munin-node.conf [apache_*] env.url http://127.0.0.1:%d/server-status?auto env.ports 80 • httpd.conf ExtendedStatus On <Location /server-status> SetHandler server-status Order deny,allow Deny from all Allow from 127.0.0.1 </Location>
  • 89. MySQL • Symbolic link # ln -s /usr/share/munin/plugins/mysql_* /etc/munin/plugins/ • munin-node.conf [mysql*] env.mysqlopts -u root -pPASSWORD env.mysqladmn /usr/bin/mysqladmin
  • 90. BIND • Symbolic link # ln -s /usr/share/munin/plugins/bind9_rndc /etc/munin/plugins/ • munin-node.conf [bind9_rndc] env.rndc /usr/sbin/rndc env.querystats /var/named/chroot/var/named/data/named_stats.txt user root • named.conf statistics-file "/var/named/data/named_stats.txt";
  • 91. Plugins I made Plugins Catalog
  • 93. Quantity of generation of the electric power company (TEPCO)
  • 94. And electricity consumption rate (TEPCO). Munin can make anything plug in if digitized.
  • 95. How to make plugin Take It Easy
  • 96. Sample case; httping plugin • http://www.vanheusden.com/httping/ • "httping" is a command-line tool which can check response time of the Web server like a “ping” command. • If you set –S opsion, then you can check response time and processing time. $ httping -S http://210.239.46.254/ PING 210.239.46.254:80 (http://210.239.46.254/): connected to 210.239.46.254:80 (380 bytes), seq=0 time=0.10+0.69=0.79 ms connected to 210.239.46.254:80 (380 bytes), seq=1 time=0.08+0.47=0.55 ms connected to 210.239.46.254:80 (380 bytes), seq=2 time=0.07+0.68=0.75 ms connected to 210.239.46.254:80 (380 bytes), seq=3 time=0.12+0.66=0.77 ms Got signal 2 --- http://210.239.46.254/ ping statistics --- 4 connects, 4 ok, 0.00% failed
  • 97. Plugin: httping_ #!/bin/sh # # Plugin to monitor HTTP response (httping) #%# family=auto #%# capabilities=autoconf URL=${URL:-"http://localhost/"} COUNT=${COUNT:-"5"} httping_bin=$(which httping) This is substance of a httping plugin, and a if [ "$1" = "autoconf" ]; then file itself is a simple shell script. echo yes exit 0 The contents are the definition about the fi Define graphing graph and commands to really acquire a value. if [ "$1" = "config" ] ; then echo "graph_args -r --lower-limit 0 "; A point is to acquire data, and therefore the echo "graph_title http response $URL"; echo "graph_category httping"; plug in can make even what kind of language echo "graph_info httping response time: $URL"; echo 'graph_vlabel msec' including perl and PHP. echo "connect.label connect time" echo "connect.draw AREA" echo "connect.type GAUGE" echo "processing.label processing time" echo "processing.draw STACK" echo "processing.type GAUGE" exit Output format is “xxx.Value ***” fi # format for httpiing 1.5.3 http://www.vanheusden.com/httping/ $httping_bin -c $COUNT -G -S $URL | tr '+|=' ' ' | awk '{connect+=$9; processing+=$10} END{print "connect.value",connect/'$COUNT'"n""processing.value",processing/'$COUNT'}'
  • 98. Config: httping_ • /etc/munin/plugin-conf.d/httping [httping_localhost] env.URL http://pocketstudio.jp/ env.COUNT 5 [httping_blog] env.URL http://pocketstudio.jp/log3/ env.COUNT 5 [httping_node1] env.URL http://node1.pocketstudio.net/ env.COUNT 5 • # ln -s /usr/share/munin/plugins/httping_ /etc/munin/plugins/httping_localhost
  • 99. httping live demo • http://demo.munin.jp/munin2/httping-day.html It is a case having any problem neither for this server, There is much partial (processing time) of this server group response time and processing time. blue.It takes the processing time by certain CMS. On the other hand, I understand that the network is good.
  • 101. Never say never. • Agility is the pivot of the service (in my case) – LOOKOUT, its cause solution of the trouble • Hardware or Software or Network – We need investigation • where a problem happens promptly
  • 102. Live Munin demo • http://demo.munin.jp/ – Then let's observe the resource situation through this demonstration site of Munin. • Where is a bottleneck? or will be? • Even if you do not log in to a server, I think that you can refer to many resources.
  • 103. This Just Can’t Be Right A Real Troubleshooting
  • 104. Case) identified unauthorized access • By the Time we Realized It, It Had Already Begun. • situation – 1. Error emails beguns to arrive to postmaster – 2. There was not the alert with the monitoring tool – 3. Therefore at first I checked a resource in Munin – 4. I identified that CMS had vulnerability from the situation and acted promptly. I was able to perform the above-mentioned movement quickly in a short time by Munin.
  • 105. How to find it. Sendmail’s queue rised suddenly Load Average has no problem
  • 106. I confirmed the time MySQL’s queries were when traffic was strange rised suddenly, too From the above-mentioned situation, I supposed illegal access for CMS. Actually, I understood the attack for the specific URL when I investigated log of the time. Identification and the action of the cause should have taken time more if I did not use Munin.
  • 107. #5 My Bery Vest Monitoring tool
  • 108. New Features of Munin 2.0 I think that Munin is truly wonderful
  • 109. Munin 2.0 has new features! • Better UI and CGI integration – New look, Graph Zooming, FastCGI • asynchronous I/O support – Better performance • Native SSH transport – secure (port 22) & easy setup • asynchronous proxy support – async-server substitutes for munin-node • And more… – https://github.com/munin- monitoring/munin/blob/devel/Announce-2.0
  • 110. [RemoteNetwork;backend-DB] addressssh://210.239.46.254/opt/munin/lib/munin-async-client --spooldir /var/opt/munin/spool/192.168.0.230 –spoolfetch use_node_name yes
  • 111. No munin, No Troubleshoot. I'm Not Afraid of Anything Anymore
  • 112. Munin changed support flow (my case) • If I don’t use tools – Troubleshooting is various command execute (sysstat) and investigation of the log files. – But, this method need long time and many human resources need, and is bad for service. • If I use Munin (now). – Even if I do not log in, I can understand the situation. – I can judge abnormality visually • “I see the ending of this troubleshooting!” – Agile Support • Troubleshooting that has Plan-Do-Check-Action (PDCA) cycles.
  • 113. In work of my dedicated server hosting • I really depend on Munin – Always, I setup Munin. Neat – Munin is almost in several I cannot part with Munin hundred servers which for my work. I manage directly. – I think that Munin is indispensable to our service quality You believe it! improvement. BAM BAM!
  • 115. Detecting problem What are Plan and situation these alerts? For real? Trobuleshoot PDCA Law of Cycles Presage!!
  • 116. Detecting problem What are Plan and situation these alerts? For real? Trobuleshoot PDCA Do Law of Suppose a cause Cycles OK, Munin. Please tell me that trouble lies hidden in wherever? Presage!! Fire! Please stop!!
  • 117. Detecting problem What are Plan and situation these alerts? For real? Trobuleshoot PDCA Do Law of Suppose a cause Cycles OK, Munin. Please tell me that trouble lies hidden in wherever? Presage!! Fire! I just talk about what I just looked in Munin!! Check Please stop!! To check resources remotely
  • 118. Detecting problem What are Plan and situation these alerts? For real? Wow! click-clack Trobuleshoot PDCA Do click-clack Law of Suppose a cause Cycles OK, Munin. Please tell me that trouble lies hidden in Action wherever? Presage!! Fire! Log in and I just talk about execute commands what I just looked in Munin!! Check Please stop!! To check resources remotely
  • 119. You are never alone! Munin always here for you 24 x 7 x 365 The Only Thing I Have Left To Guide Me
  • 120. Munin’s overview ・Munin is the resource monitoring tool that specialize to notice by the visualization. ・Simple architecture, and many plug-ins. ・Ths is most suitable for the system that quick support is necessary in a short time.
  • 121. Conclusion * This is my personal impression. No munin, No Operation. While there’s Munin, there’s hope. MY VERY BEST MONITORING TOOL. Thank you for MUNIN. Good-bye to MRTG.
  • 122. I wish… • I would appreciate you use Munin that if you were interested in Munin by my presentation. • Tomorrow is another day. Up to you. Squidn’t you use Munin? (Shoudn’t)
  • 123. Questions? • Do you have a questionable point for munin? I'm glad you asked. Let's give the rights that the reward buys Opoona for you. (but, here is wagon sale...)
  • 124. References • Munin – http://munin-monitoring.org/ • Munin User Group Japan – http://munin.jp/ – http://munin.jp/wiki/ • Website – Waiting for Munin 2.0 – Introduction – Personal Workflow Blog • http://blog.pwkf.org/post/2010/06/Waiting-for-Munin-2.0-Introduction – /tags/2.0.0/ChangeLog – Munin – Trac • http://munin-monitoring.org/browser/tags/2.0.0/ChangeLog Please feedback me zem@pocketstudio.jp or @zembutsu ( twitter ) Thank you for your reading!