SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Christian Ehrhardt
IBM Research and Development




  Red Hat Enterprise Linux on IBM System z
  Performance Evaluation




                                             © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Agenda

 Performance Evaluation Results
   – Environment
   – Noteworthy improvements
   – Changes you should be aware of

 Performance evaluation Summary
   – Improvements and degradations per area
   – Summarized comparison




2/24                                                              © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary



Environment
 Host – System z10 (2097-E269)                                    HW-Platform
       –   FICON 4 Gbps                                              – Linux on LPAR
       –   FCP 4 Gbps                                                – Linux in z/VM 5.4 guest
       –   HiperSockets
       –   OSA Express 3 1GbE + 10GbE
                                                                   Verified on
 Storage – DS8300 (2107-922 )                                       – System z Enterprise (z196)
       – HDD's                                                       – Linux in z/VM 6.1
       – FICON 4 Gbps
       – FCP 4 Gbps




3/24                                                                                                © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Compared Versions

 Compared Set
   – RHEL5 U4 (2.6.18-194.el5)
   – RHEL6-GA (2.6.32-71.el6)
   – RHEL6-GA + tuning
            • our recommended tuning (in RH Tech Notes)
            • workarounds for known issues

 Measurements
   – Base regression set covering most customer use cases as good as possible
   – Focus on areas where performance issues are more likely
   – Just the top level summary, based on thousands of comparisons
   – Special case studies for non-common features and setups

 Terminology
   – Throughput – “How much could I transfer once?”
   – Latency – “How long do I have to wait for event X?”
   – Normalized cpu consumption - “How much cpu per byte/transaction/... do I need?”
4/24                                                                            © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary



New process scheduler (CFS)


 Goals of CFS
       – Models “ideal, precise multi-tasking CPU”
       – Fair scheduling based on virtual runtime

 Changes you might notice when switching from O(1) to CFS
       –   Lower response times for I/O, signals, …
       –   Balanced distribution of process time-slices
       –   Improved distribution across processors
       –   Shorter consecutive time-slices
       –   More context switches

 Improved balancing
       – Topology support can be activated via the topology=on kernel parameter
       – This makes the scheduler aware of the cpu hierarchy

 You really get something from fairness as well
       – Improved worst case latency and throughput
       – By that CFS can ease QoS commitments
5/24                                                                              © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Benchmark descriptions - File system / LVM / Scaling

 Filesystem benchmark dbench
       – Emulation of Netbench benchmark
       – Generates file system load on the Linux VFS
       – Does the same I/O calls like smbd server in Samba (without networking calls)

 Simulation
       –   Workload simulates client and server (Emulation of Netbench benchmark)
       –   Mainly memory operations for scaling
       –   Low main memory and lvm setup for mixed I/O and lvm performance
       –   Mixed file operations workload for each process: create, write, read, append, delete
       –   8 CPUs, 2GB memory and scaling from 4 to 62 processes (clients)
       –   Measures throughput of transferred data




6/24                                                                                              © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




File system benchmark – process scaling
                                                                           RHEL5-U4                                                                                    RHEL5-U4
                                    Throughput (16 CPU)                    RHEL6-GA                                             CPU consumption (16 CPU)               RHEL6-GA




                                                                                      Total cpu in micro s econds per MiB
                                  6000
   throughput by dbench in MB/s




                                  5000
                                  4000




                                                                                                                       better
                                  3000
                                  2000
                                  1000
                                    0
                                         1   4    8   12    16   20   26   32   40                                               1   4      8    12    16   20   26   32   40

                                             number of proces s es                                                                       number of proces s es


 Improved scalability
                            – Especially improves large workloads
                            – Lower cross process deviation improves QoS
 Increased CPU consumption due to
                            – CFS is striving for better interactivity and fairness
                            – Changes affecting the writeback of dirty pages
                                         • Rule of thumb – now about twice as aggressive
                                         • One might want to tune dirty ratios in /proc/sys/vm/dirty_*


7/24                                                                                                                                                                        © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Benchmark descriptions - Java

 evaluates server side Java
       – 3-tier system
                     • Random input from user
                     • Middle tier business logic implemented in Java
                     • No explicit database --> emulated by Java objects
                     • Scales warehouses

 stressed components
       – Java
                   • Virtual Machine (VM)
                   • Just-In-Time compiler (JIT)
                   • Garbage Collection (GC)
       – Linux operating system
                   • Threads
                   • Scheduler
                   • Caches and Memory




8/24                                                                       © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Java (31/64 Bit)
                                     Throughput
 SPECjbb2005 bops




                                                                                           64b-RH5-U4
                                                                                           64b-RH6-GA
                                                                                           64b-RH6-GA-tuned
                                                                                           31b-RH5-U4
                                                                                           31b-RH6-GA
                                                                                           31b-RH6-GA-tuned




                    6   7   8   9   10     11     12     13       14   15   16   17   18


                                      Number of Warehouses

       64b – RHEL6-GA -4.8% vs RHEL5-U4; tuned at least only -3.8%
       31b – RHEL6-GA almost equal to RHEL5-U4, with tuning even +2.5%
       This is caused by a bit of over-optimization for desktop latency in the new scheduler
       System z recommended tunables are not set by default, but are part of the RHEL6 tech notes

9/24                                                                                                   © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Benchmark descriptions - Webserving

 Webserver Benchmark
        – Static website content read
        – Variable number of connections
        – Measures throughput via network connection

 Server side
        – Apache
        – HTML content

 Client side
        – 3 clients connected to webserver
        – Number of active requests scaled from 1 to 20 connections per client




10/24                                                                            © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Webserving – example for improved CPU scaling
                                                                               RHEL5-U4
                                 Simple Webserving Workload (60 Clients)       RHEL6-GA
                                                                               RHEL6-GA-tuned
 Page Elements / sec




                                1 CPU                 2 CPU                4 CPU


                                               Number of Processes

 Improved CPU scalability compared to RHEL5-U4
                       – Again the recommended scheduler tuning can add a further improvement
 Additional CFS effect
                       – Lower worst-case response time



11/24                                                                                           © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Benchmark descriptions - Network
 Network Benchmark which simulates several workloads

 Transactional Workloads
        – 2 types
                    • RR – A connection to the server is opened once for a 5 minute time frame
                    • CRR – A connection is opened and closed for every request/response
        – 4 sizes
                    • RR 1x1 – Simulating low latency keepalives
                    • RR 200x1000 – Simulating online transactions
                    • RR 200x32k – Simulating database query
                    • CRR 64x8k – Simulating website access

 Streaming Workloads – 2 types
        – STRP/STRG – Simulating incoming/outgoing large file transfers (20mx20)

 All tests are done with 1, 10 and 50 simultaneous connections
 All that across on multiple connection types (different cards and MTU configurations)


12/24                                                                                            © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Network Throughput
   10 Gigabit Ethernet OSA Express3 MTU 8992                                     Vswitch Guest to Guest MTU 1492
                                                               RHEL5-U4                                                                     RHEL5-U4
                                                               RHEL6-GA                                                                     RHEL6-GA
   Trans actions per Second




                                                                                  Trans actions per Second
                                      1             10           50
                                  1x1           1x1          1x1                                                   k1           10
                                                                                                                                           2k
                                                                                                                                              50
                                                                                                                x32          2k
                                                                                                             200          0x3          0x3
                                                                                                                        20           20
                                              Workload                                                                   Workload

 Connection scaling is good - parallel scenarios improved a lot
                              – Up by +6% to +29% for 50 concurrent connections
                              – Workloads with larger transferred sizes actually benefit a bit more
 Single connection Latency can be an issue
                              – 1x1 is shown here as it forces max overhead and latency per transferred byte
                              – Down by -5% to -15% on single connections
 Large improvements for vswitch guest to guest
                              – Up to +54% using 50 concurrent connections with MTU 1492 and 200x32k workload
13/24                                                                                                                                              © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Network CPU consumption
   10 Gigabit Ethernet OSA Express 3 MTU 8992 Client                                   Vswitch guest to guest 1492 Client
                                                                           RHEL5-GA                                                                        RHEL5-U4
                                                                           RHEL6-GA                                                                        RHEL6-GA




                                                                                         CPU cons umption per Trans action
        CPU cons umption per Trans action




                                                    1           10           50                                                    k1           10           50
                                                1x1         1x1          1x1                                                    x32          2k           2k
                                                                                                                             200          0x3          0x3
                                                                                                                                        20           20
                                                          Workload                                                                       Workload

 CPU consumption increased for a lot of workloads (roughly 2/3 of the types we distinguish)
        –                                   The range of is from +35% improvement down to -44% degradation
        –                                   Most loads degrade by an average around -15%
        –                                   This is part of the trade-offs for better scalability
        –                                   Also partially a scheduler/caching effect
 Some improvements for loads with large mtu's
        – Usually seen on the sender side
        – That implies it is beneficial for data sources not for data sinks
14/24                                                                                                                                                             © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Benchmark descriptions - Disk I/O

 Workload
        –   Threaded I/O benchmark
        –   Each process writes or reads to a single file, volume or disk
        –   Benchmark can be configured to run with and without page cache (direct I/O)
        –   Operating modes: Sequential write/rewrite/read + Random write/read

 Setup
        –   Main memory was restricted to 256MB
        –   File size (overall): 2GB, Record size: 64kB
        –   Scaling over 1, 2, 4, 8, 16, 32, 64 processes
        –   Sequential run: write, rewrite, read
        –   Random run: write, read (with previous sequential write)
        –   Once using page cache and once using Direct I/O (bypass the page cache)
        –   Sync and Drop Caches prior to every invocation to reduce noise




15/24                                                                                     © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary

                                                                                            Throughput for seq. readers


Page cache based disk I/O read issue




                                                                                    Throughput
                                                                                                 1   2      4     8    16    32
                                                                                                     number of proces s es
 Caused as corner case by memory management “improvements”
 Real World - Backups
   – It can hold a lot of data to scan by the backup software (→ a lot of seq. read)
   – A lot of data is usually split across many discs on s390 (→ concurrent access)
   – Overcommitment/ballooning effects or sized too small (→ memory constraint)
                           Systems              “protected”         “easily freeable”
                                                                                                      Sizing/ballooning
                         base footprint          (new part)       Need for read ahead allocs

 Detection
   – Most workloads won't see the impact or more than that benefit from these changes
   – Check sysstat which should report a huge amount of pgscand/s
   – Run “sync; echo 3 > /proc/sys/vm/drop_caches”
               • Should hurt throughput, huge improvements mean you are probably affected
 Workarounds other than “more memory”
   – Drop caches if there is a single time this happens (e.g. on nightly backup)
   – Use direct I/O or shrink read ahead if applicable
   – Fix got upstream accepted in 2.6.37-rc1

16/24                                                                                                       © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Disk I/O – New FICON features

 HyperPAV
   – Avoid subchannel busy
   – Automatic management of subchannel assignment/usage
   – No need of multipath daemon
   – Especially useful for concurrent disk accesses

 Read-Write Track Data
   – Allows to read/write up to a full track in one command word
   – Especially useful for huge requests and streaming sequential loads

 High Performance Ficon
   – New metadata format reduces overhead
   – Especially useful for small requests




17/24                                                                     © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary



Disk I/O – FICON – HyperPAV
                                                       FICON                                                                  FICON
             Throughput sequential writers             FICON+HPAV                Throughput sequential readers                FICON+HPAV
Throughput




                                                                    Throughput
                 1    2     4       8      16     32     64                          1    2     4       8      16        32     64

                          number of proces s es                                               number of proces s es

                                                                                                                              FICON
 Using 4 disks (4 ranks) with 3 aliases per rank                                Throughput random readers                    FICON+HPAV


 Without PAV/HyperPAV
              – Access could become contented (subchannel busy)
              – Throughput stays constant >1 proc per disk

                                                                    Throughput
 Solution: Having multiple subchannels per device
              – PAV: Aliases for devices
              – HyperPAV: Pool of aliases defined per rank
              – Throughput increased up to 3.5 x in our scenario                     1    2     4       8      16        32     64
→ Usage of HyperPAV can be highly recommended                                                 number of proces s es


18/24                                                                                                                 © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Disk I/O – FICON – effect of RWTD/HPF – Throughput

                                                               RHEL5-U4                                                               RHEL5-U4
                                                               RHEL6-GA                                                               RHEL6-GA
                  Throughput for initial writers                                           Throughput for readers
  Throughput in KB/s




                                                                          Throughput in KB/s
                          1   2    4       8      16      32     64                            1     2    4       8      16      32     64

                                  number of proces s es                                                  number of proces s es




 IOzone sequential write/read using direct I/O
                       – Huge throughput improvements
                                   • Write throughput up to 26%
                                   • Read throughput up to 82%
                       – Normalized I/O consumption stays about the same
                                   • despite the much larger throughput


19/24                                                                                                                                        © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Disk I/O – FICON – effect of RWTD/HPF – random workloads

                                                                RHEL5-U4                                                               RHEL5-U4
                                                                RHEL6-GA                                                               RHEL6-GA
                   Throughput for random writers                                            Throughput for random readers
  Throughput in KB/s




                                                                           Throughput in KB/s
                          1    2    4       8      16      32     64                            1    2     4       8      16      32     64

                                   number of proces s es                                                  number of proces s es




 IOzone random write/read using direct I/O
                       – Huge throughput improvements
                                   • Read throughput up to +81%
                                   • Write throughput up to +23%




20/24                                                                                                                                         © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary



Hints - General


 Cgroup memory support
        –   This is a feature coming with newer kernels
        –   Recommended by some management tools to enforce very customizable memory constraints
        –   Has a rather large footprint by consuming 1% of the memory
        –   Activated by default
        –   In a consolidation environment it is actually 1% multiplied by your virtual/real ratio
        –   Not pageable by linux, but fortunately by z/VM

        – This can be overridden with a kernel parameter (reboot required):
                        cgroup_disable=memory




21/24                                                                                      © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Improvements and Degradations of RHEL6 per area

                         vs. RHEL5-U4
    Improvements                  Degradations
    FICON I/O                     CPU consumption*
    Process scaling               OSA single C. Latency
    CPU scaling                   I/O corner cases via page cache*
    Compiler
    Multiconn. Networking
    Disk I/O via page cache




 Improvements in almost every area
    – Especially for large workloads
 Degradations for corner cases and cpu consumption
      * = Partially or completely avoidable due to tunings/workarounds as shown before



22/24                                                                            © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




Summary


 RHEL 6 performance is good
   – With some trade-offs roughly equal to RHEL5-U4
              • A common trade-off is increased cpu consumption for better scalability
   – Our recommended tunings/workarounds help in some known cases
              • Upcoming RHEL6.1 will further reduce the amount of manual tuning needed
   – Almost generally recommendable
              • An exception are very cpu consumption sensitive environments
              • Here upgrades have to be considered carefully

 Improvements and degradations
        Base            New                      Improved         No difference   Degraded
                                                                   or Trade-off

        RH5U4           RH6                           27               22           33
        RH5U4           RH6 tune & w.                 34               48            0

23/24                                                                                        © 2011 IBM Corporation
Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary




References

 Some links the presentation referred to
        – Linux on System z – Tuning hints and tipps
          http://www.ibm.com/developerworks/linux/linux390/perf/index.html
        – Live Virtual Classes for z/VM and Linux
          http://www.vm.ibm.com/education/lvc/
        – Red Hat Enterprise Linux 6 Tech Notes
          http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Technical_Notes/index.html




                                                               Christian Ehrhardt         Schönaicher Strasse 220
                                                               Linux on System z
                                                                                          71032 Böblingen, Germany
                                                               Performance and Evaluation
                                                                                                       ehrhardt@de.ibm.com




IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions
worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at
“Copyright and trademark information” at www.ibm.com/legal/copytrade.shtml.
24/24                                                                                                                           © 2011 IBM Corporation

Contenu connexe

Tendances

DB2 for z/OS and DASD-based Disaster Recovery - Blowing away the myths
DB2 for z/OS and DASD-based Disaster Recovery - Blowing away the mythsDB2 for z/OS and DASD-based Disaster Recovery - Blowing away the myths
DB2 for z/OS and DASD-based Disaster Recovery - Blowing away the mythsFlorence Dubois
 
IDUG NA 2014 / 11 tips for DB2 11 for z/OS
IDUG NA 2014 / 11 tips for DB2 11 for z/OSIDUG NA 2014 / 11 tips for DB2 11 for z/OS
IDUG NA 2014 / 11 tips for DB2 11 for z/OSCuneyt Goksu
 
Lessons learned from Isbank - A Story of a DB2 for z/OS Initiative
Lessons learned from Isbank - A Story of a DB2 for z/OS InitiativeLessons learned from Isbank - A Story of a DB2 for z/OS Initiative
Lessons learned from Isbank - A Story of a DB2 for z/OS InitiativeCuneyt Goksu
 
Spectrum Scale Best Practices by Olaf Weiser
Spectrum Scale Best Practices by Olaf WeiserSpectrum Scale Best Practices by Olaf Weiser
Spectrum Scale Best Practices by Olaf WeiserSandeep Patil
 
Using Release(deallocate) and Painful Lessons to be learned on DB2 locking
Using Release(deallocate) and Painful Lessons to be learned on DB2 lockingUsing Release(deallocate) and Painful Lessons to be learned on DB2 locking
Using Release(deallocate) and Painful Lessons to be learned on DB2 lockingJohn Campbell
 
Reconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
Reconfigurable Coprocessors Synthesis in the MPEG-RVC DomainReconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
Reconfigurable Coprocessors Synthesis in the MPEG-RVC DomainMDC_UNICA
 
Db2 analytics accelerator technical update
Db2 analytics accelerator  technical updateDb2 analytics accelerator  technical update
Db2 analytics accelerator technical updateCuneyt Goksu
 
FlashCopy and DB2 for z/OS
FlashCopy and DB2 for z/OSFlashCopy and DB2 for z/OS
FlashCopy and DB2 for z/OSFlorence Dubois
 
Robert Hensel resume v111416 Lnkedin
Robert Hensel resume v111416 LnkedinRobert Hensel resume v111416 Lnkedin
Robert Hensel resume v111416 LnkedinBob Hensel
 
Solaris cluster customer presentation
Solaris cluster customer presentationSolaris cluster customer presentation
Solaris cluster customer presentationxKinAnx
 
Flex your Database on 12c's Flex ASM Flex Cluster
Flex your Database on 12c's Flex ASM Flex ClusterFlex your Database on 12c's Flex ASM Flex Cluster
Flex your Database on 12c's Flex ASM Flex ClusterALI ANWAR, OCP®
 
Quattor
QuattorQuattor
QuattorInria
 
Db2 for z/OS and FlashCopy - Practical use cases (June 2019 Edition)
Db2 for z/OS and FlashCopy - Practical use cases (June 2019 Edition)Db2 for z/OS and FlashCopy - Practical use cases (June 2019 Edition)
Db2 for z/OS and FlashCopy - Practical use cases (June 2019 Edition)Florence Dubois
 
Open world exadata_top_10_lessons_learned
Open world exadata_top_10_lessons_learnedOpen world exadata_top_10_lessons_learned
Open world exadata_top_10_lessons_learnedchet justice
 

Tendances (15)

DB2 for z/OS and DASD-based Disaster Recovery - Blowing away the myths
DB2 for z/OS and DASD-based Disaster Recovery - Blowing away the mythsDB2 for z/OS and DASD-based Disaster Recovery - Blowing away the myths
DB2 for z/OS and DASD-based Disaster Recovery - Blowing away the myths
 
IDUG NA 2014 / 11 tips for DB2 11 for z/OS
IDUG NA 2014 / 11 tips for DB2 11 for z/OSIDUG NA 2014 / 11 tips for DB2 11 for z/OS
IDUG NA 2014 / 11 tips for DB2 11 for z/OS
 
Lessons learned from Isbank - A Story of a DB2 for z/OS Initiative
Lessons learned from Isbank - A Story of a DB2 for z/OS InitiativeLessons learned from Isbank - A Story of a DB2 for z/OS Initiative
Lessons learned from Isbank - A Story of a DB2 for z/OS Initiative
 
Spectrum Scale Best Practices by Olaf Weiser
Spectrum Scale Best Practices by Olaf WeiserSpectrum Scale Best Practices by Olaf Weiser
Spectrum Scale Best Practices by Olaf Weiser
 
Using Release(deallocate) and Painful Lessons to be learned on DB2 locking
Using Release(deallocate) and Painful Lessons to be learned on DB2 lockingUsing Release(deallocate) and Painful Lessons to be learned on DB2 locking
Using Release(deallocate) and Painful Lessons to be learned on DB2 locking
 
RHEL roadmap
RHEL roadmapRHEL roadmap
RHEL roadmap
 
Reconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
Reconfigurable Coprocessors Synthesis in the MPEG-RVC DomainReconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
Reconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
 
Db2 analytics accelerator technical update
Db2 analytics accelerator  technical updateDb2 analytics accelerator  technical update
Db2 analytics accelerator technical update
 
FlashCopy and DB2 for z/OS
FlashCopy and DB2 for z/OSFlashCopy and DB2 for z/OS
FlashCopy and DB2 for z/OS
 
Robert Hensel resume v111416 Lnkedin
Robert Hensel resume v111416 LnkedinRobert Hensel resume v111416 Lnkedin
Robert Hensel resume v111416 Lnkedin
 
Solaris cluster customer presentation
Solaris cluster customer presentationSolaris cluster customer presentation
Solaris cluster customer presentation
 
Flex your Database on 12c's Flex ASM Flex Cluster
Flex your Database on 12c's Flex ASM Flex ClusterFlex your Database on 12c's Flex ASM Flex Cluster
Flex your Database on 12c's Flex ASM Flex Cluster
 
Quattor
QuattorQuattor
Quattor
 
Db2 for z/OS and FlashCopy - Practical use cases (June 2019 Edition)
Db2 for z/OS and FlashCopy - Practical use cases (June 2019 Edition)Db2 for z/OS and FlashCopy - Practical use cases (June 2019 Edition)
Db2 for z/OS and FlashCopy - Practical use cases (June 2019 Edition)
 
Open world exadata_top_10_lessons_learned
Open world exadata_top_10_lessons_learnedOpen world exadata_top_10_lessons_learned
Open world exadata_top_10_lessons_learned
 

En vedette

Evaluation of performance appraisal system
Evaluation of performance appraisal systemEvaluation of performance appraisal system
Evaluation of performance appraisal systembarnesali609
 
Performance appraisal evaluation
Performance appraisal evaluationPerformance appraisal evaluation
Performance appraisal evaluationjuniecarter5
 
Evaluation of performance appraisal
Evaluation of performance appraisalEvaluation of performance appraisal
Evaluation of performance appraisalbutterangela123
 
Performance evaluation appraisal
Performance evaluation appraisalPerformance evaluation appraisal
Performance evaluation appraisalmariavernon59
 
Performance appraisal system cdac
Performance appraisal system cdacPerformance appraisal system cdac
Performance appraisal system cdacKamleshwar Pokhrail
 
Presentation on Performance Appraisal System
Presentation on Performance Appraisal SystemPresentation on Performance Appraisal System
Presentation on Performance Appraisal SystemPawan Bahuguna
 
Инструкция по управлению виджетом
Инструкция по управлению виджетомИнструкция по управлению виджетом
Инструкция по управлению виджетомmonpages
 
MBANY-Mitigation Risks in Construction Lending
MBANY-Mitigation Risks in Construction LendingMBANY-Mitigation Risks in Construction Lending
MBANY-Mitigation Risks in Construction LendingBrooks S. Clark
 
Multi Rug Ranges By Louis De Poortere
Multi Rug Ranges By Louis De Poortere Multi Rug Ranges By Louis De Poortere
Multi Rug Ranges By Louis De Poortere The Rug Shop UK
 
Conferencia que camino elijo para llegar a mimeta
Conferencia que camino elijo para llegar a mimetaConferencia que camino elijo para llegar a mimeta
Conferencia que camino elijo para llegar a mimetaJavier Fernández Higarza
 
LED LIGHT catalogue
LED LIGHT catalogueLED LIGHT catalogue
LED LIGHT catalogueSherry Cao
 
Democracia en América. La sombra de la desigualdad.
Democracia en América. La sombra de la desigualdad.Democracia en América. La sombra de la desigualdad.
Democracia en América. La sombra de la desigualdad.stalinjap
 
Acuerdo de Trabajo y Estrategia de Acción del LLECE. Carta compromiso ministe...
Acuerdo de Trabajo y Estrategia de Acción del LLECE. Carta compromiso ministe...Acuerdo de Trabajo y Estrategia de Acción del LLECE. Carta compromiso ministe...
Acuerdo de Trabajo y Estrategia de Acción del LLECE. Carta compromiso ministe...eraser Juan José Calderón
 
Hospitales líquidos Guatemala
Hospitales líquidos GuatemalaHospitales líquidos Guatemala
Hospitales líquidos GuatemalaOscar Iván Robles
 

En vedette (20)

Evaluation of performance appraisal system
Evaluation of performance appraisal systemEvaluation of performance appraisal system
Evaluation of performance appraisal system
 
Performance appraisal evaluation
Performance appraisal evaluationPerformance appraisal evaluation
Performance appraisal evaluation
 
Evaluation of performance appraisal
Evaluation of performance appraisalEvaluation of performance appraisal
Evaluation of performance appraisal
 
Performance evaluation appraisal
Performance evaluation appraisalPerformance evaluation appraisal
Performance evaluation appraisal
 
Performance appraisal system cdac
Performance appraisal system cdacPerformance appraisal system cdac
Performance appraisal system cdac
 
Major project
Major projectMajor project
Major project
 
Presentation on Performance Appraisal System
Presentation on Performance Appraisal SystemPresentation on Performance Appraisal System
Presentation on Performance Appraisal System
 
BRIK MONOCORDIO
BRIK MONOCORDIOBRIK MONOCORDIO
BRIK MONOCORDIO
 
Инструкция по управлению виджетом
Инструкция по управлению виджетомИнструкция по управлению виджетом
Инструкция по управлению виджетом
 
MBANY-Mitigation Risks in Construction Lending
MBANY-Mitigation Risks in Construction LendingMBANY-Mitigation Risks in Construction Lending
MBANY-Mitigation Risks in Construction Lending
 
Multi Rug Ranges By Louis De Poortere
Multi Rug Ranges By Louis De Poortere Multi Rug Ranges By Louis De Poortere
Multi Rug Ranges By Louis De Poortere
 
Conferencia que camino elijo para llegar a mimeta
Conferencia que camino elijo para llegar a mimetaConferencia que camino elijo para llegar a mimeta
Conferencia que camino elijo para llegar a mimeta
 
Fundasiertos del tao oficial
Fundasiertos del tao oficialFundasiertos del tao oficial
Fundasiertos del tao oficial
 
LED LIGHT catalogue
LED LIGHT catalogueLED LIGHT catalogue
LED LIGHT catalogue
 
Democracia en América. La sombra de la desigualdad.
Democracia en América. La sombra de la desigualdad.Democracia en América. La sombra de la desigualdad.
Democracia en América. La sombra de la desigualdad.
 
Acuerdo de Trabajo y Estrategia de Acción del LLECE. Carta compromiso ministe...
Acuerdo de Trabajo y Estrategia de Acción del LLECE. Carta compromiso ministe...Acuerdo de Trabajo y Estrategia de Acción del LLECE. Carta compromiso ministe...
Acuerdo de Trabajo y Estrategia de Acción del LLECE. Carta compromiso ministe...
 
Standing on the Shoulders of Giants
Standing on the Shoulders of GiantsStanding on the Shoulders of Giants
Standing on the Shoulders of Giants
 
Politica 2.0 Diez Reflexiones
Politica 2.0 Diez ReflexionesPolitica 2.0 Diez Reflexiones
Politica 2.0 Diez Reflexiones
 
Talentoynegocio Nº11
Talentoynegocio Nº11Talentoynegocio Nº11
Talentoynegocio Nº11
 
Hospitales líquidos Guatemala
Hospitales líquidos GuatemalaHospitales líquidos Guatemala
Hospitales líquidos Guatemala
 

Similaire à Red Hat Enterprise Linux on IBM System z Performance Evaluation

Shak larry-jeder-perf-and-tuning-summit14-part1-final
Shak larry-jeder-perf-and-tuning-summit14-part1-finalShak larry-jeder-perf-and-tuning-summit14-part1-final
Shak larry-jeder-perf-and-tuning-summit14-part1-finalTommy Lee
 
2009-01-28 DOI NBC Red Hat on System z Performance Considerations
2009-01-28 DOI NBC Red Hat on System z Performance Considerations2009-01-28 DOI NBC Red Hat on System z Performance Considerations
2009-01-28 DOI NBC Red Hat on System z Performance ConsiderationsShawn Wells
 
Oow 2008 yahoo_pie-db
Oow 2008 yahoo_pie-dbOow 2008 yahoo_pie-db
Oow 2008 yahoo_pie-dbbohanchen
 
Designing for High Performance Ceph at Scale
Designing for High Performance Ceph at ScaleDesigning for High Performance Ceph at Scale
Designing for High Performance Ceph at ScaleJames Saint-Rossy
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community
 
Shak larry-jeder-perf-and-tuning-summit14-part2-final
Shak larry-jeder-perf-and-tuning-summit14-part2-finalShak larry-jeder-perf-and-tuning-summit14-part2-final
Shak larry-jeder-perf-and-tuning-summit14-part2-finalTommy Lee
 
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...Filipe Miranda
 
Quick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage ClusterQuick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage ClusterPatrick Quairoli
 
Presentation db2 best practices for optimal performance
Presentation   db2 best practices for optimal performancePresentation   db2 best practices for optimal performance
Presentation db2 best practices for optimal performancesolarisyougood
 
Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Colin Charles
 
The state of Hive and Spark in the Cloud (July 2017)
The state of Hive and Spark in the Cloud (July 2017)The state of Hive and Spark in the Cloud (July 2017)
The state of Hive and Spark in the Cloud (July 2017)Nicolas Poggi
 
Linux one vs x86 18 july
Linux one vs x86 18 julyLinux one vs x86 18 july
Linux one vs x86 18 julyDiego Rodriguez
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...Daniel Martin
 
Presentation db2 best practices for optimal performance
Presentation   db2 best practices for optimal performancePresentation   db2 best practices for optimal performance
Presentation db2 best practices for optimal performancexKinAnx
 
MySQL Performance - Best practices
MySQL Performance - Best practices MySQL Performance - Best practices
MySQL Performance - Best practices Ted Wennmark
 
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
AIOUG-GroundBreakers-Jul 2019 - 19c RACAIOUG-GroundBreakers-Jul 2019 - 19c RAC
AIOUG-GroundBreakers-Jul 2019 - 19c RACSandesh Rao
 

Similaire à Red Hat Enterprise Linux on IBM System z Performance Evaluation (20)

Shak larry-jeder-perf-and-tuning-summit14-part1-final
Shak larry-jeder-perf-and-tuning-summit14-part1-finalShak larry-jeder-perf-and-tuning-summit14-part1-final
Shak larry-jeder-perf-and-tuning-summit14-part1-final
 
2009-01-28 DOI NBC Red Hat on System z Performance Considerations
2009-01-28 DOI NBC Red Hat on System z Performance Considerations2009-01-28 DOI NBC Red Hat on System z Performance Considerations
2009-01-28 DOI NBC Red Hat on System z Performance Considerations
 
Oow 2008 yahoo_pie-db
Oow 2008 yahoo_pie-dbOow 2008 yahoo_pie-db
Oow 2008 yahoo_pie-db
 
Designing for High Performance Ceph at Scale
Designing for High Performance Ceph at ScaleDesigning for High Performance Ceph at Scale
Designing for High Performance Ceph at Scale
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
 
Shak larry-jeder-perf-and-tuning-summit14-part2-final
Shak larry-jeder-perf-and-tuning-summit14-part2-finalShak larry-jeder-perf-and-tuning-summit14-part2-final
Shak larry-jeder-perf-and-tuning-summit14-part2-final
 
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...
 
Quick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage ClusterQuick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage Cluster
 
Presentation db2 best practices for optimal performance
Presentation   db2 best practices for optimal performancePresentation   db2 best practices for optimal performance
Presentation db2 best practices for optimal performance
 
Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016
 
The state of Hive and Spark in the Cloud (July 2017)
The state of Hive and Spark in the Cloud (July 2017)The state of Hive and Spark in the Cloud (July 2017)
The state of Hive and Spark in the Cloud (July 2017)
 
Linux one vs x86
Linux one vs x86 Linux one vs x86
Linux one vs x86
 
Linux one vs x86 18 july
Linux one vs x86 18 julyLinux one vs x86 18 july
Linux one vs x86 18 july
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
 
Presentation db2 best practices for optimal performance
Presentation   db2 best practices for optimal performancePresentation   db2 best practices for optimal performance
Presentation db2 best practices for optimal performance
 
OpenPOWER Webinar
OpenPOWER Webinar OpenPOWER Webinar
OpenPOWER Webinar
 
What's New in RHEL 6 for Linux on System z?
What's New in RHEL 6 for Linux on System z?What's New in RHEL 6 for Linux on System z?
What's New in RHEL 6 for Linux on System z?
 
MySQL Performance - Best practices
MySQL Performance - Best practices MySQL Performance - Best practices
MySQL Performance - Best practices
 
tburke_rhel6_summit.pdf
tburke_rhel6_summit.pdftburke_rhel6_summit.pdf
tburke_rhel6_summit.pdf
 
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
AIOUG-GroundBreakers-Jul 2019 - 19c RACAIOUG-GroundBreakers-Jul 2019 - 19c RAC
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
 

Plus de IBM India Smarter Computing

Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments IBM India Smarter Computing
 
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...IBM India Smarter Computing
 
A Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceA Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceIBM India Smarter Computing
 
IBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM India Smarter Computing
 

Plus de IBM India Smarter Computing (20)

Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments
 
All-flash Needs End to End Storage Efficiency
All-flash Needs End to End Storage EfficiencyAll-flash Needs End to End Storage Efficiency
All-flash Needs End to End Storage Efficiency
 
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
 
IBM FlashSystem 840 Product Guide
IBM FlashSystem 840 Product GuideIBM FlashSystem 840 Product Guide
IBM FlashSystem 840 Product Guide
 
IBM System x3250 M5
IBM System x3250 M5IBM System x3250 M5
IBM System x3250 M5
 
IBM NeXtScale nx360 M4
IBM NeXtScale nx360 M4IBM NeXtScale nx360 M4
IBM NeXtScale nx360 M4
 
IBM System x3650 M4 HD
IBM System x3650 M4 HDIBM System x3650 M4 HD
IBM System x3650 M4 HD
 
IBM System x3300 M4
IBM System x3300 M4IBM System x3300 M4
IBM System x3300 M4
 
IBM System x iDataPlex dx360 M4
IBM System x iDataPlex dx360 M4IBM System x iDataPlex dx360 M4
IBM System x iDataPlex dx360 M4
 
IBM System x3500 M4
IBM System x3500 M4IBM System x3500 M4
IBM System x3500 M4
 
IBM System x3550 M4
IBM System x3550 M4IBM System x3550 M4
IBM System x3550 M4
 
IBM System x3650 M4
IBM System x3650 M4IBM System x3650 M4
IBM System x3650 M4
 
IBM System x3500 M3
IBM System x3500 M3IBM System x3500 M3
IBM System x3500 M3
 
IBM System x3400 M3
IBM System x3400 M3IBM System x3400 M3
IBM System x3400 M3
 
IBM System x3250 M3
IBM System x3250 M3IBM System x3250 M3
IBM System x3250 M3
 
IBM System x3200 M3
IBM System x3200 M3IBM System x3200 M3
IBM System x3200 M3
 
IBM PowerVC Introduction and Configuration
IBM PowerVC Introduction and ConfigurationIBM PowerVC Introduction and Configuration
IBM PowerVC Introduction and Configuration
 
A Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceA Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization Performance
 
IBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architecture
 
X6: The sixth generation of EXA Technology
X6: The sixth generation of EXA TechnologyX6: The sixth generation of EXA Technology
X6: The sixth generation of EXA Technology
 

Red Hat Enterprise Linux on IBM System z Performance Evaluation

  • 1. Christian Ehrhardt IBM Research and Development Red Hat Enterprise Linux on IBM System z Performance Evaluation © 2011 IBM Corporation
  • 2. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Agenda  Performance Evaluation Results – Environment – Noteworthy improvements – Changes you should be aware of  Performance evaluation Summary – Improvements and degradations per area – Summarized comparison 2/24 © 2011 IBM Corporation
  • 3. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Environment  Host – System z10 (2097-E269)  HW-Platform – FICON 4 Gbps – Linux on LPAR – FCP 4 Gbps – Linux in z/VM 5.4 guest – HiperSockets – OSA Express 3 1GbE + 10GbE  Verified on  Storage – DS8300 (2107-922 ) – System z Enterprise (z196) – HDD's – Linux in z/VM 6.1 – FICON 4 Gbps – FCP 4 Gbps 3/24 © 2011 IBM Corporation
  • 4. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Compared Versions  Compared Set – RHEL5 U4 (2.6.18-194.el5) – RHEL6-GA (2.6.32-71.el6) – RHEL6-GA + tuning • our recommended tuning (in RH Tech Notes) • workarounds for known issues  Measurements – Base regression set covering most customer use cases as good as possible – Focus on areas where performance issues are more likely – Just the top level summary, based on thousands of comparisons – Special case studies for non-common features and setups  Terminology – Throughput – “How much could I transfer once?” – Latency – “How long do I have to wait for event X?” – Normalized cpu consumption - “How much cpu per byte/transaction/... do I need?” 4/24 © 2011 IBM Corporation
  • 5. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary New process scheduler (CFS)  Goals of CFS – Models “ideal, precise multi-tasking CPU” – Fair scheduling based on virtual runtime  Changes you might notice when switching from O(1) to CFS – Lower response times for I/O, signals, … – Balanced distribution of process time-slices – Improved distribution across processors – Shorter consecutive time-slices – More context switches  Improved balancing – Topology support can be activated via the topology=on kernel parameter – This makes the scheduler aware of the cpu hierarchy  You really get something from fairness as well – Improved worst case latency and throughput – By that CFS can ease QoS commitments 5/24 © 2011 IBM Corporation
  • 6. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Benchmark descriptions - File system / LVM / Scaling  Filesystem benchmark dbench – Emulation of Netbench benchmark – Generates file system load on the Linux VFS – Does the same I/O calls like smbd server in Samba (without networking calls)  Simulation – Workload simulates client and server (Emulation of Netbench benchmark) – Mainly memory operations for scaling – Low main memory and lvm setup for mixed I/O and lvm performance – Mixed file operations workload for each process: create, write, read, append, delete – 8 CPUs, 2GB memory and scaling from 4 to 62 processes (clients) – Measures throughput of transferred data 6/24 © 2011 IBM Corporation
  • 7. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary File system benchmark – process scaling RHEL5-U4 RHEL5-U4 Throughput (16 CPU) RHEL6-GA CPU consumption (16 CPU) RHEL6-GA Total cpu in micro s econds per MiB 6000 throughput by dbench in MB/s 5000 4000 better 3000 2000 1000 0 1 4 8 12 16 20 26 32 40 1 4 8 12 16 20 26 32 40 number of proces s es number of proces s es  Improved scalability – Especially improves large workloads – Lower cross process deviation improves QoS  Increased CPU consumption due to – CFS is striving for better interactivity and fairness – Changes affecting the writeback of dirty pages • Rule of thumb – now about twice as aggressive • One might want to tune dirty ratios in /proc/sys/vm/dirty_* 7/24 © 2011 IBM Corporation
  • 8. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Benchmark descriptions - Java  evaluates server side Java – 3-tier system • Random input from user • Middle tier business logic implemented in Java • No explicit database --> emulated by Java objects • Scales warehouses  stressed components – Java • Virtual Machine (VM) • Just-In-Time compiler (JIT) • Garbage Collection (GC) – Linux operating system • Threads • Scheduler • Caches and Memory 8/24 © 2011 IBM Corporation
  • 9. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Java (31/64 Bit) Throughput SPECjbb2005 bops 64b-RH5-U4 64b-RH6-GA 64b-RH6-GA-tuned 31b-RH5-U4 31b-RH6-GA 31b-RH6-GA-tuned 6 7 8 9 10 11 12 13 14 15 16 17 18 Number of Warehouses  64b – RHEL6-GA -4.8% vs RHEL5-U4; tuned at least only -3.8%  31b – RHEL6-GA almost equal to RHEL5-U4, with tuning even +2.5%  This is caused by a bit of over-optimization for desktop latency in the new scheduler  System z recommended tunables are not set by default, but are part of the RHEL6 tech notes 9/24 © 2011 IBM Corporation
  • 10. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Benchmark descriptions - Webserving  Webserver Benchmark – Static website content read – Variable number of connections – Measures throughput via network connection  Server side – Apache – HTML content  Client side – 3 clients connected to webserver – Number of active requests scaled from 1 to 20 connections per client 10/24 © 2011 IBM Corporation
  • 11. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Webserving – example for improved CPU scaling RHEL5-U4 Simple Webserving Workload (60 Clients) RHEL6-GA RHEL6-GA-tuned Page Elements / sec 1 CPU 2 CPU 4 CPU Number of Processes  Improved CPU scalability compared to RHEL5-U4 – Again the recommended scheduler tuning can add a further improvement  Additional CFS effect – Lower worst-case response time 11/24 © 2011 IBM Corporation
  • 12. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Benchmark descriptions - Network  Network Benchmark which simulates several workloads  Transactional Workloads – 2 types • RR – A connection to the server is opened once for a 5 minute time frame • CRR – A connection is opened and closed for every request/response – 4 sizes • RR 1x1 – Simulating low latency keepalives • RR 200x1000 – Simulating online transactions • RR 200x32k – Simulating database query • CRR 64x8k – Simulating website access  Streaming Workloads – 2 types – STRP/STRG – Simulating incoming/outgoing large file transfers (20mx20)  All tests are done with 1, 10 and 50 simultaneous connections  All that across on multiple connection types (different cards and MTU configurations) 12/24 © 2011 IBM Corporation
  • 13. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Network Throughput 10 Gigabit Ethernet OSA Express3 MTU 8992 Vswitch Guest to Guest MTU 1492 RHEL5-U4 RHEL5-U4 RHEL6-GA RHEL6-GA Trans actions per Second Trans actions per Second 1 10 50 1x1 1x1 1x1 k1 10 2k 50 x32 2k 200 0x3 0x3 20 20 Workload Workload  Connection scaling is good - parallel scenarios improved a lot – Up by +6% to +29% for 50 concurrent connections – Workloads with larger transferred sizes actually benefit a bit more  Single connection Latency can be an issue – 1x1 is shown here as it forces max overhead and latency per transferred byte – Down by -5% to -15% on single connections  Large improvements for vswitch guest to guest – Up to +54% using 50 concurrent connections with MTU 1492 and 200x32k workload 13/24 © 2011 IBM Corporation
  • 14. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Network CPU consumption 10 Gigabit Ethernet OSA Express 3 MTU 8992 Client Vswitch guest to guest 1492 Client RHEL5-GA RHEL5-U4 RHEL6-GA RHEL6-GA CPU cons umption per Trans action CPU cons umption per Trans action 1 10 50 k1 10 50 1x1 1x1 1x1 x32 2k 2k 200 0x3 0x3 20 20 Workload Workload  CPU consumption increased for a lot of workloads (roughly 2/3 of the types we distinguish) – The range of is from +35% improvement down to -44% degradation – Most loads degrade by an average around -15% – This is part of the trade-offs for better scalability – Also partially a scheduler/caching effect  Some improvements for loads with large mtu's – Usually seen on the sender side – That implies it is beneficial for data sources not for data sinks 14/24 © 2011 IBM Corporation
  • 15. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Benchmark descriptions - Disk I/O  Workload – Threaded I/O benchmark – Each process writes or reads to a single file, volume or disk – Benchmark can be configured to run with and without page cache (direct I/O) – Operating modes: Sequential write/rewrite/read + Random write/read  Setup – Main memory was restricted to 256MB – File size (overall): 2GB, Record size: 64kB – Scaling over 1, 2, 4, 8, 16, 32, 64 processes – Sequential run: write, rewrite, read – Random run: write, read (with previous sequential write) – Once using page cache and once using Direct I/O (bypass the page cache) – Sync and Drop Caches prior to every invocation to reduce noise 15/24 © 2011 IBM Corporation
  • 16. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Throughput for seq. readers Page cache based disk I/O read issue Throughput 1 2 4 8 16 32 number of proces s es  Caused as corner case by memory management “improvements”  Real World - Backups – It can hold a lot of data to scan by the backup software (→ a lot of seq. read) – A lot of data is usually split across many discs on s390 (→ concurrent access) – Overcommitment/ballooning effects or sized too small (→ memory constraint) Systems “protected” “easily freeable” Sizing/ballooning base footprint (new part) Need for read ahead allocs  Detection – Most workloads won't see the impact or more than that benefit from these changes – Check sysstat which should report a huge amount of pgscand/s – Run “sync; echo 3 > /proc/sys/vm/drop_caches” • Should hurt throughput, huge improvements mean you are probably affected  Workarounds other than “more memory” – Drop caches if there is a single time this happens (e.g. on nightly backup) – Use direct I/O or shrink read ahead if applicable – Fix got upstream accepted in 2.6.37-rc1 16/24 © 2011 IBM Corporation
  • 17. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Disk I/O – New FICON features  HyperPAV – Avoid subchannel busy – Automatic management of subchannel assignment/usage – No need of multipath daemon – Especially useful for concurrent disk accesses  Read-Write Track Data – Allows to read/write up to a full track in one command word – Especially useful for huge requests and streaming sequential loads  High Performance Ficon – New metadata format reduces overhead – Especially useful for small requests 17/24 © 2011 IBM Corporation
  • 18. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Disk I/O – FICON – HyperPAV FICON FICON Throughput sequential writers FICON+HPAV Throughput sequential readers FICON+HPAV Throughput Throughput 1 2 4 8 16 32 64 1 2 4 8 16 32 64 number of proces s es number of proces s es FICON  Using 4 disks (4 ranks) with 3 aliases per rank Throughput random readers FICON+HPAV  Without PAV/HyperPAV – Access could become contented (subchannel busy) – Throughput stays constant >1 proc per disk Throughput  Solution: Having multiple subchannels per device – PAV: Aliases for devices – HyperPAV: Pool of aliases defined per rank – Throughput increased up to 3.5 x in our scenario 1 2 4 8 16 32 64 → Usage of HyperPAV can be highly recommended number of proces s es 18/24 © 2011 IBM Corporation
  • 19. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Disk I/O – FICON – effect of RWTD/HPF – Throughput RHEL5-U4 RHEL5-U4 RHEL6-GA RHEL6-GA Throughput for initial writers Throughput for readers Throughput in KB/s Throughput in KB/s 1 2 4 8 16 32 64 1 2 4 8 16 32 64 number of proces s es number of proces s es  IOzone sequential write/read using direct I/O – Huge throughput improvements • Write throughput up to 26% • Read throughput up to 82% – Normalized I/O consumption stays about the same • despite the much larger throughput 19/24 © 2011 IBM Corporation
  • 20. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Disk I/O – FICON – effect of RWTD/HPF – random workloads RHEL5-U4 RHEL5-U4 RHEL6-GA RHEL6-GA Throughput for random writers Throughput for random readers Throughput in KB/s Throughput in KB/s 1 2 4 8 16 32 64 1 2 4 8 16 32 64 number of proces s es number of proces s es  IOzone random write/read using direct I/O – Huge throughput improvements • Read throughput up to +81% • Write throughput up to +23% 20/24 © 2011 IBM Corporation
  • 21. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Hints - General  Cgroup memory support – This is a feature coming with newer kernels – Recommended by some management tools to enforce very customizable memory constraints – Has a rather large footprint by consuming 1% of the memory – Activated by default – In a consolidation environment it is actually 1% multiplied by your virtual/real ratio – Not pageable by linux, but fortunately by z/VM – This can be overridden with a kernel parameter (reboot required): cgroup_disable=memory 21/24 © 2011 IBM Corporation
  • 22. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Improvements and Degradations of RHEL6 per area vs. RHEL5-U4 Improvements Degradations FICON I/O CPU consumption* Process scaling OSA single C. Latency CPU scaling I/O corner cases via page cache* Compiler Multiconn. Networking Disk I/O via page cache  Improvements in almost every area – Especially for large workloads  Degradations for corner cases and cpu consumption * = Partially or completely avoidable due to tunings/workarounds as shown before 22/24 © 2011 IBM Corporation
  • 23. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary Summary  RHEL 6 performance is good – With some trade-offs roughly equal to RHEL5-U4 • A common trade-off is increased cpu consumption for better scalability – Our recommended tunings/workarounds help in some known cases • Upcoming RHEL6.1 will further reduce the amount of manual tuning needed – Almost generally recommendable • An exception are very cpu consumption sensitive environments • Here upgrades have to be considered carefully  Improvements and degradations Base New Improved No difference Degraded or Trade-off RH5U4 RH6 27 22 33 RH5U4 RH6 tune & w. 34 48 0 23/24 © 2011 IBM Corporation
  • 24. Smarter Systems for a Smarter Planet – RHEL6 Evaluation Summary References  Some links the presentation referred to – Linux on System z – Tuning hints and tipps http://www.ibm.com/developerworks/linux/linux390/perf/index.html – Live Virtual Classes for z/VM and Linux http://www.vm.ibm.com/education/lvc/ – Red Hat Enterprise Linux 6 Tech Notes http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Technical_Notes/index.html Christian Ehrhardt Schönaicher Strasse 220 Linux on System z 71032 Böblingen, Germany Performance and Evaluation ehrhardt@de.ibm.com IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at “Copyright and trademark information” at www.ibm.com/legal/copytrade.shtml. 24/24 © 2011 IBM Corporation