SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
IBM Power Systems




    IBM i Application Optimization with POWER7




                                                                         Gottfried Schimunek
                                                                                                3605 Highway 52 North
                                                                         Senior Architect       Rochester, MN 55901
                                                                         Application Design
                                                                         IBM STG Software       Tel 507-253-2367
                                                                         Development            Fax 845-491-2347
                                                                         Lab Services
                                                                                                Gottfried@us.ibm.com
                                                                         IBM ISV Enablement


                                                                                                 © 2012 IBM Corporation




IBM Power Systems


Acknowledgements

    Many thanks to:

     – Helen Olson-Williams - Lab Services Performance and Scalability Services




     – Anil Madan - Consulting IT Specialist, Lab Services Performance




     – Charles Farrell - Lab Services Performance and Scalability Services




     – Mark Funk - Software Performance Analyst, Systems HW Development




2                                                                                              © 2012 IBM Corporation




                                                                                                                          1
IBM Power Systems


 Agenda


     POWER7 Architecture and Performance Features

     Application Performance Characteristics

     Performance data collection and analysis

     Tuning applications and system for best performance

     Hardware configuration optimization




 3                                                                                                     © 2012 IBM Corporation




 IBM Power Systems


 Comparative Anatomy … POWER6 vs. POWER7
                       More on Cache: http://en.wikipedia.org/wiki/CPU_cache
Memory DIMM                                          Memory DIMM



Memory DIMM                                          Memory DIMM   Core   Core   Core   Core   Core   Core   Core    Core
                      Core          Core

Memory DIMM                                          Memory DIMM



Memory DIMM                                          Memory DIMM     L2    L2    L2    L2    L2    L2    L2    L2
                                                                   256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB
Memory DIMM

                       L2            L2
                                                     Memory DIMM
                                                                                    Cache
Memory DIMM           4MB           4MB              Memory DIMM



Memory DIMM
                            Cache                    Memory DIMM
                                                                    L3
                                                                   4MB
                                                                           L3
                                                                          4MB
                                                                                  L3
                                                                                 4MB
                                                                                         L3
                                                                                        4MB
                                                                                                L3
                                                                                               4MB
                                                                                                       L3
                                                                                                      4MB
                                                                                                              L3
                                                                                                             4MB
                                                                                                                       L3
                                                                                                                      4MB

Memory DIMM           POWER6 Chip                    Memory DIMM


                                                                                   POWER7 Chip

                                                    Important points:
                          L3 32MB shared
                                                      “Multi-core” is not new.
                             Cache
                                                     – Fewer chips for same core count.
                                                      POWER7 has 4x more cores per chip
                     32       2       2     32
                     MB     Cores   Cores   MB
                                                      Cache topology is very different.
 8-core POWER6                                       – The latencies have improved with POWER7.
   (128 MB L3)                              Cache
                     32       2       2     32       – L3 data laterally cast out into neighbor L3.
                     MB     Cores   Cores   MB
                                                     POWER7 has fewer memory DIMMs per core.
 4                                                                                                     © 2012 IBM Corporation




                                                                                                                                2
IBM Power Systems


Systems Made From Multiple Chips                                                   All memory and contents of
                                                                                   all active cache
                                        750                                        accessible to all processors.
                                        • 1 – 4 Chips                              (Cache-Coherent SMP)
                                        • 8 or 6 physical cores per chip
                                        • 8 DIMM slots per chip


All chiplets (cores and cache) are active (i.e., no CUOD)
OS license count defines cores which execute instructions (i.e., a subset of active/physical cores).
• Single partition … Cores packaged densely.
• Multiple partitions … Each partition packaged densely if possible, partitions on separate chip’s cores per memory.
• Cache Active only on OS-licensed chiplets … Energy Savings.

Active chiplet count based on CUoD … 4, 8, 16, 32, 48, 64 (and 24 in “TurboCore”)
A 770/780 drawer always has 2 chips … 16 physical cores if 8-core chips.
OS licensed core count placed onto active chiplets.




                                         770 and 780
                                         • 1 to 4 drawers             16-core Systems:
                                         • 2 Chips per drawer         • POWER6 …     4 drawers and 8 chips
                                         • 6 or 8 cores per chip      • POWER7 770 … 1 drawer and 2 chips.
                                         • 8 DIMM slots per chip      • POWER7 780 … 2 drawers and 4 chips.
  5                                                                                                  © 2012 IBM Corporation




  IBM Power Systems


  Simultaneous Multi-Threading (SMT) – POWER Systems
 Time (cycles)




SMT=OFF



                                11 cycles                                    11 cycles
                                                                                                       Thread 1
                                                                                                       Thread 2



 SMT=ON                                                                    ... in this trivial example -With SMT:
                                                                                   •Thread 1: 18% longer
                                                                                   •Thread 2: 9% longer
                                13 cycles                                          •69% throughput increase
                                12 cycles                                      Amount of overlap/contention
                                                                               is environment dependent.


                  Capacity is increased but individual thread execution time in CPU is slightly
                  more when running concurrently – because of possible contention.


  6                                                                                                  © 2012 IBM Corporation




                                                                                                                              3
IBM Power Systems


4-WAY SMT (Simultaneous Multi-Threading)
 POWER5/6 have 2-Way SMT
SMT4 Provides Still More Capacity.

Up to 4 thread’s instruction streams executing
in parallel on one core.
• 4 threads share the same cache(s) and “pipes”.
• SMT4 Performance enabled via Faster L2 cache.                                                          2
• 8-core chip can support 32 threads.
• 2-core POWER6 chip supports only 4 threads.                                                            2

SMT works because other
thread’s instructions execute during
• Another thread’s cache miss delays.
• When pipe stage not already used by another.

Cores have a lot more capacity than is used by a single task.
• SMT4 common to have 1.5-2X more capacity over ST.

However, when core executing multiple threads,
each thread’s execution speed is slower.
• SMT4’s 2X capacity increase also means ½ throughput for individual thread.

Cores automatically switch between ST, SMT2, SMT4.


  7                                                                                                                          © 2012 IBM Corporation




  IBM Power Systems                                            More on Cache: http://en.wikipedia.org/wiki/CPU_cache

  Cache Design Enhanced… Lateral Cast-out                                                                     Core    Core
                                                                                                                               POWER6
  (In preparation for explaining “TurboCore”)
                                                                                                                                      Memory DIMM

  Fast L2 for rapid cache fills of L1s.                                                                        L2
                                                                                                              4MB
                                                                                                                       L2
                                                                                                                      4MB             Memory DIMM

  More Cache means
                                                                                                                                      Memory DIMM
  • Better Single-Threaded Performance                                    As data read into L2,
                                                                          other data stored out to L3.           L3                   Memory DIMM
  • More System Capacity                                                  Called a “Cast-Out”.                 32 MB
  • Acts to offset lower frequency.
                                                                                    POWER7
      To make room for cache fills,                                   Memory DIMM

      block in L2 is “Cast-Out” to L3.
                                                                                    Core   Core   Core       Core    Core     Core     Core     Core
      (Data cast-out of L3 written to memory if changed.)             Memory DIMM



                                                                      Memory DIMM
      In order to make room for block cast into L3,
      a block may be “Cast-Out” out to another L3.                    Memory DIMM     L2    L2    L2    L2    L2    L2    L2    L2
                                                                                    256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB
      This is most useful when other cores on chip                    Memory DIMM

      are temporarily or permanently idle.
                                                                      Memory DIMM

                                                                                     L3     L3     L3         L3      L3        L3      L3           L3
      Lateral Cast-Out exists on all chips by default,                Memory DIMM   4MB    4MB    4MB        4MB     4MB       4MB     4MB          4MB
      but only amongst licensed/active chiplets.
                                                                      Memory DIMM


      (Now onto “TurboCore”.)
                                                                                            “Lateral Cast-Out””
  8                                                                                                                          © 2012 IBM Corporation




                                                                                                                                                          4
IBM Power Systems



  “TurboCore”
More Cache And Higher Frequency.
                                                                                             16-way 770/780 Drawer
A chip with 4 Active Cores can                                 Core    Core   Core   Core   Core   Core   Core   Core   Core    Core   Core   Core   Core   Core   Core   Core


more easily be cooled.                                           L2    L2    L2    L2    L2    L2    L2    L2
                                                               256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB
                                                                                                                          L2    L2    L2    L2    L2    L2    L2    L2
                                                                                                                        256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB

This, in turn, allows a higher frequency.                       L3      L3     L3     L3     L3     L3     L3     L3     L3      L3     L3     L3     L3     L3     L3     L3
                                                               4MB     4MB    4MB    4MB    4MB    4MB    4MB    4MB    4MB     4MB    4MB    4MB    4MB    4MB    4MB    4MB

( 3.86 GHz       4.14 GHz )

8-Chiplet chip has 8 L3 caches, so
remaining 4 remain enabled.                                              8-way 780 TurboCore Drawer

Best results expected in multi-partition                        Core   Core   Core   Core   Core   Core   Core   Core    Core   Core   Core   Core   Core   Core   Core   Core


                                                                 L2    L2    L2    L2    L2    L2    L2    L2             L2    L2    L2    L2    L2    L2    L2    L2
environment with partition’s packaged                          256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB          256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB



within individual chips.                                         L3
                                                                4MB
                                                                        L3
                                                                       4MB
                                                                               L3
                                                                              4MB
                                                                                      L3
                                                                                     4MB
                                                                                             L3
                                                                                            4MB
                                                                                                    L3
                                                                                                   4MB
                                                                                                           L3
                                                                                                          4MB
                                                                                                                  L3
                                                                                                                 4MB
                                                                                                                          L3
                                                                                                                         4MB
                                                                                                                                 L3
                                                                                                                                4MB
                                                                                                                                        L3
                                                                                                                                       4MB
                                                                                                                                               L3
                                                                                                                                              4MB
                                                                                                                                                      L3
                                                                                                                                                     4MB
                                                                                                                                                             L3
                                                                                                                                                            4MB
                                                                                                                                                                    L3
                                                                                                                                                                   4MB
                                                                                                                                                                           L3
                                                                                                                                                                          4MB


                                                 16-way 780
                                                 TurboCore
                                                   System                8-way 780 TurboCore Drawer
In some workloads and partition placements,                     Core   Core   Core   Core   Core   Core   Core   Core    Core   Core   Core   Core   Core   Core   Core   Core


cross-chip traffic can offset benefits of TurboCore.             L2    L2    L2    L2    L2    L2    L2    L2             L2    L2    L2    L2    L2    L2    L2    L2
                                                               256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB          256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB
Example:
                                                                 L3     L3     L3     L3     L3     L3     L3      L3     L3     L3     L3     L3     L3     L3     L3      L3
16-core CPW … 4-chip TurboCore has slightly less throughput.    4MB    4MB    4MB    4MB    4MB    4MB    4MB     4MB    4MB    4MB    4MB    4MB    4MB    4MB    4MB     4MB


8-core CPW … 2-chip TurboCore has slightly more throughput.

  9                                                                                                                                            © 2012 IBM Corporation




  IBM Power Systems


 The Conclusion Chart …
  Think of POWER7 Performance as having …

   • Better Core-to-Core Capacity Producing Desired Growth

   • Massive improvement in “socket” capacity …. 8 cores per chip
       • A lot more capacity within a package (card, drawer).

   • A lower frequency (than POWER6), but still better
         • Capacity improvements result from other improvements
         • Many single-threaded workloads execute better than POWER6.

   • Lower energy consumption per core
        • Even more energy savings if you exploit IBM Systems Director Active Energy Manager.




  10                                                                                                                                           © 2012 IBM Corporation




                                                                                                                                                                                 5
IBM Power Systems


Application Performance Characteristics

     Batch performance characteristics; single vs multithread
      – Multithreaded applications can automatically take advantage of new POWER features
           Typically I/O bound
      – Single threaded applications will need some tuning
           Typically uses only a single core at a time (e.g. max 6% of a sixteen way)
           Typically I/O bound

     CPU or disk I/O bound?
      – Low overall CPU utilization, high disk utilization, lots of memory paging
      – Tuning techniques available to make an I/O bound job CPU bound

     Batch jobs waiting for something else?
      – Wait for I/Os, e.g. disk configuration # disk arms, disk speed, IOA, disk cache, etc.
      – Wait for processors, CPU queuing, higher priorities, virtualization, capping, etc.
      – Wait for record locks and seizes, DB design, application architecture
      – Wait for many other potential wait situations, hundreds of wait situations


11                                                                                                           © 2012 IBM Corporation




IBM Power Systems


Multithread vs single thread applications - Example
                                          Start of End-of-Day Process


      Job1                       Job2                      Job3                       Job4                       Job5




                                                     Single thread



      Job1                       Job2                      Job3                       Job4                       Job5


                                           End of End-of-Day Process
                                                    Backup
       Some applications cannot be run in multithread mode, e.g. data integrity, data dependency, etc
       Processor utilization in single threaded is typically below 20%
       Performance improvements in single threaded jobs are limited to one processor unless DB SMP is used
12                                                                                                           © 2012 IBM Corporation




                                                                                                                                      6
IBM Power Systems




Single versus multi-threaded Application runtime

                                                                         Single-threaded
                    A                  B                 C

              Timesingle-threaded


                                    Multi-threaded
                    A                          • Can be multithreads in programs, or more likely
                                                 multi jobs on IBM i

                            B                  • In practice, finding these thread regions is not
                                                 easy but so valuable for performance and
                                                 being handled automatically at the CPU
                        C                        hardware level


              Timemulti-threaded


13                                                                                     © 2012 IBM Corporation




IBM Power Systems


How to multithread an application?
Use DB2 SMP feature to parallelize I/O and index operations
Use Holey inserts to improve concurrent ADDs
Split input stream into multiple concurrent batch jobs so that each job works on separate
parts of the input data

From single thread:
                                           A             B                 C

To multiple threads:
                                           A              B                C

                                           A              B                C

use iStream and BSTAR from Cinimex at http://www.cinimex.co.uk/products/bstar.html
  – Automatically multithreads applications w/o application programming required


14                                                                                     © 2012 IBM Corporation




                                                                                                                7
IBM Power Systems


Finding the optimal number of threads
      The first challenge is to evenly distribute work across threads.
      In this simple example, 8 threads are running concurrently, but the work is not evenly
      distributed
      CPU resources are under-utilized.




Time (cycles)




 15                                                                                      © 2012 IBM Corporation




 IBM Power Systems


Find the optimal number of threads
  On Power 7, for the highest overall capacity,
                  number of concurrent threads = number of cores * 4




                                                                        2 cores

                                                                        8 concurrent threads



      For the best runtime for individual threads,
                        number of concurrent threads = number of cores

                                                                         2 cores

                                                                         2 concurrent threads

                                                                  High priority work

 16                                                                                      © 2012 IBM Corporation




                                                                                                                  8
IBM Power Systems


Power 6 CPU Queuing Time -> Power 7 Dispatched CPU Time


                                                                        Power 6, 5.0 Ghz,
                                                                        3 PUs, 3 Virtual Processor
                                                                        SMT 2 -> 6 threads dispatched




                                                                        Power 7, 3.5 Ghz,
                                                                        2.5 PUs, 3 Virtual Processor
                                                                        SMT 4 -> 12 threads dispatched



                                                                  Note: It appears there are
                                                                  additional CPU resources
                                                                  available on Power 7.
                                                                  Keep in mind that thread
                                                                  execution time is longer when in
                                                                  SMT 4 mode.

17                                                                                             © 2012 IBM Corporation




IBM Power Systems


How to improve I/O bound jobs?
- make them CPU bound..
                                                                          100

   Goal: Eliminate all or reduce most of the I/O waits                        80

   Use SETOBJACC for high volume accessed files and indexes                   60
                                                                                                      I/O
                                                                              40
   Avoid open/close of files, use RETRN vs SETON *LR in RPG                                           CPU
                                                                              20
   Remove deleted records – RGZPFM
                                                                               0
   Use REUSEDLT(*YES) to enhance concurrent writes                                I/O bound CPU bound
   Optimize Load/ Copy/ Save processes by multithreading and/or other techniques
   Change random reads to sequential reads
 • Higher blocking for input/output only operations – SEQONLY((*YES #rcds)) >= 128KB Increase logical
   page size in DB operations
 • Avoid opening Output files as I-O files to take advantage of logical blocking.
   Eliminate FRCRATIO(1)
 • CRTLF PAGESIZ(64 – 512KB)
   Use Expert cache in job pools
   For High volume ADDs, use DB2 SMP for parallel index maintenance and use ALLOCATE(*YES)
   Use SQL to create DB rather than DDS
   If journaling
       – Use larger commit cycles
       – Use HA Journal Performance Enhancement feature (SS1 opt 42)
       – Reduce JRN entry size with RCVSIZOPT(*RMVINTENT)


18                                                                                             © 2012 IBM Corporation




                                                                                                                        9
IBM Power Systems


Analyze Performance by collecting performance data



     What performance tools to use?
      – Collection services – 5 minutes interval for the complete run duration
      – Job Watcher – to identify wait components
      – PEX Analyzer with disk read/writes start/end ops
      – Disk Watcher to review disk and IO related performance




19                                                                               © 2012 IBM Corporation




IBM Power Systems


IBM Systems Director Navigator for i
      Performance, Investigate Data




20                                                                               © 2012 IBM Corporation




                                                                                                          10
IBM Power Systems


Investigate Data                       Perspectives are a logical
                                       grouping of similar or related
                                       views that benefit from being
                                       rendered side-by-side for
                                       reference or context.




                                                Content Package is a set of
                                                perspectives that share a
                                                commonality (major theme).




                                                       !
                                                    New

                                                    Performance Explorer and
21
                                                    Health Indicators2012 IBM Corporation
                                                                    ©




IBM Power Systems


Investigate Data – Select Collection




The Collection boxes
allow you to specify which
collection you want to
work with. Only
collections valid for the
type of chart you select
will be displayed.
22                                                                       © 2012 IBM Corporation




                                                                                                  11
IBM Power Systems


Resource Utilization Overview - Percentages
                                 Nice summary for general overall health:
                                 • CPU Utilization
                                 • Disk Utilization
                                 • Disk Busy




23                                                             © 2012 IBM Corporation




IBM Power Systems


Resource Utilization Overview - Rates

                                Nice summary for general overall health:
                                • 5250 Transactions
                                • I/Os per Second
                                • Page Faults




24                                                             © 2012 IBM Corporation




                                                                                        12
IBM Power Systems


CPU Utilization and Waits Overview




25                                                        © 2012 IBM Corporation




IBM Power Systems


What is iDoctor?


     • Product developed by the Support Center to enable
       effective resolution of customer performance problems
     • Graphical analysis
     • Heavy focus on wait analysis
     • Main components
         – Job Watcher – detailed job data and analysis
         – PEX Analyzer - PEX data analysis
         – Disk Watcher




26                                                        © 2012 IBM Corporation




                                                                                   13
IBM Power Systems


iDoctor tools (Job Watcher and PEX Analyzer) Take You …

                ………………….…from this level…………..……….
                             CPU Q’ing            Disk I/O                  Other




                ………………….…to this level…………..……….
                                                                                    H
                                                 Sync Read /
     User
     Pgm
            OS/400
             Pgm
                      MI
                     Instr
                               LIC    CPU
                              Assist Queuing        Write      DASD wait /
                                                                  Pending Fault
                                                                    Write
                                                                                    e
                                                                                    l          Other
                                                                                        COMM SeizesLocks        Gates
                                                                                    d



                                                Obj       Op'n              Op'n
                                                                     Pool
                                               Name     Start/Stop         type &
                                                                   (MS/AS)
                                               & Type     Time             length



            CPU Time                                DASD I/O Time                       Other Wait Time
27                                                                                                    © 2012 IBM Corporation




IBM Power Systems



 Waits Overview – iDoctor Job Watcher




28                                                                                                    © 2012 IBM Corporation




                                                                                                                               14
IBM Power Systems



 Waits Overview – iDoctor Job Watcher – Example Physical IO
                    USER000001




29                                                       © 2012 IBM Corporation




IBM Power Systems



Waits Overview – iDoctor Job Watcher – Example Physical IO
                    USER000001




      JOB1NUM001

      JOB2NUM002

      JOB3NUM003




30                                                       © 2012 IBM Corporation




                                                                                  15
IBM Power Systems



 Waits Overview – iDoctor Job Watcher – Objects Waited On
                    USER000001



                                        JOB2NUM002
     File000001 MBR1

     File000002 MBR1

     File000003 MBR1

      File000004 MBR1

      File000005 MBR1

      File000006 MBR1

      File000007 MBR1

      File000008 MBR1

      File000009 MBR1

      File000010 MBR1




31                                                                                    © 2012 IBM Corporation




IBM Power Systems


Power Solid State Drive
     Processors           Memory                        SSD                           Disk

     Very, very,         Very, very,                                              Very, very slow
     very, very,         very fast                                                comparatively
     very fast                                          Fast

                                                                                    1,000,000 -
      < 10’s ns            ~100 ns                   ~200,000 ns
                                                                                   8,000,000 ns

                                            Access Speed

           Sweet spots
           1. Batch window reduction for disk bound applications
              You can cut up to 40-50% off window
           2. Response time - transaction/data base for disk bound applications

       – Key points
       – -- A modest quantity of SSD can make a big difference
       –   -- Biggest performance boost for random read workload




32                                                                                    © 2012 IBM Corporation




                                                                                                               16
IBM Power Systems


Mixed SSD + HDD – cold and hot data
     It is typical for data bases to have a large percentage of data which is infrequently used
     (“cold”) and a small percentage of data which is frequently used (“hot”)




                    Hot data may be only 10-20% capacity, but represent 80-90% activity


     SSD offers best price performance when focused on “hot” data
     HDD offers best storage cost, so focus it on “cold” data …. sort of a hierarchal approach




        Cold                                                   May be able to use larger HDD and/or a larger % capacity used
        Hot         Can run SSD closer
                    to 100% capacity
33                                                                                                                  © 2012 IBM Corporation




IBM Power Systems



Waits Overview – iDoctor Job Watcher – Before and After


                                                  JOB2NUM002
                USER000001
                                         JOB2NUM002




                USER000001                        JOB2NUM002

                                         JOB2NUM002




34                                                                                                                  © 2012 IBM Corporation




                                                                                                                                             17
IBM Power Systems




Practical experiences and other optimization
examples from recent engagements……




35                                                                                     © 2012 IBM Corporation




IBM Power Systems


Application scalability and tuning engagement for Business Partner

     Results
      – Max tps increased from 2000 -> 6139,

     Changes made during Performance and Scalability Test
      – Created multiple copies of TR files to reduce contention (gate wait) when inserting
        records to the transaction files.
      – Pre-allocate log file to reduce seize contention when files extend.
      – Switch from varchar to fixed length field in log file since most of the accesses are the
        max length, to reduce seize contention on auxiliary space.
      – Turn on journal caching.
      – Increase jorecra threshold to reduce seize contention on index.
      – Contention on an application userqueue - balance workload simulation. Will need to
        consider adding more queues in the future.
      – Change frcwrtratio from 1 to *none and specify Seqonly *no to reduce dasd wait time
        and to reduce contention when multiple jobs force a file to disk
      – Other application 'clean up' activities
           set index size to *max1tb for a few files
           set reuse deleted to *yes
36                                                                                     © 2012 IBM Corporation




                                                                                                                18
IBM Power Systems


Before tuning
2000 tps, cpu utilization 20%




37                              © 2012 IBM Corporation




IBM Power Systems


After tuning
6139 tps, cpu utilization 40%

     Chart coming




38                              © 2012 IBM Corporation




                                                         19
IBM Power Systems


End of Day Batch – scalability testing and tuning for growth
     2X, 3X, 4X growth scenarios, some results:

       2X – End of Day          2X – End of Month       4X – End of Day     4X – End of Month


       Power 6, 10 cores        Power 6, 10 cores       Power 7, 32 cores   Power 7, 32 cores
       220gb memory             220gb memory            586gb memory        586gb memory
                                                        Appl and system     Appl and system
                                                        tuning changes      tuning changes
       5 hrs 6 min              12 hrs 23 min           < 4 hrs             7 hrs 4 min


     Changes made during Performance and Scalability Test
      – Application changes:
           Reduced full opens of db files
           Increased parallelism in batch stream
           Used setobjacc to load key files in memory
      – System and application settings
           Turn on Journal cache
           Re-use deleted records to take advantage of concurrent write
           DB2 fix to reduce impact of seize when records added to file
           Increased SMAPP recovery threshold

39                                                                                        © 2012 IBM Corporation




IBM Power Systems


2X – EOM
Before adding hardware and tuning changes




40                                                                                        © 2012 IBM Corporation




                                                                                                                   20
IBM Power Systems


4X - EOM
Additional cpu and memory, tuning changes




41                                          © 2012 IBM Corporation




IBM Power Systems


2X – EOD
Before adding hardware and tuning changes




42                                          © 2012 IBM Corporation




                                                                     21
IBM Power Systems


4X - EOD
Additional cpu and memory, tuning changes




43                                                                         © 2012 IBM Corporation




IBM Power Systems


Application and System Tuning to achieve SLAs for quarterly pricing
update
     Results:
      – Increased number of cpu cores by 50%
      – Tuned application and system
      – 86% improvement in throughput

     Tuning changes made during Performance and Scalability Test
      – Application changes:
           Reduce date conversions
           Change random to sequential IO
      – System and application settings
           Tuned/reduced the number of concurrent jobs
           Tuned memory settings
           Apply PTF to take advantage of skip-locked-row sql processing




44                                                                         © 2012 IBM Corporation




                                                                                                    22
IBM Power Systems


Before tuning application, system and adding hardware
8.7 million updates per hour




45                                                        © 2012 IBM Corporation




IBM Power Systems


After application and system tuning changes, adding cpu
16.2 million updates per hour




46                                                        © 2012 IBM Corporation




                                                                                   23
IBM Power Systems


Tuning applications and system for best performance


     Memory pools, activity level, expert cache, SETOBJACC, etc.
      – Proper memory pool sizing and tuning
           Move all available memory to the batch pool, or
           Use QPFRADJ(2 or 3)
           Make sure memory pool is enabled for expert cache

     Application tuning, e.g. SEQONLY((*YES nbrof rcds)), NBRRCDS(nbrofrcds)
       – SEQONLY is for program buffer (for input only, for output only)
       – NBRRCDS is used for SLIC buffer
       – Try to set as large as possible, e.g. 128KB or even 256KB

     SQL vs native I/O
      – SQL can take advantage of multithreading in IBM i, e.g. parallel ADD, large page support, etc.

     Journal caching
       – Reduces write waits and number of I/Os significantly
       – IBM i option- fee

     DB2 SMP
      – Enables many multithreaded features for the DB, e.g. index maintenance, index builds
      – IBM i option - fee


47                                                                                                  © 2012 IBM Corporation




IBM Power Systems


Hardware configuration optimization
     # of processors allocated, # of VPs
       – Ensure you always have enough CPU capacity for the complete run, e.g. <100%

     SSD – Solid Sate Drives vs HDDs performance characteristics and when to use
       – Use SSDs for select DB objects or hot pages traced from previous batch runs
       – Use the QIBMSSD tool to determine if SSDs will help,
       – use TRCASPBAL to trace and collect information of disk usage
       – And STRASPBAL to move hot objects onto SSDs, or back to HDDs
       – Investigate hot data and
       – Run TRCASPBAL and STRASPBAL *HSM (6.1.1) to move hot data to SSDs
            Run STRASPBAL *SSSM (7.1) to move hot to SSDs + move cold data back to HDDs
            Move complete files or libraries with CRTxF or CHGxF or CREATE/ALTER TABLE/INDEX
                  CRTLIB ASP(ssd-ASP), CREATE SCHEMA x IN ASP n

     Proper disk arm and disk IOA caching options
       – The more disk adaptors, the better
       – The more cache, the better
       – RAID5 typically provides better write performance
       – Use the WLE tool to actually use collected disk performance data and estimate most performance optimal
          disk configuration

     Disk adapter cache for read and write ops
       – The larger the cache, the better the performance

     Power and energy saving options vs Performance options
       – Watch out for energy saving options set in HMC or Active Energy Manager – AEM
       – Can significantly influence performance
48                                                                                                  © 2012 IBM Corporation




                                                                                                                             24
IBM Power Systems


 Energy Saving 101
                                                              If I have 8 lights (i.e. Cores)
                                                              that produce 8000 lumens, and I only need 4000 …

  My options include …

       - Turn some off … Cease clocking of some cores (OS-based)
            - Nap cores (End clocking core, cache remains active)
                   - OS returns unused cores to hypervisor for napping.
            - Sleep chiplets (End clocking core and cache)
                   - Sleep used if cores not licensed or if not used for an extended period.

       - Dim them … Slow the Frequency (TPMD-based)
            - Dynamic Power Save option in Systems Director.



Frequency Adjusting Modes … Three options:
- Static Power Save …. Frequency Lowered by Fixed Percentage below nominal to save power.
- Dynamic Power Save - Normal
        Variably adjusts frequency to reduce power. Frequency returns to nominal when required by processor utilization.
- Dynamic Power Save – Favor Performance
        Same as “Normal”, but frequency can reach a super-nominal frequency if power/cooling limits allow.


  49                                                                                                             © 2012 IBM Corporation




  IBM Power Systems


  Summary and references


       Performance of POWER6 and POWER7 systems are excellent – take
       advantage

       Some applications may not be able to take advantage of the POWER6 or
       POWER7 architecture features
        – Check out best practices and apply system and application tuning
          techniques

       IBM can help Clients, BP and ISVs to analyze and optimize their solutions
       to take advantage of POWER6 and POWER7
         – Let me know if you have a question or need some help with performance
               Send an email to: stgls@us.ibm.com



  50                                                                                                             © 2012 IBM Corporation




                                                                                                                                          25
IBM Power Systems




     Questions




51                       © 2012 IBM Corporation




IBM Power Systems


Requirements



If you have items your
business requires,
Submit a
requirement!


Help IBM understand
how we can help you.




52                       © 2012 IBM Corporation




                                                  26
IBM Power Systems




53                  © 2012 IBM Corporation




IBM Power Systems




54                  © 2012 IBM Corporation




                                             27
IBM Power Systems


Performance and Scalability Services
     The IBM i Performance and Scalability Services Center can provide facilities and hardware
     IN ROCHESTER to assist you in testing hardware or software changes

           “Traditional” benchmarks
           Release-to-release upgrades
           Assess application performance when migrating to a new release of IBM I
           Stress test your system
           Determine impact of application changes
           Proofs of Concept (e.g. HA alternatives; SSD analysis, external storage, etc.)
           Evaluate application scalability
           Capacity planning

     … all with the availability of Lab Services IBM i experts and development personnel


     To request any of these services, submit at:

              http://www-03.ibm.com/systems/services/labservices/psscontact.html

55                                                                                              © 2012 IBM Corporation




IBM Power Systems



IBM Systems Lab Services and Training
                                                                                   Mainframe Systems

 Our Mission and Profile
     Support the IBM Systems Agenda and accelerate the adoption of new
                                                                                      Power Systems
     products and solutions
     Maximize performance of our clients’ existing IBM systems
     Deliver technical training, conferences, and other services tailored to     System x & Bladecenter
     meet client needs
     Team with IBM Service Providers to optimize the deployment of IBM
     solutions (GTS, GBS, SWG Lab Services and our IBM Business                      System Storage
     Partners)

 Our Competitive Advantage
                                                                               IT Infrastructure Optimization
     Leverage relationships with the IBM development labs to build deep
     technical skills and exploit the expertise of our developers
     Combined expertise of Lab Services and the Training for Systems team
                                                                                   Data Center Services
     Skills can be deployed worldwide to assure all client needs can be met

                         Successful worldwide history:
                                                                                    Training Services
           17 years in Americas, 9 years in Europe/Middle East/Africa,
                             5 years in Asia Pacific


     www.ibm.com/systems/services/labservices           stgls@us.ibm.com
56                                                                                              © 2012 IBM Corporation




                                                                                                                         28
IBM Power Systems


Special notices
   This document was developed for IBM offerings in the United States as of the date of publication. IBM may not make these offerings available in
   other countries, and the information is subject to change without notice. Consult your local IBM business contact for information on the IBM
   offerings available in your area.
   Information in this document concerning non-IBM products was obtained from the suppliers of these products or other public sources. Questions
   on the capabilities of non-IBM products should be addressed to the suppliers of those products.
   IBM may have patents or pending patent applications covering subject matter in this document. The furnishing of this document does not give
   you any license to these patents. Send license inquires, in writing, to IBM Director of Licensing, IBM Corporation, New Castle Drive, Armonk, NY
   10504-1785 USA.
   All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives
   only.
   The information contained in this document has not been submitted to any formal IBM test and is provided "AS IS" with no warranties or
   guarantees either expressed or implied.
   All examples cited or described in this document are presented as illustrations of the manner in which some IBM products can be used and the
   results that may be achieved. Actual environmental costs and performance characteristics will vary depending on individual client configurations
   and conditions.
   IBM Global Financing offerings are provided through IBM Credit Corporation in the United States and other IBM subsidiaries and divisions
   worldwide to qualified commercial and government clients. Rates are based on a client's credit rating, financing terms, offering type, equipment
   type and options, and may vary by country. Other restrictions may apply. Rates and offerings are subject to change, extension or withdrawal
   without notice.
   IBM is not responsible for printing errors in this document that result in pricing or information inaccuracies.
   All prices shown are IBM's United States suggested list prices and are subject to change without notice; reseller prices may vary.
   IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply.
   Any performance data contained in this document was determined in a controlled environment. Actual results may vary significantly and are
   dependent on many factors including system hardware configuration and software design and configuration. Some measurements quoted in this
   document may have been made on development-level systems. There is no guarantee these measurements will be the same on generally-
   available systems. Some measurements quoted in this document may have been estimated through extrapolation. Users of this document
   should verify the applicable data for their specific environment.



                                                                                                                                               Revised September 26, 2006


57                                                                                                                                                          © 2012 IBM Corporation




IBM Power Systems


Special notices (cont.)
IBM, the IBM logo, ibm.com AIX, AIX (logo), AIX 5L, AIX 6 (logo), AS/400, BladeCenter, Blue Gene, ClusterProven, DB2, ESCON, i5/OS, i5/OS (logo), IBM Business
Partner (logo), IntelliStation, LoadLeveler, Lotus, Lotus Notes, Notes, Operating System/400, OS/400, PartnerLink, PartnerWorld, PowerPC, pSeries, Rational, RISC
System/6000, RS/6000, THINK, Tivoli, Tivoli (logo), Tivoli Management Environment, WebSphere, xSeries, z/OS, zSeries, Active Memory, Balanced Warehouse,
CacheFlow, Cool Blue, IBM Systems Director VMControl, pureScale, TurboCore, Chiphopper, Cloudscape, DB2 Universal Database, DS4000, DS6000, DS8000,
EnergyScale, Enterprise Workload Manager, General Parallel File System, , GPFS, HACMP, HACMP/6000, HASM, IBM Systems Director Active Energy Manager,
iSeries, Micro-Partitioning, POWER, PowerExecutive, PowerVM, PowerVM (logo), PowerHA, Power Architecture, Power Everywhere, Power Family, POWER Hypervisor,
Power Systems, Power Systems (logo), Power Systems Software, Power Systems Software (logo), POWER2, POWER3, POWER4, POWER4+, POWER5, POWER5+,
POWER6, POWER6+, POWER7, System i, System p, System p5, System Storage, System z, TME 10, Workload Partitions Manager and X-Architecture are trademarks
or registered trademarks of International Business Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are
marked on their first occurrence in this information with a trademark symbol (® or ™), these symbols indicate U.S. registered or common law trademarks owned by IBM at
the time this information was published. Such trademarks may also be registered or common law trademarks in other countries.

A full list of U.S. trademarks owned by IBM may be found at: http://www.ibm.com/legal/copytrade.shtml.

Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, and/or
other countries.
AltiVec is a trademark of Freescale Semiconductor, Inc.
AMD Opteron is a trademark of Advanced Micro Devices, Inc.
InfiniBand, InfiniBand Trade Association and the InfiniBand design marks are trademarks and/or service marks of the InfiniBand Trade Association.
Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered
trademarks of Intel Corporation or its subsidiaries in the United States and other countries.
IT Infrastructure Library is a registered trademark of the Central Computer and Telecommunications Agency which is now part of the Office of Government Commerce.
Java and all Java-based trademarks and logos are trademarks or registered trademarks of Oracle and/or its affiliates.
Linear Tape-Open, LTO, the LTO Logo, Ultrium, and the Ultrium logo are trademarks of HP, IBM Corp. and Quantum in the U.S. and other countries.
Linux is a registered trademark of Linus Torvalds in the United States, other countries or both.
Microsoft, Windows and the Windows logo are registered trademarks of Microsoft Corporation in the United States, other countries or both.
NetBench is a registered trademark of Ziff Davis Media in the United States, other countries or both.
SPECint, SPECfp, SPECjbb, SPECweb, SPECjAppServer, SPEC OMP, SPECviewperf, SPECapc, SPEChpc, SPECjvm, SPECmail, SPECimap and SPECsfs are
trademarks of the Standard Performance Evaluation Corp (SPEC).
The Power Architecture and Power.org wordmarks and the Power and Power.org logos and related marks are trademarks and service marks licensed by Power.org.
TPC-C and TPC-H are trademarks of the Transaction Performance Processing Council (TPPC).
UNIX is a registered trademark of The Open Group in the United States, other countries or both.

Other company, product and service names may be trademarks or service marks of others.
                                                                                                                                               Revised December 2, 2010


58                                                                                                                                                          © 2012 IBM Corporation




                                                                                                                                                                                     29
IBM Power Systems


Notes on benchmarks and values
The IBM benchmarks results shown herein were derived using particular, well configured, development-level and generally-available computer systems. Buyers should
consult other sources of information to evaluate the performance of systems they are considering buying and should consider conducting application oriented testing. For
additional information about the benchmarks, values and systems tested, contact your local IBM office or IBM authorized reseller or access the Web site of the benchmark
consortium or benchmark vendor.

IBM benchmark results can be found in the IBM Power Systems Performance Report at http://www.ibm.com/systems/p/hardware/system_perf.html .

All performance measurements were made with AIX or AIX 5L operating systems unless otherwise indicated to have used Linux. For new and upgraded systems, the latest
versions of AIX were used. All other systems used previous versions of AIX. The SPEC CPU2006, LINPACK, and Technical Computing benchmarks were compiled using
IBM's high performance C, C++, and FORTRAN compilers for AIX 5L and Linux. For new and upgraded systems, the latest versions of these compilers were used: XL C for
AIX v11.1, XL C/C++ for AIX v11.1, XL FORTRAN for AIX v13.1, XL C/C++ for Linux v11.1, and XL FORTRAN for Linux v13.1.


For a definition/explanation of each benchmark and the full list of detailed results, visit the Web site of the benchmark consortium or benchmark vendor.

TPC                             http://www.tpc.org
SPEC                            http://www.spec.org
LINPACK                         http://www.netlib.org/benchmark/performance.pdf
Pro/E                           http://www.proe.com
GPC                             http://www.spec.org/gpc
VolanoMark                      http://www.volano.com
STREAM                          http://www.cs.virginia.edu/stream/
SAP                             http://www.sap.com/benchmark/
Oracle, Siebel, PeopleSoft      http://www.oracle.com/apps_benchmark/
Baan                            http://www.ssaglobal.com
Fluent                          http://www.fluent.com/software/fluent/index.htm
TOP500 Supercomputers           http://www.top500.org/
Ideas International             http://www.ideasinternational.com/benchmark/bench.html
Storage Performance Council     http://www.storageperformance.org/results




                                                                                                                                          Revised December 2, 2010


59                                                                                                                                                     © 2012 IBM Corporation




IBM Power Systems


Notes on HPC benchmarks and values
The IBM benchmarks results shown herein were derived using particular, well configured, development-level and generally-available computer systems. Buyers should
consult other sources of information to evaluate the performance of systems they are considering buying and should consider conducting application oriented testing. For
additional information about the benchmarks, values and systems tested, contact your local IBM office or IBM authorized reseller or access the Web site of the benchmark
consortium or benchmark vendor.

IBM benchmark results can be found in the IBM Power Systems Performance Report at http://www.ibm.com/systems/p/hardware/system_perf.html .

All performance measurements were made with AIX or AIX 5L operating systems unless otherwise indicated to have used Linux. For new and upgraded systems, the latest
versions of AIX were used. All other systems used previous versions of AIX. The SPEC CPU2006, LINPACK, and Technical Computing benchmarks were compiled using
IBM's high performance C, C++, and FORTRAN compilers for AIX 5L and Linux. For new and upgraded systems, the latest versions of these compilers were used: XL C for
AIX v11.1, XL C/C++ for AIX v11.1, XL FORTRAN for AIX v13.1, XL C/C++ for Linux v11.1, and XL FORTRAN for Linux v13.1. Linpack HPC (Highly Parallel Computing)
used the current versions of the IBM Engineering and Scientific Subroutine Library (ESSL). For Power7 systems, IBM Engineering and Scientific Subroutine Library (ESSL)
for AIX Version 5.1 and IBM Engineering and Scientific Subroutine Library (ESSL) for Linux Version 5.1 were used.


For a definition/explanation of each benchmark and the full list of detailed results, visit the Web site of the benchmark consortium or benchmark vendor.
SPEC                              http://www.spec.org
LINPACK                           http://www.netlib.org/benchmark/performance.pdf
Pro/E                             http://www.proe.com
GPC                               http://www.spec.org/gpc
STREAM                            http://www.cs.virginia.edu/stream/
Fluent                            http://www.fluent.com/software/fluent/index.htm
TOP500 Supercomputers             http://www.top500.org/
AMBER                             http://amber.scripps.edu/
FLUENT                            http://www.fluent.com/software/fluent/fl5bench/index.htm
GAMESS                            http://www.msg.chem.iastate.edu/gamess
GAUSSIAN                          http://www.gaussian.com
ANSYS                             http://www.ansys.com/services/hardware-support-db.htm
                                  Click on the "Benchmarks" icon on the left hand side frame to expand. Click on "Benchmark Results in a Table" icon for benchmark results.
ABAQUS                            http://www.simulia.com/support/v68/v68_performance.php
ECLIPSE                           http://www.sis.slb.com/content/software/simulation/index.asp?seg=geoquest&
MM5                               http://www.mmm.ucar.edu/mm5/
MSC.NASTRAN                       http://www.mscsoftware.com/support/prod%5Fsupport/nastran/performance/v04_sngl.cfm
STAR-CD                           www.cd-adapco.com/products/STAR-CD/performance/320/index/html
NAMD                              http://www.ks.uiuc.edu/Research/namd
HMMER                             http://hmmer.janelia.org/                                                                                 Revised December 2, 2010
                                  http://powerdev.osuosl.org/project/hmmerAltivecGen2mod

60                                                                                                                                                     © 2012 IBM Corporation




                                                                                                                                                                                30
IBM Power Systems


Notes on performance estimates
         rPerf for AIX

         rPerf (Relative Performance) is an estimate of commercial processing performance relative to other IBM UNIX systems. It is derived from an IBM analytical
            model which uses characteristics from IBM internal workloads, TPC and SPEC benchmarks. The rPerf model is not intended to represent any specific
            public benchmark results and should not be reasonably used in that way. The model simulates some of the system operations such as CPU, cache and
            memory. However, the model does not simulate disk or network I/O operations.

              rPerf estimates are calculated based on systems with the latest levels of AIX and other pertinent software at the time of system announcement. Actual
              performance will vary based on application and configuration specifics. The IBM eServer pSeries 640 is the baseline reference system and has a value
              of 1.0. Although rPerf may be used to approximate relative IBM UNIX commercial processing performance, actual system performance may vary and is
              dependent upon many factors including system hardware configuration and software design and configuration. Note that the rPerf methodology used for
              the POWER6 systems is identical to that used for the POWER5 systems. Variations in incremental system performance may be observed in commercial
              workloads due to changes in the underlying system architecture.

         All performance estimates are provided "AS IS" and no warranties or guarantees are expressed or implied by IBM. Buyers should consult other sources of
             information, including system benchmarks, and application sizing guides to evaluate the performance of a system they are considering buying. For
             additional information about rPerf, contact your local IBM office or IBM authorized reseller.

         ========================================================================

         CPW for IBM i

         Commercial Processing Workload (CPW) is a relative measure of performance of processors running the IBM i operating system. Performance in customer
           environments may vary. The value is based on maximum configurations. More performance information is available in the Performance Capabilities
           Reference at: www.ibm.com/systems/i/solutions/perfmgmt/resource.html




                                                                                                                                                                                       Revised April 2, 2007


61                                                                                                                                                                                                      © 2012 IBM Corporation




IBM Power Systems

Trademarks
 The following are trademarks of the International Business Machines Corporation in the United States, other countries, or both.

     Not all common law marks used by IBM are listed on this page. Failure of a mark to appear does not mean that IBM does not use the mark nor does it mean that the product is not actively marketed or is not
     significant within its relevant market.
     Those trademarks followed by ® are registered trademarks of IBM in the United States; all others are trademarks or common law marks of IBM in the United States.




      For a complete list of IBM Trademarks, see www.ibm.com/legal/copytrade.shtml:
      *, AS/400®, e business(logo)®, DBE, ESCO, eServer, FICON, IBM®, IBM (logo)®, iSeries®, MVS, OS/390®, pSeries®, RS/6000®, S/30, VM/ESA®, VSE/ESA, WebSphere®, xSeries®,
      z/OS®, zSeries®, z/VM®, System i, System i5, System p, System p5, System x, System z, System z9®, BladeCenter®




 The following are trademarks or registered trademarks of other companies.

     Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, and/or other countries.
     Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. in the United States, other countries, or both and is used under license therefrom.
     Java and all Java-based trademarks are trademarks of Sun Microsystems, Inc. in the United States, other countries, or both.
     Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both.
     Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries
     in the United States and other countries.
     UNIX is a registered trademark of The Open Group in the United States and other countries.
     Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
     ITIL is a registered trademark, and a registered community trademark of the Office of Government Commerce, and is registered in the U.S. Patent and Trademark Office.
     IT Infrastructure Library is a registered trademark of the Central Computer and Telecommunications Agency, which is now part of the Office of Government Commerce.



 * All other products may be trademarks or registered trademarks of their respective companies.

 Notes:
 Performance is in Internal Throughput Rate (ITR) ratio based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput that any user will
 experience will vary depending upon considerations such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed.
 Therefore, no assurance can be given that an individual user will achieve throughput improvements equivalent to the performance ratios stated here.
 IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply.
 All customer examples cited or described in this presentation are presented as illustrations of the manner in which some customers have used IBM products and the results they may have achieved. Actual
 environmental costs and performance characteristics will vary depending on individual customer configurations and conditions.
 This publication was produced in the United States. IBM may not offer the products, services or features discussed in this document in other countries, and the information may be subject to change without
 notice. Consult your local IBM business contact for information on the product or services available in your area.
 All statements regarding IBM's future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
 Information about non-IBM products is obtained from the manufacturers of those products or their published announcements. IBM has not tested those products and cannot confirm the performance,
 compatibility, or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products.
 Prices subject to change without notice. Contact your IBM representative or Business Partner for the most current pricing in your geography.


62
62                                                                                                                                                                                                      © 2012 IBM Corporation




                                                                                                                                                                                                                                            31

Contenu connexe

En vedette

Las Generaciones En El Tiempo
Las Generaciones En El TiempoLas Generaciones En El Tiempo
Las Generaciones En El Tiempoguestfee961a
 
1000 1200 _stained_glass
1000 1200 _stained_glass1000 1200 _stained_glass
1000 1200 _stained_glassStefanie Pender
 
R3: Ruta Entrerrías Vilaboa-Pontevedra - bicicleando.es
R3: Ruta Entrerrías Vilaboa-Pontevedra - bicicleando.esR3: Ruta Entrerrías Vilaboa-Pontevedra - bicicleando.es
R3: Ruta Entrerrías Vilaboa-Pontevedra - bicicleando.esBicicleando
 
Análisis Urbano "Cono de Seguridad del Aeropuerto de Barquisimeto, Venezuela"
Análisis Urbano "Cono de Seguridad del Aeropuerto de Barquisimeto, Venezuela"Análisis Urbano "Cono de Seguridad del Aeropuerto de Barquisimeto, Venezuela"
Análisis Urbano "Cono de Seguridad del Aeropuerto de Barquisimeto, Venezuela"Scarlett Velasquez
 
Clase1 Fundamentos de Comunicaciones Electronicas
Clase1 Fundamentos de Comunicaciones ElectronicasClase1 Fundamentos de Comunicaciones Electronicas
Clase1 Fundamentos de Comunicaciones ElectronicasJohan Silva Cueva
 
Diagramas de Fase, Mecanica de Materiales
Diagramas de Fase, Mecanica de MaterialesDiagramas de Fase, Mecanica de Materiales
Diagramas de Fase, Mecanica de MaterialesNoble Drilling
 
Photochemical Micropatterning of Carbohydrates on a Surface
Photochemical Micropatterning of Carbohydrates on a SurfacePhotochemical Micropatterning of Carbohydrates on a Surface
Photochemical Micropatterning of Carbohydrates on a SurfaceGregory Carroll
 
Cuento Espectáculo "CALLES" Engrata Teatre
Cuento Espectáculo "CALLES" Engrata TeatreCuento Espectáculo "CALLES" Engrata Teatre
Cuento Espectáculo "CALLES" Engrata TeatreJose Angel Gomis
 
Collar de perlas negras definitivo
Collar de perlas negras definitivoCollar de perlas negras definitivo
Collar de perlas negras definitivoAlejandra Jara
 
PROYECTO EDUCATIVO DEL ÁREA DE INGLÉS 2010
PROYECTO EDUCATIVO DEL ÁREA DE INGLÉS 2010PROYECTO EDUCATIVO DEL ÁREA DE INGLÉS 2010
PROYECTO EDUCATIVO DEL ÁREA DE INGLÉS 2010It's English Time
 
Recopilatorio de gadgets
Recopilatorio de gadgetsRecopilatorio de gadgets
Recopilatorio de gadgetsxilefo56
 
How to be a Good Project Manager - Advice from top influencers in Project Man...
How to be a Good Project Manager - Advice from top influencers in Project Man...How to be a Good Project Manager - Advice from top influencers in Project Man...
How to be a Good Project Manager - Advice from top influencers in Project Man...Donnie MacNicol
 

En vedette (15)

Las Generaciones En El Tiempo
Las Generaciones En El TiempoLas Generaciones En El Tiempo
Las Generaciones En El Tiempo
 
1000 1200 _stained_glass
1000 1200 _stained_glass1000 1200 _stained_glass
1000 1200 _stained_glass
 
R3: Ruta Entrerrías Vilaboa-Pontevedra - bicicleando.es
R3: Ruta Entrerrías Vilaboa-Pontevedra - bicicleando.esR3: Ruta Entrerrías Vilaboa-Pontevedra - bicicleando.es
R3: Ruta Entrerrías Vilaboa-Pontevedra - bicicleando.es
 
Análisis Urbano "Cono de Seguridad del Aeropuerto de Barquisimeto, Venezuela"
Análisis Urbano "Cono de Seguridad del Aeropuerto de Barquisimeto, Venezuela"Análisis Urbano "Cono de Seguridad del Aeropuerto de Barquisimeto, Venezuela"
Análisis Urbano "Cono de Seguridad del Aeropuerto de Barquisimeto, Venezuela"
 
Clase1 Fundamentos de Comunicaciones Electronicas
Clase1 Fundamentos de Comunicaciones ElectronicasClase1 Fundamentos de Comunicaciones Electronicas
Clase1 Fundamentos de Comunicaciones Electronicas
 
Diagramas de Fase, Mecanica de Materiales
Diagramas de Fase, Mecanica de MaterialesDiagramas de Fase, Mecanica de Materiales
Diagramas de Fase, Mecanica de Materiales
 
Photochemical Micropatterning of Carbohydrates on a Surface
Photochemical Micropatterning of Carbohydrates on a SurfacePhotochemical Micropatterning of Carbohydrates on a Surface
Photochemical Micropatterning of Carbohydrates on a Surface
 
Cuento Espectáculo "CALLES" Engrata Teatre
Cuento Espectáculo "CALLES" Engrata TeatreCuento Espectáculo "CALLES" Engrata Teatre
Cuento Espectáculo "CALLES" Engrata Teatre
 
criteris
criteris criteris
criteris
 
Transbeneficios
TransbeneficiosTransbeneficios
Transbeneficios
 
Collar de perlas negras definitivo
Collar de perlas negras definitivoCollar de perlas negras definitivo
Collar de perlas negras definitivo
 
Lenguaje musical
Lenguaje musicalLenguaje musical
Lenguaje musical
 
PROYECTO EDUCATIVO DEL ÁREA DE INGLÉS 2010
PROYECTO EDUCATIVO DEL ÁREA DE INGLÉS 2010PROYECTO EDUCATIVO DEL ÁREA DE INGLÉS 2010
PROYECTO EDUCATIVO DEL ÁREA DE INGLÉS 2010
 
Recopilatorio de gadgets
Recopilatorio de gadgetsRecopilatorio de gadgets
Recopilatorio de gadgets
 
How to be a Good Project Manager - Advice from top influencers in Project Man...
How to be a Good Project Manager - Advice from top influencers in Project Man...How to be a Good Project Manager - Advice from top influencers in Project Man...
How to be a Good Project Manager - Advice from top influencers in Project Man...
 

Similaire à Application optimization with POWER 7

2013 02 08 annunci power 7 plus sito cta
2013 02 08 annunci power 7 plus sito cta2013 02 08 annunci power 7 plus sito cta
2013 02 08 annunci power 7 plus sito ctaLorenzo Corbetta
 
Sun sparc enterprise t5120 and t5220 servers technical presentation
Sun sparc enterprise t5120 and t5220 servers technical presentationSun sparc enterprise t5120 and t5220 servers technical presentation
Sun sparc enterprise t5120 and t5220 servers technical presentationxKinAnx
 
Dsmp Whitepaper Release 3
Dsmp Whitepaper Release 3Dsmp Whitepaper Release 3
Dsmp Whitepaper Release 3gelfstrom
 
Sun sparc enterprise t5140 and t5240 servers technical presentation
Sun sparc enterprise t5140 and t5240 servers technical presentationSun sparc enterprise t5140 and t5240 servers technical presentation
Sun sparc enterprise t5140 and t5240 servers technical presentationxKinAnx
 
05 2012 power_roadshow_software_on_power
05 2012 power_roadshow_software_on_power05 2012 power_roadshow_software_on_power
05 2012 power_roadshow_software_on_powerGennaro (Rino) Persico
 
Sun sparc enterprise t5440 server technical presentation
Sun sparc enterprise t5440 server technical presentationSun sparc enterprise t5440 server technical presentation
Sun sparc enterprise t5440 server technical presentationxKinAnx
 
IBM System x3850 X5 Technical Presenation abbrv.
IBM System x3850 X5 Technical Presenation abbrv.IBM System x3850 X5 Technical Presenation abbrv.
IBM System x3850 X5 Technical Presenation abbrv.meye0611
 
The Case for using MongoDB in Social Game - Animal Land
The Case for using MongoDB in Social Game - Animal LandThe Case for using MongoDB in Social Game - Animal Land
The Case for using MongoDB in Social Game - Animal LandMasakazu Matsushita
 
HPCMPUG2011 cray tutorial
HPCMPUG2011 cray tutorialHPCMPUG2011 cray tutorial
HPCMPUG2011 cray tutorialJeff Larkin
 
Infrastruttura Efficiente Di Sun E Amd -Virtualise with Confidence
Infrastruttura Efficiente Di Sun E Amd -Virtualise with ConfidenceInfrastruttura Efficiente Di Sun E Amd -Virtualise with Confidence
Infrastruttura Efficiente Di Sun E Amd -Virtualise with ConfidenceWalter Moriconi
 
ELCE 2011 - BZ - Embedded Linux Optimization Techniques - How Not To Be Slow
ELCE 2011 - BZ - Embedded Linux Optimization Techniques - How Not To Be SlowELCE 2011 - BZ - Embedded Linux Optimization Techniques - How Not To Be Slow
ELCE 2011 - BZ - Embedded Linux Optimization Techniques - How Not To Be SlowBenjamin Zores
 
I3 multicore processor
I3 multicore processorI3 multicore processor
I3 multicore processorAmol Barewar
 
In-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsIn-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsDATAVERSITY
 

Similaire à Application optimization with POWER 7 (20)

Linux on System z – performance update
Linux on System z – performance updateLinux on System z – performance update
Linux on System z – performance update
 
2013 02 08 annunci power 7 plus sito cta
2013 02 08 annunci power 7 plus sito cta2013 02 08 annunci power 7 plus sito cta
2013 02 08 annunci power 7 plus sito cta
 
Sun sparc enterprise t5120 and t5220 servers technical presentation
Sun sparc enterprise t5120 and t5220 servers technical presentationSun sparc enterprise t5120 and t5220 servers technical presentation
Sun sparc enterprise t5120 and t5220 servers technical presentation
 
Ibm power7
Ibm power7Ibm power7
Ibm power7
 
Blackfin optimization for performance
Blackfin optimization for performanceBlackfin optimization for performance
Blackfin optimization for performance
 
Ludden q3 2008_boston
Ludden q3 2008_bostonLudden q3 2008_boston
Ludden q3 2008_boston
 
Dsmp Whitepaper Release 3
Dsmp Whitepaper Release 3Dsmp Whitepaper Release 3
Dsmp Whitepaper Release 3
 
Sun sparc enterprise t5140 and t5240 servers technical presentation
Sun sparc enterprise t5140 and t5240 servers technical presentationSun sparc enterprise t5140 and t5240 servers technical presentation
Sun sparc enterprise t5140 and t5240 servers technical presentation
 
05 2012 power_roadshow_software_on_power
05 2012 power_roadshow_software_on_power05 2012 power_roadshow_software_on_power
05 2012 power_roadshow_software_on_power
 
Sun sparc enterprise t5440 server technical presentation
Sun sparc enterprise t5440 server technical presentationSun sparc enterprise t5440 server technical presentation
Sun sparc enterprise t5440 server technical presentation
 
IBM System x3850 X5 Technical Presenation abbrv.
IBM System x3850 X5 Technical Presenation abbrv.IBM System x3850 X5 Technical Presenation abbrv.
IBM System x3850 X5 Technical Presenation abbrv.
 
The Case for using MongoDB in Social Game - Animal Land
The Case for using MongoDB in Social Game - Animal LandThe Case for using MongoDB in Social Game - Animal Land
The Case for using MongoDB in Social Game - Animal Land
 
HPCMPUG2011 cray tutorial
HPCMPUG2011 cray tutorialHPCMPUG2011 cray tutorial
HPCMPUG2011 cray tutorial
 
Infrastruttura Efficiente Di Sun E Amd -Virtualise with Confidence
Infrastruttura Efficiente Di Sun E Amd -Virtualise with ConfidenceInfrastruttura Efficiente Di Sun E Amd -Virtualise with Confidence
Infrastruttura Efficiente Di Sun E Amd -Virtualise with Confidence
 
Sakar jain
Sakar jainSakar jain
Sakar jain
 
ELCE 2011 - BZ - Embedded Linux Optimization Techniques - How Not To Be Slow
ELCE 2011 - BZ - Embedded Linux Optimization Techniques - How Not To Be SlowELCE 2011 - BZ - Embedded Linux Optimization Techniques - How Not To Be Slow
ELCE 2011 - BZ - Embedded Linux Optimization Techniques - How Not To Be Slow
 
I3 multicore processor
I3 multicore processorI3 multicore processor
I3 multicore processor
 
I3
I3I3
I3
 
In-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsIn-Memory Computing: Myths and Facts
In-Memory Computing: Myths and Facts
 
Mca ppt
Mca pptMca ppt
Mca ppt
 

Plus de COMMON Europe

What's New in WebSphere Application Server
What's New in WebSphere Application ServerWhat's New in WebSphere Application Server
What's New in WebSphere Application ServerCOMMON Europe
 
Compiling the Compiler
Compiling the CompilerCompiling the Compiler
Compiling the CompilerCOMMON Europe
 
Workload Groups overview updates
Workload Groups overview updatesWorkload Groups overview updates
Workload Groups overview updatesCOMMON Europe
 
Why i - Common Europe 2012
Why i - Common Europe 2012Why i - Common Europe 2012
Why i - Common Europe 2012COMMON Europe
 
The Ruby OpenSSL extension
The Ruby OpenSSL extensionThe Ruby OpenSSL extension
The Ruby OpenSSL extensionCOMMON Europe
 
Using Ruby on IBM i (i5/OS)
Using Ruby on IBM i (i5/OS)Using Ruby on IBM i (i5/OS)
Using Ruby on IBM i (i5/OS)COMMON Europe
 
IBM Systems Director Navigator for i
IBM Systems Director Navigator for iIBM Systems Director Navigator for i
IBM Systems Director Navigator for iCOMMON Europe
 
IBM i Trends & Directions Common Europe 2012
IBM i Trends & Directions Common Europe 2012IBM i Trends & Directions Common Europe 2012
IBM i Trends & Directions Common Europe 2012COMMON Europe
 
IBM i Technology Refreshes Overview 2012 06-04
IBM i Technology Refreshes Overview 2012 06-04IBM i Technology Refreshes Overview 2012 06-04
IBM i Technology Refreshes Overview 2012 06-04COMMON Europe
 
IBM i 7.1 & TRs CEC 2012
IBM i 7.1 & TRs CEC 2012IBM i 7.1 & TRs CEC 2012
IBM i 7.1 & TRs CEC 2012COMMON Europe
 
DB2 Web Query whats new
DB2 Web Query whats newDB2 Web Query whats new
DB2 Web Query whats newCOMMON Europe
 
Access client solutions overview
Access client solutions overviewAccess client solutions overview
Access client solutions overviewCOMMON Europe
 
What's new with Zend server
What's new with Zend serverWhat's new with Zend server
What's new with Zend serverCOMMON Europe
 
Php arrays for RPG programmers
Php arrays for RPG programmersPhp arrays for RPG programmers
Php arrays for RPG programmersCOMMON Europe
 
Open source report writing tools for IBM i Vienna 2012
Open source report writing tools for IBM i  Vienna 2012Open source report writing tools for IBM i  Vienna 2012
Open source report writing tools for IBM i Vienna 2012COMMON Europe
 
Moving 5.4 to 7.1 AB
Moving 5.4 to 7.1 ABMoving 5.4 to 7.1 AB
Moving 5.4 to 7.1 ABCOMMON Europe
 
Introduction to My SQL
Introduction to My SQLIntroduction to My SQL
Introduction to My SQLCOMMON Europe
 
IBM CEC 2012 Storage june 11, 2012
IBM CEC 2012 Storage june 11, 2012IBM CEC 2012 Storage june 11, 2012
IBM CEC 2012 Storage june 11, 2012COMMON Europe
 
Getting started with PHP on IBM i
Getting started with PHP on IBM iGetting started with PHP on IBM i
Getting started with PHP on IBM iCOMMON Europe
 

Plus de COMMON Europe (20)

What's New in WebSphere Application Server
What's New in WebSphere Application ServerWhat's New in WebSphere Application Server
What's New in WebSphere Application Server
 
Compiling the Compiler
Compiling the CompilerCompiling the Compiler
Compiling the Compiler
 
Workload Groups overview updates
Workload Groups overview updatesWorkload Groups overview updates
Workload Groups overview updates
 
Why i - Common Europe 2012
Why i - Common Europe 2012Why i - Common Europe 2012
Why i - Common Europe 2012
 
The Ruby OpenSSL extension
The Ruby OpenSSL extensionThe Ruby OpenSSL extension
The Ruby OpenSSL extension
 
Using Ruby on IBM i (i5/OS)
Using Ruby on IBM i (i5/OS)Using Ruby on IBM i (i5/OS)
Using Ruby on IBM i (i5/OS)
 
IBM Systems Director Navigator for i
IBM Systems Director Navigator for iIBM Systems Director Navigator for i
IBM Systems Director Navigator for i
 
IBM i Trends & Directions Common Europe 2012
IBM i Trends & Directions Common Europe 2012IBM i Trends & Directions Common Europe 2012
IBM i Trends & Directions Common Europe 2012
 
IBM i Technology Refreshes Overview 2012 06-04
IBM i Technology Refreshes Overview 2012 06-04IBM i Technology Refreshes Overview 2012 06-04
IBM i Technology Refreshes Overview 2012 06-04
 
IBM i 7.1 & TRs CEC 2012
IBM i 7.1 & TRs CEC 2012IBM i 7.1 & TRs CEC 2012
IBM i 7.1 & TRs CEC 2012
 
DB2 Web Query whats new
DB2 Web Query whats newDB2 Web Query whats new
DB2 Web Query whats new
 
Access client solutions overview
Access client solutions overviewAccess client solutions overview
Access client solutions overview
 
What's new with Zend server
What's new with Zend serverWhat's new with Zend server
What's new with Zend server
 
RPG investment
RPG investmentRPG investment
RPG investment
 
Php arrays for RPG programmers
Php arrays for RPG programmersPhp arrays for RPG programmers
Php arrays for RPG programmers
 
Open source report writing tools for IBM i Vienna 2012
Open source report writing tools for IBM i  Vienna 2012Open source report writing tools for IBM i  Vienna 2012
Open source report writing tools for IBM i Vienna 2012
 
Moving 5.4 to 7.1 AB
Moving 5.4 to 7.1 ABMoving 5.4 to 7.1 AB
Moving 5.4 to 7.1 AB
 
Introduction to My SQL
Introduction to My SQLIntroduction to My SQL
Introduction to My SQL
 
IBM CEC 2012 Storage june 11, 2012
IBM CEC 2012 Storage june 11, 2012IBM CEC 2012 Storage june 11, 2012
IBM CEC 2012 Storage june 11, 2012
 
Getting started with PHP on IBM i
Getting started with PHP on IBM iGetting started with PHP on IBM i
Getting started with PHP on IBM i
 

Dernier

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Dernier (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Application optimization with POWER 7

  • 1. IBM Power Systems IBM i Application Optimization with POWER7 Gottfried Schimunek 3605 Highway 52 North Senior Architect Rochester, MN 55901 Application Design IBM STG Software Tel 507-253-2367 Development Fax 845-491-2347 Lab Services Gottfried@us.ibm.com IBM ISV Enablement © 2012 IBM Corporation IBM Power Systems Acknowledgements Many thanks to: – Helen Olson-Williams - Lab Services Performance and Scalability Services – Anil Madan - Consulting IT Specialist, Lab Services Performance – Charles Farrell - Lab Services Performance and Scalability Services – Mark Funk - Software Performance Analyst, Systems HW Development 2 © 2012 IBM Corporation 1
  • 2. IBM Power Systems Agenda POWER7 Architecture and Performance Features Application Performance Characteristics Performance data collection and analysis Tuning applications and system for best performance Hardware configuration optimization 3 © 2012 IBM Corporation IBM Power Systems Comparative Anatomy … POWER6 vs. POWER7 More on Cache: http://en.wikipedia.org/wiki/CPU_cache Memory DIMM Memory DIMM Memory DIMM Memory DIMM Core Core Core Core Core Core Core Core Core Core Memory DIMM Memory DIMM Memory DIMM Memory DIMM L2 L2 L2 L2 L2 L2 L2 L2 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB Memory DIMM L2 L2 Memory DIMM Cache Memory DIMM 4MB 4MB Memory DIMM Memory DIMM Cache Memory DIMM L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB Memory DIMM POWER6 Chip Memory DIMM POWER7 Chip Important points: L3 32MB shared “Multi-core” is not new. Cache – Fewer chips for same core count. POWER7 has 4x more cores per chip 32 2 2 32 MB Cores Cores MB Cache topology is very different. 8-core POWER6 – The latencies have improved with POWER7. (128 MB L3) Cache 32 2 2 32 – L3 data laterally cast out into neighbor L3. MB Cores Cores MB POWER7 has fewer memory DIMMs per core. 4 © 2012 IBM Corporation 2
  • 3. IBM Power Systems Systems Made From Multiple Chips All memory and contents of all active cache 750 accessible to all processors. • 1 – 4 Chips (Cache-Coherent SMP) • 8 or 6 physical cores per chip • 8 DIMM slots per chip All chiplets (cores and cache) are active (i.e., no CUOD) OS license count defines cores which execute instructions (i.e., a subset of active/physical cores). • Single partition … Cores packaged densely. • Multiple partitions … Each partition packaged densely if possible, partitions on separate chip’s cores per memory. • Cache Active only on OS-licensed chiplets … Energy Savings. Active chiplet count based on CUoD … 4, 8, 16, 32, 48, 64 (and 24 in “TurboCore”) A 770/780 drawer always has 2 chips … 16 physical cores if 8-core chips. OS licensed core count placed onto active chiplets. 770 and 780 • 1 to 4 drawers 16-core Systems: • 2 Chips per drawer • POWER6 … 4 drawers and 8 chips • 6 or 8 cores per chip • POWER7 770 … 1 drawer and 2 chips. • 8 DIMM slots per chip • POWER7 780 … 2 drawers and 4 chips. 5 © 2012 IBM Corporation IBM Power Systems Simultaneous Multi-Threading (SMT) – POWER Systems Time (cycles) SMT=OFF 11 cycles 11 cycles Thread 1 Thread 2 SMT=ON ... in this trivial example -With SMT: •Thread 1: 18% longer •Thread 2: 9% longer 13 cycles •69% throughput increase 12 cycles Amount of overlap/contention is environment dependent. Capacity is increased but individual thread execution time in CPU is slightly more when running concurrently – because of possible contention. 6 © 2012 IBM Corporation 3
  • 4. IBM Power Systems 4-WAY SMT (Simultaneous Multi-Threading) POWER5/6 have 2-Way SMT SMT4 Provides Still More Capacity. Up to 4 thread’s instruction streams executing in parallel on one core. • 4 threads share the same cache(s) and “pipes”. • SMT4 Performance enabled via Faster L2 cache. 2 • 8-core chip can support 32 threads. • 2-core POWER6 chip supports only 4 threads. 2 SMT works because other thread’s instructions execute during • Another thread’s cache miss delays. • When pipe stage not already used by another. Cores have a lot more capacity than is used by a single task. • SMT4 common to have 1.5-2X more capacity over ST. However, when core executing multiple threads, each thread’s execution speed is slower. • SMT4’s 2X capacity increase also means ½ throughput for individual thread. Cores automatically switch between ST, SMT2, SMT4. 7 © 2012 IBM Corporation IBM Power Systems More on Cache: http://en.wikipedia.org/wiki/CPU_cache Cache Design Enhanced… Lateral Cast-out Core Core POWER6 (In preparation for explaining “TurboCore”) Memory DIMM Fast L2 for rapid cache fills of L1s. L2 4MB L2 4MB Memory DIMM More Cache means Memory DIMM • Better Single-Threaded Performance As data read into L2, other data stored out to L3. L3 Memory DIMM • More System Capacity Called a “Cast-Out”. 32 MB • Acts to offset lower frequency. POWER7 To make room for cache fills, Memory DIMM block in L2 is “Cast-Out” to L3. Core Core Core Core Core Core Core Core (Data cast-out of L3 written to memory if changed.) Memory DIMM Memory DIMM In order to make room for block cast into L3, a block may be “Cast-Out” out to another L3. Memory DIMM L2 L2 L2 L2 L2 L2 L2 L2 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB This is most useful when other cores on chip Memory DIMM are temporarily or permanently idle. Memory DIMM L3 L3 L3 L3 L3 L3 L3 L3 Lateral Cast-Out exists on all chips by default, Memory DIMM 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB but only amongst licensed/active chiplets. Memory DIMM (Now onto “TurboCore”.) “Lateral Cast-Out”” 8 © 2012 IBM Corporation 4
  • 5. IBM Power Systems “TurboCore” More Cache And Higher Frequency. 16-way 770/780 Drawer A chip with 4 Active Cores can Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core more easily be cooled. L2 L2 L2 L2 L2 L2 L2 L2 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB L2 L2 L2 L2 L2 L2 L2 L2 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB This, in turn, allows a higher frequency. L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB ( 3.86 GHz 4.14 GHz ) 8-Chiplet chip has 8 L3 caches, so remaining 4 remain enabled. 8-way 780 TurboCore Drawer Best results expected in multi-partition Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 environment with partition’s packaged 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB within individual chips. L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB L3 4MB 16-way 780 TurboCore System 8-way 780 TurboCore Drawer In some workloads and partition placements, Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core cross-chip traffic can offset benefits of TurboCore. L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 L2 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB 256KB Example: L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 L3 16-core CPW … 4-chip TurboCore has slightly less throughput. 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 4MB 8-core CPW … 2-chip TurboCore has slightly more throughput. 9 © 2012 IBM Corporation IBM Power Systems The Conclusion Chart … Think of POWER7 Performance as having … • Better Core-to-Core Capacity Producing Desired Growth • Massive improvement in “socket” capacity …. 8 cores per chip • A lot more capacity within a package (card, drawer). • A lower frequency (than POWER6), but still better • Capacity improvements result from other improvements • Many single-threaded workloads execute better than POWER6. • Lower energy consumption per core • Even more energy savings if you exploit IBM Systems Director Active Energy Manager. 10 © 2012 IBM Corporation 5
  • 6. IBM Power Systems Application Performance Characteristics Batch performance characteristics; single vs multithread – Multithreaded applications can automatically take advantage of new POWER features Typically I/O bound – Single threaded applications will need some tuning Typically uses only a single core at a time (e.g. max 6% of a sixteen way) Typically I/O bound CPU or disk I/O bound? – Low overall CPU utilization, high disk utilization, lots of memory paging – Tuning techniques available to make an I/O bound job CPU bound Batch jobs waiting for something else? – Wait for I/Os, e.g. disk configuration # disk arms, disk speed, IOA, disk cache, etc. – Wait for processors, CPU queuing, higher priorities, virtualization, capping, etc. – Wait for record locks and seizes, DB design, application architecture – Wait for many other potential wait situations, hundreds of wait situations 11 © 2012 IBM Corporation IBM Power Systems Multithread vs single thread applications - Example Start of End-of-Day Process Job1 Job2 Job3 Job4 Job5 Single thread Job1 Job2 Job3 Job4 Job5 End of End-of-Day Process Backup Some applications cannot be run in multithread mode, e.g. data integrity, data dependency, etc Processor utilization in single threaded is typically below 20% Performance improvements in single threaded jobs are limited to one processor unless DB SMP is used 12 © 2012 IBM Corporation 6
  • 7. IBM Power Systems Single versus multi-threaded Application runtime Single-threaded A B C Timesingle-threaded Multi-threaded A • Can be multithreads in programs, or more likely multi jobs on IBM i B • In practice, finding these thread regions is not easy but so valuable for performance and being handled automatically at the CPU C hardware level Timemulti-threaded 13 © 2012 IBM Corporation IBM Power Systems How to multithread an application? Use DB2 SMP feature to parallelize I/O and index operations Use Holey inserts to improve concurrent ADDs Split input stream into multiple concurrent batch jobs so that each job works on separate parts of the input data From single thread: A B C To multiple threads: A B C A B C use iStream and BSTAR from Cinimex at http://www.cinimex.co.uk/products/bstar.html – Automatically multithreads applications w/o application programming required 14 © 2012 IBM Corporation 7
  • 8. IBM Power Systems Finding the optimal number of threads The first challenge is to evenly distribute work across threads. In this simple example, 8 threads are running concurrently, but the work is not evenly distributed CPU resources are under-utilized. Time (cycles) 15 © 2012 IBM Corporation IBM Power Systems Find the optimal number of threads On Power 7, for the highest overall capacity, number of concurrent threads = number of cores * 4 2 cores 8 concurrent threads For the best runtime for individual threads, number of concurrent threads = number of cores 2 cores 2 concurrent threads High priority work 16 © 2012 IBM Corporation 8
  • 9. IBM Power Systems Power 6 CPU Queuing Time -> Power 7 Dispatched CPU Time Power 6, 5.0 Ghz, 3 PUs, 3 Virtual Processor SMT 2 -> 6 threads dispatched Power 7, 3.5 Ghz, 2.5 PUs, 3 Virtual Processor SMT 4 -> 12 threads dispatched Note: It appears there are additional CPU resources available on Power 7. Keep in mind that thread execution time is longer when in SMT 4 mode. 17 © 2012 IBM Corporation IBM Power Systems How to improve I/O bound jobs? - make them CPU bound.. 100 Goal: Eliminate all or reduce most of the I/O waits 80 Use SETOBJACC for high volume accessed files and indexes 60 I/O 40 Avoid open/close of files, use RETRN vs SETON *LR in RPG CPU 20 Remove deleted records – RGZPFM 0 Use REUSEDLT(*YES) to enhance concurrent writes I/O bound CPU bound Optimize Load/ Copy/ Save processes by multithreading and/or other techniques Change random reads to sequential reads • Higher blocking for input/output only operations – SEQONLY((*YES #rcds)) >= 128KB Increase logical page size in DB operations • Avoid opening Output files as I-O files to take advantage of logical blocking. Eliminate FRCRATIO(1) • CRTLF PAGESIZ(64 – 512KB) Use Expert cache in job pools For High volume ADDs, use DB2 SMP for parallel index maintenance and use ALLOCATE(*YES) Use SQL to create DB rather than DDS If journaling – Use larger commit cycles – Use HA Journal Performance Enhancement feature (SS1 opt 42) – Reduce JRN entry size with RCVSIZOPT(*RMVINTENT) 18 © 2012 IBM Corporation 9
  • 10. IBM Power Systems Analyze Performance by collecting performance data What performance tools to use? – Collection services – 5 minutes interval for the complete run duration – Job Watcher – to identify wait components – PEX Analyzer with disk read/writes start/end ops – Disk Watcher to review disk and IO related performance 19 © 2012 IBM Corporation IBM Power Systems IBM Systems Director Navigator for i Performance, Investigate Data 20 © 2012 IBM Corporation 10
  • 11. IBM Power Systems Investigate Data Perspectives are a logical grouping of similar or related views that benefit from being rendered side-by-side for reference or context. Content Package is a set of perspectives that share a commonality (major theme). ! New Performance Explorer and 21 Health Indicators2012 IBM Corporation © IBM Power Systems Investigate Data – Select Collection The Collection boxes allow you to specify which collection you want to work with. Only collections valid for the type of chart you select will be displayed. 22 © 2012 IBM Corporation 11
  • 12. IBM Power Systems Resource Utilization Overview - Percentages Nice summary for general overall health: • CPU Utilization • Disk Utilization • Disk Busy 23 © 2012 IBM Corporation IBM Power Systems Resource Utilization Overview - Rates Nice summary for general overall health: • 5250 Transactions • I/Os per Second • Page Faults 24 © 2012 IBM Corporation 12
  • 13. IBM Power Systems CPU Utilization and Waits Overview 25 © 2012 IBM Corporation IBM Power Systems What is iDoctor? • Product developed by the Support Center to enable effective resolution of customer performance problems • Graphical analysis • Heavy focus on wait analysis • Main components – Job Watcher – detailed job data and analysis – PEX Analyzer - PEX data analysis – Disk Watcher 26 © 2012 IBM Corporation 13
  • 14. IBM Power Systems iDoctor tools (Job Watcher and PEX Analyzer) Take You … ………………….…from this level…………..………. CPU Q’ing Disk I/O Other ………………….…to this level…………..………. H Sync Read / User Pgm OS/400 Pgm MI Instr LIC CPU Assist Queuing Write DASD wait / Pending Fault Write e l Other COMM SeizesLocks Gates d Obj Op'n Op'n Pool Name Start/Stop type & (MS/AS) & Type Time length CPU Time DASD I/O Time Other Wait Time 27 © 2012 IBM Corporation IBM Power Systems Waits Overview – iDoctor Job Watcher 28 © 2012 IBM Corporation 14
  • 15. IBM Power Systems Waits Overview – iDoctor Job Watcher – Example Physical IO USER000001 29 © 2012 IBM Corporation IBM Power Systems Waits Overview – iDoctor Job Watcher – Example Physical IO USER000001 JOB1NUM001 JOB2NUM002 JOB3NUM003 30 © 2012 IBM Corporation 15
  • 16. IBM Power Systems Waits Overview – iDoctor Job Watcher – Objects Waited On USER000001 JOB2NUM002 File000001 MBR1 File000002 MBR1 File000003 MBR1 File000004 MBR1 File000005 MBR1 File000006 MBR1 File000007 MBR1 File000008 MBR1 File000009 MBR1 File000010 MBR1 31 © 2012 IBM Corporation IBM Power Systems Power Solid State Drive Processors Memory SSD Disk Very, very, Very, very, Very, very slow very, very, very fast comparatively very fast Fast 1,000,000 - < 10’s ns ~100 ns ~200,000 ns 8,000,000 ns Access Speed Sweet spots 1. Batch window reduction for disk bound applications You can cut up to 40-50% off window 2. Response time - transaction/data base for disk bound applications – Key points – -- A modest quantity of SSD can make a big difference – -- Biggest performance boost for random read workload 32 © 2012 IBM Corporation 16
  • 17. IBM Power Systems Mixed SSD + HDD – cold and hot data It is typical for data bases to have a large percentage of data which is infrequently used (“cold”) and a small percentage of data which is frequently used (“hot”) Hot data may be only 10-20% capacity, but represent 80-90% activity SSD offers best price performance when focused on “hot” data HDD offers best storage cost, so focus it on “cold” data …. sort of a hierarchal approach Cold May be able to use larger HDD and/or a larger % capacity used Hot Can run SSD closer to 100% capacity 33 © 2012 IBM Corporation IBM Power Systems Waits Overview – iDoctor Job Watcher – Before and After JOB2NUM002 USER000001 JOB2NUM002 USER000001 JOB2NUM002 JOB2NUM002 34 © 2012 IBM Corporation 17
  • 18. IBM Power Systems Practical experiences and other optimization examples from recent engagements…… 35 © 2012 IBM Corporation IBM Power Systems Application scalability and tuning engagement for Business Partner Results – Max tps increased from 2000 -> 6139, Changes made during Performance and Scalability Test – Created multiple copies of TR files to reduce contention (gate wait) when inserting records to the transaction files. – Pre-allocate log file to reduce seize contention when files extend. – Switch from varchar to fixed length field in log file since most of the accesses are the max length, to reduce seize contention on auxiliary space. – Turn on journal caching. – Increase jorecra threshold to reduce seize contention on index. – Contention on an application userqueue - balance workload simulation. Will need to consider adding more queues in the future. – Change frcwrtratio from 1 to *none and specify Seqonly *no to reduce dasd wait time and to reduce contention when multiple jobs force a file to disk – Other application 'clean up' activities set index size to *max1tb for a few files set reuse deleted to *yes 36 © 2012 IBM Corporation 18
  • 19. IBM Power Systems Before tuning 2000 tps, cpu utilization 20% 37 © 2012 IBM Corporation IBM Power Systems After tuning 6139 tps, cpu utilization 40% Chart coming 38 © 2012 IBM Corporation 19
  • 20. IBM Power Systems End of Day Batch – scalability testing and tuning for growth 2X, 3X, 4X growth scenarios, some results: 2X – End of Day 2X – End of Month 4X – End of Day 4X – End of Month Power 6, 10 cores Power 6, 10 cores Power 7, 32 cores Power 7, 32 cores 220gb memory 220gb memory 586gb memory 586gb memory Appl and system Appl and system tuning changes tuning changes 5 hrs 6 min 12 hrs 23 min < 4 hrs 7 hrs 4 min Changes made during Performance and Scalability Test – Application changes: Reduced full opens of db files Increased parallelism in batch stream Used setobjacc to load key files in memory – System and application settings Turn on Journal cache Re-use deleted records to take advantage of concurrent write DB2 fix to reduce impact of seize when records added to file Increased SMAPP recovery threshold 39 © 2012 IBM Corporation IBM Power Systems 2X – EOM Before adding hardware and tuning changes 40 © 2012 IBM Corporation 20
  • 21. IBM Power Systems 4X - EOM Additional cpu and memory, tuning changes 41 © 2012 IBM Corporation IBM Power Systems 2X – EOD Before adding hardware and tuning changes 42 © 2012 IBM Corporation 21
  • 22. IBM Power Systems 4X - EOD Additional cpu and memory, tuning changes 43 © 2012 IBM Corporation IBM Power Systems Application and System Tuning to achieve SLAs for quarterly pricing update Results: – Increased number of cpu cores by 50% – Tuned application and system – 86% improvement in throughput Tuning changes made during Performance and Scalability Test – Application changes: Reduce date conversions Change random to sequential IO – System and application settings Tuned/reduced the number of concurrent jobs Tuned memory settings Apply PTF to take advantage of skip-locked-row sql processing 44 © 2012 IBM Corporation 22
  • 23. IBM Power Systems Before tuning application, system and adding hardware 8.7 million updates per hour 45 © 2012 IBM Corporation IBM Power Systems After application and system tuning changes, adding cpu 16.2 million updates per hour 46 © 2012 IBM Corporation 23
  • 24. IBM Power Systems Tuning applications and system for best performance Memory pools, activity level, expert cache, SETOBJACC, etc. – Proper memory pool sizing and tuning Move all available memory to the batch pool, or Use QPFRADJ(2 or 3) Make sure memory pool is enabled for expert cache Application tuning, e.g. SEQONLY((*YES nbrof rcds)), NBRRCDS(nbrofrcds) – SEQONLY is for program buffer (for input only, for output only) – NBRRCDS is used for SLIC buffer – Try to set as large as possible, e.g. 128KB or even 256KB SQL vs native I/O – SQL can take advantage of multithreading in IBM i, e.g. parallel ADD, large page support, etc. Journal caching – Reduces write waits and number of I/Os significantly – IBM i option- fee DB2 SMP – Enables many multithreaded features for the DB, e.g. index maintenance, index builds – IBM i option - fee 47 © 2012 IBM Corporation IBM Power Systems Hardware configuration optimization # of processors allocated, # of VPs – Ensure you always have enough CPU capacity for the complete run, e.g. <100% SSD – Solid Sate Drives vs HDDs performance characteristics and when to use – Use SSDs for select DB objects or hot pages traced from previous batch runs – Use the QIBMSSD tool to determine if SSDs will help, – use TRCASPBAL to trace and collect information of disk usage – And STRASPBAL to move hot objects onto SSDs, or back to HDDs – Investigate hot data and – Run TRCASPBAL and STRASPBAL *HSM (6.1.1) to move hot data to SSDs Run STRASPBAL *SSSM (7.1) to move hot to SSDs + move cold data back to HDDs Move complete files or libraries with CRTxF or CHGxF or CREATE/ALTER TABLE/INDEX CRTLIB ASP(ssd-ASP), CREATE SCHEMA x IN ASP n Proper disk arm and disk IOA caching options – The more disk adaptors, the better – The more cache, the better – RAID5 typically provides better write performance – Use the WLE tool to actually use collected disk performance data and estimate most performance optimal disk configuration Disk adapter cache for read and write ops – The larger the cache, the better the performance Power and energy saving options vs Performance options – Watch out for energy saving options set in HMC or Active Energy Manager – AEM – Can significantly influence performance 48 © 2012 IBM Corporation 24
  • 25. IBM Power Systems Energy Saving 101 If I have 8 lights (i.e. Cores) that produce 8000 lumens, and I only need 4000 … My options include … - Turn some off … Cease clocking of some cores (OS-based) - Nap cores (End clocking core, cache remains active) - OS returns unused cores to hypervisor for napping. - Sleep chiplets (End clocking core and cache) - Sleep used if cores not licensed or if not used for an extended period. - Dim them … Slow the Frequency (TPMD-based) - Dynamic Power Save option in Systems Director. Frequency Adjusting Modes … Three options: - Static Power Save …. Frequency Lowered by Fixed Percentage below nominal to save power. - Dynamic Power Save - Normal Variably adjusts frequency to reduce power. Frequency returns to nominal when required by processor utilization. - Dynamic Power Save – Favor Performance Same as “Normal”, but frequency can reach a super-nominal frequency if power/cooling limits allow. 49 © 2012 IBM Corporation IBM Power Systems Summary and references Performance of POWER6 and POWER7 systems are excellent – take advantage Some applications may not be able to take advantage of the POWER6 or POWER7 architecture features – Check out best practices and apply system and application tuning techniques IBM can help Clients, BP and ISVs to analyze and optimize their solutions to take advantage of POWER6 and POWER7 – Let me know if you have a question or need some help with performance Send an email to: stgls@us.ibm.com 50 © 2012 IBM Corporation 25
  • 26. IBM Power Systems Questions 51 © 2012 IBM Corporation IBM Power Systems Requirements If you have items your business requires, Submit a requirement! Help IBM understand how we can help you. 52 © 2012 IBM Corporation 26
  • 27. IBM Power Systems 53 © 2012 IBM Corporation IBM Power Systems 54 © 2012 IBM Corporation 27
  • 28. IBM Power Systems Performance and Scalability Services The IBM i Performance and Scalability Services Center can provide facilities and hardware IN ROCHESTER to assist you in testing hardware or software changes “Traditional” benchmarks Release-to-release upgrades Assess application performance when migrating to a new release of IBM I Stress test your system Determine impact of application changes Proofs of Concept (e.g. HA alternatives; SSD analysis, external storage, etc.) Evaluate application scalability Capacity planning … all with the availability of Lab Services IBM i experts and development personnel To request any of these services, submit at: http://www-03.ibm.com/systems/services/labservices/psscontact.html 55 © 2012 IBM Corporation IBM Power Systems IBM Systems Lab Services and Training Mainframe Systems Our Mission and Profile Support the IBM Systems Agenda and accelerate the adoption of new Power Systems products and solutions Maximize performance of our clients’ existing IBM systems Deliver technical training, conferences, and other services tailored to System x & Bladecenter meet client needs Team with IBM Service Providers to optimize the deployment of IBM solutions (GTS, GBS, SWG Lab Services and our IBM Business System Storage Partners) Our Competitive Advantage IT Infrastructure Optimization Leverage relationships with the IBM development labs to build deep technical skills and exploit the expertise of our developers Combined expertise of Lab Services and the Training for Systems team Data Center Services Skills can be deployed worldwide to assure all client needs can be met Successful worldwide history: Training Services 17 years in Americas, 9 years in Europe/Middle East/Africa, 5 years in Asia Pacific www.ibm.com/systems/services/labservices stgls@us.ibm.com 56 © 2012 IBM Corporation 28
  • 29. IBM Power Systems Special notices This document was developed for IBM offerings in the United States as of the date of publication. IBM may not make these offerings available in other countries, and the information is subject to change without notice. Consult your local IBM business contact for information on the IBM offerings available in your area. Information in this document concerning non-IBM products was obtained from the suppliers of these products or other public sources. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. IBM may have patents or pending patent applications covering subject matter in this document. The furnishing of this document does not give you any license to these patents. Send license inquires, in writing, to IBM Director of Licensing, IBM Corporation, New Castle Drive, Armonk, NY 10504-1785 USA. All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only. The information contained in this document has not been submitted to any formal IBM test and is provided "AS IS" with no warranties or guarantees either expressed or implied. All examples cited or described in this document are presented as illustrations of the manner in which some IBM products can be used and the results that may be achieved. Actual environmental costs and performance characteristics will vary depending on individual client configurations and conditions. IBM Global Financing offerings are provided through IBM Credit Corporation in the United States and other IBM subsidiaries and divisions worldwide to qualified commercial and government clients. Rates are based on a client's credit rating, financing terms, offering type, equipment type and options, and may vary by country. Other restrictions may apply. Rates and offerings are subject to change, extension or withdrawal without notice. IBM is not responsible for printing errors in this document that result in pricing or information inaccuracies. All prices shown are IBM's United States suggested list prices and are subject to change without notice; reseller prices may vary. IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply. Any performance data contained in this document was determined in a controlled environment. Actual results may vary significantly and are dependent on many factors including system hardware configuration and software design and configuration. Some measurements quoted in this document may have been made on development-level systems. There is no guarantee these measurements will be the same on generally- available systems. Some measurements quoted in this document may have been estimated through extrapolation. Users of this document should verify the applicable data for their specific environment. Revised September 26, 2006 57 © 2012 IBM Corporation IBM Power Systems Special notices (cont.) IBM, the IBM logo, ibm.com AIX, AIX (logo), AIX 5L, AIX 6 (logo), AS/400, BladeCenter, Blue Gene, ClusterProven, DB2, ESCON, i5/OS, i5/OS (logo), IBM Business Partner (logo), IntelliStation, LoadLeveler, Lotus, Lotus Notes, Notes, Operating System/400, OS/400, PartnerLink, PartnerWorld, PowerPC, pSeries, Rational, RISC System/6000, RS/6000, THINK, Tivoli, Tivoli (logo), Tivoli Management Environment, WebSphere, xSeries, z/OS, zSeries, Active Memory, Balanced Warehouse, CacheFlow, Cool Blue, IBM Systems Director VMControl, pureScale, TurboCore, Chiphopper, Cloudscape, DB2 Universal Database, DS4000, DS6000, DS8000, EnergyScale, Enterprise Workload Manager, General Parallel File System, , GPFS, HACMP, HACMP/6000, HASM, IBM Systems Director Active Energy Manager, iSeries, Micro-Partitioning, POWER, PowerExecutive, PowerVM, PowerVM (logo), PowerHA, Power Architecture, Power Everywhere, Power Family, POWER Hypervisor, Power Systems, Power Systems (logo), Power Systems Software, Power Systems Software (logo), POWER2, POWER3, POWER4, POWER4+, POWER5, POWER5+, POWER6, POWER6+, POWER7, System i, System p, System p5, System Storage, System z, TME 10, Workload Partitions Manager and X-Architecture are trademarks or registered trademarks of International Business Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are marked on their first occurrence in this information with a trademark symbol (® or ™), these symbols indicate U.S. registered or common law trademarks owned by IBM at the time this information was published. Such trademarks may also be registered or common law trademarks in other countries. A full list of U.S. trademarks owned by IBM may be found at: http://www.ibm.com/legal/copytrade.shtml. Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, and/or other countries. AltiVec is a trademark of Freescale Semiconductor, Inc. AMD Opteron is a trademark of Advanced Micro Devices, Inc. InfiniBand, InfiniBand Trade Association and the InfiniBand design marks are trademarks and/or service marks of the InfiniBand Trade Association. Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. IT Infrastructure Library is a registered trademark of the Central Computer and Telecommunications Agency which is now part of the Office of Government Commerce. Java and all Java-based trademarks and logos are trademarks or registered trademarks of Oracle and/or its affiliates. Linear Tape-Open, LTO, the LTO Logo, Ultrium, and the Ultrium logo are trademarks of HP, IBM Corp. and Quantum in the U.S. and other countries. Linux is a registered trademark of Linus Torvalds in the United States, other countries or both. Microsoft, Windows and the Windows logo are registered trademarks of Microsoft Corporation in the United States, other countries or both. NetBench is a registered trademark of Ziff Davis Media in the United States, other countries or both. SPECint, SPECfp, SPECjbb, SPECweb, SPECjAppServer, SPEC OMP, SPECviewperf, SPECapc, SPEChpc, SPECjvm, SPECmail, SPECimap and SPECsfs are trademarks of the Standard Performance Evaluation Corp (SPEC). The Power Architecture and Power.org wordmarks and the Power and Power.org logos and related marks are trademarks and service marks licensed by Power.org. TPC-C and TPC-H are trademarks of the Transaction Performance Processing Council (TPPC). UNIX is a registered trademark of The Open Group in the United States, other countries or both. Other company, product and service names may be trademarks or service marks of others. Revised December 2, 2010 58 © 2012 IBM Corporation 29
  • 30. IBM Power Systems Notes on benchmarks and values The IBM benchmarks results shown herein were derived using particular, well configured, development-level and generally-available computer systems. Buyers should consult other sources of information to evaluate the performance of systems they are considering buying and should consider conducting application oriented testing. For additional information about the benchmarks, values and systems tested, contact your local IBM office or IBM authorized reseller or access the Web site of the benchmark consortium or benchmark vendor. IBM benchmark results can be found in the IBM Power Systems Performance Report at http://www.ibm.com/systems/p/hardware/system_perf.html . All performance measurements were made with AIX or AIX 5L operating systems unless otherwise indicated to have used Linux. For new and upgraded systems, the latest versions of AIX were used. All other systems used previous versions of AIX. The SPEC CPU2006, LINPACK, and Technical Computing benchmarks were compiled using IBM's high performance C, C++, and FORTRAN compilers for AIX 5L and Linux. For new and upgraded systems, the latest versions of these compilers were used: XL C for AIX v11.1, XL C/C++ for AIX v11.1, XL FORTRAN for AIX v13.1, XL C/C++ for Linux v11.1, and XL FORTRAN for Linux v13.1. For a definition/explanation of each benchmark and the full list of detailed results, visit the Web site of the benchmark consortium or benchmark vendor. TPC http://www.tpc.org SPEC http://www.spec.org LINPACK http://www.netlib.org/benchmark/performance.pdf Pro/E http://www.proe.com GPC http://www.spec.org/gpc VolanoMark http://www.volano.com STREAM http://www.cs.virginia.edu/stream/ SAP http://www.sap.com/benchmark/ Oracle, Siebel, PeopleSoft http://www.oracle.com/apps_benchmark/ Baan http://www.ssaglobal.com Fluent http://www.fluent.com/software/fluent/index.htm TOP500 Supercomputers http://www.top500.org/ Ideas International http://www.ideasinternational.com/benchmark/bench.html Storage Performance Council http://www.storageperformance.org/results Revised December 2, 2010 59 © 2012 IBM Corporation IBM Power Systems Notes on HPC benchmarks and values The IBM benchmarks results shown herein were derived using particular, well configured, development-level and generally-available computer systems. Buyers should consult other sources of information to evaluate the performance of systems they are considering buying and should consider conducting application oriented testing. For additional information about the benchmarks, values and systems tested, contact your local IBM office or IBM authorized reseller or access the Web site of the benchmark consortium or benchmark vendor. IBM benchmark results can be found in the IBM Power Systems Performance Report at http://www.ibm.com/systems/p/hardware/system_perf.html . All performance measurements were made with AIX or AIX 5L operating systems unless otherwise indicated to have used Linux. For new and upgraded systems, the latest versions of AIX were used. All other systems used previous versions of AIX. The SPEC CPU2006, LINPACK, and Technical Computing benchmarks were compiled using IBM's high performance C, C++, and FORTRAN compilers for AIX 5L and Linux. For new and upgraded systems, the latest versions of these compilers were used: XL C for AIX v11.1, XL C/C++ for AIX v11.1, XL FORTRAN for AIX v13.1, XL C/C++ for Linux v11.1, and XL FORTRAN for Linux v13.1. Linpack HPC (Highly Parallel Computing) used the current versions of the IBM Engineering and Scientific Subroutine Library (ESSL). For Power7 systems, IBM Engineering and Scientific Subroutine Library (ESSL) for AIX Version 5.1 and IBM Engineering and Scientific Subroutine Library (ESSL) for Linux Version 5.1 were used. For a definition/explanation of each benchmark and the full list of detailed results, visit the Web site of the benchmark consortium or benchmark vendor. SPEC http://www.spec.org LINPACK http://www.netlib.org/benchmark/performance.pdf Pro/E http://www.proe.com GPC http://www.spec.org/gpc STREAM http://www.cs.virginia.edu/stream/ Fluent http://www.fluent.com/software/fluent/index.htm TOP500 Supercomputers http://www.top500.org/ AMBER http://amber.scripps.edu/ FLUENT http://www.fluent.com/software/fluent/fl5bench/index.htm GAMESS http://www.msg.chem.iastate.edu/gamess GAUSSIAN http://www.gaussian.com ANSYS http://www.ansys.com/services/hardware-support-db.htm Click on the "Benchmarks" icon on the left hand side frame to expand. Click on "Benchmark Results in a Table" icon for benchmark results. ABAQUS http://www.simulia.com/support/v68/v68_performance.php ECLIPSE http://www.sis.slb.com/content/software/simulation/index.asp?seg=geoquest& MM5 http://www.mmm.ucar.edu/mm5/ MSC.NASTRAN http://www.mscsoftware.com/support/prod%5Fsupport/nastran/performance/v04_sngl.cfm STAR-CD www.cd-adapco.com/products/STAR-CD/performance/320/index/html NAMD http://www.ks.uiuc.edu/Research/namd HMMER http://hmmer.janelia.org/ Revised December 2, 2010 http://powerdev.osuosl.org/project/hmmerAltivecGen2mod 60 © 2012 IBM Corporation 30
  • 31. IBM Power Systems Notes on performance estimates rPerf for AIX rPerf (Relative Performance) is an estimate of commercial processing performance relative to other IBM UNIX systems. It is derived from an IBM analytical model which uses characteristics from IBM internal workloads, TPC and SPEC benchmarks. The rPerf model is not intended to represent any specific public benchmark results and should not be reasonably used in that way. The model simulates some of the system operations such as CPU, cache and memory. However, the model does not simulate disk or network I/O operations. rPerf estimates are calculated based on systems with the latest levels of AIX and other pertinent software at the time of system announcement. Actual performance will vary based on application and configuration specifics. The IBM eServer pSeries 640 is the baseline reference system and has a value of 1.0. Although rPerf may be used to approximate relative IBM UNIX commercial processing performance, actual system performance may vary and is dependent upon many factors including system hardware configuration and software design and configuration. Note that the rPerf methodology used for the POWER6 systems is identical to that used for the POWER5 systems. Variations in incremental system performance may be observed in commercial workloads due to changes in the underlying system architecture. All performance estimates are provided "AS IS" and no warranties or guarantees are expressed or implied by IBM. Buyers should consult other sources of information, including system benchmarks, and application sizing guides to evaluate the performance of a system they are considering buying. For additional information about rPerf, contact your local IBM office or IBM authorized reseller. ======================================================================== CPW for IBM i Commercial Processing Workload (CPW) is a relative measure of performance of processors running the IBM i operating system. Performance in customer environments may vary. The value is based on maximum configurations. More performance information is available in the Performance Capabilities Reference at: www.ibm.com/systems/i/solutions/perfmgmt/resource.html Revised April 2, 2007 61 © 2012 IBM Corporation IBM Power Systems Trademarks The following are trademarks of the International Business Machines Corporation in the United States, other countries, or both. Not all common law marks used by IBM are listed on this page. Failure of a mark to appear does not mean that IBM does not use the mark nor does it mean that the product is not actively marketed or is not significant within its relevant market. Those trademarks followed by ® are registered trademarks of IBM in the United States; all others are trademarks or common law marks of IBM in the United States. For a complete list of IBM Trademarks, see www.ibm.com/legal/copytrade.shtml: *, AS/400®, e business(logo)®, DBE, ESCO, eServer, FICON, IBM®, IBM (logo)®, iSeries®, MVS, OS/390®, pSeries®, RS/6000®, S/30, VM/ESA®, VSE/ESA, WebSphere®, xSeries®, z/OS®, zSeries®, z/VM®, System i, System i5, System p, System p5, System x, System z, System z9®, BladeCenter® The following are trademarks or registered trademarks of other companies. Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, and/or other countries. Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. in the United States, other countries, or both and is used under license therefrom. Java and all Java-based trademarks are trademarks of Sun Microsystems, Inc. in the United States, other countries, or both. Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both. Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. UNIX is a registered trademark of The Open Group in the United States and other countries. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. ITIL is a registered trademark, and a registered community trademark of the Office of Government Commerce, and is registered in the U.S. Patent and Trademark Office. IT Infrastructure Library is a registered trademark of the Central Computer and Telecommunications Agency, which is now part of the Office of Government Commerce. * All other products may be trademarks or registered trademarks of their respective companies. Notes: Performance is in Internal Throughput Rate (ITR) ratio based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput that any user will experience will vary depending upon considerations such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve throughput improvements equivalent to the performance ratios stated here. IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply. All customer examples cited or described in this presentation are presented as illustrations of the manner in which some customers have used IBM products and the results they may have achieved. Actual environmental costs and performance characteristics will vary depending on individual customer configurations and conditions. This publication was produced in the United States. IBM may not offer the products, services or features discussed in this document in other countries, and the information may be subject to change without notice. Consult your local IBM business contact for information on the product or services available in your area. All statements regarding IBM's future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only. Information about non-IBM products is obtained from the manufacturers of those products or their published announcements. IBM has not tested those products and cannot confirm the performance, compatibility, or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. Prices subject to change without notice. Contact your IBM representative or Business Partner for the most current pricing in your geography. 62 62 © 2012 IBM Corporation 31