SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
™
                 mVerify
                 A Million Users in a Box ™



    ISSRE 2008 Trip Report
                   Presented to the
Chicago Software Process Improvement Network (CSPIN)
                   January 7, 2009

     Robert V. Binder     bob_binder@mverify.com

                        www.mVerify.com
Overview

Software Reliability Engineering (SRE) concepts

Case Study, SRE applied to a Trading System

ISSRE 2008 Highlights

Applying SRE

Q & A


                      © 2009 mVerify Corporation   2
What is SRE?

   Basic Concepts
   Predictive Reliability Modeling
   Profile-based Testing
   Release Management
   Corollary Techniques




                        © 2009 mVerify Corporation   3
Musa’s Observation

Testing driven by an operational profile is
very efficient because it identifies failures
(and hence the faults causing them) on
average, in order of how often they occur.

This approach rapidly increases reliability …
because the failures that occur most
frequently are caused by the faulty
operations used most frequently.


                                                 IEEE Software, March 1993

                    © 2009 mVerify Corporation                               4
Behavioral Focus
 Can a buggy system be reliable?
    It depends on how it is used
 Operational Profile
    A list of interactions, similar to “use cases”
    Relative frequency assigned to each operation
        Three ops: Op1: 60%, Op2, 30%, Op3, 10%.
        Represents expected field usage
    For a good reliability estimate
        Must include all field operations
        Accurate rank order critical, magnitude less so
 Test cases generated/applied accordingly
    Optimal allocation of testing resources, wrt reliability
    High criticality operations tested separately
    Can overlay most system testing process/technology
                                © 2009 mVerify Corporation      5
Reliability Arithmetic

 Reliability: probability of non-failure
 MTTR: mean time to
    recover, repair, restart …
 Availability: percent up-time
    Availability = 1 / 1 + (MTTR  Reliability)
    99.999% availability = 5 min unscheduled downtime
     per year
    “Five Nines”


                         © 2009 mVerify Corporation      6
Some Reliability Data Points
                        Reliability                  Availability,
                 (Failures/million hours)            6 min MTTR

NT 4.0 Desktop                    82,000             0.999000000

Windows 2K Server                 36,013             0.999640000

Common Light Bulb                     1,000          0.999990000

Stepstone OO Framework                           5   0.999999500

Telelabs Digital Cross Connect                  3    0.999999842


                        © 2009 mVerify Corporation                   7
Predictive Reliability Modeling

 Input
   Observed number of failures
   Elapsed time or Execution units
 Model
   Curve fitting
   Extensive research and application
   Many variations
 Output
   CASRE tool
   Estimated Field Failure Intensity (rate), Reliability

                          © 2009 mVerify Corporation        8
Reliability: Failures versus Time

Failure Rate       




Present
                                                                       R = e - t


Objective




             Present
                   Additional Time                                              Execution Time


                                     Lyu. Software Reliability Engineering: A Roadmap
                                          © 2009 mVerify Corporation                             9
Many Models




Broon et al. A New Statistical Software Reliability Tool
                                  © 2009 mVerify Corporation   10
Release Management

 FIO -- Failure Intensity Objective
    Maximum acceptable failure rate for release
    Estimated from failures observed in system test
    Application and industry specific
 FI -- Failure Intensity
    Rate of failures observed
 FI/FIO over time
    Reliability Growth curve
    At 1.0, release goal is achieved

                         © 2009 mVerify Corporation    11
Reliability Growth
            18
            16
                                           Conventional test
            14
            12
            10
FI/FIO
              8
              6
              4
                            Operational-profile-driven test
              2
              0
                  0        0.1          0.2                  0.3   0.4   0.5
                                        Millions of Calls
                                           Mcalls
John Musa. More Reliable Software Faster and Cheaper – An Overview

                                    © 2009 mVerify Corporation                 12
Corollary Techniques

 “Statistical” Testing Strategies
    Randomized selection of test inputs
    Randomized selection of test case values
 Orthogonal Defect Classification
    Process for collecting and analyzing defect data
    Designed to support reliability growth analysis
 Fault-Tolerant Architecture
    Large-grain strategies
    HW/SW focus

                         © 2009 mVerify Corporation     13
Promise of Profile-Based Testing

 Tester's point of view versus reliability analyst
    Maximize reliability within fixed budget
    Measurement of reliability not primary goal

 Profile-Based Testing is optimal when
    Available information already used
    Must allocate resources to test complex SUT

 Many significant practical obstacles


                         © 2009 mVerify Corporation   14
Trading System
 Case Study




     © 2009 mVerify Corporation   15
Case Study – Trading System

E-commerce/securities market, screen-based
 trading over private network

3 million transactions per hour (peak)

15 billion dollars per day (peak)




                      © 2009 mVerify Corporation   16
Profile-based Testing Approach

 Executable operational profile
 Simulator generates realistic unique test suites
 Loosely coupled automated test agents
 Oracle/Comparator automatically evaluate
 Support integration, functional, and stress test




                       © 2009 mVerify Corporation    17
Model-based Testing

Profile alone insufficient
Extended Use Case
    RBSC test methodology
    Defines feature usage profile
    Input conditions, output actions
Mode Machine
Invariant Boundaries


                         © 2009 mVerify Corporation   18
Simulator

 Generate inputs according to OP
 Discrete event simulation
    Generate any distribution with pseudo-random
 Prolog implementation (50 KLOC)
    Rule inversion
 Load Profile
    Time domain variation
    Orthogonal to operational profile
 Each event assigned a "port" and submit time

                        © 2009 mVerify Corporation   19
Automated Run Evaluation
 Oracle accepts output of simulator
 About 500 unique rules
 Verification
    Splainer – result/rule backtracking tool
    Rule/Run coverage analyzer
 Comparator
    Extract transaction log
    Post run database state
    end-to-end invariant
 Stealth requirements engineering

                         © 2009 mVerify Corporation   20
Test Process

 Six development increments
    3 to 5 months
    Test design/implementation parallel with app dev

 Plan each day's test run
    Load profile
    Total volume
    Configuration/operational scenarios


                        © 2009 mVerify Corporation      21
Daily Test Runs

 Run Simulator
     100,000 events per hour
     FTP event files to test agents
   Start SUT
   Test agents automatically start at scheduled time
   Extract results
   Run Oracle/Comparator
   Prepare bug reports


                           © 2009 mVerify Corporation   22
Results

 Revealed about 1,500 bugs over two years
    5% showstoppers
 Five person team, huge productivity increase
 Achieved proven high reliability
    Last pre-release test run: 500,000 events in two
     hours, no failures detected
    No production failures



                         © 2009 mVerify Corporation     23
International
Symposium on Software
 Reliability Engineering
      (ISSRE) 2008
      Trip Report

         © 2009 mVerify Corporation   24
General
 Nineteenth ISSRE                                            Presentations, by type
      200+ attended
      ~1/3 outside US                                                Accept’d   Submit’
                                                                                 d
      ~1/3 practitioners
      ~2/3 researchers/students                         Research        29        116
 Conference focus is broadening
                                                         Industry        25            32
 Incremental research progress
 Practitioner community about the                       Posters         14            31
  same
                                                         Student         11            30
 Student presentations impressive
 No new tool support                                    Workshops      ~25            ?

 2009 at Mysore India, housing included in registration
 Over 100 presentations -- a few snapshots follow
                            © 2009 mVerify Corporation                                     25
Presentation Snapshots

 Test results from hardware/software integration
  testing of space vehicle components
    Composite HW/SW model
    Stochastic Activity Networks
         Blends Petri nets and Markov chain
    Used established SRE reliability and availability
     models for analysis and estimation

 Hecht, McAdams, and Lam. Use of Test Data for Integrated Hardware/Software
 Reliability and Availability Modeling of a Space Vehicle




                                 © 2009 mVerify Corporation                   26
Presentation Snapshots




Hecht, McAdams, and Lam. Use of Test Data for Integrated Hardware/Software Reliability and
Availability Modeling of a Space Vehicle
                                            © 2009 mVerify Corporation                       27
Presentation Snapshots

 Server farm reliability: Service availability is most
  effectively improved by server replication, but at
  the margin, failure prevention is more effective.
    Salfner & Wolter. Replication vs. Failure Prevention: How to Boost Service
    Availability?


 Best reliabilty growth model for an “agile”
  process is linear – faults revealed at a steady
  rate. Classic S curve corresponds to waterfall
  process.
   Baumer, Seidler, Torkar & Feldt. Predicting Fault Inflow in Highly Iterative
   Software Development Processes: An Industrial Evaluation

                                   © 2009 mVerify Corporation                     28
Presentation Snapshots
Baumer et al. Predicting Fault Inflow in Highly Iterative
Software Development Processes: An Industrial Evaluation




                                               © 2009 mVerify Corporation   29
Presentation Snapshots

 Reliability model for
  large campus building
  control system
      Markov model
       corresponds to Poisson
       process, i.e., reliability
       growth.
      Abstraction necessary: “If
       you wants everything,
       you can’t have nothing.”

Avritzer. Applying Markovian Models to
the Reliability Estimation of Large
Distributed Software Systems.



                                         © 2009 mVerify Corporation   30
Presentation Snapshots

 Improving safety in a new NASA system
   Analyzed process metrics, product metrics to identify
    safety-related hazards
   Metrics don’t tell you if its safe, but can indicate risks
   You can’t measure “safety” per se, but you can
    measure its proxies
   Approach applicable for similar characteristics, e.g.,
    security

  Basili. Using Measures and Risk Indicators for Early Insight into Software Product
  Characteristics such as Software Safety.


                                  © 2009 mVerify Corporation                           31
Presentation Snapshots
 Microsoft Crash Telemetry
        How to use feedback from 800 million Windows hosts
        Auto collect app crash signatures, dump, registry, etc.
        100,000+ hosts for Vista Customer Experience
        390,000 unique devices (drivers) in Vista SP1
        Drivers extend the Windows kernel
          25 new daily, 100 patches
          000s changes to the windows kernel
      30x reliability differences between hardware
      Variation by locale: 4x between worst, best

Li et al. Reliability Assessment of Mass-Market Software: Insights from Windows Vista.

                                    © 2009 mVerify Corporation                           32
SRE Concerns
 Speed bumps
      Operational profile can be hard to develop
      Skepticism about cost/benefit
      Too many models
      Restrictive model assumptions
 Testing with an operational profile
      No integrated representation of constraints
      Modes not integrated
      Hand-crafted test suites too brittle
      Scalability
 Not much tool support
    With better automation, applicable to broader class of
     apps/environments
                              © 2009 mVerify Corporation      33
What I’m Working On

 Automatic generation of behavior model from an
  operational profile
    Generate constrained test suites
    Overcome monolithic model obstacle
    Automatic mode partitioning
 Test generation by environment simulation
    Statistical accuracy needed for reliability prediction
    No brittleness
 Tight integration of failure capture and reliability
  dashboard
    Actionable real-time metrics


                              © 2009 mVerify Corporation      34
Summary

 SRE is credible, well-established
    Compatible with almost any process
    Quant models refined with 20 years of peer-reviewed
     research and application
    Proven best practice, 100s of firms
    Sustainable for typical IT workers
    Proven ROI
 Very effective in its sweet spot



                        © 2009 mVerify Corporation         35
Resources
 ISSRE Web Site
      http://www.issre.org
 ISSRE Proceedings
      http://ieeexplore.ieee.org/servlet/opac?punumber=4700288
 Musa’s Web Site
      http://softwarereliabilityengineering.com
 Definitive Industry Reference
    Software Reliability Engineering: More Reliable Software Faster and
     Cheaper. John D. Musa
      http://www.authorhouse.com/BookStore/ItemDetail.aspx?bookid=26806
 Survey of the State of the Art
    Software Reliability Engineering: A Roadmap. Michael R. Lyu
      http://csse.usc.edu/classes/cs589_2007/Reliability.pdf


                                   © 2009 mVerify Corporation              36

Contenu connexe

Tendances

Test and integration in REC
Test and integration in RECTest and integration in REC
Test and integration in RECkatybairstow
 
Risk and Testing (2003)
Risk and Testing (2003)Risk and Testing (2003)
Risk and Testing (2003)Neil Thompson
 
UVM BASED REUSABLE VERIFICATION IP FOR WISHBONE COMPLIANT SPI MASTER CORE
UVM BASED REUSABLE VERIFICATION IP FOR WISHBONE COMPLIANT SPI MASTER COREUVM BASED REUSABLE VERIFICATION IP FOR WISHBONE COMPLIANT SPI MASTER CORE
UVM BASED REUSABLE VERIFICATION IP FOR WISHBONE COMPLIANT SPI MASTER COREVLSICS Design
 
Advanced Verification Methodology for Complex System on Chip Verification
Advanced Verification Methodology for Complex System on Chip VerificationAdvanced Verification Methodology for Complex System on Chip Verification
Advanced Verification Methodology for Complex System on Chip VerificationVLSICS Design
 
Robust design and reliability engineering synergy webinar 2013 04 10
Robust design and reliability engineering synergy webinar   2013 04 10Robust design and reliability engineering synergy webinar   2013 04 10
Robust design and reliability engineering synergy webinar 2013 04 10ASQ Reliability Division
 
Dependablity Engineering 1 (CS 5032 2012)
Dependablity Engineering 1 (CS 5032 2012)Dependablity Engineering 1 (CS 5032 2012)
Dependablity Engineering 1 (CS 5032 2012)Ian Sommerville
 
SEM Test Development Center
SEM Test Development CenterSEM Test Development Center
SEM Test Development CenterESA Italia Srl
 
JoramMQ Entreprise tools and services for the JORAM user, OW2con’12, Paris
JoramMQ Entreprise tools and services for the JORAM user, OW2con’12, ParisJoramMQ Entreprise tools and services for the JORAM user, OW2con’12, Paris
JoramMQ Entreprise tools and services for the JORAM user, OW2con’12, ParisOW2
 
AVIT Brochure
AVIT BrochureAVIT Brochure
AVIT Brochurejohnuzz
 
Accelerated reliability techniques in the 21st century
Accelerated reliability techniques in the 21st centuryAccelerated reliability techniques in the 21st century
Accelerated reliability techniques in the 21st centuryASQ Reliability Division
 
Neil Tompson - SoftTest Ireland
Neil Tompson - SoftTest IrelandNeil Tompson - SoftTest Ireland
Neil Tompson - SoftTest IrelandDavid O'Dowd
 
Plug&Care Connector - SOSMD 2011
Plug&Care Connector - SOSMD 2011Plug&Care Connector - SOSMD 2011
Plug&Care Connector - SOSMD 2011StevenMohr
 
Friedenthal.sandford
Friedenthal.sandfordFriedenthal.sandford
Friedenthal.sandfordNASAPMC
 
Reliability doe the proper analysis approach for life data
Reliability doe the proper analysis approach for life dataReliability doe the proper analysis approach for life data
Reliability doe the proper analysis approach for life dataASQ Reliability Division
 
Effective and pragmatic test driven development by Andrew Rendell, Principal ...
Effective and pragmatic test driven development by Andrew Rendell, Principal ...Effective and pragmatic test driven development by Andrew Rendell, Principal ...
Effective and pragmatic test driven development by Andrew Rendell, Principal ...Valtech UK
 
Probabilistic design for reliability (pdfr) in electronics part1of2
Probabilistic design for reliability (pdfr) in electronics part1of2Probabilistic design for reliability (pdfr) in electronics part1of2
Probabilistic design for reliability (pdfr) in electronics part1of2ASQ Reliability Division
 
Testing in the Oil & Gas Market“
Testing in the Oil & Gas Market“Testing in the Oil & Gas Market“
Testing in the Oil & Gas Market“Ernesto Kiszkurno
 

Tendances (19)

Test and integration in REC
Test and integration in RECTest and integration in REC
Test and integration in REC
 
Risk and Testing (2003)
Risk and Testing (2003)Risk and Testing (2003)
Risk and Testing (2003)
 
UVM BASED REUSABLE VERIFICATION IP FOR WISHBONE COMPLIANT SPI MASTER CORE
UVM BASED REUSABLE VERIFICATION IP FOR WISHBONE COMPLIANT SPI MASTER COREUVM BASED REUSABLE VERIFICATION IP FOR WISHBONE COMPLIANT SPI MASTER CORE
UVM BASED REUSABLE VERIFICATION IP FOR WISHBONE COMPLIANT SPI MASTER CORE
 
Advanced Verification Methodology for Complex System on Chip Verification
Advanced Verification Methodology for Complex System on Chip VerificationAdvanced Verification Methodology for Complex System on Chip Verification
Advanced Verification Methodology for Complex System on Chip Verification
 
Robust design and reliability engineering synergy webinar 2013 04 10
Robust design and reliability engineering synergy webinar   2013 04 10Robust design and reliability engineering synergy webinar   2013 04 10
Robust design and reliability engineering synergy webinar 2013 04 10
 
Dependablity Engineering 1 (CS 5032 2012)
Dependablity Engineering 1 (CS 5032 2012)Dependablity Engineering 1 (CS 5032 2012)
Dependablity Engineering 1 (CS 5032 2012)
 
SEM Test Development Center
SEM Test Development CenterSEM Test Development Center
SEM Test Development Center
 
JoramMQ Entreprise tools and services for the JORAM user, OW2con’12, Paris
JoramMQ Entreprise tools and services for the JORAM user, OW2con’12, ParisJoramMQ Entreprise tools and services for the JORAM user, OW2con’12, Paris
JoramMQ Entreprise tools and services for the JORAM user, OW2con’12, Paris
 
AVIT Brochure
AVIT BrochureAVIT Brochure
AVIT Brochure
 
Accelerated reliability techniques in the 21st century
Accelerated reliability techniques in the 21st centuryAccelerated reliability techniques in the 21st century
Accelerated reliability techniques in the 21st century
 
Neil Tompson - SoftTest Ireland
Neil Tompson - SoftTest IrelandNeil Tompson - SoftTest Ireland
Neil Tompson - SoftTest Ireland
 
Ch1
Ch1Ch1
Ch1
 
Plug&Care Connector - SOSMD 2011
Plug&Care Connector - SOSMD 2011Plug&Care Connector - SOSMD 2011
Plug&Care Connector - SOSMD 2011
 
Friedenthal.sandford
Friedenthal.sandfordFriedenthal.sandford
Friedenthal.sandford
 
Reliability doe the proper analysis approach for life data
Reliability doe the proper analysis approach for life dataReliability doe the proper analysis approach for life data
Reliability doe the proper analysis approach for life data
 
Testing Experience Magazine Vol.14 June 2011
Testing Experience Magazine Vol.14 June 2011Testing Experience Magazine Vol.14 June 2011
Testing Experience Magazine Vol.14 June 2011
 
Effective and pragmatic test driven development by Andrew Rendell, Principal ...
Effective and pragmatic test driven development by Andrew Rendell, Principal ...Effective and pragmatic test driven development by Andrew Rendell, Principal ...
Effective and pragmatic test driven development by Andrew Rendell, Principal ...
 
Probabilistic design for reliability (pdfr) in electronics part1of2
Probabilistic design for reliability (pdfr) in electronics part1of2Probabilistic design for reliability (pdfr) in electronics part1of2
Probabilistic design for reliability (pdfr) in electronics part1of2
 
Testing in the Oil & Gas Market“
Testing in the Oil & Gas Market“Testing in the Oil & Gas Market“
Testing in the Oil & Gas Market“
 

En vedette

Mobile Reliability Challenges
Mobile Reliability ChallengesMobile Reliability Challenges
Mobile Reliability ChallengesBob Binder
 
The Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision SupportThe Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision SupportBob Binder
 
mVerify Investor Overview
mVerify Investor OverviewmVerify Investor Overview
mVerify Investor OverviewBob Binder
 
Experience with a Profile-based Automated Testing Environment
Experience with a Profile-based Automated Testing EnvironmentExperience with a Profile-based Automated Testing Environment
Experience with a Profile-based Automated Testing EnvironmentBob Binder
 
Lessons learned validating 60,000 pages of api documentation
Lessons learned validating 60,000 pages of api documentationLessons learned validating 60,000 pages of api documentation
Lessons learned validating 60,000 pages of api documentationBob Binder
 
MTS: Controllable Test Objects
MTS: Controllable Test ObjectsMTS: Controllable Test Objects
MTS: Controllable Test ObjectsBob Binder
 
Performance Testing Mobile and Multi-Tier Applications
Performance Testing Mobile and Multi-Tier ApplicationsPerformance Testing Mobile and Multi-Tier Applications
Performance Testing Mobile and Multi-Tier ApplicationsBob Binder
 

En vedette (7)

Mobile Reliability Challenges
Mobile Reliability ChallengesMobile Reliability Challenges
Mobile Reliability Challenges
 
The Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision SupportThe Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision Support
 
mVerify Investor Overview
mVerify Investor OverviewmVerify Investor Overview
mVerify Investor Overview
 
Experience with a Profile-based Automated Testing Environment
Experience with a Profile-based Automated Testing EnvironmentExperience with a Profile-based Automated Testing Environment
Experience with a Profile-based Automated Testing Environment
 
Lessons learned validating 60,000 pages of api documentation
Lessons learned validating 60,000 pages of api documentationLessons learned validating 60,000 pages of api documentation
Lessons learned validating 60,000 pages of api documentation
 
MTS: Controllable Test Objects
MTS: Controllable Test ObjectsMTS: Controllable Test Objects
MTS: Controllable Test Objects
 
Performance Testing Mobile and Multi-Tier Applications
Performance Testing Mobile and Multi-Tier ApplicationsPerformance Testing Mobile and Multi-Tier Applications
Performance Testing Mobile and Multi-Tier Applications
 

Similaire à ISSRE 2008 Trip Report

How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...kalichargn70th171
 
NERC CIP - Top Testing & Compliance Challenges, How to Address Them
NERC CIP - Top Testing & Compliance Challenges, How to Address ThemNERC CIP - Top Testing & Compliance Challenges, How to Address Them
NERC CIP - Top Testing & Compliance Challenges, How to Address ThemInflectra
 
Seminar on Software Testing
Seminar on Software TestingSeminar on Software Testing
Seminar on Software TestingBeat Fluri
 
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...kalichargn70th171
 
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
'Automated Reliability Testing via Hardware Interfaces' by Bryan BakkerTEST Huddle
 
Achieving Very High Reliability for Ubiquitous Information Technology
Achieving Very High Reliability for Ubiquitous Information Technology Achieving Very High Reliability for Ubiquitous Information Technology
Achieving Very High Reliability for Ubiquitous Information Technology Bob Binder
 
Telecom software testing for CSPs
Telecom software testing for CSPsTelecom software testing for CSPs
Telecom software testing for CSPsHOT TELECOM
 
DevOps in Practice: When does "Practice" Become "Doing"?
DevOps in Practice: When does "Practice" Become "Doing"?DevOps in Practice: When does "Practice" Become "Doing"?
DevOps in Practice: When does "Practice" Become "Doing"?Michael Elder
 
Mdd test qa_test2014_bryan_bakker
Mdd test qa_test2014_bryan_bakkerMdd test qa_test2014_bryan_bakker
Mdd test qa_test2014_bryan_bakkerBryan Bakker
 
Cloud Reliability: Decreasing outage frequency using fault injection
Cloud Reliability: Decreasing outage frequency using fault injectionCloud Reliability: Decreasing outage frequency using fault injection
Cloud Reliability: Decreasing outage frequency using fault injectionJorge Cardoso
 
SaaS Testing on an Agile World
SaaS Testing on an Agile WorldSaaS Testing on an Agile World
SaaS Testing on an Agile WorldPractiTest
 
Service Virtualization: Delivering Complex Test Environments on Demand
Service Virtualization: Delivering Complex Test Environments on DemandService Virtualization: Delivering Complex Test Environments on Demand
Service Virtualization: Delivering Complex Test Environments on DemandErika Barron
 
Testing a Microservices Architecture
Testing a Microservices ArchitectureTesting a Microservices Architecture
Testing a Microservices ArchitectureParasoft
 
Performance On Demand
Performance On DemandPerformance On Demand
Performance On DemandFranky Smit
 
Keyword Driven Automation
Keyword Driven AutomationKeyword Driven Automation
Keyword Driven AutomationPankaj Goel
 
Automotive communication systems: from dependability to security
Automotive communication systems: from dependability to securityAutomotive communication systems: from dependability to security
Automotive communication systems: from dependability to securityRealTime-at-Work (RTaW)
 
Automotive communication systems: from dependability to security
Automotive communication systems: from dependability to securityAutomotive communication systems: from dependability to security
Automotive communication systems: from dependability to securityNicolas Navet
 
Tips to Improve Retail Mobile App Testing
Tips to Improve Retail Mobile App TestingTips to Improve Retail Mobile App Testing
Tips to Improve Retail Mobile App TestingMatthew Allen
 

Similaire à ISSRE 2008 Trip Report (20)

Mobile Monitoring Best Practices
Mobile Monitoring Best PracticesMobile Monitoring Best Practices
Mobile Monitoring Best Practices
 
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
 
NERC CIP - Top Testing & Compliance Challenges, How to Address Them
NERC CIP - Top Testing & Compliance Challenges, How to Address ThemNERC CIP - Top Testing & Compliance Challenges, How to Address Them
NERC CIP - Top Testing & Compliance Challenges, How to Address Them
 
Seminar on Software Testing
Seminar on Software TestingSeminar on Software Testing
Seminar on Software Testing
 
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
How Real Device Cloud Testing Ensures Exceptional Efficiency and Scalability ...
 
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
 
Achieving Very High Reliability for Ubiquitous Information Technology
Achieving Very High Reliability for Ubiquitous Information Technology Achieving Very High Reliability for Ubiquitous Information Technology
Achieving Very High Reliability for Ubiquitous Information Technology
 
Telecom software testing for CSPs
Telecom software testing for CSPsTelecom software testing for CSPs
Telecom software testing for CSPs
 
DevOps in Practice: When does "Practice" Become "Doing"?
DevOps in Practice: When does "Practice" Become "Doing"?DevOps in Practice: When does "Practice" Become "Doing"?
DevOps in Practice: When does "Practice" Become "Doing"?
 
Mdd test qa_test2014_bryan_bakker
Mdd test qa_test2014_bryan_bakkerMdd test qa_test2014_bryan_bakker
Mdd test qa_test2014_bryan_bakker
 
Cloud Reliability: Decreasing outage frequency using fault injection
Cloud Reliability: Decreasing outage frequency using fault injectionCloud Reliability: Decreasing outage frequency using fault injection
Cloud Reliability: Decreasing outage frequency using fault injection
 
SaaS Testing on an Agile World
SaaS Testing on an Agile WorldSaaS Testing on an Agile World
SaaS Testing on an Agile World
 
Service Virtualization: Delivering Complex Test Environments on Demand
Service Virtualization: Delivering Complex Test Environments on DemandService Virtualization: Delivering Complex Test Environments on Demand
Service Virtualization: Delivering Complex Test Environments on Demand
 
Testing a Microservices Architecture
Testing a Microservices ArchitectureTesting a Microservices Architecture
Testing a Microservices Architecture
 
Performance On Demand
Performance On DemandPerformance On Demand
Performance On Demand
 
Testing a SaaS Platform
Testing a SaaS PlatformTesting a SaaS Platform
Testing a SaaS Platform
 
Keyword Driven Automation
Keyword Driven AutomationKeyword Driven Automation
Keyword Driven Automation
 
Automotive communication systems: from dependability to security
Automotive communication systems: from dependability to securityAutomotive communication systems: from dependability to security
Automotive communication systems: from dependability to security
 
Automotive communication systems: from dependability to security
Automotive communication systems: from dependability to securityAutomotive communication systems: from dependability to security
Automotive communication systems: from dependability to security
 
Tips to Improve Retail Mobile App Testing
Tips to Improve Retail Mobile App TestingTips to Improve Retail Mobile App Testing
Tips to Improve Retail Mobile App Testing
 

Plus de Bob Binder

How to Release Rock-solid RESTful APIs and Ice the Testing BackBlob
How to Release Rock-solid RESTful APIs and Ice the Testing BackBlobHow to Release Rock-solid RESTful APIs and Ice the Testing BackBlob
How to Release Rock-solid RESTful APIs and Ice the Testing BackBlobBob Binder
 
Model-based Testing: Taking BDD/ATDD to the Next Level
Model-based Testing: Taking BDD/ATDD to the Next LevelModel-based Testing: Taking BDD/ATDD to the Next Level
Model-based Testing: Taking BDD/ATDD to the Next LevelBob Binder
 
Model-based Testing: Today And Tomorrow
Model-based Testing: Today And TomorrowModel-based Testing: Today And Tomorrow
Model-based Testing: Today And TomorrowBob Binder
 
Mobile App Assurance: Yesterday, Today, and Tomorrow.
Mobile App Assurance: Yesterday, Today, and Tomorrow.Mobile App Assurance: Yesterday, Today, and Tomorrow.
Mobile App Assurance: Yesterday, Today, and Tomorrow.Bob Binder
 
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?Bob Binder
 
Testing Object-Oriented Systems: Lessons Learned
Testing Object-Oriented Systems: Lessons LearnedTesting Object-Oriented Systems: Lessons Learned
Testing Object-Oriented Systems: Lessons LearnedBob Binder
 
Model-Based Testing: Why, What, How
Model-Based Testing: Why, What, HowModel-Based Testing: Why, What, How
Model-Based Testing: Why, What, HowBob Binder
 
MDD and the Tautology Problem: Discussion Notes.
MDD and the Tautology Problem: Discussion Notes.MDD and the Tautology Problem: Discussion Notes.
MDD and the Tautology Problem: Discussion Notes.Bob Binder
 
Testability: Factors and Strategy
Testability: Factors and StrategyTestability: Factors and Strategy
Testability: Factors and StrategyBob Binder
 
Test Objects -- They Just Work
Test Objects -- They Just WorkTest Objects -- They Just Work
Test Objects -- They Just WorkBob Binder
 
A Million Users in a Box: The WTS Story
A Million Users in a Box: The WTS StoryA Million Users in a Box: The WTS Story
A Million Users in a Box: The WTS StoryBob Binder
 
Software Test Patterns: Successes and Challenges
Software Test Patterns: Successes and ChallengesSoftware Test Patterns: Successes and Challenges
Software Test Patterns: Successes and ChallengesBob Binder
 
Assurance for Cloud Computing
Assurance for Cloud ComputingAssurance for Cloud Computing
Assurance for Cloud ComputingBob Binder
 
The Advanced Mobile Application Testing Environment: Project Report
The Advanced Mobile Application Testing Environment: Project ReportThe Advanced Mobile Application Testing Environment: Project Report
The Advanced Mobile Application Testing Environment: Project ReportBob Binder
 
Software Testing: Models, Patterns, Tools
Software Testing: Models, Patterns, ToolsSoftware Testing: Models, Patterns, Tools
Software Testing: Models, Patterns, ToolsBob Binder
 
The Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision SupportThe Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision SupportBob Binder
 
Testability: Factors and Strategy
Testability: Factors and StrategyTestability: Factors and Strategy
Testability: Factors and StrategyBob Binder
 

Plus de Bob Binder (17)

How to Release Rock-solid RESTful APIs and Ice the Testing BackBlob
How to Release Rock-solid RESTful APIs and Ice the Testing BackBlobHow to Release Rock-solid RESTful APIs and Ice the Testing BackBlob
How to Release Rock-solid RESTful APIs and Ice the Testing BackBlob
 
Model-based Testing: Taking BDD/ATDD to the Next Level
Model-based Testing: Taking BDD/ATDD to the Next LevelModel-based Testing: Taking BDD/ATDD to the Next Level
Model-based Testing: Taking BDD/ATDD to the Next Level
 
Model-based Testing: Today And Tomorrow
Model-based Testing: Today And TomorrowModel-based Testing: Today And Tomorrow
Model-based Testing: Today And Tomorrow
 
Mobile App Assurance: Yesterday, Today, and Tomorrow.
Mobile App Assurance: Yesterday, Today, and Tomorrow.Mobile App Assurance: Yesterday, Today, and Tomorrow.
Mobile App Assurance: Yesterday, Today, and Tomorrow.
 
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?
 
Testing Object-Oriented Systems: Lessons Learned
Testing Object-Oriented Systems: Lessons LearnedTesting Object-Oriented Systems: Lessons Learned
Testing Object-Oriented Systems: Lessons Learned
 
Model-Based Testing: Why, What, How
Model-Based Testing: Why, What, HowModel-Based Testing: Why, What, How
Model-Based Testing: Why, What, How
 
MDD and the Tautology Problem: Discussion Notes.
MDD and the Tautology Problem: Discussion Notes.MDD and the Tautology Problem: Discussion Notes.
MDD and the Tautology Problem: Discussion Notes.
 
Testability: Factors and Strategy
Testability: Factors and StrategyTestability: Factors and Strategy
Testability: Factors and Strategy
 
Test Objects -- They Just Work
Test Objects -- They Just WorkTest Objects -- They Just Work
Test Objects -- They Just Work
 
A Million Users in a Box: The WTS Story
A Million Users in a Box: The WTS StoryA Million Users in a Box: The WTS Story
A Million Users in a Box: The WTS Story
 
Software Test Patterns: Successes and Challenges
Software Test Patterns: Successes and ChallengesSoftware Test Patterns: Successes and Challenges
Software Test Patterns: Successes and Challenges
 
Assurance for Cloud Computing
Assurance for Cloud ComputingAssurance for Cloud Computing
Assurance for Cloud Computing
 
The Advanced Mobile Application Testing Environment: Project Report
The Advanced Mobile Application Testing Environment: Project ReportThe Advanced Mobile Application Testing Environment: Project Report
The Advanced Mobile Application Testing Environment: Project Report
 
Software Testing: Models, Patterns, Tools
Software Testing: Models, Patterns, ToolsSoftware Testing: Models, Patterns, Tools
Software Testing: Models, Patterns, Tools
 
The Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision SupportThe Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision Support
 
Testability: Factors and Strategy
Testability: Factors and StrategyTestability: Factors and Strategy
Testability: Factors and Strategy
 

Dernier

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Dernier (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

ISSRE 2008 Trip Report

  • 1. mVerify A Million Users in a Box ™ ISSRE 2008 Trip Report Presented to the Chicago Software Process Improvement Network (CSPIN) January 7, 2009 Robert V. Binder bob_binder@mverify.com www.mVerify.com
  • 2. Overview Software Reliability Engineering (SRE) concepts Case Study, SRE applied to a Trading System ISSRE 2008 Highlights Applying SRE Q & A © 2009 mVerify Corporation 2
  • 3. What is SRE?  Basic Concepts  Predictive Reliability Modeling  Profile-based Testing  Release Management  Corollary Techniques © 2009 mVerify Corporation 3
  • 4. Musa’s Observation Testing driven by an operational profile is very efficient because it identifies failures (and hence the faults causing them) on average, in order of how often they occur. This approach rapidly increases reliability … because the failures that occur most frequently are caused by the faulty operations used most frequently. IEEE Software, March 1993 © 2009 mVerify Corporation 4
  • 5. Behavioral Focus  Can a buggy system be reliable?  It depends on how it is used  Operational Profile  A list of interactions, similar to “use cases”  Relative frequency assigned to each operation  Three ops: Op1: 60%, Op2, 30%, Op3, 10%.  Represents expected field usage  For a good reliability estimate  Must include all field operations  Accurate rank order critical, magnitude less so  Test cases generated/applied accordingly  Optimal allocation of testing resources, wrt reliability  High criticality operations tested separately  Can overlay most system testing process/technology © 2009 mVerify Corporation 5
  • 6. Reliability Arithmetic  Reliability: probability of non-failure  MTTR: mean time to  recover, repair, restart …  Availability: percent up-time  Availability = 1 / 1 + (MTTR  Reliability)  99.999% availability = 5 min unscheduled downtime per year  “Five Nines” © 2009 mVerify Corporation 6
  • 7. Some Reliability Data Points Reliability Availability, (Failures/million hours) 6 min MTTR NT 4.0 Desktop 82,000 0.999000000 Windows 2K Server 36,013 0.999640000 Common Light Bulb 1,000 0.999990000 Stepstone OO Framework 5 0.999999500 Telelabs Digital Cross Connect 3 0.999999842 © 2009 mVerify Corporation 7
  • 8. Predictive Reliability Modeling  Input  Observed number of failures  Elapsed time or Execution units  Model  Curve fitting  Extensive research and application  Many variations  Output  CASRE tool  Estimated Field Failure Intensity (rate), Reliability © 2009 mVerify Corporation 8
  • 9. Reliability: Failures versus Time Failure Rate  Present R = e - t Objective Present Additional Time Execution Time Lyu. Software Reliability Engineering: A Roadmap © 2009 mVerify Corporation 9
  • 10. Many Models Broon et al. A New Statistical Software Reliability Tool © 2009 mVerify Corporation 10
  • 11. Release Management  FIO -- Failure Intensity Objective  Maximum acceptable failure rate for release  Estimated from failures observed in system test  Application and industry specific  FI -- Failure Intensity  Rate of failures observed  FI/FIO over time  Reliability Growth curve  At 1.0, release goal is achieved © 2009 mVerify Corporation 11
  • 12. Reliability Growth 18 16 Conventional test 14 12 10 FI/FIO 8 6 4 Operational-profile-driven test 2 0 0 0.1 0.2 0.3 0.4 0.5 Millions of Calls Mcalls John Musa. More Reliable Software Faster and Cheaper – An Overview © 2009 mVerify Corporation 12
  • 13. Corollary Techniques  “Statistical” Testing Strategies  Randomized selection of test inputs  Randomized selection of test case values  Orthogonal Defect Classification  Process for collecting and analyzing defect data  Designed to support reliability growth analysis  Fault-Tolerant Architecture  Large-grain strategies  HW/SW focus © 2009 mVerify Corporation 13
  • 14. Promise of Profile-Based Testing  Tester's point of view versus reliability analyst  Maximize reliability within fixed budget  Measurement of reliability not primary goal  Profile-Based Testing is optimal when  Available information already used  Must allocate resources to test complex SUT  Many significant practical obstacles © 2009 mVerify Corporation 14
  • 15. Trading System Case Study © 2009 mVerify Corporation 15
  • 16. Case Study – Trading System E-commerce/securities market, screen-based trading over private network 3 million transactions per hour (peak) 15 billion dollars per day (peak) © 2009 mVerify Corporation 16
  • 17. Profile-based Testing Approach  Executable operational profile  Simulator generates realistic unique test suites  Loosely coupled automated test agents  Oracle/Comparator automatically evaluate  Support integration, functional, and stress test © 2009 mVerify Corporation 17
  • 18. Model-based Testing Profile alone insufficient Extended Use Case  RBSC test methodology  Defines feature usage profile  Input conditions, output actions Mode Machine Invariant Boundaries © 2009 mVerify Corporation 18
  • 19. Simulator  Generate inputs according to OP  Discrete event simulation  Generate any distribution with pseudo-random  Prolog implementation (50 KLOC)  Rule inversion  Load Profile  Time domain variation  Orthogonal to operational profile  Each event assigned a "port" and submit time © 2009 mVerify Corporation 19
  • 20. Automated Run Evaluation  Oracle accepts output of simulator  About 500 unique rules  Verification  Splainer – result/rule backtracking tool  Rule/Run coverage analyzer  Comparator  Extract transaction log  Post run database state  end-to-end invariant  Stealth requirements engineering © 2009 mVerify Corporation 20
  • 21. Test Process  Six development increments  3 to 5 months  Test design/implementation parallel with app dev  Plan each day's test run  Load profile  Total volume  Configuration/operational scenarios © 2009 mVerify Corporation 21
  • 22. Daily Test Runs  Run Simulator  100,000 events per hour  FTP event files to test agents  Start SUT  Test agents automatically start at scheduled time  Extract results  Run Oracle/Comparator  Prepare bug reports © 2009 mVerify Corporation 22
  • 23. Results  Revealed about 1,500 bugs over two years  5% showstoppers  Five person team, huge productivity increase  Achieved proven high reliability  Last pre-release test run: 500,000 events in two hours, no failures detected  No production failures © 2009 mVerify Corporation 23
  • 24. International Symposium on Software Reliability Engineering (ISSRE) 2008 Trip Report © 2009 mVerify Corporation 24
  • 25. General  Nineteenth ISSRE Presentations, by type  200+ attended  ~1/3 outside US Accept’d Submit’ d  ~1/3 practitioners  ~2/3 researchers/students Research 29 116  Conference focus is broadening Industry 25 32  Incremental research progress  Practitioner community about the Posters 14 31 same Student 11 30  Student presentations impressive  No new tool support Workshops ~25 ?  2009 at Mysore India, housing included in registration  Over 100 presentations -- a few snapshots follow © 2009 mVerify Corporation 25
  • 26. Presentation Snapshots  Test results from hardware/software integration testing of space vehicle components  Composite HW/SW model  Stochastic Activity Networks  Blends Petri nets and Markov chain  Used established SRE reliability and availability models for analysis and estimation Hecht, McAdams, and Lam. Use of Test Data for Integrated Hardware/Software Reliability and Availability Modeling of a Space Vehicle © 2009 mVerify Corporation 26
  • 27. Presentation Snapshots Hecht, McAdams, and Lam. Use of Test Data for Integrated Hardware/Software Reliability and Availability Modeling of a Space Vehicle © 2009 mVerify Corporation 27
  • 28. Presentation Snapshots  Server farm reliability: Service availability is most effectively improved by server replication, but at the margin, failure prevention is more effective. Salfner & Wolter. Replication vs. Failure Prevention: How to Boost Service Availability?  Best reliabilty growth model for an “agile” process is linear – faults revealed at a steady rate. Classic S curve corresponds to waterfall process. Baumer, Seidler, Torkar & Feldt. Predicting Fault Inflow in Highly Iterative Software Development Processes: An Industrial Evaluation © 2009 mVerify Corporation 28
  • 29. Presentation Snapshots Baumer et al. Predicting Fault Inflow in Highly Iterative Software Development Processes: An Industrial Evaluation © 2009 mVerify Corporation 29
  • 30. Presentation Snapshots  Reliability model for large campus building control system  Markov model corresponds to Poisson process, i.e., reliability growth.  Abstraction necessary: “If you wants everything, you can’t have nothing.” Avritzer. Applying Markovian Models to the Reliability Estimation of Large Distributed Software Systems. © 2009 mVerify Corporation 30
  • 31. Presentation Snapshots  Improving safety in a new NASA system  Analyzed process metrics, product metrics to identify safety-related hazards  Metrics don’t tell you if its safe, but can indicate risks  You can’t measure “safety” per se, but you can measure its proxies  Approach applicable for similar characteristics, e.g., security Basili. Using Measures and Risk Indicators for Early Insight into Software Product Characteristics such as Software Safety. © 2009 mVerify Corporation 31
  • 32. Presentation Snapshots  Microsoft Crash Telemetry  How to use feedback from 800 million Windows hosts  Auto collect app crash signatures, dump, registry, etc.  100,000+ hosts for Vista Customer Experience  390,000 unique devices (drivers) in Vista SP1  Drivers extend the Windows kernel  25 new daily, 100 patches  000s changes to the windows kernel  30x reliability differences between hardware  Variation by locale: 4x between worst, best Li et al. Reliability Assessment of Mass-Market Software: Insights from Windows Vista. © 2009 mVerify Corporation 32
  • 33. SRE Concerns  Speed bumps  Operational profile can be hard to develop  Skepticism about cost/benefit  Too many models  Restrictive model assumptions  Testing with an operational profile  No integrated representation of constraints  Modes not integrated  Hand-crafted test suites too brittle  Scalability  Not much tool support  With better automation, applicable to broader class of apps/environments © 2009 mVerify Corporation 33
  • 34. What I’m Working On  Automatic generation of behavior model from an operational profile  Generate constrained test suites  Overcome monolithic model obstacle  Automatic mode partitioning  Test generation by environment simulation  Statistical accuracy needed for reliability prediction  No brittleness  Tight integration of failure capture and reliability dashboard  Actionable real-time metrics © 2009 mVerify Corporation 34
  • 35. Summary  SRE is credible, well-established  Compatible with almost any process  Quant models refined with 20 years of peer-reviewed research and application  Proven best practice, 100s of firms  Sustainable for typical IT workers  Proven ROI  Very effective in its sweet spot © 2009 mVerify Corporation 35
  • 36. Resources  ISSRE Web Site  http://www.issre.org  ISSRE Proceedings  http://ieeexplore.ieee.org/servlet/opac?punumber=4700288  Musa’s Web Site  http://softwarereliabilityengineering.com  Definitive Industry Reference  Software Reliability Engineering: More Reliable Software Faster and Cheaper. John D. Musa  http://www.authorhouse.com/BookStore/ItemDetail.aspx?bookid=26806  Survey of the State of the Art  Software Reliability Engineering: A Roadmap. Michael R. Lyu  http://csse.usc.edu/classes/cs589_2007/Reliability.pdf © 2009 mVerify Corporation 36