SlideShare une entreprise Scribd logo
1  sur  51
Télécharger pour lire hors ligne
INFRASTRUCTURE
RELIABILITY AND
RISK
ASSESSMENTS
        Steven Shapiro, P.E., ATD
        Mission Critical Practice Lead
        Morrison Hershfield
        Mission Critical



                  Morrison Hershfield Mission Critical
WHAT YOU NEED TO KNOW
AGENDA


• RISK ASSESSMENT

• INFRASTRUCTURE RELIABILITY
                 COOLING                          POWER




          Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RISK ASSESSMENTS



• WHY

• SITE EVALUATION

• METRICS



             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures

  •    Location
  •    Design
  •    Redundancy level
  •    Construction
  •    Quality of equipment
  •    Age                                   Lurking Vulnerabilities
  •    Operations & Maintenance program
  •    Personnel training
  •    Level of operator coverage
  •    Thoroughness of the commissioning program




                                       5
                            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
 WHY
Causes of Critical Failures

• Equipment failure
• Operator error
• Natural disaster
• Design error
• Installation error
• Commissioning or test deficiency
• Maintenance oversight
• Equipment design




 WHY                      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures


• Root cause not always easy to ascertain
• Combination of factors (Cascading Failures)
• Latent failures
• Most occur during change of state events
• More maintenance does not necessarily mean higher availability
• Non-Fault tolerant systems




  WHY
  FILURES                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures
                                     Commissioning or
                                      Test Deficiency
                                            4%

                 System Design                               Equipment
  Natural Disaster    20%                                      Design
        3%                                                      13%
   Maintenance
    Oversight
       4%
                                                                         Equipment Failure
                                                                               28%
    Installation Error
           10%           Human Error
                            18%




 WHY                             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessment
WHY DO RISK ASSESSMENT

• Alignment of business mission and facility performance expectation

• Quantifies the risk and exposure of the critical facilities to failure

• Identifies vulnerabilities and single points of failure

• First step in creating an action plan for site hardening

• Benchmark against the industry

• Assists in developing business case for capital expenditures




 RISK ASSESSMENT              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 1

• Quantify reliability expectations
• Develop resiliency metrics




 RISK ASSESSMENT      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 2
   •   Develop PRA model     (Probabilistic Risk Assessment)




   •   Identify Single Points of Failure within critical systems
   •   Evaluate redundancy of critical systems
   •   Capacity and expendability analysis
   •   Adequacy of Engineered Systems
   •   Operation and maintenance policies, practices and procedures
   •   Adequacy of maintenance and testing programs
   •   Evaluate risks associated with site location
   •   Overall Risk Analysis
   •   Evaluate the adequacy of operations and maintenance programs


 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION


STEP 2 cont.
• Harmonics analysis
• EMF studies
• Short circuit & coordination studies
• Air flow modeling-CFD




 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 3
   • Perform gap analysis
STEP 4
   • Recommendations for upgrade/alteration to optimize facility
      performance
   • Budget and schedule development
   • Assess risk during implementation
   • Benchmark findings with industry standards




 RISK ASSESSMENT         Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RISK ASSESSMENT METRICS

   • Probability of Failure/Reliability
   • Availability
   • MTTF
   • MTTR
   • Susceptibility to natural disasters
   • Fault tolerance
   • Single Points of Failure
   • Maintainability
   • Operational readiness
   • Maintenance program

 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
INFRASTRUCTURE RELIABILITY



 • RELIABILITY / AVAILABLITY

 • RELIABILITY MODELING

 • RELIABILITY CONSIDERATIONS




 RELIABILITY    Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY


• “Reliability” is used as an umbrella definition

• May Refer to Availability, Durability, Quality

• Five 9’s ????

• Reliability = Probability of Successful Operation




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY AND AVAILABILITY



•     Reliability predicts how likely is the system to fail.

•     Availability is a measure (or a future prediction) of what percentage
      of the time the system will operating properly




    RELIABILITY                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
AVAILABILITY

Five 9’s refers to Availability

Availability (A) = Average fraction of time Something is in service
and performing intended function.

99.999% availability means:
    • 5.3 minutes of downtime each year
                       or
    • 1.77 hours of downtime every 20 years

Availability does not specify how often an outage occurs



 RELIABILITY                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
AVAILABILITY


Availability (A) = MTBF/(MTBF + MTTR)

  MTTF: Mean Time To Failure
  MTBF: Mean Time Between Failures
  MTTR: Mean Time to Repair or Downtime
  MTBF=MTTF+MTTR




 RELIABILITY            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY BATHTUB CURVE

      Failure Rate




                     early                                                                   wear-out
                     life                        useful life                                 period

                             0.5
                                       Time (t) Years YEARS                       12 14

 RELIABILITY                       Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING


•      Used to compare system designs and assist in the evaluation of
       risk versus the cost to mitigate the risk.

•      Failure and Repair data comes from IEEE 493, Recommended
       Practice for Design of Reliable Industrial and Commercial Power
       Systems (IEEE Gold Book)




    RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING

Components used for reliability modeling of the electrical system shown
here:

•   Utility power
•   Generator
•   Circuit breakers
•   Switchboards
•   Cables
•   Automatic Transfer Switch
•   UPS module
•   Battery
•   Static Bypass Switch
•   Rack Power



 RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING




                                         Reliability Block 
                                         Diagram (RBD)


 RELIABILITY   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING

Shown below are the results of the calculations




                         Hours         Hours




 RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
THE TRADITIONAL CLASSIFICATION SYSTEM
           The Uptime Institute
Tier 1 – Basic Non-Redundant Data Center
         Single path for power and cooling distribution without redundant
         components

Tier 2 – Basic Redundant Data Center
         Single path for power and cooling distribution with redundant
         components

Tier 3 – Concurrently Maintainable Data Center
         Multiple paths for power and cooling distribution with only one path
         active and with redundant components

Tier 4 – Fault Tolerant Data Center
         Multiple active power and cooling distribution paths with redundant
         components and fault tolerant


RELIABILITY                   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Tier Definitions


                             TIER REQUIREMENTS
                                  Tier I Tier II                Tier III    Tier IV
                                                               1 Active
Number of Delivery Paths             1              1                      2 Active
                                                              1 Passive
Redundancy                         N             N+1             N+1     2N Minimum
Compartmentalization               No            No               No          Yes
Concurrent Maintainability         No            No              Yes          Yes
Fault Tolerance                    No            No               No          Yes
Availability                     99.67          99.75          99.982       99.95
Downtime in Hr/Yr                 28.8           22              1.6          0.4




  RELIABILITY                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Data Center Cost

From the UI

• Tier I - $10,000 US/kW of Useable UPS Power Output

• Tier II - $11,000 US/kW of Useable UPS Power Output

• Tier III - $20,000 US/kW of Useable UPS Power Output

• Tier IV - $22,000 US/kW of Useable UPS Power Output

• Plus $225 US/SF of Computer Room




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
HOW MUCH REDUNDANCY IS ENOUGH?




RELIABILITY   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations

Assumptions

• Various configurations examined for single or dual utility feeders, UPS,
  Generators, STS’s, single or dual cords

• Compare Reliability at 2000 KW and 4000 KW Load

• 5 Year Probability of Failure




 RELIABILITY                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Single utility feeder, parallel redundant UPS and
     generators, single cord IT equipment
2N UPS, N+1 Generators, ASTSs, Dual Cord Rack
Two Utility Feeders, 2(N+1) UPS, 2(N+1) Generators,
               ASTSs, Dual Cord Rack
Distributed Redundant UPS, N+2 Generators, Two
   Utility Feeders, ASTSs and Dual Cord Rack
Reliability Considerations




RELIABILITY         Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations
Emergency Diesel Generators

                                       fail to start


                                       fail after ½ hour



                                        fail after 8 hours



                                        fail after 24 hours


Study Performed by Idaho National Engineering Laboratory – February 1996 at Nuclear Power Plants



  RELIABILITY                                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations


• 2(N+1) UPS/Generator with dual utility feeders - most reliable
  topology
• 2(N+1) UPS > 2N UPS by small margin
• 2N > Distributed Redundant by small margin
• Significant improvement if a second utility feeder
  is provided
• N+2 and/or 2N generator systems are more reliable than N+1
• Hybrid configuration in a hybrid facility is sometimes the best solution




 RELIABILITY               Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations


•   Assess the condition of the mechanical plant in conjunction with the
    electrical system
•   The facility reliability will be driven by the least reliable component
    (typically the electrical infrastructure)




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
System Reliability Block




                Electrical System                                      Electrical          Mechanical




         Electrical systempow    ering the                          Mechanical systemsupporting critical
                   critical load                                                    load




 RELIABILITY                                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
System Reliability Block
                                                  MTBF                 Availability              Pf (3 years)
Electrical system
alone                                           330,184                  0.99999                       8.10%
Mechanical system
alone                                           178,611                 0.999943                       11.70%
Electrical system
supporting mechanical                           108,500                 0.999985                       21.40%
Overall mechanical
system                                           70,087                 0.999931                       29.20%
Combined electrical
mechanical system                                57,819                 0.999922                       36.90%


                  Electrical System                               Electrical           Mechanical




            Electrical system powering the                     Mechanical system supporting critical
                      critical load                                            load


  RELIABILITY                                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
The Cost of Reliability
 Reliability


 99.9999

  99.999


  99.99

  99.9


  99.0

  .9
               $   $$   $$$      $$$$      $$$$$

 RELIABILITY             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Key Takeaways – Risk Assessment

 • What Reliability Level Do you Really Need Based on Your Business
   Case?

 • Minimize Single Points of Failure

 • Concurrent Maintainability?

 • Fault Tolerance?

 • Ensure Adequacy of Operations, Maintenance and Testing Programs

 • How to justify the cost to upgrade from present state?




 RISK ASSESSMENT             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Key Takeaways – Reliability

•    Design objective – find optimum compromise between cost and reliability
•    Size matters – larger facilities yield lower reliability
•    System architecture and design implementation is more important role
     than equipment selection
•    Segregate system in independent blocks
•    Eliminate common source components to minimize fault propagation (i.e.
     LBS, hot-tie, manual bus ties)
•    Move single points of failures as close to the load as possible
•    Always maintain two independent sources of power to the critical load
•    Optimize the design of monitoring and controls circuits
•    Keep it simple/minimize human intervention/Utilize Automation


    RELIABILITY                    Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Thank you and please feel
QUESTIONS?                                            free to contact me

Steven Shapiro, PE, ATD
SShapiro@MorrisonHershfield.com
914.420.3213
http://www.linkedin.com/in/stevenshapirope
References:
Uptime Institute White Papers:
Tier Myths and Misconceptions
Data Center Site Infrastructure Tier Standard: Topology
Building Areas/Systems Reviewed

‫׀‬   General Construction
‫׀‬   Electrical
‫׀‬   Mechanical
‫׀‬   Plumbing And Fire Protection
‫׀‬   Operation and Maintenance
‫׀‬   Security 
‫׀‬   Load Density

                                 48
                      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Site Reliability
•   Is Project Compatible With Zoning
•   Natural Environment Issues
‫׀‬   Seismic Zone
‫׀‬   Geo Technical Reports
‫׀‬   Sub Surface Conditions
‫׀‬   Tornado/hurricane Risk
‫׀‬   Site Flood Potential
‫׀‬   Fire Potential
‫׀‬   Site Topography
‫׀‬   Weather Extremes
•   Man‐Made Environment Issues
‫׀‬   Power/Data and Communication/Water Supply/Sanitary Sewer Availability
‫׀‬   ISP Connectivity to Mirror and DR Sites
‫׀‬   Proximity of Hazardous Operational Facilities, i.e. Nuclear Power Plants, Military Bases, 
    Chemical Plants, Tank Farms, Water/Sewage Treatment Plants, Dams/Reservoirs, Gas 
    Stations, etc.
‫׀‬   Distance to Airports & Freeways
‫׀‬   Distance to Emergency Services, i.e. Fire and Police Departments, Hospital 


                                                49
                                     Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
Building Utilities and Physical Issues
‫ ׀‬General building systems and area characteristics
‫ ׀‬Life safety and environmental
Electrical Systems
‫ ׀‬Utility feeders
‫ ׀‬Service entry
‫ ׀‬Base building electrical distribution system including busways, step‐down 
    transformers, switchgear and distribution panels
‫׀‬   Uninterruptible power supply (UPS) systems
‫׀‬   Battery systems
‫׀‬   Power Distribution System including the critical computer rooms
‫׀‬   Emergency/standby generator and fuel system
‫׀‬   Normal/standby power transfer switchgear
‫׀‬   Grounding
‫׀‬   Emergency Power Off Systems
‫׀‬   Lightning protection system
‫׀‬   Fire alarm and smoke detection systems


                                            50
                                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
•   Mechanical Systems
‫׀‬   Critical Systems Chilled Water Plant:  Chillers, pumps, piping distribution system, 
    controls, etc
‫׀‬   Critical Systems Condenser Water System:  Cooling towers, pumps, piping, etc
‫׀‬   Critical Systems Air Handling Systems
‫׀‬   Critical Systems Air Distribution
‫׀‬   Critical Systems Secondary Chilled Water Loop
‫׀‬   Fuel Oil Systems
‫׀‬   Boiler Systems
‫׀‬   Compressed Air Systems
•   Plumbing Systems
‫׀‬   Domestic Water Systems
‫׀‬   Natural Gas Systems
‫׀‬   Fire Suppression Systems (Water and Gaseous)
•   Operation and Maintenance of the Critical Support Systems
‫׀‬   Maintenance procedures and programs
‫׀‬   Normal operating procedures
‫׀‬   Emergency operating procedures
‫׀‬   Training programs and methods
‫׀‬   Spare parts



                                                51
                                     Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
•   Building Automation
‫׀‬   Building Automation Systems.
‫׀‬   Physical Security Systems.
‫׀‬   Access control
‫׀‬   Intrusion detection
‫׀‬   CCTV systems
‫׀‬   ID badging systems
‫׀‬   Intercom systems
‫׀‬   Smoke Purge Systems
•   Technology Systems
‫׀‬   Entrance Facility Feeds.
‫׀‬   Telephone Company Services.
•   Systems Integration:
‫׀‬   The integration, compatibility and interaction of the above systems with each 
    other, as well as with the other building elements will be reviewed to ensure that 
    the systems are compatible and fully integrated.
                                              52
                                   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT

Contenu connexe

Tendances

PECB Webinar: The importance of business impact analysis
PECB Webinar: The importance of business impact analysisPECB Webinar: The importance of business impact analysis
PECB Webinar: The importance of business impact analysisPECB
 
Agile Project Management
Agile Project ManagementAgile Project Management
Agile Project ManagementAbdullah Khan
 
Business Continuity Management PowerPoint Presentation Slides
Business Continuity Management PowerPoint Presentation SlidesBusiness Continuity Management PowerPoint Presentation Slides
Business Continuity Management PowerPoint Presentation SlidesSlideTeam
 
ERP Manager meets SDLC and CMMI
ERP Manager meets SDLC and CMMIERP Manager meets SDLC and CMMI
ERP Manager meets SDLC and CMMIMahesh Vallampati
 
ISO 22301 Business Continuity Management
ISO 22301 Business Continuity ManagementISO 22301 Business Continuity Management
ISO 22301 Business Continuity ManagementRamiro Cid
 
Assessing the impact of a disruption: Building an effective business impact a...
Assessing the impact of a disruption: Building an effective business impact a...Assessing the impact of a disruption: Building an effective business impact a...
Assessing the impact of a disruption: Building an effective business impact a...Bryghtpath LLC
 
Business Continuity & Disaster Recovery
Business Continuity & Disaster RecoveryBusiness Continuity & Disaster Recovery
Business Continuity & Disaster RecoveryEC-Council
 
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptxBUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptxJayLloyd8
 
Fundamentals of reliability engineering and applications part1of3
Fundamentals of reliability engineering and applications part1of3Fundamentals of reliability engineering and applications part1of3
Fundamentals of reliability engineering and applications part1of3ASQ Reliability Division
 
Business Analysis Framework
Business Analysis FrameworkBusiness Analysis Framework
Business Analysis Frameworkmarctayfl
 
Business Process Improvement
Business Process ImprovementBusiness Process Improvement
Business Process ImprovementAnand Subramaniam
 
Risk Identification Process PowerPoint Presentation Slides
Risk Identification Process PowerPoint Presentation SlidesRisk Identification Process PowerPoint Presentation Slides
Risk Identification Process PowerPoint Presentation SlidesSlideTeam
 
BUSINESS CONTINUITY MANAGEMENT system
BUSINESS CONTINUITY MANAGEMENT systemBUSINESS CONTINUITY MANAGEMENT system
BUSINESS CONTINUITY MANAGEMENT systemKuroba Kaitou
 
Impact Analysis Template - Enterprise
Impact Analysis Template - EnterpriseImpact Analysis Template - Enterprise
Impact Analysis Template - EnterpriseToby Elwin
 
Data Driven Risk Assessment
Data Driven Risk AssessmentData Driven Risk Assessment
Data Driven Risk AssessmentResolver Inc.
 

Tendances (20)

PECB Webinar: The importance of business impact analysis
PECB Webinar: The importance of business impact analysisPECB Webinar: The importance of business impact analysis
PECB Webinar: The importance of business impact analysis
 
Enterprise Asset Management
Enterprise Asset ManagementEnterprise Asset Management
Enterprise Asset Management
 
Agile Project Management
Agile Project ManagementAgile Project Management
Agile Project Management
 
Business Continuity Management PowerPoint Presentation Slides
Business Continuity Management PowerPoint Presentation SlidesBusiness Continuity Management PowerPoint Presentation Slides
Business Continuity Management PowerPoint Presentation Slides
 
ERP Manager meets SDLC and CMMI
ERP Manager meets SDLC and CMMIERP Manager meets SDLC and CMMI
ERP Manager meets SDLC and CMMI
 
ISO 22301 Business Continuity Management
ISO 22301 Business Continuity ManagementISO 22301 Business Continuity Management
ISO 22301 Business Continuity Management
 
Assessing the impact of a disruption: Building an effective business impact a...
Assessing the impact of a disruption: Building an effective business impact a...Assessing the impact of a disruption: Building an effective business impact a...
Assessing the impact of a disruption: Building an effective business impact a...
 
Business Continuity & Disaster Recovery
Business Continuity & Disaster RecoveryBusiness Continuity & Disaster Recovery
Business Continuity & Disaster Recovery
 
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptxBUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
 
Lean Project Management
Lean Project ManagementLean Project Management
Lean Project Management
 
KPI SMRP Presentation
KPI SMRP PresentationKPI SMRP Presentation
KPI SMRP Presentation
 
Fundamentals of reliability engineering and applications part1of3
Fundamentals of reliability engineering and applications part1of3Fundamentals of reliability engineering and applications part1of3
Fundamentals of reliability engineering and applications part1of3
 
Business Analysis Framework
Business Analysis FrameworkBusiness Analysis Framework
Business Analysis Framework
 
Business Process Improvement
Business Process ImprovementBusiness Process Improvement
Business Process Improvement
 
RCM
RCMRCM
RCM
 
Risk Identification Process PowerPoint Presentation Slides
Risk Identification Process PowerPoint Presentation SlidesRisk Identification Process PowerPoint Presentation Slides
Risk Identification Process PowerPoint Presentation Slides
 
BUSINESS CONTINUITY MANAGEMENT system
BUSINESS CONTINUITY MANAGEMENT systemBUSINESS CONTINUITY MANAGEMENT system
BUSINESS CONTINUITY MANAGEMENT system
 
Layered process audit
Layered process audit Layered process audit
Layered process audit
 
Impact Analysis Template - Enterprise
Impact Analysis Template - EnterpriseImpact Analysis Template - Enterprise
Impact Analysis Template - Enterprise
 
Data Driven Risk Assessment
Data Driven Risk AssessmentData Driven Risk Assessment
Data Driven Risk Assessment
 

Similaire à Risk Assessments and Reliability, What You Need To Know

Reliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability TechniquesReliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability TechniquesAccendo Reliability
 
BAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptxBAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptxQienKing
 
Reducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsReducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsWilde Analysis Ltd.
 
PyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python ApplicationsPyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python ApplicationsMichael Kehoe
 
Failure Mode and Effect Analysis
Failure Mode and Effect AnalysisFailure Mode and Effect Analysis
Failure Mode and Effect Analysistulasiva
 
Operational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionOperational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionMichael Marshall, PE
 
MERC Capabilities Briefing
MERC Capabilities BriefingMERC Capabilities Briefing
MERC Capabilities BriefingHerb MacMillan
 
Reliability Program Approval Presentation_
Reliability Program Approval Presentation_Reliability Program Approval Presentation_
Reliability Program Approval Presentation_Chad Broussard
 
Managing your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspectionsManaging your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspectionsEdwin A Merrick
 
Maintenance types
Maintenance typesMaintenance types
Maintenance typesMotasem Ash
 
Introduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBIIntroduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBIEdwin A Merrick
 
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael AndrewsReliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael AndrewsWPICPE
 
Mechanical Integrity.pdf
Mechanical Integrity.pdfMechanical Integrity.pdf
Mechanical Integrity.pdfaashir14
 
Reliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex TrainingReliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex TrainingBryan Len
 
Res Technical recruitment & training profile
Res Technical recruitment & training profile Res Technical recruitment & training profile
Res Technical recruitment & training profile Alaa Thabet
 
Turner.john
Turner.johnTurner.john
Turner.johnNASAPMC
 
Turner.john
Turner.johnTurner.john
Turner.johnNASAPMC
 
Risk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the YearRisk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the YearKarl Davey
 

Similaire à Risk Assessments and Reliability, What You Need To Know (20)

Reliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability TechniquesReliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability Techniques
 
BAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptxBAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptx
 
Reducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsReducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering Methods
 
PyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python ApplicationsPyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python Applications
 
Failure Mode and Effect Analysis
Failure Mode and Effect AnalysisFailure Mode and Effect Analysis
Failure Mode and Effect Analysis
 
Operational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionOperational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss Prevention
 
MERC Capabilities Briefing
MERC Capabilities BriefingMERC Capabilities Briefing
MERC Capabilities Briefing
 
Reliability Program Approval Presentation_
Reliability Program Approval Presentation_Reliability Program Approval Presentation_
Reliability Program Approval Presentation_
 
Managing your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspectionsManaging your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspections
 
Maintenance types
Maintenance typesMaintenance types
Maintenance types
 
Introduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBIIntroduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBI
 
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael AndrewsReliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
 
Rbi
RbiRbi
Rbi
 
Mechanical Integrity.pdf
Mechanical Integrity.pdfMechanical Integrity.pdf
Mechanical Integrity.pdf
 
Reliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex TrainingReliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex Training
 
Res Technical recruitment & training profile
Res Technical recruitment & training profile Res Technical recruitment & training profile
Res Technical recruitment & training profile
 
Turner.john
Turner.johnTurner.john
Turner.john
 
Turner.john
Turner.johnTurner.john
Turner.john
 
FMEA.pptx
FMEA.pptxFMEA.pptx
FMEA.pptx
 
Risk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the YearRisk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the Year
 

Dernier

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 

Dernier (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Risk Assessments and Reliability, What You Need To Know

  • 1. INFRASTRUCTURE RELIABILITY AND RISK ASSESSMENTS Steven Shapiro, P.E., ATD Mission Critical Practice Lead Morrison Hershfield Mission Critical Morrison Hershfield Mission Critical
  • 2. WHAT YOU NEED TO KNOW AGENDA • RISK ASSESSMENT • INFRASTRUCTURE RELIABILITY COOLING POWER Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 3. RISK ASSESSMENTS • WHY • SITE EVALUATION • METRICS Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 4. Causes of Critical Failures • Location • Design • Redundancy level • Construction • Quality of equipment • Age Lurking Vulnerabilities • Operations & Maintenance program • Personnel training • Level of operator coverage • Thoroughness of the commissioning program 5 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments WHY
  • 5. Causes of Critical Failures • Equipment failure • Operator error • Natural disaster • Design error • Installation error • Commissioning or test deficiency • Maintenance oversight • Equipment design WHY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 6. Causes of Critical Failures • Root cause not always easy to ascertain • Combination of factors (Cascading Failures) • Latent failures • Most occur during change of state events • More maintenance does not necessarily mean higher availability • Non-Fault tolerant systems WHY FILURES Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 7. Causes of Critical Failures Commissioning or Test Deficiency 4% System Design Equipment Natural Disaster 20% Design 3% 13% Maintenance Oversight 4% Equipment Failure 28% Installation Error 10% Human Error 18% WHY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessment
  • 8. WHY DO RISK ASSESSMENT • Alignment of business mission and facility performance expectation • Quantifies the risk and exposure of the critical facilities to failure • Identifies vulnerabilities and single points of failure • First step in creating an action plan for site hardening • Benchmark against the industry • Assists in developing business case for capital expenditures RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 9. SITE EVALUATION STEP 1 • Quantify reliability expectations • Develop resiliency metrics RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 10. SITE EVALUATION STEP 2 • Develop PRA model (Probabilistic Risk Assessment) • Identify Single Points of Failure within critical systems • Evaluate redundancy of critical systems • Capacity and expendability analysis • Adequacy of Engineered Systems • Operation and maintenance policies, practices and procedures • Adequacy of maintenance and testing programs • Evaluate risks associated with site location • Overall Risk Analysis • Evaluate the adequacy of operations and maintenance programs RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 11. SITE EVALUATION STEP 2 cont. • Harmonics analysis • EMF studies • Short circuit & coordination studies • Air flow modeling-CFD RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 12. SITE EVALUATION STEP 3 • Perform gap analysis STEP 4 • Recommendations for upgrade/alteration to optimize facility performance • Budget and schedule development • Assess risk during implementation • Benchmark findings with industry standards RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 13. RISK ASSESSMENT METRICS • Probability of Failure/Reliability • Availability • MTTF • MTTR • Susceptibility to natural disasters • Fault tolerance • Single Points of Failure • Maintainability • Operational readiness • Maintenance program RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 14. INFRASTRUCTURE RELIABILITY • RELIABILITY / AVAILABLITY • RELIABILITY MODELING • RELIABILITY CONSIDERATIONS RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 15. RELIABILITY • “Reliability” is used as an umbrella definition • May Refer to Availability, Durability, Quality • Five 9’s ???? • Reliability = Probability of Successful Operation RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 16. RELIABILITY AND AVAILABILITY • Reliability predicts how likely is the system to fail. • Availability is a measure (or a future prediction) of what percentage of the time the system will operating properly RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 17. AVAILABILITY Five 9’s refers to Availability Availability (A) = Average fraction of time Something is in service and performing intended function. 99.999% availability means: • 5.3 minutes of downtime each year or • 1.77 hours of downtime every 20 years Availability does not specify how often an outage occurs RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 18. AVAILABILITY Availability (A) = MTBF/(MTBF + MTTR) MTTF: Mean Time To Failure MTBF: Mean Time Between Failures MTTR: Mean Time to Repair or Downtime MTBF=MTTF+MTTR RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 19. RELIABILITY BATHTUB CURVE Failure Rate early wear-out life useful life period 0.5 Time (t) Years YEARS 12 14 RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 20. RELIABILITY MODELING • Used to compare system designs and assist in the evaluation of risk versus the cost to mitigate the risk. • Failure and Repair data comes from IEEE 493, Recommended Practice for Design of Reliable Industrial and Commercial Power Systems (IEEE Gold Book) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 21. RELIABILITY MODELING Components used for reliability modeling of the electrical system shown here: • Utility power • Generator • Circuit breakers • Switchboards • Cables • Automatic Transfer Switch • UPS module • Battery • Static Bypass Switch • Rack Power RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 22. RELIABILITY MODELING Reliability Block  Diagram (RBD) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 23. RELIABILITY MODELING Shown below are the results of the calculations Hours Hours RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 24. THE TRADITIONAL CLASSIFICATION SYSTEM The Uptime Institute Tier 1 – Basic Non-Redundant Data Center Single path for power and cooling distribution without redundant components Tier 2 – Basic Redundant Data Center Single path for power and cooling distribution with redundant components Tier 3 – Concurrently Maintainable Data Center Multiple paths for power and cooling distribution with only one path active and with redundant components Tier 4 – Fault Tolerant Data Center Multiple active power and cooling distribution paths with redundant components and fault tolerant RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 25. Tier Definitions TIER REQUIREMENTS Tier I Tier II Tier III Tier IV 1 Active Number of Delivery Paths 1 1 2 Active 1 Passive Redundancy N N+1 N+1 2N Minimum Compartmentalization No No No Yes Concurrent Maintainability No No Yes Yes Fault Tolerance No No No Yes Availability 99.67 99.75 99.982 99.95 Downtime in Hr/Yr 28.8 22 1.6 0.4 RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 26. Data Center Cost From the UI • Tier I - $10,000 US/kW of Useable UPS Power Output • Tier II - $11,000 US/kW of Useable UPS Power Output • Tier III - $20,000 US/kW of Useable UPS Power Output • Tier IV - $22,000 US/kW of Useable UPS Power Output • Plus $225 US/SF of Computer Room RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 27. HOW MUCH REDUNDANCY IS ENOUGH? RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 28. Reliability Considerations Assumptions • Various configurations examined for single or dual utility feeders, UPS, Generators, STS’s, single or dual cords • Compare Reliability at 2000 KW and 4000 KW Load • 5 Year Probability of Failure RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 29. Single utility feeder, parallel redundant UPS and generators, single cord IT equipment
  • 30. 2N UPS, N+1 Generators, ASTSs, Dual Cord Rack
  • 31. Two Utility Feeders, 2(N+1) UPS, 2(N+1) Generators, ASTSs, Dual Cord Rack
  • 32. Distributed Redundant UPS, N+2 Generators, Two Utility Feeders, ASTSs and Dual Cord Rack
  • 33. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 34. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 35. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 36. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 37. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 38. Reliability Considerations Emergency Diesel Generators fail to start fail after ½ hour fail after 8 hours fail after 24 hours Study Performed by Idaho National Engineering Laboratory – February 1996 at Nuclear Power Plants RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 39. Reliability Considerations • 2(N+1) UPS/Generator with dual utility feeders - most reliable topology • 2(N+1) UPS > 2N UPS by small margin • 2N > Distributed Redundant by small margin • Significant improvement if a second utility feeder is provided • N+2 and/or 2N generator systems are more reliable than N+1 • Hybrid configuration in a hybrid facility is sometimes the best solution RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 40. Reliability Considerations • Assess the condition of the mechanical plant in conjunction with the electrical system • The facility reliability will be driven by the least reliable component (typically the electrical infrastructure) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 41. System Reliability Block Electrical System Electrical Mechanical Electrical systempow ering the Mechanical systemsupporting critical critical load load RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 42. System Reliability Block MTBF Availability Pf (3 years) Electrical system alone 330,184 0.99999 8.10% Mechanical system alone 178,611 0.999943 11.70% Electrical system supporting mechanical 108,500 0.999985 21.40% Overall mechanical system 70,087 0.999931 29.20% Combined electrical mechanical system 57,819 0.999922 36.90% Electrical System Electrical Mechanical Electrical system powering the Mechanical system supporting critical critical load load RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 43. The Cost of Reliability Reliability 99.9999 99.999 99.99 99.9 99.0 .9 $ $$ $$$ $$$$ $$$$$ RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 44. Key Takeaways – Risk Assessment • What Reliability Level Do you Really Need Based on Your Business Case? • Minimize Single Points of Failure • Concurrent Maintainability? • Fault Tolerance? • Ensure Adequacy of Operations, Maintenance and Testing Programs • How to justify the cost to upgrade from present state? RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 45. Key Takeaways – Reliability • Design objective – find optimum compromise between cost and reliability • Size matters – larger facilities yield lower reliability • System architecture and design implementation is more important role than equipment selection • Segregate system in independent blocks • Eliminate common source components to minimize fault propagation (i.e. LBS, hot-tie, manual bus ties) • Move single points of failures as close to the load as possible • Always maintain two independent sources of power to the critical load • Optimize the design of monitoring and controls circuits • Keep it simple/minimize human intervention/Utilize Automation RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 46. Thank you and please feel QUESTIONS? free to contact me Steven Shapiro, PE, ATD SShapiro@MorrisonHershfield.com 914.420.3213 http://www.linkedin.com/in/stevenshapirope References: Uptime Institute White Papers: Tier Myths and Misconceptions Data Center Site Infrastructure Tier Standard: Topology
  • 47. Building Areas/Systems Reviewed ‫׀‬ General Construction ‫׀‬ Electrical ‫׀‬ Mechanical ‫׀‬ Plumbing And Fire Protection ‫׀‬ Operation and Maintenance ‫׀‬ Security  ‫׀‬ Load Density 48 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 48. Site Reliability • Is Project Compatible With Zoning • Natural Environment Issues ‫׀‬ Seismic Zone ‫׀‬ Geo Technical Reports ‫׀‬ Sub Surface Conditions ‫׀‬ Tornado/hurricane Risk ‫׀‬ Site Flood Potential ‫׀‬ Fire Potential ‫׀‬ Site Topography ‫׀‬ Weather Extremes • Man‐Made Environment Issues ‫׀‬ Power/Data and Communication/Water Supply/Sanitary Sewer Availability ‫׀‬ ISP Connectivity to Mirror and DR Sites ‫׀‬ Proximity of Hazardous Operational Facilities, i.e. Nuclear Power Plants, Military Bases,  Chemical Plants, Tank Farms, Water/Sewage Treatment Plants, Dams/Reservoirs, Gas  Stations, etc. ‫׀‬ Distance to Airports & Freeways ‫׀‬ Distance to Emergency Services, i.e. Fire and Police Departments, Hospital  49 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 49. Building Areas/Systems Reviewed Building Utilities and Physical Issues ‫ ׀‬General building systems and area characteristics ‫ ׀‬Life safety and environmental Electrical Systems ‫ ׀‬Utility feeders ‫ ׀‬Service entry ‫ ׀‬Base building electrical distribution system including busways, step‐down  transformers, switchgear and distribution panels ‫׀‬ Uninterruptible power supply (UPS) systems ‫׀‬ Battery systems ‫׀‬ Power Distribution System including the critical computer rooms ‫׀‬ Emergency/standby generator and fuel system ‫׀‬ Normal/standby power transfer switchgear ‫׀‬ Grounding ‫׀‬ Emergency Power Off Systems ‫׀‬ Lightning protection system ‫׀‬ Fire alarm and smoke detection systems 50 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 50. Building Areas/Systems Reviewed • Mechanical Systems ‫׀‬ Critical Systems Chilled Water Plant:  Chillers, pumps, piping distribution system,  controls, etc ‫׀‬ Critical Systems Condenser Water System:  Cooling towers, pumps, piping, etc ‫׀‬ Critical Systems Air Handling Systems ‫׀‬ Critical Systems Air Distribution ‫׀‬ Critical Systems Secondary Chilled Water Loop ‫׀‬ Fuel Oil Systems ‫׀‬ Boiler Systems ‫׀‬ Compressed Air Systems • Plumbing Systems ‫׀‬ Domestic Water Systems ‫׀‬ Natural Gas Systems ‫׀‬ Fire Suppression Systems (Water and Gaseous) • Operation and Maintenance of the Critical Support Systems ‫׀‬ Maintenance procedures and programs ‫׀‬ Normal operating procedures ‫׀‬ Emergency operating procedures ‫׀‬ Training programs and methods ‫׀‬ Spare parts 51 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 51. Building Areas/Systems Reviewed • Building Automation ‫׀‬ Building Automation Systems. ‫׀‬ Physical Security Systems. ‫׀‬ Access control ‫׀‬ Intrusion detection ‫׀‬ CCTV systems ‫׀‬ ID badging systems ‫׀‬ Intercom systems ‫׀‬ Smoke Purge Systems • Technology Systems ‫׀‬ Entrance Facility Feeds. ‫׀‬ Telephone Company Services. • Systems Integration: ‫׀‬ The integration, compatibility and interaction of the above systems with each  other, as well as with the other building elements will be reviewed to ensure that  the systems are compatible and fully integrated. 52 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT