SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
Gluster for Geeks:
Performance Tips &
Tricks

Jacob Shucart
August 25th, 2011
Some Housekeeping Items…

 Ask a question at any time              How To Ask a Question?

 Questions will be answered at
the end of the webinar


 Slides will be available after
the webinar


 The webinar is being
recorded


            A Better Way To Do Storage                       2
Gluster for Geeks

  The Gluster for Geeks webinar series is designed
 for technical audiences who are familiar with
 GlusterFS


  In this edition, “Performance tuning tips and tricks”
 we will discuss in detail the performance related
 considerations for a running a GlusterFS
 deployment




        A Better Way To Do Storage                    3
Topics

 Planning
 Configuration
 Implementing
 Tuning
 Benchmarking
 Top 5 Issues




         A Better Way To Do Storage   4
Planning – Key Considerations

 Performance requirements
  – What performance do you need to hit & how do you plan to get to it?
       •   Read
       •   Write
       •   Throughput
       •   Availability
 For a given performance level what type is required?
  – E.g. for a throughput of X and capacity of Y what is needed?
 Workloads
  –   What is the workload in the environment?
  –   Small files?
  –   Large files?
  –   Is throughput your only consideration?
  –   What is the application?



             A Better Way To Do Storage                              5
Planning - Sizing and Architecture

 Gluster performance relies on hardware/underlying infrastructure
  –   CPU, memory, disks, network
  –   Virtual machine & cloud infrastructure
  –   Number of systems in the cluster depends on performance and capacity requirements
  –   There are many ways to meet organizational needs
  –   For on-prem 2U & 4U DAS systems and JBODS are great building blocks
 Examples: 3 common deployment scenarios
  – Capacity-centric environments
       • 2U & 4U DAS systems with multiple JBODS
       • Lower RAM and CPU requirements
       • Lower network requirements
  – Mixed capacity and performance environments
       • 2U & 4U DAS systems with 1-2 JBODS max
       • Higher RAM and CPU requirements
       • Low to high network requirements
  – High performance environments
       • 1U or 2U systems with no JBODS
       • Highest RAM and CPU requirements
       • Fast disks and fast network




            A Better Way To Do Storage                                                    6
Configuration

 Choosing the correct volume type for a workload
 Volume options include
  – Distribute – higher performance, no redundancy
  – Replicate(or distribute+replicate) – general purpose, HA, faster
    reads
  – Stripe(or distribute+stripe) – high concurrent reads, low writes, no
    redundancy
 Protocols & performance
  – GlusterFS gives the best overall performance (pNFS like
    functionality)
  – NFS gives excellent performance given right workload
  – CIFS should only be used for Windows systems
 Data flow
  – How do supported protocols differ?



         A Better Way To Do Storage                                        7
Implementing – Cluster Hardware Configuration

 Node and cluster configurations
  – More CPU means greater parallel threads on servers
  – More RAM means more cached operations
  – More network means more throughput
  Dedicated backend network for node
 communication
  – Dedicated back end network should be used for NFS and
    CIFS
  – Recommend 10GBe minimum
  GlusterFS native only uses inter-node
 communication for management calls


        A Better Way To Do Storage                          8
Implementing Gluster - Fundamentals

 Distribute only
  • Non-redundant at the brick level
    •   Cuts hardware, software costs in half.
    •   Failure of a brick or node results in loss of access to the data on those bricks.
    •   Writes destined to the failed brick will fail.
    •   Redundant RAID, hardware is strongly recommended.




           A Better Way To Do Storage                                                       9
Implementing Gluster - Fundamentals

 Distribute with replica
  • Redundant at the brick level
    •   Failure of a brick or node does not affect I/O.
    •   Writes are written simultaneously to each replica.
    •   Any number of replicas are supported.
    •   Gluster Native, CIFS, and NFS support stateful failover. (Gluster Native only in AWS)
    •   Redundant RAID, hardware is strongly recommended.




          A Better Way To Do Storage                                                            10
Implementing Gluster - Fundamentals

 Gluster Native client data flow




       A Better Way To Do Storage     11
Implementing Gluster - Fundamentals
 NFS, CIFS dataflow




       A Better Way To Do Storage     12
Tuning

 Key tuning parameters
  –   performance.write-behind-window-size 65535 (in bytes)
  –   performance.cache-refresh-timeout 1 (in seconds)
  –   performance.cache-size 1073741824 (in bytes)
  –   performance.read-ahead off (only for 1GbE)
  –   Default settings are suitable for mixed workloads
 Tuning for different environments
  – For Amazon, m1.xlarge or greater
  – Understand hardware/firmware settings and their impact on
    performance(for example, CPU frequency scaling and IB,
    10GbE and the TCP Offload Engine)



          A Better Way To Do Storage                            13
Benchmarking

 From the Gluster Performance white paper
  – iozone –R –l 3 –u 5 –r 512k –s 256m –F /mnt/1 /mnt/2 /mnt/3
    /mnt/4 /mnt/5
  – dd if=/dev/zero of=/mnt/test bs=1M count=1

 Performance expectations
  – Get a baseline benchmark of disks on systems
  – What can you expect from your network?

 IOPS vs. throughput
  – Is your workload better measured in throughput
  – Certain operations have different impact(dir creation)
  – If IOPS is your measurement remember latency



         A Better Way To Do Storage                               14
Top 5 Causes for Performance Issues

  Straight from our professional services
 performance team
1.   Underpowered/mis-configured disks
2.   Underpowered/mis-configured network
3.   Faulty hardware(broken/bad blocks/etc)
4.   Too few servers
5.   Wrong protocol for the job




         A Better Way To Do Storage           15
Conclusion

 GlusterFS performance depends heavily on the underlying
hardware
 You should understand your workloads to guide your
hardware configuration
The default parameters work well for general workloads
Several tuning parameters are available
 When experiencing performance issues check the disks
and network first




          A Better Way To Do Storage                       16
Polling Question

What should we talk about in next months Gluster
             Geeks Only webinar?
      A.    Setting up a basic Gluster cluster
                B.    Gluster Geo-Replication
           C.    Frequently Asked Questions
                     D.    Gluster Translators
                 E.       Other technical topics




           A Better Way To Do Storage              17
Questions & Resources

What are your performance questions?
 – Ask now using the Go-to-webinar questions panel

Helpful resources
 – Performance white paper posted here:
   http://www.gluster.com/products/resources/
 – Documentation: http://gluster.com/community/documentation
 – Questions?: http://community.gluster.org/




            A Better Way To Do Storage                         18

Contenu connexe

Tendances

Tendances (20)

Disk health prediction for Ceph
Disk health prediction for CephDisk health prediction for Ceph
Disk health prediction for Ceph
 
SR-IOV Introduce
SR-IOV IntroduceSR-IOV Introduce
SR-IOV Introduce
 
Crimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent MemoryCrimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent Memory
 
Seastore: Next Generation Backing Store for Ceph
Seastore: Next Generation Backing Store for CephSeastore: Next Generation Backing Store for Ceph
Seastore: Next Generation Backing Store for Ceph
 
Best Practices with PostgreSQL on Solaris
Best Practices with PostgreSQL on SolarisBest Practices with PostgreSQL on Solaris
Best Practices with PostgreSQL on Solaris
 
Container Performance Analysis
Container Performance AnalysisContainer Performance Analysis
Container Performance Analysis
 
Linux memory
Linux memoryLinux memory
Linux memory
 
Futex Scaling for Multi-core Systems
Futex Scaling for Multi-core SystemsFutex Scaling for Multi-core Systems
Futex Scaling for Multi-core Systems
 
Block Storage For VMs With Ceph
Block Storage For VMs With CephBlock Storage For VMs With Ceph
Block Storage For VMs With Ceph
 
eBPF in the view of a storage developer
eBPF in the view of a storage developereBPF in the view of a storage developer
eBPF in the view of a storage developer
 
Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital Kedia
Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital KediaTuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital Kedia
Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital Kedia
 
AF Ceph: Ceph Performance Analysis and Improvement on Flash
AF Ceph: Ceph Performance Analysis and Improvement on FlashAF Ceph: Ceph Performance Analysis and Improvement on Flash
AF Ceph: Ceph Performance Analysis and Improvement on Flash
 
Using cgroups in docker container
Using cgroups in docker containerUsing cgroups in docker container
Using cgroups in docker container
 
USENIX LISA11 Tutorial: ZFS a
USENIX LISA11 Tutorial: ZFS a USENIX LISA11 Tutorial: ZFS a
USENIX LISA11 Tutorial: ZFS a
 
BlueStore, A New Storage Backend for Ceph, One Year In
BlueStore, A New Storage Backend for Ceph, One Year InBlueStore, A New Storage Backend for Ceph, One Year In
BlueStore, A New Storage Backend for Ceph, One Year In
 
Ceph Performance and Sizing Guide
Ceph Performance and Sizing GuideCeph Performance and Sizing Guide
Ceph Performance and Sizing Guide
 
Linux Network Stack
Linux Network StackLinux Network Stack
Linux Network Stack
 
HBaseCon 2015: HBase Performance Tuning @ Salesforce
HBaseCon 2015: HBase Performance Tuning @ SalesforceHBaseCon 2015: HBase Performance Tuning @ Salesforce
HBaseCon 2015: HBase Performance Tuning @ Salesforce
 
Linux kernel debugging
Linux kernel debuggingLinux kernel debugging
Linux kernel debugging
 
Feature Hashing for Scalable Machine Learning: Spark Summit East talk by Nick...
Feature Hashing for Scalable Machine Learning: Spark Summit East talk by Nick...Feature Hashing for Scalable Machine Learning: Spark Summit East talk by Nick...
Feature Hashing for Scalable Machine Learning: Spark Summit East talk by Nick...
 

En vedette

Filesystem Comparison: NFS vs GFS2 vs OCFS2
Filesystem Comparison: NFS vs GFS2 vs OCFS2Filesystem Comparison: NFS vs GFS2 vs OCFS2
Filesystem Comparison: NFS vs GFS2 vs OCFS2
Giuseppe Paterno'
 
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Gluster.org
 

En vedette (20)

Performance comparison of Distributed File Systems on 1Gbit networks
Performance comparison of Distributed File Systems on 1Gbit networksPerformance comparison of Distributed File Systems on 1Gbit networks
Performance comparison of Distributed File Systems on 1Gbit networks
 
Erasure codes and storage tiers on gluster
Erasure codes and storage tiers on glusterErasure codes and storage tiers on gluster
Erasure codes and storage tiers on gluster
 
Filesystem Comparison: NFS vs GFS2 vs OCFS2
Filesystem Comparison: NFS vs GFS2 vs OCFS2Filesystem Comparison: NFS vs GFS2 vs OCFS2
Filesystem Comparison: NFS vs GFS2 vs OCFS2
 
Gluster Data Tiering
Gluster Data TieringGluster Data Tiering
Gluster Data Tiering
 
State of Gluster Performance
State of Gluster PerformanceState of Gluster Performance
State of Gluster Performance
 
Gluster.community.day.2013
Gluster.community.day.2013Gluster.community.day.2013
Gluster.community.day.2013
 
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
 
Gfs vs hdfs
Gfs vs hdfsGfs vs hdfs
Gfs vs hdfs
 
GlusterFS座談会テクニカルセッション
GlusterFS座談会テクニカルセッションGlusterFS座談会テクニカルセッション
GlusterFS座談会テクニカルセッション
 
The Google File System (GFS)
The Google File System (GFS)The Google File System (GFS)
The Google File System (GFS)
 
Gluster the ugly parts with Jeff Darcy
Gluster  the ugly parts with Jeff DarcyGluster  the ugly parts with Jeff Darcy
Gluster the ugly parts with Jeff Darcy
 
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff DarcyHands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
 
Gluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmapGluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmap
 
Deep Dive: Maximizing EC2 and EBS Performance
Deep Dive: Maximizing EC2 and EBS PerformanceDeep Dive: Maximizing EC2 and EBS Performance
Deep Dive: Maximizing EC2 and EBS Performance
 
(SDD416) Amazon EBS Deep Dive | AWS re:Invent 2014
(SDD416) Amazon EBS Deep Dive | AWS re:Invent 2014(SDD416) Amazon EBS Deep Dive | AWS re:Invent 2014
(SDD416) Amazon EBS Deep Dive | AWS re:Invent 2014
 
Red Hat Storage Server For AWS
Red Hat Storage Server For AWSRed Hat Storage Server For AWS
Red Hat Storage Server For AWS
 
멀티 티어 시스템을 위한 통합 성능 관리 방안_APM
멀티 티어 시스템을 위한 통합 성능 관리 방안_APM멀티 티어 시스템을 위한 통합 성능 관리 방안_APM
멀티 티어 시스템을 위한 통합 성능 관리 방안_APM
 
Challenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan LambrightChallenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan Lambright
 
Erasure Coding Costs and Benefits
Erasure Coding Costs and BenefitsErasure Coding Costs and Benefits
Erasure Coding Costs and Benefits
 
Trying and evaluating the new features of GlusterFS 3.5
Trying and evaluating the new features of GlusterFS 3.5Trying and evaluating the new features of GlusterFS 3.5
Trying and evaluating the new features of GlusterFS 3.5
 

Similaire à Gluster for Geeks: Performance Tuning Tips & Tricks

Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Community
 
Designing Information Structures For Performance And Reliability
Designing Information Structures For Performance And ReliabilityDesigning Information Structures For Performance And Reliability
Designing Information Structures For Performance And Reliability
bryanrandol
 
Tips and Tricks for SAP Sybase IQ
Tips and Tricks for SAP  Sybase IQTips and Tricks for SAP  Sybase IQ
Tips and Tricks for SAP Sybase IQ
Don Brizendine
 

Similaire à Gluster for Geeks: Performance Tuning Tips & Tricks (20)

Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
 
Tuning Linux Windows and Firebird for Heavy Workload
Tuning Linux Windows and Firebird for Heavy WorkloadTuning Linux Windows and Firebird for Heavy Workload
Tuning Linux Windows and Firebird for Heavy Workload
 
SUSE Storage: Sizing and Performance (Ceph)
SUSE Storage: Sizing and Performance (Ceph)SUSE Storage: Sizing and Performance (Ceph)
SUSE Storage: Sizing and Performance (Ceph)
 
Presentation db2 best practices for optimal performance
Presentation   db2 best practices for optimal performancePresentation   db2 best practices for optimal performance
Presentation db2 best practices for optimal performance
 
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
 
Capacity Planning
Capacity PlanningCapacity Planning
Capacity Planning
 
MongoDB Capacity Planning
MongoDB Capacity PlanningMongoDB Capacity Planning
MongoDB Capacity Planning
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
 
Presentation db2 best practices for optimal performance
Presentation   db2 best practices for optimal performancePresentation   db2 best practices for optimal performance
Presentation db2 best practices for optimal performance
 
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
 
071410 sun a_1515_feldman_stephen
071410 sun a_1515_feldman_stephen071410 sun a_1515_feldman_stephen
071410 sun a_1515_feldman_stephen
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Backup management with Ceph Storage - Camilo Echevarne, Félix Barbeira
Backup management with Ceph Storage - Camilo Echevarne, Félix BarbeiraBackup management with Ceph Storage - Camilo Echevarne, Félix Barbeira
Backup management with Ceph Storage - Camilo Echevarne, Félix Barbeira
 
Oracle real application_cluster
Oracle real application_clusterOracle real application_cluster
Oracle real application_cluster
 
Designing Information Structures For Performance And Reliability
Designing Information Structures For Performance And ReliabilityDesigning Information Structures For Performance And Reliability
Designing Information Structures For Performance And Reliability
 
Tips and Tricks for SAP Sybase IQ
Tips and Tricks for SAP  Sybase IQTips and Tricks for SAP  Sybase IQ
Tips and Tricks for SAP Sybase IQ
 
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
 
Taking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - ArchitectureTaking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - Architecture
 
Comparison of-foss-distributed-storage
Comparison of-foss-distributed-storageComparison of-foss-distributed-storage
Comparison of-foss-distributed-storage
 

Plus de GlusterFS

Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFS
GlusterFS
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS
 

Plus de GlusterFS (14)

Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFS
 
Introduction to GlusterFS Webinar - September 2011
Introduction to GlusterFS Webinar - September 2011Introduction to GlusterFS Webinar - September 2011
Introduction to GlusterFS Webinar - September 2011
 
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS Storage
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS StorageWebinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS Storage
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS Storage
 
Intro to GlusterFS Webinar - August 2011
Intro to GlusterFS Webinar - August 2011Intro to GlusterFS Webinar - August 2011
Intro to GlusterFS Webinar - August 2011
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3
 
Cloud Storage Adoption, Practice, and Deployment
Cloud Storage Adoption, Practice, and DeploymentCloud Storage Adoption, Practice, and Deployment
Cloud Storage Adoption, Practice, and Deployment
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 
Gluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFSGluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFS
 
Award winning scale-up and scale-out storage for Xen
Award winning scale-up and scale-out storage for XenAward winning scale-up and scale-out storage for Xen
Award winning scale-up and scale-out storage for Xen
 
Future of cloud storage
Future of cloud storageFuture of cloud storage
Future of cloud storage
 
Gluster Webinar May 25: Whats New in GlusterFS 3.2
Gluster Webinar May 25: Whats New in GlusterFS 3.2Gluster Webinar May 25: Whats New in GlusterFS 3.2
Gluster Webinar May 25: Whats New in GlusterFS 3.2
 
Gluster Blog 11.15.2010
Gluster Blog 11.15.2010Gluster Blog 11.15.2010
Gluster Blog 11.15.2010
 
Gluster Storage Platform Installation Guide
Gluster Storage Platform Installation GuideGluster Storage Platform Installation Guide
Gluster Storage Platform Installation Guide
 
How to Install Gluster Storage Platform
How to Install Gluster Storage PlatformHow to Install Gluster Storage Platform
How to Install Gluster Storage Platform
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Gluster for Geeks: Performance Tuning Tips & Tricks

  • 1. Gluster for Geeks: Performance Tips & Tricks Jacob Shucart August 25th, 2011
  • 2. Some Housekeeping Items… Ask a question at any time How To Ask a Question? Questions will be answered at the end of the webinar Slides will be available after the webinar The webinar is being recorded A Better Way To Do Storage 2
  • 3. Gluster for Geeks The Gluster for Geeks webinar series is designed for technical audiences who are familiar with GlusterFS In this edition, “Performance tuning tips and tricks” we will discuss in detail the performance related considerations for a running a GlusterFS deployment A Better Way To Do Storage 3
  • 4. Topics Planning Configuration Implementing Tuning Benchmarking Top 5 Issues A Better Way To Do Storage 4
  • 5. Planning – Key Considerations Performance requirements – What performance do you need to hit & how do you plan to get to it? • Read • Write • Throughput • Availability For a given performance level what type is required? – E.g. for a throughput of X and capacity of Y what is needed? Workloads – What is the workload in the environment? – Small files? – Large files? – Is throughput your only consideration? – What is the application? A Better Way To Do Storage 5
  • 6. Planning - Sizing and Architecture Gluster performance relies on hardware/underlying infrastructure – CPU, memory, disks, network – Virtual machine & cloud infrastructure – Number of systems in the cluster depends on performance and capacity requirements – There are many ways to meet organizational needs – For on-prem 2U & 4U DAS systems and JBODS are great building blocks Examples: 3 common deployment scenarios – Capacity-centric environments • 2U & 4U DAS systems with multiple JBODS • Lower RAM and CPU requirements • Lower network requirements – Mixed capacity and performance environments • 2U & 4U DAS systems with 1-2 JBODS max • Higher RAM and CPU requirements • Low to high network requirements – High performance environments • 1U or 2U systems with no JBODS • Highest RAM and CPU requirements • Fast disks and fast network A Better Way To Do Storage 6
  • 7. Configuration Choosing the correct volume type for a workload Volume options include – Distribute – higher performance, no redundancy – Replicate(or distribute+replicate) – general purpose, HA, faster reads – Stripe(or distribute+stripe) – high concurrent reads, low writes, no redundancy Protocols & performance – GlusterFS gives the best overall performance (pNFS like functionality) – NFS gives excellent performance given right workload – CIFS should only be used for Windows systems Data flow – How do supported protocols differ? A Better Way To Do Storage 7
  • 8. Implementing – Cluster Hardware Configuration Node and cluster configurations – More CPU means greater parallel threads on servers – More RAM means more cached operations – More network means more throughput Dedicated backend network for node communication – Dedicated back end network should be used for NFS and CIFS – Recommend 10GBe minimum GlusterFS native only uses inter-node communication for management calls A Better Way To Do Storage 8
  • 9. Implementing Gluster - Fundamentals Distribute only • Non-redundant at the brick level • Cuts hardware, software costs in half. • Failure of a brick or node results in loss of access to the data on those bricks. • Writes destined to the failed brick will fail. • Redundant RAID, hardware is strongly recommended. A Better Way To Do Storage 9
  • 10. Implementing Gluster - Fundamentals Distribute with replica • Redundant at the brick level • Failure of a brick or node does not affect I/O. • Writes are written simultaneously to each replica. • Any number of replicas are supported. • Gluster Native, CIFS, and NFS support stateful failover. (Gluster Native only in AWS) • Redundant RAID, hardware is strongly recommended. A Better Way To Do Storage 10
  • 11. Implementing Gluster - Fundamentals Gluster Native client data flow A Better Way To Do Storage 11
  • 12. Implementing Gluster - Fundamentals NFS, CIFS dataflow A Better Way To Do Storage 12
  • 13. Tuning Key tuning parameters – performance.write-behind-window-size 65535 (in bytes) – performance.cache-refresh-timeout 1 (in seconds) – performance.cache-size 1073741824 (in bytes) – performance.read-ahead off (only for 1GbE) – Default settings are suitable for mixed workloads Tuning for different environments – For Amazon, m1.xlarge or greater – Understand hardware/firmware settings and their impact on performance(for example, CPU frequency scaling and IB, 10GbE and the TCP Offload Engine) A Better Way To Do Storage 13
  • 14. Benchmarking From the Gluster Performance white paper – iozone –R –l 3 –u 5 –r 512k –s 256m –F /mnt/1 /mnt/2 /mnt/3 /mnt/4 /mnt/5 – dd if=/dev/zero of=/mnt/test bs=1M count=1 Performance expectations – Get a baseline benchmark of disks on systems – What can you expect from your network? IOPS vs. throughput – Is your workload better measured in throughput – Certain operations have different impact(dir creation) – If IOPS is your measurement remember latency A Better Way To Do Storage 14
  • 15. Top 5 Causes for Performance Issues Straight from our professional services performance team 1. Underpowered/mis-configured disks 2. Underpowered/mis-configured network 3. Faulty hardware(broken/bad blocks/etc) 4. Too few servers 5. Wrong protocol for the job A Better Way To Do Storage 15
  • 16. Conclusion GlusterFS performance depends heavily on the underlying hardware You should understand your workloads to guide your hardware configuration The default parameters work well for general workloads Several tuning parameters are available When experiencing performance issues check the disks and network first A Better Way To Do Storage 16
  • 17. Polling Question What should we talk about in next months Gluster Geeks Only webinar? A. Setting up a basic Gluster cluster B. Gluster Geo-Replication C. Frequently Asked Questions D. Gluster Translators E. Other technical topics A Better Way To Do Storage 17
  • 18. Questions & Resources What are your performance questions? – Ask now using the Go-to-webinar questions panel Helpful resources – Performance white paper posted here: http://www.gluster.com/products/resources/ – Documentation: http://gluster.com/community/documentation – Questions?: http://community.gluster.org/ A Better Way To Do Storage 18