SlideShare une entreprise Scribd logo
1  sur  56
Télécharger pour lire hors ligne
DUSTIN L. BLACK, RHCA1
Dustin L. Black, RHCA
Sr. Technical Account Manager, Red Hat
2012-07-17
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA2
#whoami
● Systems and Infrastructure Geek
● Decade+ of Linux, UNIX, networking
● <notacoder/>
● Believe in Open Source Everything
● Sr. Technical Account Manager, Red
Hat GSS
● dustin@redhat.com
DUSTIN L. BLACK, RHCA3
#whatis TAM
● Premium named-resource support
● Proactive and early access
● Regular calls and on-site engagements
● Customer advocate within Red Hat and upstream
● Multi-vendor support coordinator
● High-touch access to engineering
● Influence for software enhancements
● NOT Hands-on or consulting
DUSTIN L. BLACK, RHCA4
Agenda
● Technology Overview
● A Peek at Gluster Logic
● Scaling Up and Out
● Redundancy and Fault Tolerance
● Data Access
● General Administration
● Use Cases
DUSTIN L. BLACK, RHCA5
Technology Overview
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA6
What is GlusterFS?
● POSIX-Compliant Distributed File System
● No Metadata Server
● Network Attached Storage (NAS)
● Heterogeneous Commodity Hardware
● Aggregated Storage and Memory
● Standards-Based – Clients, Applications, Networks
● Flexible and Agile Scaling
● Capacity – Petabytes and beyond
● Performance – Thousands of Clients
● Single Global Namespace
DUSTIN L. BLACK, RHCA7
What is Red Hat Storage?
● Enterprise Implementation of GlusterFS
● Software Appliance
● Bare Metal Installation
● Built on RHEL + XFS
● Subscription Model
● Storage Software Appliance
● Datacenter and Private Cloud Deployments
● Virtual Storage Appliance
● Amazon Web Services Public Cloud Deployments
DUSTIN L. BLACK, RHCA8
GlusterFS vs. Traditional Solutions
● A basic NAS has limited scalability and redundancy
● Other distributed filesystems limited by metadata
● SAN is costly & complicated but high performance &
scalable
● GlusterFS
● Linear Scaling
● Minimal Overhead
● High Redundancy
● Simple and Inexpensive Deployment
DUSTIN L. BLACK, RHCA9
Technology Stack
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA10
Terminology
● Brick
● A filesystem mountpoint
● A unit of storage used as a GlusterFS building block
● Translator
● Logic between the bits and the Global Namespace
● Layered to provide GlusterFS functionality
● Volume
● Bricks combined and passed through translators
● Node
● Server running the gluster daemon and sharing volumes
DUSTIN L. BLACK, RHCA11
Foundation Components
● Private Cloud (Datacenter)
● Common Commodity x86_64 Servers
● RHS: Hardware Compatibility List (HCL)
● Public Cloud
● Amazon Web Services (AWS)
● EC2 + EBS
DUSTIN L. BLACK, RHCA12
Disk, LVM, and Filesystems
● Direct-Attached Storage (DAS)
● Just a Bunch Of Disks (JBOD)
● Hardware RAID
● RHS: RAID 6 required
● Logical Volume Management (LVM)
● XFS, EXT3/4, BTRFS
● Extended attributes support required
DUSTIN L. BLACK, RHCA13
Data Access Overview
● Gluster Native Client
● Filesystem in Userspace (FUSE)
● NFS
● Built-in Service
● SMB/CIFS
● Samba server required
DUSTIN L. BLACK, RHCA14
Gluster Components
● glusterd
● Elastic volume management daemon
● Runs on all export servers
● Interfaced through gluster CLI
● glusterfsd
● GlusterFS brick daemon
● One process for each brick
● Managed by glusterd
DUSTIN L. BLACK, RHCA15
Gluster Components
● glusterfs
● NFS server daemon
● FUSE client daemon
● mount.glusterfs
● FUSE native mount tool
● gluster
● Gluster Console Manager (CLI)
DUSTIN L. BLACK, RHCA16
Putting it All Together
DUSTIN L. BLACK, RHCA17
Scaling
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA18
Scaling Up
● Add disks and filesystems to a node
● Expand a gluster volume by adding bricks
XFS
DUSTIN L. BLACK, RHCA19
Scaling Out
● Add gluster nodes to trusted pool
● Add filesystems as new bricks
DUSTIN L. BLACK, RHCA20
Under the Hood
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA21
Elastic Hash Algorithm
● No central metadata
● No Performance Bottleneck
● Eliminates risk scenarios
● Location hashed intelligently on path and filename
● Unique identifiers, similar to md5sum
● The “Elastic” Part
● Files assigned to virtual volumes
● Virtual volumes assigned to multiple bricks
● Volumes easily reassigned on the fly
DUSTIN L. BLACK, RHCA22
Translators
DUSTIN L. BLACK, RHCA23
Distribution and Replication
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA24
Distributed Volume
● Files “evenly” spread across bricks
● File-level RAID 0
● Server/Disk failure could be catastrophic
DUSTIN L. BLACK, RHCA25
Replicated Volume
● Copies files to multiple bricks
● File-level RAID 1
DUSTIN L. BLACK, RHCA26
Distributed Replicated Volume
● Distributes files across replicated bricks
● RAID 1 plus improved read performance
DUSTIN L. BLACK, RHCA27
Geo Replication
● Asynchronous across LAN, WAN, or Internet
● Master-Slave model -- Cascading possible
● Continuous and incremental
● Time should be synchronized on all master nodes
DUSTIN L. BLACK, RHCA28
Replicated Volumes vs Geo-replication
Replicated Volumes Geo-replication
Mirrors data across clusters Mirrors data across geographically
distributed clusters
Provides high-availability Ensures backing up of data for disaster
recovery
Synchronous replication (each and every
file operation is sent across all the bricks)
Asynchronous replication (checks for the
changes in files periodically and syncs
them on detecting differences)
DUSTIN L. BLACK, RHCA29
Layered Functionality
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA30
Striped Volumes
● Individual files split among bricks
● Similar to RAID 0
● Limited Use Cases – HPC Pre/Post Processing
DUSTIN L. BLACK, RHCA31
Distributed Striped Volume
● Files striped across two or more nodes
● Striping plus scalability
DUSTIN L. BLACK, RHCA32
Striped Replicated Volume
● RHS 2.0 / GlusterFS 3.3+
● Similar to RAID 10 (1+0)
Replicated Volume
Striped Striped
DUSTIN L. BLACK, RHCA33
Distributed Striped Replicated Volume
● GlusterFS 3.3+
● Limited Use Cases – Map Reduce
DUSTIN L. BLACK, RHCA34
Data Access
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA35
Gluster Native Client (FUSE)
● FUSE kernel module allows the filesystem to be built
and operated entirely in userspace
● Specify mount to any GlusterFS node
● Native Client fetches volfile from mount server, then
communicates directly with all nodes to access data
● Recommended for high concurrency and high write
performance
DUSTIN L. BLACK, RHCA36
NFS
● Standard NFS v3 clients
● Mount with vers=3 option
● Standard automounter is supported
● Mount to any node, or use a load balancer
● Gluster NFS server includes Network Lock Manager
(NLM) to synchronize locks across clients
● Better performance for reading many small files
DUSTIN L. BLACK, RHCA37
SMB/CIFS
● GlusterFS volume is first redundantly mounted with the
Native Client on localhost
● Native mount point is then shared via Samba
● Must be setup on each node you wish to connect to via
CIFS
DUSTIN L. BLACK, RHCA38
General Administration
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA39
Preparing a Brick
# lvcreate -L 100G -n lv_brick1 vg_server1
# mkfs -t xfs -i size=512 /dev/vg_server1/lv_brick1
# mkdir /brick1
# mount /dev/vg_server1/lv_brick1 /brick1
# echo '/dev/vg_server1/lv_brick1 /brick1 xfs defaults 1 2' >> /etc/fstab
DUSTIN L. BLACK, RHCA40
Adding Nodes (peers) and Volumes
gluster> peer probe server3
gluster> peer status
Number of Peers: 2
Hostname: server2
Uuid: 5e987bda-16dd-43c2-835b-08b7d55e94e5
State: Peer in Cluster (Connected)
Hostname: server3
Uuid: 1e0ca3aa-9ef7-4f66-8f15-cbc348f29ff7
State: Peer in Cluster (Connected)
gluster> volume create my-dist-vol server2:/brick2 server3:/brick3
gluster> volume info my-dist-vol
Volume Name: my-dist-vol
Type: Distribute
Status: Created
Number of Bricks: 2
Transport-type: tcp
Bricks:
Brick1: server2:/brick2
Brick2: server3:/brick3
gluster> volume start my-dist-vol
Distributed Volume
DUSTIN L. BLACK, RHCA41
Distributed Striped Replicated Volume
gluster> volume create test-volume replica 2 stripe 2 transport tcp 
server1:/exp1 server1:/exp2 server2:/exp3 server2:/exp4 
server3:/exp5 server3:/exp6 server4:/exp7 server4:/exp8
Multiple bricks of a replicate volume are present on the same server. This setup is not
optimal.
Do you still want to continue creating the volume? (y/n) y
Creation of volume test-volume has been successful. Please start the volume to access
data.
<- test-volume<- test-volume
<- distributed files -><- distributed files ->
stripe 2stripe 2
replica 2replica 2
DUSTIN L. BLACK, RHCA42
Distributed Striped Replicated Volume
gluster> volume info test-volume
Volume Name: test-volume
Type: Distributed-Striped-Replicate
Volume ID: 8f8b8b59-d1a1-42fe-ae05-abe2537d0e2d
Status: Created
Number of Bricks: 2 x 2 x 2 = 8
Transport-type: tcp
Bricks:
Brick1: server1:/exp1
Brick2: server2:/exp3
Brick3: server1:/exp2
Brick4: server2:/exp4
Brick5: server3:/exp5
Brick6: server4:/exp7
Brick7: server3:/exp6
Brick8: server4:/exp8
gluster> volume create test-volume stripe 2 replica 2 transport tcp 
server1:/exp1 server2:/exp3 server1:/exp2 server2:/exp4 
server3:/exp5 server4:/exp7 server3:/exp6 server4:/exp8
Creation of volume test-volume has been successful. Please start the volume to access
data.
DUSTIN L. BLACK, RHCA43
Manipulating Bricks in a Volume
gluster> volume add-brick my-dist-vol server4:/brick4
gluster> volume remove-brick my-dist-vol server2:/brick2 start
gluster> volume remove-brick my-dist-vol server2:/brick2 status
Node Rebalanced-files size scanned failures status
--------- ----------- ----------- ----------- ----------- ------------
localhost 16 16777216 52 0 in progress
192.168.1.1 13 16723211 47 0 in progress
gluster> volume remove-brick my-dist-vol server2:/brick2 commit
gluster> volume rebalance my-dist-vol fix-layout start
gluster> volume rebalance my-dist-vol start
gluster> volume rebalance my-dist-vol status
Node Rebalanced-files size scanned failures status
--------- ----------- ----------- ----------- ----------- ------------
localhost 112 15674 170 0 completed
10.16.156.72 140 3423 321 2 completed
DUSTIN L. BLACK, RHCA44
Migrating Data / Replacing Bricks
gluster> volume replace-brick my-dist-vol server3:/brick3 server5:/brick5 start
gluster> volume replace-brick my-dist-vol server3:/brick3 server5:/brick5 status
Current File = /usr/src/linux-headers-2.6.31-14/block/Makefile
Number of files migrated = 10567
Migration complete
gluster> volume replace-brick my-dist-vol server3:/brick3 server5:/brick5 commit
DUSTIN L. BLACK, RHCA45
Volume Options
gluster> volume set my-dist-vol auth.allow 192.168.1.*
gluster> volume set my-dist-vol auth.reject 10.*
gluster> volume set my-dist-vol nfs.volume-access read-only
gluster> volume set my-dist-vol nfs.disable on
gluster> volume set my-dist-vol features.read-only on
gluster> volume set my-dist-vol performance.cache-size 67108864
gluster> volume set my-dist-vol auth.allow 192.168.1.*
gluster> volume set my-dist-vol auth.reject 10.*
NFS
Auth
Other advanced options
DUSTIN L. BLACK, RHCA46
Volume Top Command
gluster> volume set my-dist-vol auth.allow 192.168.1.*
gluster> volume set my-dist-vol auth.reject 10.*
gluster> volume top my-dist-vol read brick server3:/brick3 list-cnt 3
Brick: server:/export/dir1
==========Read file stats========
read filename
call count
116 /clients/client0/~dmtmp/SEED/LARGE.FIL
64 /clients/client0/~dmtmp/SEED/MEDIUM.FIL
54 /clients/client2/~dmtmp/SEED/LARGE.FIL
● Many top commands are available for analysis of
files, directories, and bricks
● Read and write performance test commands available
● Perform active dd tests and measure throughput
DUSTIN L. BLACK, RHCA47
Volume Profiling
gluster> volume set my-dist-vol auth.allow 192.168.1.*
gluster> volume set my-dist-vol auth.reject 10.*
gluster> volume profile my-dist-vol start
gluster> volume profile my-dist-vol info
Brick: Test:/export/2
Cumulative Stats:
Block 1b+ 32b+ 64b+
Size:
Read: 0 0 0
Write: 908 28 8
...
%-latency Avg- Min- Max- calls Fop
latency Latency Latency
___________________________________________________________
4.82 1132.28 21.00 800970.00 4575 WRITE
5.70 156.47 9.00 665085.00 39163 READDIRP
11.35 315.02 9.00 1433947.00 38698 LOOKUP
11.88 1729.34 21.00 2569638.00 7382 FXATTROP
47.35 104235.02 2485.00 7789367.00 488 FSYNC
------------------
Duration : 335
BytesRead : 94505058
BytesWritten : 195571980
DUSTIN L. BLACK, RHCA48
Geo-Replication
gluster> volume set my-dist-vol auth.allow 192.168.1.*
gluster> volume set my-dist-vol auth.reject 10.*
gluster> volume geo-replication my-dist-vol slavehost1:my-dist-repl start
Starting geo-replication session between my-dist-vol & slavehost1:my-dist-repl has been
successful
gluster> volume geo-replication my-dist-vol status
MASTER SLAVE STATUS
--------------------------------------------------------------------------------
my-dist-vol gluster://slavehost1:my-dist-repl OK
# ssh-keygen -f /var/lib/glusterd/geo-replication/secret.pem
# ssh-copy-id -i /var/lib/glusterd/geo-replication/secret.pem repluser@slavehost1
gluster> volume geo-replication my-dist-vol repluser@slavehost1:/repl_dir start
Starting geo-replication session between my-dist-vol & slavehost1:/repl_dir has been
successful
gluster> volume geo-replication my-dist-vol status
MASTER SLAVE STATUS
--------------------------------------------------------------------------------
my-dist-vol ssh://repluser@slavehost1:/repl_dir OK
Remote GlusterFS Volume
Remote SSH
gluster> volume info my-dist-vol
...
Options Reconfigured:
geo-replication.indexing: on
DUSTIN L. BLACK, RHCA49
Use Cases
Demystifying Gluster
GlusterFS for the SysAdmin
DUSTIN L. BLACK, RHCA50
Common Solutions
● Media / Content Distribution Network (CDN)
● Backup / Archive / Disaster Recovery (DR)
● Large Scale File Server
● Home directories
● High Performance Computing (HPC)
● Infrastructure as a Service (IaaS) storage layer
DUSTIN L. BLACK, RHCA51
Hadoop – Map Reduce
● Access data within and outside of Hadoop
● No HDFS name node single point of failure / bottleneck
● Seamless replacement for HDFS
● Scales with the massive growth of big data
DUSTIN L. BLACK, RHCA52
CIC Electronic Signature Solutions
● Challenge
● Must leverage economics of the cloud
● Storage performance in the cloud too slow
● Need to meet demanding client SLA’s
● Solution
● Red Hat Storage Software Appliance
● Amazon EC2 and Elastic Block Storage (EBS)
● Benefits
● Faster development and delivery of new products
● SLA’s met with headroom to spare
● Accelerated cloud migration
● Scale-out for rapid and simple expansion
● Data is highly available for 24/7 client access
Hybrid Cloud: Electronic Signature Solutions
● Reduced time-to-
market for new
products
● Meeting all client SLAs
● Accelerating move to
the cloud
DUSTIN L. BLACK, RHCA53
Pandora Internet Radio
● Challenge
● Explosive user & title growth
● As many as 12 file formats for each song
● ‘Hot’ content and long tail
● Solution
● Three data centers, each with a six-node GlusterFS
cluster
● Replication for high availability
● 250+ TB total capacity
● Benefits
● Easily scale capacity
● Centralized management; one administrator to manage
day-to-day operations
● No changes to application
● Higher reliability
Private Cloud: Media Serving
● 1.2 PB of audio served
per week
● 13 million files
● Over 50 GB/sec peak
traffic
DUSTIN L. BLACK, RHCA54
Brightcove
• Over 1 PB currently in
Gluster
• Separate 4 PB project
in the works
Private Cloud: Media Serving
● Challenge
● Explosive customer & title growth
● Massive video in multiple locations
● Costs rising, esp. with HD formats
● Solution
● Complete scale-out based on commodity DAS/JBOD
and GlusterFS
● Replication for high availability
● 1PB total capacity
● Benefits
● Easily scale capacity
● Centralized management; one administrator to manage
day-to-day operations
● Higher reliability
● Path to multi-site
DUSTIN L. BLACK, RHCA55
Pattern Energy
• Rapid and advance
weather predictions
• Maximizing energy
assets
• Cost savings and
avoidance
High Performance Computing for Weather Prediction
●
Challenge
●
Need to deliver rapid advance weather predictions
●
Identify wind and solar abundance in advance
●
More effectively perform preventative maintenance and
repair
●
Solution
●
32 HP compute nodes
●
Red Hat SSA for high throughput and availability
●
20TB+ total capacity
●
Benefits
●
Predicts solar and wind patterns 3 to 5 days in advance
●
Maximize energy production and repair times
●
Avoid costs of outsourcing weather predictions
●
Solution has paid for itself many times over
Thank You!
● dustin@redhat.com
● storage-sales@redhat.com
● RHS:
www.redhat.com/storage/
● GlusterFS:
www.gluster.org
● TAM:
access.redhat.com/support/offerings/tam/
@Glusterorg
@RedHatStorage
Gluster
Red Hat Storage
Demystifying Gluster
GlusterFS for the SysAdmin

Contenu connexe

Tendances

Red Hat Gluster Storage, Container Storage and CephFS Plans
Red Hat Gluster Storage, Container Storage and CephFS PlansRed Hat Gluster Storage, Container Storage and CephFS Plans
Red Hat Gluster Storage, Container Storage and CephFS PlansRed_Hat_Storage
 
Storage as a Service with Gluster
Storage as a Service with GlusterStorage as a Service with Gluster
Storage as a Service with GlusterVijay Bellur
 
Performance characterization in large distributed file system with gluster fs
Performance characterization in large distributed file system with gluster fsPerformance characterization in large distributed file system with gluster fs
Performance characterization in large distributed file system with gluster fsNeependra Khare
 
Gluster fs architecture_future_directions_tlv
Gluster fs architecture_future_directions_tlvGluster fs architecture_future_directions_tlv
Gluster fs architecture_future_directions_tlvSahina Bose
 
State of Gluster Performance
State of Gluster PerformanceState of Gluster Performance
State of Gluster PerformanceGluster.org
 
GlusterFs Architecture & Roadmap - LinuxCon EU 2013
GlusterFs Architecture & Roadmap - LinuxCon EU 2013GlusterFs Architecture & Roadmap - LinuxCon EU 2013
GlusterFs Architecture & Roadmap - LinuxCon EU 2013Gluster.org
 
Gluster for Geeks: Performance Tuning Tips & Tricks
Gluster for Geeks: Performance Tuning Tips & TricksGluster for Geeks: Performance Tuning Tips & Tricks
Gluster for Geeks: Performance Tuning Tips & TricksGlusterFS
 
Gluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmapGluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmapGluster.org
 
Red Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red Hat Gluster Storage - Direction, Roadmap and Use-CasesRed Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red Hat Gluster Storage - Direction, Roadmap and Use-CasesRed_Hat_Storage
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveGluster.org
 
Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSGlusterFS
 
Tiering barcelona
Tiering barcelonaTiering barcelona
Tiering barcelonaGluster.org
 
Ceph Block Devices: A Deep Dive
Ceph Block Devices: A Deep DiveCeph Block Devices: A Deep Dive
Ceph Block Devices: A Deep Divejoshdurgin
 
Gluster for sysadmins
Gluster for sysadminsGluster for sysadmins
Gluster for sysadminsGluster.org
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed_Hat_Storage
 
Disperse xlator ramon_datalab
Disperse xlator ramon_datalabDisperse xlator ramon_datalab
Disperse xlator ramon_datalabGluster.org
 
GlusterFS CTDB Integration
GlusterFS CTDB IntegrationGlusterFS CTDB Integration
GlusterFS CTDB IntegrationEtsuji Nakai
 
Gluster fs hadoop_fifth-elephant
Gluster fs hadoop_fifth-elephantGluster fs hadoop_fifth-elephant
Gluster fs hadoop_fifth-elephantGluster.org
 
Improvements in GlusterFS for Virtualization usecase
Improvements in GlusterFS for Virtualization usecaseImprovements in GlusterFS for Virtualization usecase
Improvements in GlusterFS for Virtualization usecaseDeepak Shetty
 

Tendances (20)

Red Hat Gluster Storage, Container Storage and CephFS Plans
Red Hat Gluster Storage, Container Storage and CephFS PlansRed Hat Gluster Storage, Container Storage and CephFS Plans
Red Hat Gluster Storage, Container Storage and CephFS Plans
 
Storage as a Service with Gluster
Storage as a Service with GlusterStorage as a Service with Gluster
Storage as a Service with Gluster
 
Performance characterization in large distributed file system with gluster fs
Performance characterization in large distributed file system with gluster fsPerformance characterization in large distributed file system with gluster fs
Performance characterization in large distributed file system with gluster fs
 
Gluster fs architecture_future_directions_tlv
Gluster fs architecture_future_directions_tlvGluster fs architecture_future_directions_tlv
Gluster fs architecture_future_directions_tlv
 
State of Gluster Performance
State of Gluster PerformanceState of Gluster Performance
State of Gluster Performance
 
GlusterFs Architecture & Roadmap - LinuxCon EU 2013
GlusterFs Architecture & Roadmap - LinuxCon EU 2013GlusterFs Architecture & Roadmap - LinuxCon EU 2013
GlusterFs Architecture & Roadmap - LinuxCon EU 2013
 
Gluster for Geeks: Performance Tuning Tips & Tricks
Gluster for Geeks: Performance Tuning Tips & TricksGluster for Geeks: Performance Tuning Tips & Tricks
Gluster for Geeks: Performance Tuning Tips & Tricks
 
Gluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmapGluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmap
 
Red Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red Hat Gluster Storage - Direction, Roadmap and Use-CasesRed Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red Hat Gluster Storage - Direction, Roadmap and Use-Cases
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep Dive
 
Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFS
 
Gluster d2
Gluster d2Gluster d2
Gluster d2
 
Tiering barcelona
Tiering barcelonaTiering barcelona
Tiering barcelona
 
Ceph Block Devices: A Deep Dive
Ceph Block Devices: A Deep DiveCeph Block Devices: A Deep Dive
Ceph Block Devices: A Deep Dive
 
Gluster for sysadmins
Gluster for sysadminsGluster for sysadmins
Gluster for sysadmins
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
 
Disperse xlator ramon_datalab
Disperse xlator ramon_datalabDisperse xlator ramon_datalab
Disperse xlator ramon_datalab
 
GlusterFS CTDB Integration
GlusterFS CTDB IntegrationGlusterFS CTDB Integration
GlusterFS CTDB Integration
 
Gluster fs hadoop_fifth-elephant
Gluster fs hadoop_fifth-elephantGluster fs hadoop_fifth-elephant
Gluster fs hadoop_fifth-elephant
 
Improvements in GlusterFS for Virtualization usecase
Improvements in GlusterFS for Virtualization usecaseImprovements in GlusterFS for Virtualization usecase
Improvements in GlusterFS for Virtualization usecase
 

Similaire à Gluster fs tutorial part 2 gluster and big data- gluster for devs and sys admins presentation

Software defined storage
Software defined storageSoftware defined storage
Software defined storageGluster.org
 
Challenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan LambrightChallenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan LambrightGluster.org
 
High Availability Storage (susecon2016)
High Availability Storage (susecon2016)High Availability Storage (susecon2016)
High Availability Storage (susecon2016)Roger Zhou 周志强
 
Glusterfs and openstack
Glusterfs  and openstackGlusterfs  and openstack
Glusterfs and openstackopenstackindia
 
Seagate SC15 Announcements for HPC
Seagate SC15 Announcements for HPCSeagate SC15 Announcements for HPC
Seagate SC15 Announcements for HPCinside-BigData.com
 
Part 3 - Local Name Resolution in Linux, FreeBSD and macOS/iOS
Part 3 - Local Name Resolution in Linux, FreeBSD and macOS/iOSPart 3 - Local Name Resolution in Linux, FreeBSD and macOS/iOS
Part 3 - Local Name Resolution in Linux, FreeBSD and macOS/iOSMen and Mice
 
Hadoop operations basic
Hadoop operations basicHadoop operations basic
Hadoop operations basicHafizur Rahman
 
What CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBDWhat CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBDShapeBlue
 
Big Data in Container; Hadoop Spark in Docker and Mesos
Big Data in Container; Hadoop Spark in Docker and MesosBig Data in Container; Hadoop Spark in Docker and Mesos
Big Data in Container; Hadoop Spark in Docker and MesosHeiko Loewe
 
Distributed replicated block device
Distributed replicated block deviceDistributed replicated block device
Distributed replicated block deviceChanaka Lasantha
 
Greenplum Overview for Postgres Hackers - Greenplum Summit 2018
Greenplum Overview for Postgres Hackers - Greenplum Summit 2018Greenplum Overview for Postgres Hackers - Greenplum Summit 2018
Greenplum Overview for Postgres Hackers - Greenplum Summit 2018VMware Tanzu
 
Globus for System Administrators (GlobusWorld Tour - Columbia University)
Globus for System Administrators (GlobusWorld Tour - Columbia University)Globus for System Administrators (GlobusWorld Tour - Columbia University)
Globus for System Administrators (GlobusWorld Tour - Columbia University)Globus
 
ClustrixDB at Samsung Cloud
ClustrixDB at Samsung CloudClustrixDB at Samsung Cloud
ClustrixDB at Samsung CloudMariaDB plc
 
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GRGlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GRTheophanis Kontogiannis
 
Directory Write Leases in MagFS
Directory Write Leases in MagFSDirectory Write Leases in MagFS
Directory Write Leases in MagFSMaginatics
 
Snapshots In Red Hat Storage Server Overview & Quickstart
Snapshots In Red Hat Storage Server Overview & QuickstartSnapshots In Red Hat Storage Server Overview & Quickstart
Snapshots In Red Hat Storage Server Overview & QuickstartRed_Hat_Storage
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonSage Weil
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdoseGluster.org
 

Similaire à Gluster fs tutorial part 2 gluster and big data- gluster for devs and sys admins presentation (20)

Software defined storage
Software defined storageSoftware defined storage
Software defined storage
 
Challenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan LambrightChallenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan Lambright
 
High Availability Storage (susecon2016)
High Availability Storage (susecon2016)High Availability Storage (susecon2016)
High Availability Storage (susecon2016)
 
Glusterfs and openstack
Glusterfs  and openstackGlusterfs  and openstack
Glusterfs and openstack
 
Seagate SC15 Announcements for HPC
Seagate SC15 Announcements for HPCSeagate SC15 Announcements for HPC
Seagate SC15 Announcements for HPC
 
Part 3 - Local Name Resolution in Linux, FreeBSD and macOS/iOS
Part 3 - Local Name Resolution in Linux, FreeBSD and macOS/iOSPart 3 - Local Name Resolution in Linux, FreeBSD and macOS/iOS
Part 3 - Local Name Resolution in Linux, FreeBSD and macOS/iOS
 
Hadoop operations basic
Hadoop operations basicHadoop operations basic
Hadoop operations basic
 
What CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBDWhat CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBD
 
Big Data in Container; Hadoop Spark in Docker and Mesos
Big Data in Container; Hadoop Spark in Docker and MesosBig Data in Container; Hadoop Spark in Docker and Mesos
Big Data in Container; Hadoop Spark in Docker and Mesos
 
Distributed replicated block device
Distributed replicated block deviceDistributed replicated block device
Distributed replicated block device
 
Greenplum Overview for Postgres Hackers - Greenplum Summit 2018
Greenplum Overview for Postgres Hackers - Greenplum Summit 2018Greenplum Overview for Postgres Hackers - Greenplum Summit 2018
Greenplum Overview for Postgres Hackers - Greenplum Summit 2018
 
Globus for System Administrators (GlobusWorld Tour - Columbia University)
Globus for System Administrators (GlobusWorld Tour - Columbia University)Globus for System Administrators (GlobusWorld Tour - Columbia University)
Globus for System Administrators (GlobusWorld Tour - Columbia University)
 
ClustrixDB at Samsung Cloud
ClustrixDB at Samsung CloudClustrixDB at Samsung Cloud
ClustrixDB at Samsung Cloud
 
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GRGlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
 
Directory Write Leases in MagFS
Directory Write Leases in MagFSDirectory Write Leases in MagFS
Directory Write Leases in MagFS
 
GlusterFS And Big Data
GlusterFS And Big DataGlusterFS And Big Data
GlusterFS And Big Data
 
Snapshots In Red Hat Storage Server Overview & Quickstart
Snapshots In Red Hat Storage Server Overview & QuickstartSnapshots In Red Hat Storage Server Overview & Quickstart
Snapshots In Red Hat Storage Server Overview & Quickstart
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit Boston
 
Apache Hadoop 3
Apache Hadoop 3Apache Hadoop 3
Apache Hadoop 3
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdose
 

Plus de Tommy Lee

새하늘과 새땅-리차드 미들턴
새하늘과 새땅-리차드 미들턴새하늘과 새땅-리차드 미들턴
새하늘과 새땅-리차드 미들턴Tommy Lee
 
하나님의 아픔의신학 20180131
하나님의 아픔의신학 20180131하나님의 아픔의신학 20180131
하나님의 아픔의신학 20180131Tommy Lee
 
그리스도인의미덕 통합
그리스도인의미덕 통합그리스도인의미덕 통합
그리스도인의미덕 통합Tommy Lee
 
그리스도인의미덕 1장-4장
그리스도인의미덕 1장-4장그리스도인의미덕 1장-4장
그리스도인의미덕 1장-4장Tommy Lee
 
예수왕의복음
예수왕의복음예수왕의복음
예수왕의복음Tommy Lee
 
Grub2 and troubleshooting_ol7_boot_problems
Grub2 and troubleshooting_ol7_boot_problemsGrub2 and troubleshooting_ol7_boot_problems
Grub2 and troubleshooting_ol7_boot_problemsTommy Lee
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-CRUI
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-CRUI제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-CRUI
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-CRUITommy Lee
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나- IBM Bluemix
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나- IBM Bluemix제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나- IBM Bluemix
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나- IBM BluemixTommy Lee
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-RanchersTommy Lee
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AI
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AI제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AI
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AITommy Lee
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Asible
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Asible제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Asible
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AsibleTommy Lee
 
제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustreTommy Lee
 
제3회난공불락 오픈소스 인프라세미나 - Nagios
제3회난공불락 오픈소스 인프라세미나 - Nagios제3회난공불락 오픈소스 인프라세미나 - Nagios
제3회난공불락 오픈소스 인프라세미나 - NagiosTommy Lee
 
제3회난공불락 오픈소스 인프라세미나 - MySQL Performance
제3회난공불락 오픈소스 인프라세미나 - MySQL Performance제3회난공불락 오픈소스 인프라세미나 - MySQL Performance
제3회난공불락 오픈소스 인프라세미나 - MySQL PerformanceTommy Lee
 
제3회난공불락 오픈소스 인프라세미나 - MySQL
제3회난공불락 오픈소스 인프라세미나 - MySQL제3회난공불락 오픈소스 인프라세미나 - MySQL
제3회난공불락 오픈소스 인프라세미나 - MySQLTommy Lee
 
제3회난공불락 오픈소스 인프라세미나 - JuJu
제3회난공불락 오픈소스 인프라세미나 - JuJu제3회난공불락 오픈소스 인프라세미나 - JuJu
제3회난공불락 오픈소스 인프라세미나 - JuJuTommy Lee
 
제3회난공불락 오픈소스 인프라세미나 - Pacemaker
제3회난공불락 오픈소스 인프라세미나 - Pacemaker제3회난공불락 오픈소스 인프라세미나 - Pacemaker
제3회난공불락 오픈소스 인프라세미나 - PacemakerTommy Lee
 
새하늘과새땅 북톡-3부-우주적회복에대한신약의비전
새하늘과새땅 북톡-3부-우주적회복에대한신약의비전새하늘과새땅 북톡-3부-우주적회복에대한신약의비전
새하늘과새땅 북톡-3부-우주적회복에대한신약의비전Tommy Lee
 
새하늘과새땅 북톡-2부-구약에서의총체적구원
새하늘과새땅 북톡-2부-구약에서의총체적구원새하늘과새땅 북톡-2부-구약에서의총체적구원
새하늘과새땅 북톡-2부-구약에서의총체적구원Tommy Lee
 
새하늘과새땅 Part1
새하늘과새땅 Part1새하늘과새땅 Part1
새하늘과새땅 Part1Tommy Lee
 

Plus de Tommy Lee (20)

새하늘과 새땅-리차드 미들턴
새하늘과 새땅-리차드 미들턴새하늘과 새땅-리차드 미들턴
새하늘과 새땅-리차드 미들턴
 
하나님의 아픔의신학 20180131
하나님의 아픔의신학 20180131하나님의 아픔의신학 20180131
하나님의 아픔의신학 20180131
 
그리스도인의미덕 통합
그리스도인의미덕 통합그리스도인의미덕 통합
그리스도인의미덕 통합
 
그리스도인의미덕 1장-4장
그리스도인의미덕 1장-4장그리스도인의미덕 1장-4장
그리스도인의미덕 1장-4장
 
예수왕의복음
예수왕의복음예수왕의복음
예수왕의복음
 
Grub2 and troubleshooting_ol7_boot_problems
Grub2 and troubleshooting_ol7_boot_problemsGrub2 and troubleshooting_ol7_boot_problems
Grub2 and troubleshooting_ol7_boot_problems
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-CRUI
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-CRUI제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-CRUI
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-CRUI
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나- IBM Bluemix
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나- IBM Bluemix제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나- IBM Bluemix
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나- IBM Bluemix
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AI
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AI제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AI
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-AI
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Asible
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Asible제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Asible
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Asible
 
제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre
 
제3회난공불락 오픈소스 인프라세미나 - Nagios
제3회난공불락 오픈소스 인프라세미나 - Nagios제3회난공불락 오픈소스 인프라세미나 - Nagios
제3회난공불락 오픈소스 인프라세미나 - Nagios
 
제3회난공불락 오픈소스 인프라세미나 - MySQL Performance
제3회난공불락 오픈소스 인프라세미나 - MySQL Performance제3회난공불락 오픈소스 인프라세미나 - MySQL Performance
제3회난공불락 오픈소스 인프라세미나 - MySQL Performance
 
제3회난공불락 오픈소스 인프라세미나 - MySQL
제3회난공불락 오픈소스 인프라세미나 - MySQL제3회난공불락 오픈소스 인프라세미나 - MySQL
제3회난공불락 오픈소스 인프라세미나 - MySQL
 
제3회난공불락 오픈소스 인프라세미나 - JuJu
제3회난공불락 오픈소스 인프라세미나 - JuJu제3회난공불락 오픈소스 인프라세미나 - JuJu
제3회난공불락 오픈소스 인프라세미나 - JuJu
 
제3회난공불락 오픈소스 인프라세미나 - Pacemaker
제3회난공불락 오픈소스 인프라세미나 - Pacemaker제3회난공불락 오픈소스 인프라세미나 - Pacemaker
제3회난공불락 오픈소스 인프라세미나 - Pacemaker
 
새하늘과새땅 북톡-3부-우주적회복에대한신약의비전
새하늘과새땅 북톡-3부-우주적회복에대한신약의비전새하늘과새땅 북톡-3부-우주적회복에대한신약의비전
새하늘과새땅 북톡-3부-우주적회복에대한신약의비전
 
새하늘과새땅 북톡-2부-구약에서의총체적구원
새하늘과새땅 북톡-2부-구약에서의총체적구원새하늘과새땅 북톡-2부-구약에서의총체적구원
새하늘과새땅 북톡-2부-구약에서의총체적구원
 
새하늘과새땅 Part1
새하늘과새땅 Part1새하늘과새땅 Part1
새하늘과새땅 Part1
 

Gluster fs tutorial part 2 gluster and big data- gluster for devs and sys admins presentation

  • 1. DUSTIN L. BLACK, RHCA1 Dustin L. Black, RHCA Sr. Technical Account Manager, Red Hat 2012-07-17 Demystifying Gluster GlusterFS for the SysAdmin
  • 2. DUSTIN L. BLACK, RHCA2 #whoami ● Systems and Infrastructure Geek ● Decade+ of Linux, UNIX, networking ● <notacoder/> ● Believe in Open Source Everything ● Sr. Technical Account Manager, Red Hat GSS ● dustin@redhat.com
  • 3. DUSTIN L. BLACK, RHCA3 #whatis TAM ● Premium named-resource support ● Proactive and early access ● Regular calls and on-site engagements ● Customer advocate within Red Hat and upstream ● Multi-vendor support coordinator ● High-touch access to engineering ● Influence for software enhancements ● NOT Hands-on or consulting
  • 4. DUSTIN L. BLACK, RHCA4 Agenda ● Technology Overview ● A Peek at Gluster Logic ● Scaling Up and Out ● Redundancy and Fault Tolerance ● Data Access ● General Administration ● Use Cases
  • 5. DUSTIN L. BLACK, RHCA5 Technology Overview Demystifying Gluster GlusterFS for the SysAdmin
  • 6. DUSTIN L. BLACK, RHCA6 What is GlusterFS? ● POSIX-Compliant Distributed File System ● No Metadata Server ● Network Attached Storage (NAS) ● Heterogeneous Commodity Hardware ● Aggregated Storage and Memory ● Standards-Based – Clients, Applications, Networks ● Flexible and Agile Scaling ● Capacity – Petabytes and beyond ● Performance – Thousands of Clients ● Single Global Namespace
  • 7. DUSTIN L. BLACK, RHCA7 What is Red Hat Storage? ● Enterprise Implementation of GlusterFS ● Software Appliance ● Bare Metal Installation ● Built on RHEL + XFS ● Subscription Model ● Storage Software Appliance ● Datacenter and Private Cloud Deployments ● Virtual Storage Appliance ● Amazon Web Services Public Cloud Deployments
  • 8. DUSTIN L. BLACK, RHCA8 GlusterFS vs. Traditional Solutions ● A basic NAS has limited scalability and redundancy ● Other distributed filesystems limited by metadata ● SAN is costly & complicated but high performance & scalable ● GlusterFS ● Linear Scaling ● Minimal Overhead ● High Redundancy ● Simple and Inexpensive Deployment
  • 9. DUSTIN L. BLACK, RHCA9 Technology Stack Demystifying Gluster GlusterFS for the SysAdmin
  • 10. DUSTIN L. BLACK, RHCA10 Terminology ● Brick ● A filesystem mountpoint ● A unit of storage used as a GlusterFS building block ● Translator ● Logic between the bits and the Global Namespace ● Layered to provide GlusterFS functionality ● Volume ● Bricks combined and passed through translators ● Node ● Server running the gluster daemon and sharing volumes
  • 11. DUSTIN L. BLACK, RHCA11 Foundation Components ● Private Cloud (Datacenter) ● Common Commodity x86_64 Servers ● RHS: Hardware Compatibility List (HCL) ● Public Cloud ● Amazon Web Services (AWS) ● EC2 + EBS
  • 12. DUSTIN L. BLACK, RHCA12 Disk, LVM, and Filesystems ● Direct-Attached Storage (DAS) ● Just a Bunch Of Disks (JBOD) ● Hardware RAID ● RHS: RAID 6 required ● Logical Volume Management (LVM) ● XFS, EXT3/4, BTRFS ● Extended attributes support required
  • 13. DUSTIN L. BLACK, RHCA13 Data Access Overview ● Gluster Native Client ● Filesystem in Userspace (FUSE) ● NFS ● Built-in Service ● SMB/CIFS ● Samba server required
  • 14. DUSTIN L. BLACK, RHCA14 Gluster Components ● glusterd ● Elastic volume management daemon ● Runs on all export servers ● Interfaced through gluster CLI ● glusterfsd ● GlusterFS brick daemon ● One process for each brick ● Managed by glusterd
  • 15. DUSTIN L. BLACK, RHCA15 Gluster Components ● glusterfs ● NFS server daemon ● FUSE client daemon ● mount.glusterfs ● FUSE native mount tool ● gluster ● Gluster Console Manager (CLI)
  • 16. DUSTIN L. BLACK, RHCA16 Putting it All Together
  • 17. DUSTIN L. BLACK, RHCA17 Scaling Demystifying Gluster GlusterFS for the SysAdmin
  • 18. DUSTIN L. BLACK, RHCA18 Scaling Up ● Add disks and filesystems to a node ● Expand a gluster volume by adding bricks XFS
  • 19. DUSTIN L. BLACK, RHCA19 Scaling Out ● Add gluster nodes to trusted pool ● Add filesystems as new bricks
  • 20. DUSTIN L. BLACK, RHCA20 Under the Hood Demystifying Gluster GlusterFS for the SysAdmin
  • 21. DUSTIN L. BLACK, RHCA21 Elastic Hash Algorithm ● No central metadata ● No Performance Bottleneck ● Eliminates risk scenarios ● Location hashed intelligently on path and filename ● Unique identifiers, similar to md5sum ● The “Elastic” Part ● Files assigned to virtual volumes ● Virtual volumes assigned to multiple bricks ● Volumes easily reassigned on the fly
  • 22. DUSTIN L. BLACK, RHCA22 Translators
  • 23. DUSTIN L. BLACK, RHCA23 Distribution and Replication Demystifying Gluster GlusterFS for the SysAdmin
  • 24. DUSTIN L. BLACK, RHCA24 Distributed Volume ● Files “evenly” spread across bricks ● File-level RAID 0 ● Server/Disk failure could be catastrophic
  • 25. DUSTIN L. BLACK, RHCA25 Replicated Volume ● Copies files to multiple bricks ● File-level RAID 1
  • 26. DUSTIN L. BLACK, RHCA26 Distributed Replicated Volume ● Distributes files across replicated bricks ● RAID 1 plus improved read performance
  • 27. DUSTIN L. BLACK, RHCA27 Geo Replication ● Asynchronous across LAN, WAN, or Internet ● Master-Slave model -- Cascading possible ● Continuous and incremental ● Time should be synchronized on all master nodes
  • 28. DUSTIN L. BLACK, RHCA28 Replicated Volumes vs Geo-replication Replicated Volumes Geo-replication Mirrors data across clusters Mirrors data across geographically distributed clusters Provides high-availability Ensures backing up of data for disaster recovery Synchronous replication (each and every file operation is sent across all the bricks) Asynchronous replication (checks for the changes in files periodically and syncs them on detecting differences)
  • 29. DUSTIN L. BLACK, RHCA29 Layered Functionality Demystifying Gluster GlusterFS for the SysAdmin
  • 30. DUSTIN L. BLACK, RHCA30 Striped Volumes ● Individual files split among bricks ● Similar to RAID 0 ● Limited Use Cases – HPC Pre/Post Processing
  • 31. DUSTIN L. BLACK, RHCA31 Distributed Striped Volume ● Files striped across two or more nodes ● Striping plus scalability
  • 32. DUSTIN L. BLACK, RHCA32 Striped Replicated Volume ● RHS 2.0 / GlusterFS 3.3+ ● Similar to RAID 10 (1+0) Replicated Volume Striped Striped
  • 33. DUSTIN L. BLACK, RHCA33 Distributed Striped Replicated Volume ● GlusterFS 3.3+ ● Limited Use Cases – Map Reduce
  • 34. DUSTIN L. BLACK, RHCA34 Data Access Demystifying Gluster GlusterFS for the SysAdmin
  • 35. DUSTIN L. BLACK, RHCA35 Gluster Native Client (FUSE) ● FUSE kernel module allows the filesystem to be built and operated entirely in userspace ● Specify mount to any GlusterFS node ● Native Client fetches volfile from mount server, then communicates directly with all nodes to access data ● Recommended for high concurrency and high write performance
  • 36. DUSTIN L. BLACK, RHCA36 NFS ● Standard NFS v3 clients ● Mount with vers=3 option ● Standard automounter is supported ● Mount to any node, or use a load balancer ● Gluster NFS server includes Network Lock Manager (NLM) to synchronize locks across clients ● Better performance for reading many small files
  • 37. DUSTIN L. BLACK, RHCA37 SMB/CIFS ● GlusterFS volume is first redundantly mounted with the Native Client on localhost ● Native mount point is then shared via Samba ● Must be setup on each node you wish to connect to via CIFS
  • 38. DUSTIN L. BLACK, RHCA38 General Administration Demystifying Gluster GlusterFS for the SysAdmin
  • 39. DUSTIN L. BLACK, RHCA39 Preparing a Brick # lvcreate -L 100G -n lv_brick1 vg_server1 # mkfs -t xfs -i size=512 /dev/vg_server1/lv_brick1 # mkdir /brick1 # mount /dev/vg_server1/lv_brick1 /brick1 # echo '/dev/vg_server1/lv_brick1 /brick1 xfs defaults 1 2' >> /etc/fstab
  • 40. DUSTIN L. BLACK, RHCA40 Adding Nodes (peers) and Volumes gluster> peer probe server3 gluster> peer status Number of Peers: 2 Hostname: server2 Uuid: 5e987bda-16dd-43c2-835b-08b7d55e94e5 State: Peer in Cluster (Connected) Hostname: server3 Uuid: 1e0ca3aa-9ef7-4f66-8f15-cbc348f29ff7 State: Peer in Cluster (Connected) gluster> volume create my-dist-vol server2:/brick2 server3:/brick3 gluster> volume info my-dist-vol Volume Name: my-dist-vol Type: Distribute Status: Created Number of Bricks: 2 Transport-type: tcp Bricks: Brick1: server2:/brick2 Brick2: server3:/brick3 gluster> volume start my-dist-vol Distributed Volume
  • 41. DUSTIN L. BLACK, RHCA41 Distributed Striped Replicated Volume gluster> volume create test-volume replica 2 stripe 2 transport tcp server1:/exp1 server1:/exp2 server2:/exp3 server2:/exp4 server3:/exp5 server3:/exp6 server4:/exp7 server4:/exp8 Multiple bricks of a replicate volume are present on the same server. This setup is not optimal. Do you still want to continue creating the volume? (y/n) y Creation of volume test-volume has been successful. Please start the volume to access data. <- test-volume<- test-volume <- distributed files -><- distributed files -> stripe 2stripe 2 replica 2replica 2
  • 42. DUSTIN L. BLACK, RHCA42 Distributed Striped Replicated Volume gluster> volume info test-volume Volume Name: test-volume Type: Distributed-Striped-Replicate Volume ID: 8f8b8b59-d1a1-42fe-ae05-abe2537d0e2d Status: Created Number of Bricks: 2 x 2 x 2 = 8 Transport-type: tcp Bricks: Brick1: server1:/exp1 Brick2: server2:/exp3 Brick3: server1:/exp2 Brick4: server2:/exp4 Brick5: server3:/exp5 Brick6: server4:/exp7 Brick7: server3:/exp6 Brick8: server4:/exp8 gluster> volume create test-volume stripe 2 replica 2 transport tcp server1:/exp1 server2:/exp3 server1:/exp2 server2:/exp4 server3:/exp5 server4:/exp7 server3:/exp6 server4:/exp8 Creation of volume test-volume has been successful. Please start the volume to access data.
  • 43. DUSTIN L. BLACK, RHCA43 Manipulating Bricks in a Volume gluster> volume add-brick my-dist-vol server4:/brick4 gluster> volume remove-brick my-dist-vol server2:/brick2 start gluster> volume remove-brick my-dist-vol server2:/brick2 status Node Rebalanced-files size scanned failures status --------- ----------- ----------- ----------- ----------- ------------ localhost 16 16777216 52 0 in progress 192.168.1.1 13 16723211 47 0 in progress gluster> volume remove-brick my-dist-vol server2:/brick2 commit gluster> volume rebalance my-dist-vol fix-layout start gluster> volume rebalance my-dist-vol start gluster> volume rebalance my-dist-vol status Node Rebalanced-files size scanned failures status --------- ----------- ----------- ----------- ----------- ------------ localhost 112 15674 170 0 completed 10.16.156.72 140 3423 321 2 completed
  • 44. DUSTIN L. BLACK, RHCA44 Migrating Data / Replacing Bricks gluster> volume replace-brick my-dist-vol server3:/brick3 server5:/brick5 start gluster> volume replace-brick my-dist-vol server3:/brick3 server5:/brick5 status Current File = /usr/src/linux-headers-2.6.31-14/block/Makefile Number of files migrated = 10567 Migration complete gluster> volume replace-brick my-dist-vol server3:/brick3 server5:/brick5 commit
  • 45. DUSTIN L. BLACK, RHCA45 Volume Options gluster> volume set my-dist-vol auth.allow 192.168.1.* gluster> volume set my-dist-vol auth.reject 10.* gluster> volume set my-dist-vol nfs.volume-access read-only gluster> volume set my-dist-vol nfs.disable on gluster> volume set my-dist-vol features.read-only on gluster> volume set my-dist-vol performance.cache-size 67108864 gluster> volume set my-dist-vol auth.allow 192.168.1.* gluster> volume set my-dist-vol auth.reject 10.* NFS Auth Other advanced options
  • 46. DUSTIN L. BLACK, RHCA46 Volume Top Command gluster> volume set my-dist-vol auth.allow 192.168.1.* gluster> volume set my-dist-vol auth.reject 10.* gluster> volume top my-dist-vol read brick server3:/brick3 list-cnt 3 Brick: server:/export/dir1 ==========Read file stats======== read filename call count 116 /clients/client0/~dmtmp/SEED/LARGE.FIL 64 /clients/client0/~dmtmp/SEED/MEDIUM.FIL 54 /clients/client2/~dmtmp/SEED/LARGE.FIL ● Many top commands are available for analysis of files, directories, and bricks ● Read and write performance test commands available ● Perform active dd tests and measure throughput
  • 47. DUSTIN L. BLACK, RHCA47 Volume Profiling gluster> volume set my-dist-vol auth.allow 192.168.1.* gluster> volume set my-dist-vol auth.reject 10.* gluster> volume profile my-dist-vol start gluster> volume profile my-dist-vol info Brick: Test:/export/2 Cumulative Stats: Block 1b+ 32b+ 64b+ Size: Read: 0 0 0 Write: 908 28 8 ... %-latency Avg- Min- Max- calls Fop latency Latency Latency ___________________________________________________________ 4.82 1132.28 21.00 800970.00 4575 WRITE 5.70 156.47 9.00 665085.00 39163 READDIRP 11.35 315.02 9.00 1433947.00 38698 LOOKUP 11.88 1729.34 21.00 2569638.00 7382 FXATTROP 47.35 104235.02 2485.00 7789367.00 488 FSYNC ------------------ Duration : 335 BytesRead : 94505058 BytesWritten : 195571980
  • 48. DUSTIN L. BLACK, RHCA48 Geo-Replication gluster> volume set my-dist-vol auth.allow 192.168.1.* gluster> volume set my-dist-vol auth.reject 10.* gluster> volume geo-replication my-dist-vol slavehost1:my-dist-repl start Starting geo-replication session between my-dist-vol & slavehost1:my-dist-repl has been successful gluster> volume geo-replication my-dist-vol status MASTER SLAVE STATUS -------------------------------------------------------------------------------- my-dist-vol gluster://slavehost1:my-dist-repl OK # ssh-keygen -f /var/lib/glusterd/geo-replication/secret.pem # ssh-copy-id -i /var/lib/glusterd/geo-replication/secret.pem repluser@slavehost1 gluster> volume geo-replication my-dist-vol repluser@slavehost1:/repl_dir start Starting geo-replication session between my-dist-vol & slavehost1:/repl_dir has been successful gluster> volume geo-replication my-dist-vol status MASTER SLAVE STATUS -------------------------------------------------------------------------------- my-dist-vol ssh://repluser@slavehost1:/repl_dir OK Remote GlusterFS Volume Remote SSH gluster> volume info my-dist-vol ... Options Reconfigured: geo-replication.indexing: on
  • 49. DUSTIN L. BLACK, RHCA49 Use Cases Demystifying Gluster GlusterFS for the SysAdmin
  • 50. DUSTIN L. BLACK, RHCA50 Common Solutions ● Media / Content Distribution Network (CDN) ● Backup / Archive / Disaster Recovery (DR) ● Large Scale File Server ● Home directories ● High Performance Computing (HPC) ● Infrastructure as a Service (IaaS) storage layer
  • 51. DUSTIN L. BLACK, RHCA51 Hadoop – Map Reduce ● Access data within and outside of Hadoop ● No HDFS name node single point of failure / bottleneck ● Seamless replacement for HDFS ● Scales with the massive growth of big data
  • 52. DUSTIN L. BLACK, RHCA52 CIC Electronic Signature Solutions ● Challenge ● Must leverage economics of the cloud ● Storage performance in the cloud too slow ● Need to meet demanding client SLA’s ● Solution ● Red Hat Storage Software Appliance ● Amazon EC2 and Elastic Block Storage (EBS) ● Benefits ● Faster development and delivery of new products ● SLA’s met with headroom to spare ● Accelerated cloud migration ● Scale-out for rapid and simple expansion ● Data is highly available for 24/7 client access Hybrid Cloud: Electronic Signature Solutions ● Reduced time-to- market for new products ● Meeting all client SLAs ● Accelerating move to the cloud
  • 53. DUSTIN L. BLACK, RHCA53 Pandora Internet Radio ● Challenge ● Explosive user & title growth ● As many as 12 file formats for each song ● ‘Hot’ content and long tail ● Solution ● Three data centers, each with a six-node GlusterFS cluster ● Replication for high availability ● 250+ TB total capacity ● Benefits ● Easily scale capacity ● Centralized management; one administrator to manage day-to-day operations ● No changes to application ● Higher reliability Private Cloud: Media Serving ● 1.2 PB of audio served per week ● 13 million files ● Over 50 GB/sec peak traffic
  • 54. DUSTIN L. BLACK, RHCA54 Brightcove • Over 1 PB currently in Gluster • Separate 4 PB project in the works Private Cloud: Media Serving ● Challenge ● Explosive customer & title growth ● Massive video in multiple locations ● Costs rising, esp. with HD formats ● Solution ● Complete scale-out based on commodity DAS/JBOD and GlusterFS ● Replication for high availability ● 1PB total capacity ● Benefits ● Easily scale capacity ● Centralized management; one administrator to manage day-to-day operations ● Higher reliability ● Path to multi-site
  • 55. DUSTIN L. BLACK, RHCA55 Pattern Energy • Rapid and advance weather predictions • Maximizing energy assets • Cost savings and avoidance High Performance Computing for Weather Prediction ● Challenge ● Need to deliver rapid advance weather predictions ● Identify wind and solar abundance in advance ● More effectively perform preventative maintenance and repair ● Solution ● 32 HP compute nodes ● Red Hat SSA for high throughput and availability ● 20TB+ total capacity ● Benefits ● Predicts solar and wind patterns 3 to 5 days in advance ● Maximize energy production and repair times ● Avoid costs of outsourcing weather predictions ● Solution has paid for itself many times over
  • 56. Thank You! ● dustin@redhat.com ● storage-sales@redhat.com ● RHS: www.redhat.com/storage/ ● GlusterFS: www.gluster.org ● TAM: access.redhat.com/support/offerings/tam/ @Glusterorg @RedHatStorage Gluster Red Hat Storage Demystifying Gluster GlusterFS for the SysAdmin