Exchange 2010 storage improvements

Exchange 2010 Storage Improvements Nathan Winters – Exchange MVP

Agenda A Brief History of Exchange Storage The new ethos Feature Deep Dive Summary

History ESE/JET Blue IOPS – Random IO application Why? – Small Expensive drives 1.6GB disk $400 in 1996 SCSI 2GB and 4GB 100 IOPS Single Instance Storage Clustering with Shared Storage Backup an issue Single Point of Failure 32 bit Not enough RAM Ram limited number of users per server

History - Exchange 2007 Big improvements in Exchange Server 2007 Reduce storage input/output (I/O) (70%) Use large amounts of memory (64 bit) Increased page size (4 kilobyte (KB) -> 8 KB) Lower storage costs Support large mailboxes (> 1 gigabyte (GB)) Provide fast search (CI) Continuous replication (log shipping) High Availability (HA) + fast recovery Eliminate single points of failure 5

Email Usage Radicati seeing 165 mails per day growing to 230 over next couple of years Users used to large free storage 25GB 5GB 3 years of mail Triage once per year to archive Not once per day! Mail available through all clients Cached Mode/Performance issues High Item counts – 5000, 20000, 100000

Disk Technology Currently 2TB Moving to 8TB Random IO not getting quicker 15K RPM, 10K RPM, 7.2K RPM Density is getting better so can read more data in the same time Flash – SSD – Didn’t take that bet Optimised for spinning media for E14 Expensive – so use as Cache in SAN

Exchange Server 2010 Storage Vision IO Reduction Sequential IO SATA/Tier 2 Disk Optimization Large, Fast, Low-cost Mailboxes Storage Design Flexibility RAID-less Storage (JBOD) 9

Exchange Server 2010 HA Storage Design Flexibility 10 DAS (SAS) DAS (SATA) HA = Shared Storage Clustering +1.0 IOPS/Mailbox 3.5” 15K 146GB FC Disks RAID10 for DB & Logs Dedicated Spindles Multi-path (HBA’s, FC Switches, SAN array controllers) Backup = Streaming off active Fast Recovery = Hardware VSS (Snapshots/Clones) HA = CCR .33 IOPS/Mailbox 2.5” 146GB 10K SAS Disks RAID5 for DB RAID10 for Logs SAS Array Controller (/w BBU) Backup = VSS Snapshot Fast Recovery = CCR HA = DAG (2 DB copies) .11 IOPS/Mailbox 3.5” 2TB 7.2K SATA/SAS Disks RAID10 for DB & Logs SAS Array Controller (/w BBU) Backup = Optional/VSS Fast Recovery = Database Failover HA = DAG (3+ DB copies) .11 IOPS/Mailbox 3.5” 2TB 7.2K SATA/SAS Disks 1 DB = 1 Disk Backup = Optional/VSS Fast Recovery = Database Failover SAN JBOD (SATA) More options to reduce storage cost

JBOD/RAID-less Storage: Now An Option JBOD : 1 disk = 1 database (with logs) Requires Exchange Server 2010 High Availability (3+ DB Copies) Annual Disk Failure Rate (AFR) = 5% 11

Exchange Server 2010 HA Simplified mailbox High Availability and disaster recovery with new unified platform New York San Jose Mailbox Server Mailbox Server Mailbox Server Replicate databases to remote datacenter DB1 DB1 DB1 Recover quickly from disk and database failures DB2 DB2 DB2 DB3 DB3 DB3 DB4 DB4 DB4 DB5 DB5 DB5 Evolution of continuous replication technology (database mobility) Easier than traditional clustering to deploy and manage Allows each database to have 16 replicated copies Provides full redundancy of Exchange roles on as few as two servers 12

Exchange 2010 Features Move to Sequential IO Change Table structure Lazy View Page size 32KB Database Compression (LVC) Read/Write Coalescing Database Contiguity Cache Compression Storage Groups Gone Single Point of Failure Gone Optimised for huge mailboxes

Random vs. Sequential Disk IO Random IO Disk head has to move to process subsequent IO Head movement = High IO latency Seek Latency limits IO (IOPS) Sequential IO Disk head does not move to process subsequent IO Stationary head = low IO latency Disk RPM speed limits I/O per second (IOPS) Disk Head 7.2K SATA Disk (20ms Latency) Random = 50 IOPS Sequential = +300 IOPS 15

IO Reduction: Store Table Architecture Per Database Per Folder Exchange Server 2007 Secondary Indexes used for Views Per Database Per Mailbox Per View Exchange Server 2010 New store schema = no more single instance storage within a database 16

Exchange 2007 M1 M2 M1 M3 M2 Nickel & Dime Approach Many, random, IOs (1 per update) Time DB I/O M1 arrives M2 arrives M1 flagged M3 arrives M2 deleted User uses OWA/Outlook Online and switches to this view Exchange 2010 M1 M2 M1 M3 M2 Pay to Play Approach Fewer, sequential, IOs (1 per view) Store Schema Changes: Lazy View Updates

IO Reduction: Database Page Size Increased to 32 KB Exchange Server 2007 DB Read 20 KB Message DB Cache Disk 3 Read IO’s 8 KB Pages Exchange Server 2010 DB Read 20 KB Message DB Cache Disk 1 Read IO 32 KB Pages 18

Mitigate DB Space Growth: Database Compression Problem:Store Schema change, space hints, B+Tree Defrag and 32 KB page size combine to increase DB file size by 20% Solution: Growth is 100% mitigated by Database Compression Targeted compression for message headers and text/html bodies (7bit/Express) DB Space Analysis DB File Size Comparison Msg Views 32KB Pages 1 Database, 750 x 250MB mailboxes RTF = RTF Compressed, Mix = 77% HTML, 15% RTF, 8% Text Avg. Message size = ~50KB 19

IO Reduction: Read IO Gap Coalescing Exchange Server 2007 DB Read Behavior DB Cache Disk 3 Read IO’s Exchange Server 2010 DB Read Behavior DB Cache Disk 1 Read IO 20

IO Reduction: Maintain Contiguity Over Time New Database Maintenance Architecture: Database B+Tree Defragmentation (aka OLD2): Background/throttled process that maintains space and contiguity of database tables 21

IO Reduction: Database Contiguity Results Exchange Server 2007 Message Header Table (aka MFT) DB Page Numbers FRAGMENTED Random deletes at the tail Exchange Server 2010 Message Header Table (aka MsgHeader) CONTIGUOUS *Production/Dogfood database analysis Blue = contiguous (good) Red = fragmented (bad) 22

Exchange IO Trend +90% Reduction! 24

Putting It All Together: Mailboxes/Disk Exchange Server 2010 storage improvements cannot be quantified in IOPS reductions alone +4X Mailboxes/Disk! +500 125 250 MB Mailbox Size, 3MB DB Cache/user, 12 x 7.2k SATA disks (DB/Logs on same spindles), Loadgen Outlook 2007 Online Very Heavy Profile, measured at <20ms RPC Average latency 25

Summary Exchange Server 2010 store has… Reduced DB IOPS by +70%...again! Optimized for large mailboxes (+10 GB) and 100K item counts Optimized for large/slow/low-cost disks (SATA/Tier2) Made JBOD/RAID-less storage a viable option Enables unmatched storage flexibility to push storage Capex costs down Provides many more backup/DR options 26

Exchange 2010 storage improvements

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Exchange 2010 storage improvements

Similaire à Exchange 2010 storage improvements (20)

Plus de Nathan Winters

Plus de Nathan Winters (20)

Dernier

Dernier (20)

Exchange 2010 storage improvements

Notes de l'éditeur