SlideShare une entreprise Scribd logo
1  sur  34
Sizing Your Content Databases:
 Understanding the New Limits

        Randy Williams
          AvePoint
Randy Williams
         • Enterprise Trainer & Evangelist – AvePoint
         • 20+ years in IT
            ●   developer, consultant, trainer, author
         • Three-time SharePoint MVP
         • Speaker at many global conferences

         randy.williams@avepoint.com
         http://linkd.in/plEEb1
         @tweetraw
Agenda

 Understanding                Remote BLOB
   new limits                 storage (RBS)




    1 2          Achieving
                                 3 4          Summary
                   larger                       Q&A
                 capacities
Agenda

 Understanding
   new limits




    1
The SharePoint storage dilemma

• Documents, databases, and BLOBs
• Storage growth
                                                 SharePoint

                                                  SQL Server
                                                   2008/R2


                                                   Content
                                                  Database
                                                    Content
                                         Content Database
                                                   Database

       Active Content   Actual Content
Previously supported limits

                                                                      Large, single
                                                                      -site
                                                                      repositories
                                                               1 TB
                                                                      and archives
                           General use                                (records
          200 GB
                           scenarios                                  center)




                                         100 GB site collection *


* A larger site collection is supported if it
is the only site collection in the database
Revised limits (July ‘11)
                                                                  Document
                                                         No       archive
                                                       explicit   scenario:
                                      All scenarios:    limit     caveats
                          4 TB
                                      caveats                     apply
         General use                  apply
200 GB
         scenarios




                           Site collection
                       No explicit size – limit by
                       scenario, database size,
                             item count
Understanding scenarios

• SharePoint is multi-purpose
• Scenario primarily refers to needs and
  usage patterns
  ●   Read/write centric
  ●   Concurrent users
  ●   Average/peak loads
  ●   Recovery objectives
• Isolate different usage patterns to
  separate databases
Common scenarios

Record Center                 Team Site
• Long term retention         • Day to day collaboration
• Low volatility – very few     w/ shorter retention
  write operations            • Higher volatility
• Limited reads               • Higher reads

Larger databases              Smaller databases
What are the 4TB-level caveats?

   • A larger db requires faster storage
        ●   Between 0.25 – 2.0 IOPS/GB
        ●   4TB DB : 1000 IOPS minimum
   • Plans developed for DR/HA
   • Capacity planning/perf testing
   • Recognize added complexity
        ●   Skilled architects and proactive admins
   • 60M total item limit per db
http://technet.microsoft.com/en-us/library/cc262787.aspx
What are the >4TB caveats?

   • All 4TB caveats, plus
   • Document Center or Record Center only
   • In any given month
        ●   <5% of content accessed
        ●   <1% of content modified
   • No alerts, user workflow, item-level
     security, et al


http://technet.microsoft.com/en-us/library/cc262787.aspx
Why is 200GB still a good number?

• Support operations are much easier
• Better performance
  ●   The larger the db, the slower it gets
• Easier to meet backup and recovery
  objectives
  ●   Most recoveries begin with a db restore
  ●   Can you meet your recovery objectives?
• Patching / upgrading is faster
                                                200 GB
Why are larger DBs slower?

• Select queries take longer
  ●   More rows to filter, group and sort
• Write queries take longer
• Locking escalation
  ●   More blocking
• More data, but data cache same size
• DB maintenance takes longer
  ●   reindex
  ●   dbcc checkdb
What happens as size increases?




http://technet.microsoft.com/en-us/library/hh395916.aspx
Demo

SIZE AFFECTS PERFORMANCE
Agenda




          2
         Achieving
           larger
         capacities
Achieving storage performance

• Storage array (RAID 1+0)
  ●   10 300GB SAS drives, 15k RPM
  ●   1.5 TB effective space
  ●   ~1500 IOPS = 1.0 IOPS/GB
• Set of drives (RAID 1+0)
  ●   4 750GB SATA drives, 10k RPM
  ●   1.5 TB effective space
  ●   ~300 IOPS = 0.2 IOPS/GB
• Go with higher quality storage
  ●   SAS > SATA ; SAN > DAS
Scaling storage

• Multiple storage arrays (RAID 1+0)
• Break out into multiple LUNs
• Add additional data files to DB, one per
  array
                                F:SP_DocCenter_1.mdf
• Advice                        G: SP_DocCenter_2.ndf
                                                                 Data
   ●   Many smaller drives >           H: SP_DocCenter_3.ndf
                                        I: SP_DocCenter_4.ndf
       fewer larger ones
                                        J: SP_DocCenter.ldf     Log
   ●   RAID 1+0 > RAID 5
Additional performance guidance
    • How many data files?
        ●   Advice varies – between 0.25 to 1 per physical CPU
        ●   Each on a different spindle/LUN
    • Adjust database growth settings
        ●   Use 50-100MB for each data file
        ●   Use 20-40MB for log
    • Enable instant file initialization
    • Optimize tempdb
        ●   Use multiple data files
        ●   Pre-size to 25% of largest db
        ●   RAID 1+0
http://slidesha.re/pwVlJM
Demo (if time permits)

DB SETTINGS AFFECT
PERFORMANCE
Achieving Disaster Recovery

   • Built-in SharePoint backup is incapable of
     working with large capacities
        ●   Site collection backup limit : 15GB
        ●   Practical database backup limit : 200GB
   • Look at your backup/recovery objectives
        ●   Most recoveries involve a database restore
   • Look for third-party solutions
   • Deploy SP1 – site recycle bin

http://slidesha.re/rlv3u1
Agenda

         Remote BLOB
         storage (RBS)




            3
Remote BLOB Storage (RBS)

• Storing document (BLOB) outside
  database
  ●   Reduce database size
• Cannot be used to scale beyond database
  limits
  ●   Effective size = DB size + BLOB store
• Can externalize based on document size
• Built in RBS support with SQL Server
  2008 (FILESTREAM provider)
Overview of BLOB externalization




                                             Pointer
                                              (stub)
                                    RBS
           Upload                         SQL Server



                    Web Front-end




Externalized BLOB is
transparent to both                       File System

SharePoint and its users
Advantages of externalizing BLOBs

• Reduce storage costs
• Increase performance
  ●   Read & write
  ●   All other activity by users of the DB and SQL server
• Access to features of BLOB storage
  platform
• Efficient content restructure
  ●   Shallow copy in SP1
Advantages of keeping BLOBs in
SQL
• One storage container to
   ●   Maintain
   ●   Monitor
   ●   Recover
• Tier I storage
   ●   Performance relative to lower tiers of storage
       benefits all content access
• SQL caching
   ●   Performance of reads/writes of small documents
   ●   SQL caching benefits reads
RBS Guidance

• Consider using in document-heavy databases
• Trade off
  ●   Storage cost & performance benefits versus
  ●   More complex architecture (support, DR, HA)
• Consider third party providers
  ●   More full-featured solutions
• In general
  ●   Do not externalize <1MB documents
  ●   Ideal number varies widely
Agenda




          4
         Summary
           Q&A
In review

• 4TB is the new supported limit for all
  scenarios
• No limit for record/document centers
• Keys to achieving larger sizes
  ●   Storage performance planning/testing
  ●   DR/HA planning/testing
• RBS offers benefits but does not extend
  these limits
Your Feedback is Important

 Please fill out a session evaluation form
  drop it off at the conference registration
                      desk.

                Thank you!
Questions?
  randy.williams@avepoint.com
  http://linkd.in/plEEb1
  @tweetraw
Sizing your Content Databases: Understanding the Limits
Sizing your Content Databases: Understanding the Limits
Sizing your Content Databases: Understanding the Limits

Contenu connexe

Tendances

Storing and managing your content in share point tspbug
Storing and managing your content in share point tspbugStoring and managing your content in share point tspbug
Storing and managing your content in share point tspbug
Baris Bruce Tuncertan
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
Yiwei Ma
 
Storage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook MessagesStorage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook Messages
yarapavan
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
enissoz
 

Tendances (19)

Inexpensive storage
Inexpensive storageInexpensive storage
Inexpensive storage
 
Find a needle in Haystack: Facebook's storage system
Find a needle in Haystack: Facebook's storage systemFind a needle in Haystack: Facebook's storage system
Find a needle in Haystack: Facebook's storage system
 
To blob or not to blob
To blob or not to blobTo blob or not to blob
To blob or not to blob
 
HBase Advanced - Lars George
HBase Advanced - Lars GeorgeHBase Advanced - Lars George
HBase Advanced - Lars George
 
HBase @ Twitter
HBase @ TwitterHBase @ Twitter
HBase @ Twitter
 
SharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 PerformanceSharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 Performance
 
HBaseCon 2013: Compaction Improvements in Apache HBase
HBaseCon 2013: Compaction Improvements in Apache HBaseHBaseCon 2013: Compaction Improvements in Apache HBase
HBaseCon 2013: Compaction Improvements in Apache HBase
 
Storing and managing your content in share point tspbug
Storing and managing your content in share point tspbugStoring and managing your content in share point tspbug
Storing and managing your content in share point tspbug
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
 
Presentation day1oracle 12c
Presentation day1oracle 12cPresentation day1oracle 12c
Presentation day1oracle 12c
 
Alfresco Large Scale Enterprise Deployments
Alfresco Large Scale Enterprise DeploymentsAlfresco Large Scale Enterprise Deployments
Alfresco Large Scale Enterprise Deployments
 
Storage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook MessagesStorage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook Messages
 
HBase Sizing Guide
HBase Sizing GuideHBase Sizing Guide
HBase Sizing Guide
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
 
Future of cloud storage
Future of cloud storageFuture of cloud storage
Future of cloud storage
 
Hbase: an introduction
Hbase: an introductionHbase: an introduction
Hbase: an introduction
 
Apache HBase Performance Tuning
Apache HBase Performance TuningApache HBase Performance Tuning
Apache HBase Performance Tuning
 
Hadoop Successes and Failures to Drive Deployment Evolution
Hadoop Successes and Failures to Drive Deployment EvolutionHadoop Successes and Failures to Drive Deployment Evolution
Hadoop Successes and Failures to Drive Deployment Evolution
 
Digital Library Collection Management using HBase
Digital Library Collection Management using HBaseDigital Library Collection Management using HBase
Digital Library Collection Management using HBase
 

Similaire à Sizing your Content Databases: Understanding the Limits

Sizing Your Content Databases- Understanding The Limits
Sizing Your Content Databases- Understanding The LimitsSizing Your Content Databases- Understanding The Limits
Sizing Your Content Databases- Understanding The Limits
wahidsaleemi
 
Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...
Alluxio, Inc.
 
Share point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practicesShare point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practices
Eric Shupps
 
4. hadoop גיא לבנברג
4. hadoop  גיא לבנברג4. hadoop  גיא לבנברג
4. hadoop גיא לבנברג
Taldor Group
 

Similaire à Sizing your Content Databases: Understanding the Limits (20)

Sizing Your Content Databases- Understanding The Limits
Sizing Your Content Databases- Understanding The LimitsSizing Your Content Databases- Understanding The Limits
Sizing Your Content Databases- Understanding The Limits
 
SharePoint Storage Best Practices
SharePoint Storage Best PracticesSharePoint Storage Best Practices
SharePoint Storage Best Practices
 
Scalable Filesystem Metadata Services with RocksDB
Scalable Filesystem Metadata Services with RocksDBScalable Filesystem Metadata Services with RocksDB
Scalable Filesystem Metadata Services with RocksDB
 
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
 
SPS Kansas City: What SharePoint Admin need to know about SQL
SPS Kansas City: What SharePoint Admin need to know about SQLSPS Kansas City: What SharePoint Admin need to know about SQL
SPS Kansas City: What SharePoint Admin need to know about SQL
 
What SharePoint Admins need to know about SQL-Cinncinati
What SharePoint Admins need to know about SQL-CinncinatiWhat SharePoint Admins need to know about SQL-Cinncinati
What SharePoint Admins need to know about SQL-Cinncinati
 
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
 
What SQL DBA's need to know about SharePoint-St. Louis 2013
What SQL DBA's need to know about SharePoint-St. Louis 2013What SQL DBA's need to know about SharePoint-St. Louis 2013
What SQL DBA's need to know about SharePoint-St. Louis 2013
 
SharePoint 2010 database maintenance
SharePoint 2010 database maintenanceSharePoint 2010 database maintenance
SharePoint 2010 database maintenance
 
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
 
Apache Tajo - An open source big data warehouse
Apache Tajo - An open source big data warehouseApache Tajo - An open source big data warehouse
Apache Tajo - An open source big data warehouse
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep Dive
 
Evolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage SubsystemEvolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage Subsystem
 
Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...
 
What SQL DBA's need to know about SharePoint
What SQL DBA's need to know about SharePointWhat SQL DBA's need to know about SharePoint
What SQL DBA's need to know about SharePoint
 
Share point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practicesShare point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practices
 
4. hadoop גיא לבנברג
4. hadoop  גיא לבנברג4. hadoop  גיא לבנברג
4. hadoop גיא לבנברג
 
Sps Ottawa - Storing Your Content in SharePoint
Sps Ottawa - Storing Your Content in SharePointSps Ottawa - Storing Your Content in SharePoint
Sps Ottawa - Storing Your Content in SharePoint
 
Still All on One Server: Perforce at Scale
Still All on One Server: Perforce at Scale Still All on One Server: Perforce at Scale
Still All on One Server: Perforce at Scale
 
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
 

Plus de Randy Williams

Integrating SharePoint with Exchange-2013
Integrating SharePoint with Exchange-2013Integrating SharePoint with Exchange-2013
Integrating SharePoint with Exchange-2013
Randy Williams
 
Governance - how does information & security drive your architecture
Governance - how does information & security drive your architectureGovernance - how does information & security drive your architecture
Governance - how does information & security drive your architecture
Randy Williams
 

Plus de Randy Williams (20)

Architecting Solutions and Systems – Randy’s Secrets to Success
Architecting Solutions and Systems – Randy’s Secrets to SuccessArchitecting Solutions and Systems – Randy’s Secrets to Success
Architecting Solutions and Systems – Randy’s Secrets to Success
 
An IT Pro Guide to Deploying and Managing SharePoint 2013 Apps
An IT Pro Guide to Deploying and Managing SharePoint 2013 AppsAn IT Pro Guide to Deploying and Managing SharePoint 2013 Apps
An IT Pro Guide to Deploying and Managing SharePoint 2013 Apps
 
Integrating SharePoint with Exchange-2013
Integrating SharePoint with Exchange-2013Integrating SharePoint with Exchange-2013
Integrating SharePoint with Exchange-2013
 
Governing and managing hybrid SharePoint environments
Governing and managing hybrid SharePoint environmentsGoverning and managing hybrid SharePoint environments
Governing and managing hybrid SharePoint environments
 
When governance lacks compliance
When governance lacks complianceWhen governance lacks compliance
When governance lacks compliance
 
The Future of Social Collaboration
The Future of Social CollaborationThe Future of Social Collaboration
The Future of Social Collaboration
 
Getting Started with SharePoint 2013 Apps
Getting Started with SharePoint 2013 AppsGetting Started with SharePoint 2013 Apps
Getting Started with SharePoint 2013 Apps
 
Building the next generation of browser apps today
Building the next generation of browser apps todayBuilding the next generation of browser apps today
Building the next generation of browser apps today
 
SQL Server 2012 and SharePoint 2010: Reporting Nirvana
SQL Server 2012 and SharePoint 2010: Reporting NirvanaSQL Server 2012 and SharePoint 2010: Reporting Nirvana
SQL Server 2012 and SharePoint 2010: Reporting Nirvana
 
How governance drives your information and security architecture
How governance drives your information and security architectureHow governance drives your information and security architecture
How governance drives your information and security architecture
 
SharePoint Online and the Cloud
SharePoint Online and the CloudSharePoint Online and the Cloud
SharePoint Online and the Cloud
 
Enforcing SharePoint Governance
Enforcing SharePoint GovernanceEnforcing SharePoint Governance
Enforcing SharePoint Governance
 
Migrating to Office 365
Migrating to Office 365Migrating to Office 365
Migrating to Office 365
 
Governance - how does information & security drive your architecture
Governance - how does information & security drive your architectureGovernance - how does information & security drive your architecture
Governance - how does information & security drive your architecture
 
Understanding SharePoint Governance
Understanding SharePoint GovernanceUnderstanding SharePoint Governance
Understanding SharePoint Governance
 
Why I Use SharePoint
Why I Use SharePointWhy I Use SharePoint
Why I Use SharePoint
 
Guiding a Successful SharePoint Implementation
Guiding a Successful SharePoint ImplementationGuiding a Successful SharePoint Implementation
Guiding a Successful SharePoint Implementation
 
Plugging holes in your SharePoint 2010 disaster recovery strategy
Plugging holes in your SharePoint 2010 disaster recovery strategyPlugging holes in your SharePoint 2010 disaster recovery strategy
Plugging holes in your SharePoint 2010 disaster recovery strategy
 
Spicing up SharePoint web parts
Spicing up SharePoint web partsSpicing up SharePoint web parts
Spicing up SharePoint web parts
 
Using Visual Studio to Build SharePoint 2010 Solutions
Using Visual Studio to Build SharePoint 2010 SolutionsUsing Visual Studio to Build SharePoint 2010 Solutions
Using Visual Studio to Build SharePoint 2010 Solutions
 

Dernier

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Dernier (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Sizing your Content Databases: Understanding the Limits

  • 1. Sizing Your Content Databases: Understanding the New Limits Randy Williams AvePoint
  • 2. Randy Williams • Enterprise Trainer & Evangelist – AvePoint • 20+ years in IT ● developer, consultant, trainer, author • Three-time SharePoint MVP • Speaker at many global conferences randy.williams@avepoint.com http://linkd.in/plEEb1 @tweetraw
  • 3. Agenda Understanding Remote BLOB new limits storage (RBS) 1 2 Achieving 3 4 Summary larger Q&A capacities
  • 4. Agenda Understanding new limits 1
  • 5. The SharePoint storage dilemma • Documents, databases, and BLOBs • Storage growth SharePoint SQL Server 2008/R2 Content Database Content Content Database Database Active Content Actual Content
  • 6. Previously supported limits Large, single -site repositories 1 TB and archives General use (records 200 GB scenarios center) 100 GB site collection * * A larger site collection is supported if it is the only site collection in the database
  • 7. Revised limits (July ‘11) Document No archive explicit scenario: All scenarios: limit caveats 4 TB caveats apply General use apply 200 GB scenarios Site collection No explicit size – limit by scenario, database size, item count
  • 8. Understanding scenarios • SharePoint is multi-purpose • Scenario primarily refers to needs and usage patterns ● Read/write centric ● Concurrent users ● Average/peak loads ● Recovery objectives • Isolate different usage patterns to separate databases
  • 9. Common scenarios Record Center Team Site • Long term retention • Day to day collaboration • Low volatility – very few w/ shorter retention write operations • Higher volatility • Limited reads • Higher reads Larger databases Smaller databases
  • 10. What are the 4TB-level caveats? • A larger db requires faster storage ● Between 0.25 – 2.0 IOPS/GB ● 4TB DB : 1000 IOPS minimum • Plans developed for DR/HA • Capacity planning/perf testing • Recognize added complexity ● Skilled architects and proactive admins • 60M total item limit per db http://technet.microsoft.com/en-us/library/cc262787.aspx
  • 11. What are the >4TB caveats? • All 4TB caveats, plus • Document Center or Record Center only • In any given month ● <5% of content accessed ● <1% of content modified • No alerts, user workflow, item-level security, et al http://technet.microsoft.com/en-us/library/cc262787.aspx
  • 12. Why is 200GB still a good number? • Support operations are much easier • Better performance ● The larger the db, the slower it gets • Easier to meet backup and recovery objectives ● Most recoveries begin with a db restore ● Can you meet your recovery objectives? • Patching / upgrading is faster 200 GB
  • 13. Why are larger DBs slower? • Select queries take longer ● More rows to filter, group and sort • Write queries take longer • Locking escalation ● More blocking • More data, but data cache same size • DB maintenance takes longer ● reindex ● dbcc checkdb
  • 14. What happens as size increases? http://technet.microsoft.com/en-us/library/hh395916.aspx
  • 16. Agenda 2 Achieving larger capacities
  • 17. Achieving storage performance • Storage array (RAID 1+0) ● 10 300GB SAS drives, 15k RPM ● 1.5 TB effective space ● ~1500 IOPS = 1.0 IOPS/GB • Set of drives (RAID 1+0) ● 4 750GB SATA drives, 10k RPM ● 1.5 TB effective space ● ~300 IOPS = 0.2 IOPS/GB • Go with higher quality storage ● SAS > SATA ; SAN > DAS
  • 18. Scaling storage • Multiple storage arrays (RAID 1+0) • Break out into multiple LUNs • Add additional data files to DB, one per array F:SP_DocCenter_1.mdf • Advice G: SP_DocCenter_2.ndf Data ● Many smaller drives > H: SP_DocCenter_3.ndf I: SP_DocCenter_4.ndf fewer larger ones J: SP_DocCenter.ldf Log ● RAID 1+0 > RAID 5
  • 19. Additional performance guidance • How many data files? ● Advice varies – between 0.25 to 1 per physical CPU ● Each on a different spindle/LUN • Adjust database growth settings ● Use 50-100MB for each data file ● Use 20-40MB for log • Enable instant file initialization • Optimize tempdb ● Use multiple data files ● Pre-size to 25% of largest db ● RAID 1+0 http://slidesha.re/pwVlJM
  • 20. Demo (if time permits) DB SETTINGS AFFECT PERFORMANCE
  • 21. Achieving Disaster Recovery • Built-in SharePoint backup is incapable of working with large capacities ● Site collection backup limit : 15GB ● Practical database backup limit : 200GB • Look at your backup/recovery objectives ● Most recoveries involve a database restore • Look for third-party solutions • Deploy SP1 – site recycle bin http://slidesha.re/rlv3u1
  • 22. Agenda Remote BLOB storage (RBS) 3
  • 23. Remote BLOB Storage (RBS) • Storing document (BLOB) outside database ● Reduce database size • Cannot be used to scale beyond database limits ● Effective size = DB size + BLOB store • Can externalize based on document size • Built in RBS support with SQL Server 2008 (FILESTREAM provider)
  • 24. Overview of BLOB externalization Pointer (stub) RBS Upload SQL Server Web Front-end Externalized BLOB is transparent to both File System SharePoint and its users
  • 25. Advantages of externalizing BLOBs • Reduce storage costs • Increase performance ● Read & write ● All other activity by users of the DB and SQL server • Access to features of BLOB storage platform • Efficient content restructure ● Shallow copy in SP1
  • 26. Advantages of keeping BLOBs in SQL • One storage container to ● Maintain ● Monitor ● Recover • Tier I storage ● Performance relative to lower tiers of storage benefits all content access • SQL caching ● Performance of reads/writes of small documents ● SQL caching benefits reads
  • 27. RBS Guidance • Consider using in document-heavy databases • Trade off ● Storage cost & performance benefits versus ● More complex architecture (support, DR, HA) • Consider third party providers ● More full-featured solutions • In general ● Do not externalize <1MB documents ● Ideal number varies widely
  • 28. Agenda 4 Summary Q&A
  • 29. In review • 4TB is the new supported limit for all scenarios • No limit for record/document centers • Keys to achieving larger sizes ● Storage performance planning/testing ● DR/HA planning/testing • RBS offers benefits but does not extend these limits
  • 30. Your Feedback is Important Please fill out a session evaluation form drop it off at the conference registration desk. Thank you!
  • 31. Questions? randy.williams@avepoint.com http://linkd.in/plEEb1 @tweetraw

Notes de l'éditeur

  1. Introduce concept of documents being stored as BLOBs in CDBBUILD: Diagram of architectureDiscuss storage growthBUILD: Bloat of data, mostly inactiveBUILD: Burden on CDBsDiscuss need to thin about storage holistically: lifecycle, compliance, SLAs, cost
  2. v4iMMm