SlideShare une entreprise Scribd logo
1  sur  54
Considerations for large-scale SharePoint deployments on SQL Server Name:		Joel Oleson Title:		Sr. Tech Prod Mgr Company:	Quest Software
Audience Poll New to SharePoint? SQL Admins? Large-scale Implementation (>1TB) experience? Scalability or performance issues in SharePoint deployments?
Session Overview  Lightweight Understanding SharePoint databases SQL Performance SQL Server 2008 with SharePoint Heavyweight Architectural Design Considerations Real-world scenarios Business Requirements Logical and Physical Architecture Architectural Design Statistical Results Appendix: DB Sizes, Content Distribution…
= Lightweight
Real World Examples Information based on real-world, large-scale SharePoint Implementations. Large software company (Microsoft) Intranet Portal for 120K users Global Enterprise Collaboration Solution (~20TB) Scalable Hosting Solution (SharePoint Online) Large automotive manufacturer Loan Origination Application / Document Repository ~50 Million content items (~6 TB)
Understanding the SharePoint Databases
Disk I/O Demand Most Demand Medium  Demand Low  Demand *Content.. Search Config Temp Model +SSP Master Tlogs * Except during backup and Indexing  + Except during Profile Import
Top Performance Killers Indexing/Crawling Backup (SQL & Tape) Profile Import Misc Timer Jobs – User Sync for large #s of Users STSADM Backup/Restore Large List Operations Heavy User Operation List Import/Write
Content Db
Config
SQL Server 2008 with Windows Server 2008 Transactional Performance with SQL Server 2008 Dramatically outperformed SQL 2005 on Win 2003. Compressed backup in the box Support for SQL External Blob Storage Increased resiliency Transparent Encryption See Performance Gains athttp://msdn.microsoft.com/en-us/library/dd263442.aspx
= heavyweight
Architectural Design Considerations Database Volumes Separate database volumes into unique LUN’s consisting of unique physical disk spindles. In a heavily read-oriented internet (portal) site, prioritize data over logs. Separate out Search database transaction log from content database transaction logs.
Architectural Design Considerations SQL TempDB Data Files Optimal TempDB data file sizes can be calculated using the following formula: [MAX DB SIZE (KB)] X [.25] / [# CORES] = DATA FILE SIZE (KB) Calculation result (starting size) should be roughly equal to 25% of the largest content or search DB. Use RAID 10; separate LUN from other database objects (content, search, etc…). “Autogrow” feature set to a fixed amount; if auto grow occurs, permanently increase TempDB size. TempDB Log file separated to unique LUN.
Architectural Design Considerations Content Databases 100 content databases per Web application 100GB per content database CAUTION: DB locking issues reported in collaborative DM scenarios above 100GB Need to ensure that you understand the issues based on number of users, usage profiles, etc… Service Level Agreement (SLA) requirements for backup and restore will also have an impact on this decision.
Architectural Design Considerations Content Databases - Continued Pre-construct and pre-size Use RAID 5 or RAID 10 logical units RAID 10 is the best choice when cost is not a concern.  RAID 5 will be sufficient and will save on costs, since content databases tend to be more read intensive than write intensive. Multi-core computer running SQL Server Primary file group could consist of a data file for each CPU core present in SQL Server.
Architectural Design Considerations Database Maintenance SQL Server SP2 is needed if using the DB maintenance wizard (KB930887). Plan regular defrag of databases Performance - Average Disk Queue Length Single Digit values are optimal. Occasional double-digit values aren’t a large concern. Sustained triple-digit values require attention.
Architectural Design Considerations Performance The recommended practice for separating the database volume types for the transaction log files to unique LUN’s follows. Content Database Log Files. Search Database Log Files. Consider filegroups for search database
Architectural Design Considerations Topology A single list should not have more than 2,000 items per list container. A container represents the root of the list, as well as any folders within the list; a folder is a container because other list items are stored within it. Whitepaper: Working with large lists in Office SharePoint Server 2007 (Steve Peschka) http://go.microsoft.com/fwlink/?LinkId=95450 Disk Drive Speed 15K RPM recommended. IIS Application Pools Ensure “Max Used Memory” setting utilizes all the available RAM in your WFE’s.
Architectural Design Considerations STSADM Command-line Tool and CreateSiteInNewDBOperation Gary Lapoint STSADM Extensions for Site Collection DB maintenance Codeplex.com/governance tools for archive & delete capture
Large Scale Manufacture
Real-world Scenarios Automotive Mfgr. Business Requirements (Phase I) Loan Origination Application built on Office SharePoint Server 2007 Ability to manage10.5 million images. System performance with a “normal” input load defined as receipt of 27,000 images per business day = 10 hours. Simulate user load to represent 200 users for search, view & update with 2x peak
Real-world Scenarios Data Load Process (Phase I) Used KnowledgeLake Document Release Engine Loaded 9.17 documents/second per server  Employs a high-volume, storage-based folder architecture within SharePoint to ensure UI responsiveness. Executed on 4 servers. Using this application, we were able to achieve: An average document load throughput of 36.6 documents per second! An average daily input of 3.17 million documents! 10.5 million documents with only 28% utilization!
Real-world Scenarios Data Load Process (Phase II) 15 million documents consisting of Word (.docx), Excel (.xlsx), PowerPoint (.pptx) and Adobe PDF. Five Web Front-Ends were used for the load process. Peak Load Rate: 24.3 docs per second/2.1 million documents per day. Average Load Rate: ~1.9 million documents per day. Load Time: 8 days.NOTE: Load rates included  automation process that created the PDF files.
What does the logical architecture look like?!
What does the physical architecture look like?! Scale OUT… Scale UP…
What does the site topology look like?! Phase I 17 Divisional Site Collections / DB’s Phase II 10 Departmental Site Collections / DB’s
What does the storage architecture look like?
Database SizesPhase I
Architectural Design Statistical ResultsPhase I Designed Once / Built Once No architecture OR configuration changes were required after the initial build was completed. 10.5+ million documents loaded into the system in approximately 60 hours! Full Crawl indexed 10 Million items in 32 hours! Average content database size for divisional breakouts was 60GB
Architectural Design Statistical ResultsPhase II Search database size was 539GB. Lesson Learned: Large search database caused disk I/O contention; break this out into multiple data file allocations matching the number of core processors on SQL Server, and spread them over unique LUN’s. Total Index size was 162GB! Average Content database size for Divisional breakouts was 200.65GB! Average Content database size for Departmental breakouts was 137.60GB!
Large Scale Pharma
Real-world Scenarios Pharmaceutical Business Requirements Collaboration Portal built on Office SharePoint Server 2007 Validate ~40TB of content storage. Identify performance characteristics and provide guidance around content database sizing FAST search integration
Real-world Scenarios Data Load Process 71,524,357 documents loaded across two SharePoint Farms 10.92 days! Content was spread across the farms into 165 unique content databases. 6,240 Site Collections, each containing 10 sub-sites for a total of 62,400 sites. Database sizes were pre-configured to vary in size from 100GB to 350GB to determine performance and/or SLA impacts.
What does the logical architecture look like?!
What does the physical architecture look like?!
What does the site topology look like?! 165 Content DB’s 6,240 Site Collections 10 Sub-Sites in each collection: 62,400 Sites!
What does the storage architecture look like?
Architectural Design Statistical ResultsConclusion User Loads Stress tests included 2 - 3,000 concurrent users. Based on the 10% rule, testing completed equated to an environment representing 300,000 users! RAWnumber of RPS during peak times is 1,469 at Pharma. 773 RPS, which equates to 346.59 ACTUAL RPS! FAST Search Integration Successfully integrated FAST search capabilities, indexed content corpus and served search results as expected.
Large-Scale Case Study Available SharePoint Scalability and Performance Whitepaper Contains majority of content you will see here, along with test results you won’t see here. TechNet topic: http://go.microsoft.com/fwlink/?LinkId=120901 Word 2007 format: http://go.microsoft.com/fwlink/?LinkId=120881 Word 2000-2003 format: http://go.microsoft.com/fwlink/?LinkId=120890 PDF format: http://go.microsoft.com/fwlink/?LinkId=120891
question & answer
Appendix
Database SizesMPSC/Nissan Phase I
Database SizesMPSC/Nissan Phase II
Performance of Components Over Time MPSC/Nissan Phase I 14 individual performance tests were run to simulate various load scenarios.
How do we pull all this together?! PharmaContent Database Distribution Substitute “F1” with SQL Server number to generate unique DB’s Farm 1: 2 SQL Farm 2: 1 SQL 165 Content Databases!
How do we pull all this together?! PharmaData Load Statistics
Architectural Design Statistical ResultsTesting Results – 300GB Content Databases
Architectural Design Statistical ResultsTesting Results – 350GB Content Databases
Architectural Design Statistical ResultsTesting Results – 250GB Content Databases
Architectural Design Statistical ResultsTesting Results – 150GB Content Databases
Required Slide © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation.  Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation.  MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Contenu connexe

Tendances

SharePoint 2010 – Installation and maintenance – best practices
SharePoint 2010 – Installation and maintenance – best practicesSharePoint 2010 – Installation and maintenance – best practices
SharePoint 2010 – Installation and maintenance – best practices
Toni Frankola
 
Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012
Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012
Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012
Michael Noel
 

Tendances (20)

What SharePoint Admins need to know about SQL-Cinncinati
What SharePoint Admins need to know about SQL-CinncinatiWhat SharePoint Admins need to know about SQL-Cinncinati
What SharePoint Admins need to know about SQL-Cinncinati
 
SharePoint 2010 database maintenance
SharePoint 2010 database maintenanceSharePoint 2010 database maintenance
SharePoint 2010 database maintenance
 
SharePoint 2010 – Installation and maintenance – best practices
SharePoint 2010 – Installation and maintenance – best practicesSharePoint 2010 – Installation and maintenance – best practices
SharePoint 2010 – Installation and maintenance – best practices
 
No Data Left Behind: A SharePoint 2013 Migration
No Data Left Behind: A SharePoint 2013 MigrationNo Data Left Behind: A SharePoint 2013 Migration
No Data Left Behind: A SharePoint 2013 Migration
 
Find a needle in Haystack: Facebook's storage system
Find a needle in Haystack: Facebook's storage systemFind a needle in Haystack: Facebook's storage system
Find a needle in Haystack: Facebook's storage system
 
Auditing and Monitoring PostgreSQL/EPAS
Auditing and Monitoring PostgreSQL/EPASAuditing and Monitoring PostgreSQL/EPAS
Auditing and Monitoring PostgreSQL/EPAS
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
 
Mongodb
MongodbMongodb
Mongodb
 
Application Development & Database Choices: Postgres Support for non Relation...
Application Development & Database Choices: Postgres Support for non Relation...Application Development & Database Choices: Postgres Support for non Relation...
Application Development & Database Choices: Postgres Support for non Relation...
 
Big data with HDFS and Mapreduce
Big data  with HDFS and MapreduceBig data  with HDFS and Mapreduce
Big data with HDFS and Mapreduce
 
Ein Expertenleitfaden für die Migration von Legacy-Datenbanken zu PostgreSQL
Ein Expertenleitfaden für die Migration von Legacy-Datenbanken zu PostgreSQLEin Expertenleitfaden für die Migration von Legacy-Datenbanken zu PostgreSQL
Ein Expertenleitfaden für die Migration von Legacy-Datenbanken zu PostgreSQL
 
Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012
Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012
Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012
 
ГАННА КАПЛУН «noSQL vs SQL: порівняння використання реляційних та нереляційни...
ГАННА КАПЛУН «noSQL vs SQL: порівняння використання реляційних та нереляційни...ГАННА КАПЛУН «noSQL vs SQL: порівняння використання реляційних та нереляційни...
ГАННА КАПЛУН «noSQL vs SQL: порівняння використання реляційних та нереляційни...
 
Highlights from SharePoint Conference 2011
Highlights from SharePoint Conference 2011Highlights from SharePoint Conference 2011
Highlights from SharePoint Conference 2011
 
Responsive Web Design ~ Best Practices for Maximizing ROI
Responsive Web Design ~ Best Practices for Maximizing ROIResponsive Web Design ~ Best Practices for Maximizing ROI
Responsive Web Design ~ Best Practices for Maximizing ROI
 
Situation
SituationSituation
Situation
 
Preparing for Upgrade to SharePoint 2010 Today
Preparing for Upgrade to SharePoint 2010 TodayPreparing for Upgrade to SharePoint 2010 Today
Preparing for Upgrade to SharePoint 2010 Today
 
HDFS Analysis for Small Files
HDFS Analysis for Small FilesHDFS Analysis for Small Files
HDFS Analysis for Small Files
 
Connected at the hip for MS BI: SharePoint and SQL
Connected at the hip for MS BI: SharePoint and SQLConnected at the hip for MS BI: SharePoint and SQL
Connected at the hip for MS BI: SharePoint and SQL
 
GSA Webinar - June 2, 2011
GSA Webinar - June 2, 2011GSA Webinar - June 2, 2011
GSA Webinar - June 2, 2011
 

En vedette (8)

UTPL_EDUCACION-AMIGOS COMO TU
UTPL_EDUCACION-AMIGOS COMO TUUTPL_EDUCACION-AMIGOS COMO TU
UTPL_EDUCACION-AMIGOS COMO TU
 
W I S P
W I S PW I S P
W I S P
 
ERI Tutorial
ERI TutorialERI Tutorial
ERI Tutorial
 
Mike Vaillancourt Art Direction/Graphic Design/Illustration
Mike Vaillancourt Art Direction/Graphic Design/IllustrationMike Vaillancourt Art Direction/Graphic Design/Illustration
Mike Vaillancourt Art Direction/Graphic Design/Illustration
 
Sample SEO Best Practice Guide
Sample SEO Best Practice GuideSample SEO Best Practice Guide
Sample SEO Best Practice Guide
 
Cards For Cancer
Cards For CancerCards For Cancer
Cards For Cancer
 
Guia para Menores en Internet
Guia para Menores en InternetGuia para Menores en Internet
Guia para Menores en Internet
 
Best Practices to SharePoint Architecture Fundamentals NZ & AUS
Best Practices to SharePoint Architecture Fundamentals NZ & AUSBest Practices to SharePoint Architecture Fundamentals NZ & AUS
Best Practices to SharePoint Architecture Fundamentals NZ & AUS
 

Similaire à SharePoint and Large Scale SQL Deployments - NZSPC

Sql And Storage Considerations For Share Point Server 2010
Sql And Storage Considerations For Share Point Server 2010Sql And Storage Considerations For Share Point Server 2010
Sql And Storage Considerations For Share Point Server 2010
Mike Watson
 
MOSS 2007 Deployment Fundamentals -Part2
MOSS 2007 Deployment Fundamentals -Part2MOSS 2007 Deployment Fundamentals -Part2
MOSS 2007 Deployment Fundamentals -Part2
Information Technology
 
AX2012 Technical Track - Infrastructure, Davy Vliegen
AX2012 Technical Track - Infrastructure, Davy VliegenAX2012 Technical Track - Infrastructure, Davy Vliegen
AX2012 Technical Track - Infrastructure, Davy Vliegen
dynamicscom
 
NZSPC 2013 - Ultimate SharePoint Infrastructure Best Practices Session
NZSPC 2013 - Ultimate SharePoint Infrastructure Best Practices SessionNZSPC 2013 - Ultimate SharePoint Infrastructure Best Practices Session
NZSPC 2013 - Ultimate SharePoint Infrastructure Best Practices Session
Michael Noel
 

Similaire à SharePoint and Large Scale SQL Deployments - NZSPC (20)

SharePoint Intelligence Real World Business Workflow With Share Point Designe...
SharePoint Intelligence Real World Business Workflow With Share Point Designe...SharePoint Intelligence Real World Business Workflow With Share Point Designe...
SharePoint Intelligence Real World Business Workflow With Share Point Designe...
 
Real world business workflow with SharePoint designer 2013
Real world business workflow with SharePoint designer 2013Real world business workflow with SharePoint designer 2013
Real world business workflow with SharePoint designer 2013
 
Large Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint DeploymentsLarge Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint Deployments
 
Sql Health in a SharePoint environment
Sql Health in a SharePoint environmentSql Health in a SharePoint environment
Sql Health in a SharePoint environment
 
Sql And Storage Considerations For Share Point Server 2010
Sql And Storage Considerations For Share Point Server 2010Sql And Storage Considerations For Share Point Server 2010
Sql And Storage Considerations For Share Point Server 2010
 
SharePoint 2010 Boost your farm performance!
SharePoint 2010 Boost your farm performance!SharePoint 2010 Boost your farm performance!
SharePoint 2010 Boost your farm performance!
 
MOSS 2007 Deployment Fundamentals -Part2
MOSS 2007 Deployment Fundamentals -Part2MOSS 2007 Deployment Fundamentals -Part2
MOSS 2007 Deployment Fundamentals -Part2
 
Building the Perfect SharePoint 2010 Farm
Building the Perfect SharePoint 2010 FarmBuilding the Perfect SharePoint 2010 Farm
Building the Perfect SharePoint 2010 Farm
 
Unity Connect - Getting SQL Spinning with SharePoint - Best Practices for the...
Unity Connect - Getting SQL Spinning with SharePoint - Best Practices for the...Unity Connect - Getting SQL Spinning with SharePoint - Best Practices for the...
Unity Connect - Getting SQL Spinning with SharePoint - Best Practices for the...
 
Optimize SQL server performance for SharePoint
Optimize SQL server performance for SharePointOptimize SQL server performance for SharePoint
Optimize SQL server performance for SharePoint
 
Database Performance Management in Cloud
Database Performance Management in CloudDatabase Performance Management in Cloud
Database Performance Management in Cloud
 
In-memory ColumnStore Index
In-memory ColumnStore IndexIn-memory ColumnStore Index
In-memory ColumnStore Index
 
AX2012 Technical Track - Infrastructure, Davy Vliegen
AX2012 Technical Track - Infrastructure, Davy VliegenAX2012 Technical Track - Infrastructure, Davy Vliegen
AX2012 Technical Track - Infrastructure, Davy Vliegen
 
SPSMadrid Get sql spinning with SharePoint. Best practice for the back end
SPSMadrid Get sql spinning with SharePoint. Best practice for the back endSPSMadrid Get sql spinning with SharePoint. Best practice for the back end
SPSMadrid Get sql spinning with SharePoint. Best practice for the back end
 
NZSPC 2013 - Ultimate SharePoint Infrastructure Best Practices Session
NZSPC 2013 - Ultimate SharePoint Infrastructure Best Practices SessionNZSPC 2013 - Ultimate SharePoint Infrastructure Best Practices Session
NZSPC 2013 - Ultimate SharePoint Infrastructure Best Practices Session
 
Big Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI MobileBig Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI Mobile
 
SharePoint Intelligence Extending Share Point Designer 2010 Workflows With Cu...
SharePoint Intelligence Extending Share Point Designer 2010 Workflows With Cu...SharePoint Intelligence Extending Share Point Designer 2010 Workflows With Cu...
SharePoint Intelligence Extending Share Point Designer 2010 Workflows With Cu...
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27
 
Oracle
OracleOracle
Oracle
 
Ibm db2 big sql
Ibm db2 big sqlIbm db2 big sql
Ibm db2 big sql
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Dernier (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

SharePoint and Large Scale SQL Deployments - NZSPC

  • 1.
  • 2. Considerations for large-scale SharePoint deployments on SQL Server Name: Joel Oleson Title: Sr. Tech Prod Mgr Company: Quest Software
  • 3.
  • 4. Audience Poll New to SharePoint? SQL Admins? Large-scale Implementation (>1TB) experience? Scalability or performance issues in SharePoint deployments?
  • 5. Session Overview Lightweight Understanding SharePoint databases SQL Performance SQL Server 2008 with SharePoint Heavyweight Architectural Design Considerations Real-world scenarios Business Requirements Logical and Physical Architecture Architectural Design Statistical Results Appendix: DB Sizes, Content Distribution…
  • 7. Real World Examples Information based on real-world, large-scale SharePoint Implementations. Large software company (Microsoft) Intranet Portal for 120K users Global Enterprise Collaboration Solution (~20TB) Scalable Hosting Solution (SharePoint Online) Large automotive manufacturer Loan Origination Application / Document Repository ~50 Million content items (~6 TB)
  • 9. Disk I/O Demand Most Demand Medium Demand Low Demand *Content.. Search Config Temp Model +SSP Master Tlogs * Except during backup and Indexing + Except during Profile Import
  • 10. Top Performance Killers Indexing/Crawling Backup (SQL & Tape) Profile Import Misc Timer Jobs – User Sync for large #s of Users STSADM Backup/Restore Large List Operations Heavy User Operation List Import/Write
  • 13. SQL Server 2008 with Windows Server 2008 Transactional Performance with SQL Server 2008 Dramatically outperformed SQL 2005 on Win 2003. Compressed backup in the box Support for SQL External Blob Storage Increased resiliency Transparent Encryption See Performance Gains athttp://msdn.microsoft.com/en-us/library/dd263442.aspx
  • 15. Architectural Design Considerations Database Volumes Separate database volumes into unique LUN’s consisting of unique physical disk spindles. In a heavily read-oriented internet (portal) site, prioritize data over logs. Separate out Search database transaction log from content database transaction logs.
  • 16. Architectural Design Considerations SQL TempDB Data Files Optimal TempDB data file sizes can be calculated using the following formula: [MAX DB SIZE (KB)] X [.25] / [# CORES] = DATA FILE SIZE (KB) Calculation result (starting size) should be roughly equal to 25% of the largest content or search DB. Use RAID 10; separate LUN from other database objects (content, search, etc…). “Autogrow” feature set to a fixed amount; if auto grow occurs, permanently increase TempDB size. TempDB Log file separated to unique LUN.
  • 17. Architectural Design Considerations Content Databases 100 content databases per Web application 100GB per content database CAUTION: DB locking issues reported in collaborative DM scenarios above 100GB Need to ensure that you understand the issues based on number of users, usage profiles, etc… Service Level Agreement (SLA) requirements for backup and restore will also have an impact on this decision.
  • 18. Architectural Design Considerations Content Databases - Continued Pre-construct and pre-size Use RAID 5 or RAID 10 logical units RAID 10 is the best choice when cost is not a concern. RAID 5 will be sufficient and will save on costs, since content databases tend to be more read intensive than write intensive. Multi-core computer running SQL Server Primary file group could consist of a data file for each CPU core present in SQL Server.
  • 19. Architectural Design Considerations Database Maintenance SQL Server SP2 is needed if using the DB maintenance wizard (KB930887). Plan regular defrag of databases Performance - Average Disk Queue Length Single Digit values are optimal. Occasional double-digit values aren’t a large concern. Sustained triple-digit values require attention.
  • 20. Architectural Design Considerations Performance The recommended practice for separating the database volume types for the transaction log files to unique LUN’s follows. Content Database Log Files. Search Database Log Files. Consider filegroups for search database
  • 21. Architectural Design Considerations Topology A single list should not have more than 2,000 items per list container. A container represents the root of the list, as well as any folders within the list; a folder is a container because other list items are stored within it. Whitepaper: Working with large lists in Office SharePoint Server 2007 (Steve Peschka) http://go.microsoft.com/fwlink/?LinkId=95450 Disk Drive Speed 15K RPM recommended. IIS Application Pools Ensure “Max Used Memory” setting utilizes all the available RAM in your WFE’s.
  • 22. Architectural Design Considerations STSADM Command-line Tool and CreateSiteInNewDBOperation Gary Lapoint STSADM Extensions for Site Collection DB maintenance Codeplex.com/governance tools for archive & delete capture
  • 24. Real-world Scenarios Automotive Mfgr. Business Requirements (Phase I) Loan Origination Application built on Office SharePoint Server 2007 Ability to manage10.5 million images. System performance with a “normal” input load defined as receipt of 27,000 images per business day = 10 hours. Simulate user load to represent 200 users for search, view & update with 2x peak
  • 25. Real-world Scenarios Data Load Process (Phase I) Used KnowledgeLake Document Release Engine Loaded 9.17 documents/second per server Employs a high-volume, storage-based folder architecture within SharePoint to ensure UI responsiveness. Executed on 4 servers. Using this application, we were able to achieve: An average document load throughput of 36.6 documents per second! An average daily input of 3.17 million documents! 10.5 million documents with only 28% utilization!
  • 26. Real-world Scenarios Data Load Process (Phase II) 15 million documents consisting of Word (.docx), Excel (.xlsx), PowerPoint (.pptx) and Adobe PDF. Five Web Front-Ends were used for the load process. Peak Load Rate: 24.3 docs per second/2.1 million documents per day. Average Load Rate: ~1.9 million documents per day. Load Time: 8 days.NOTE: Load rates included automation process that created the PDF files.
  • 27. What does the logical architecture look like?!
  • 28. What does the physical architecture look like?! Scale OUT… Scale UP…
  • 29. What does the site topology look like?! Phase I 17 Divisional Site Collections / DB’s Phase II 10 Departmental Site Collections / DB’s
  • 30. What does the storage architecture look like?
  • 32. Architectural Design Statistical ResultsPhase I Designed Once / Built Once No architecture OR configuration changes were required after the initial build was completed. 10.5+ million documents loaded into the system in approximately 60 hours! Full Crawl indexed 10 Million items in 32 hours! Average content database size for divisional breakouts was 60GB
  • 33. Architectural Design Statistical ResultsPhase II Search database size was 539GB. Lesson Learned: Large search database caused disk I/O contention; break this out into multiple data file allocations matching the number of core processors on SQL Server, and spread them over unique LUN’s. Total Index size was 162GB! Average Content database size for Divisional breakouts was 200.65GB! Average Content database size for Departmental breakouts was 137.60GB!
  • 35. Real-world Scenarios Pharmaceutical Business Requirements Collaboration Portal built on Office SharePoint Server 2007 Validate ~40TB of content storage. Identify performance characteristics and provide guidance around content database sizing FAST search integration
  • 36. Real-world Scenarios Data Load Process 71,524,357 documents loaded across two SharePoint Farms 10.92 days! Content was spread across the farms into 165 unique content databases. 6,240 Site Collections, each containing 10 sub-sites for a total of 62,400 sites. Database sizes were pre-configured to vary in size from 100GB to 350GB to determine performance and/or SLA impacts.
  • 37. What does the logical architecture look like?!
  • 38. What does the physical architecture look like?!
  • 39. What does the site topology look like?! 165 Content DB’s 6,240 Site Collections 10 Sub-Sites in each collection: 62,400 Sites!
  • 40. What does the storage architecture look like?
  • 41. Architectural Design Statistical ResultsConclusion User Loads Stress tests included 2 - 3,000 concurrent users. Based on the 10% rule, testing completed equated to an environment representing 300,000 users! RAWnumber of RPS during peak times is 1,469 at Pharma. 773 RPS, which equates to 346.59 ACTUAL RPS! FAST Search Integration Successfully integrated FAST search capabilities, indexed content corpus and served search results as expected.
  • 42. Large-Scale Case Study Available SharePoint Scalability and Performance Whitepaper Contains majority of content you will see here, along with test results you won’t see here. TechNet topic: http://go.microsoft.com/fwlink/?LinkId=120901 Word 2007 format: http://go.microsoft.com/fwlink/?LinkId=120881 Word 2000-2003 format: http://go.microsoft.com/fwlink/?LinkId=120890 PDF format: http://go.microsoft.com/fwlink/?LinkId=120891
  • 47. Performance of Components Over Time MPSC/Nissan Phase I 14 individual performance tests were run to simulate various load scenarios.
  • 48. How do we pull all this together?! PharmaContent Database Distribution Substitute “F1” with SQL Server number to generate unique DB’s Farm 1: 2 SQL Farm 2: 1 SQL 165 Content Databases!
  • 49. How do we pull all this together?! PharmaData Load Statistics
  • 50. Architectural Design Statistical ResultsTesting Results – 300GB Content Databases
  • 51. Architectural Design Statistical ResultsTesting Results – 350GB Content Databases
  • 52. Architectural Design Statistical ResultsTesting Results – 250GB Content Databases
  • 53. Architectural Design Statistical ResultsTesting Results – 150GB Content Databases
  • 54. Required Slide © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Notes de l'éditeur

  1. Pre-construct and pre-size your content databases. Once the content database size has been specified, it is recommended that the database be created using a script that appropriately generates the empty database. Note that the “Autogrow” feature should be left on to prevent any future issues.Place the content database file or files on RAID 5 or RAID 10 logical units. RAID 10 is the best choice when cost is not a concern. RAID 5 will be sufficient and will save on costs, since content databases tend to be more read intensive than write intensive.For a large-scale document management solution, with a multi-core computer running SQL Server, the primary file group for the content database could potentially consist of a data file for each CPU core present in SQL Server. If possible, move each data file to separate logical units consisting of unique physical disk spindles.Database storage for content items will be between 1.2 and 1.5 time the raw file size when stored in SharePoint.
  2. If you would like to host your demo on the Virtual Server, please use the myVPC demo slide, not this slide.