SlideShare une entreprise Scribd logo
1  sur  42
November 23rd, 2013

Search Topology and Optimization

Mike Maadarani
SharePoint Architect
mike@maadarani.com
Thank you to all of our Sponsors!!
Bio..

Mike Maadarani
App Dev and Architecture for over 18 years (15 Years Microsoft, 3
Years with the “Other Guys”)
Business focused on Enterprise Content Management & Publishing
Sites

Technology focused on SharePoint, SQL Server and SharePoint
Integration
Architect, trainer, and presenter
Blog: www.maadarani.com

mike@maadarani.com; @mikemaadarani
Architecture and Resource
Utilization
Configuring SSA
and PS

Agenda
SharePoint 2013
Search Overview

Topology
Scenarios

Relevancy, Query Builder, &
Optimization
Closing and Q&A
Search in 2010

SharePoint 2010 Search Service Application
Query Component
Query
Engine
Property
Store
(SQL)

Crawl Component
FAST Search for SharePoint 2010

FAST
Content
SSA

FAST back-end components
(managed separately)

Extensibility:
• Sandbox
• Entity
Extraction

FAST
Query
SSA
… In SharePoint 2013

SharePoint 2013 Search Service Application
Extensibility:
• Web
callout
• Entity
Extraction
Crawl
Component

Crawl

Index
Component
Query
Engine

Content
Processing
Component
Content
Pipeline

Analytics
Processing
Component

Separate crawl
and indexing

Query
Pipeline

Query
Processing
Component

Property
Store
(SQL)

Indexing
Engine
Admin
Component
Analysis
Engine

Search
Admin

Link/query analysis
& recommendations

Entire index
on local disk
SharePoint 2013 Search Architecture

Public API

Search topology components

HTTP
File shares
SharePoint
User profiles
Lotus Notes
Documentum
Exchange folders
Custom - BCS

SharePoint
SP Apps
Devices
Non-SP UX
Why Search is so important?

FAST

I just uploaded a
document.
Make it searchable, quick!
Why Search is so important?

EASY
Why Search is so important?

EASY
Why Search is so important?

Search Driven Applications
Why Search is so important?

Search Everything

I can find ALL of Rob
Ford’s hidden videos!
Where does Search live in the farm?

Windows services

Processes

SharePoint Search Host Controller
service

Noderunner.exe
Runtime environment for search
components (except crawler)

Runtime/lifecycle control of search
components (except crawler)
 hostcontrollerservice.exe

SharePoint Server Search service

SharePoint App Server

Crawl Component
 mssearch.exe
 mssdmn.exe

Still there, but only
Crawl Component

Search Service Instance: Provisioning of
the search service on each box
Search Service Application: SharePoint
Configuration entity
hostcontrollerservice.exe

Host Controller

Search Runtime Environment

msseearch.exe
mssdmn.exe
Crawl
Component

noderunner.exe

noderunner.exe

noderunner.exe

noderunner.exe

noderunner.exe

Admin
Component

Query
Processing
Component

Content
Processing
Component

Index
Component

Analytics
Processing
Component
Where do I host my
components?
Query processing component (QPC)

CPU load

Load impact (relative)

Driving factors
QPS
Query transformations

Network load
Driving factors
Number of index partitions

Size of queries and results

Example:
20 index partitions @ 20 qps => 200/100 Mbit/s
in/outbound

Item
count
CPU

DPS
Network

QPS
Disk
Index component

CPU load

Load impact (relative)

Driving factors
QPS and item count

Guidelines per index component @ 2 GHz CPU
1M items: 5 QPS per CPU core
5M items: 2 QPS per CPU core
10M items: 1 QPS per CPU core

Disk load
Driving factors
QPS and item count
New content invalidates caches

Disk size: 500GB @ 10M items per index
component

Item
count
CPU

DPS
Network

QPS
Disk
Crawl component

CPU load

Load impact (relative)

Driving factors
Documents per second
Link discovery

Crawl management

Network load
Driving factors
Downloading items from content sources
Passing items on to CPC

Disk load
All documents are temporarily stored in data folder

Item
count
CPU

DPS
Network

QPS
Disk
Content processing component (CPC)

CPU load

Load impact (relative)

Driving factors
Documents per second
Document size and complexity

Feature extraction

Estimate: 5-10 DPS per CPU core

Network load
Driving factors
Documents per second
Document size

Item
count
CPU

DPS
Network

QPS
Disk
Analytics processing component (APC)

CPU load

Load impact (relative)

Driving factors
Number of items
Site activity

Disk load
Local disk used for temporary storage
Bulk load, primacy concern is load isolation

Network load
Same as for CPU load
PLUS: Network traffic increases when distributing APC
across multiple machines

Item
count
CPU

DPS
Network

QPS
Disk
Search administration component

Low CPU and network load

Load impact (relative)

Load increase with more components in the
search topology

Item
count
CPU

DPS
Network

QPS
Disk
Create your SSA
Small Search Topology
Fault tolerant small search topology

Admin

CPC

Crawl

Index

APC

VM
Host

Admin

CPC

Crawl

VM

QPC

Index

APC

VM

QPC

VM
Host
Small search farm (up to 10M items)

Other
SharePoint
applications

Web front
end

Admin

CPC

Index

Crawl

APC

QPC
Scaling from small to medium search topology

Adm

Adm
Extend your SSA
Medium Search Topology
Tweaking Your results
Challenges: Intent

Infrastructure
Project

Where is my talk
Project Plan?

Are Documents held at
the same place?

There is rarely a single right
answer
Different people have different
intents
Query Rules help you handle
intents

I wonder if there are
references from
previous projects?
Configuration in the Conceptual Relevance Flow

Query:
HR Employment
quarterly
report

Search
Web Part

(WORDS
(WORDS
(WORDS
(WORDS

HR, Human Resources) AND
employees, employed) AND
quarterly, quarterlies) AND
report, reports, reported)

Query Processing
Thesaurus: HR  Human Resources
Best bets: HR Employment /HR/employment
Dynamic Reordering Rules:
Quarterly Report 
{prefer docs from http://reports}
Query Rule:
{Terms} Quarterly Report 
{Terms} ContentType=“reports”

Mixed Results for:
• HR Employment best bet
• HR Employment quarterly
report
• HR Employment
ContentType=reports

Engine

Document
Collection
For all queries:
Authorities:

Level 1: http://employment

Ranking model: {incorporate user ratings}
Authorities: SSA-level configuration

Takes ~24hrs to
propagate

Sites that are important

Sites with low intrinsic relevance
Authorities: Connected
Authorities: Connected

Setting an authority affects all sites connected through hyperlinks

Sites are weighted
by distance to
the authority
Query Rules
Tune Search Results
Created at the SSA, Tenant, Site Collection or Site
SSA

Site Collection

Site
Query Rules

Condition
When Do I apply the rule?
Action
What to do when the rule is matched?
Publishing
When should the rule be active?
Query Rules

Exact match, beginning or end
Ad-hoc or term store dictionary
Match a regex (advanced)
Is this query more likely aimed at
the following source…?
 Do people mostly click on result of
the following type…?





 Show a promoted result
 Show a block of results
 Replace the core results

with a different query
Query Builder

Dynamically Ranking Change
Part of the query

Results Ranking
Query Builder
Session Objective and Takeaways

High Availability and Performance

Better Search Quality

Better management

Friendly results and tools
Q&A
Thank You / Merci
www.maadarani.com, mike@maadarani.com , @mikemaadarani
Remember to fill out your evaluation forms to win some
great prizes!

&

Join us for SharePint today!
Date & Time: Nov 23rd, 2013 @6:00 pm
Location:

The Observatory Pub,
Algonquin Student’s Association

Address:

A-170 on Algonquin Campus

Parking:

No need to move your car!*

Site:

http://www.algonquinsa.com/ob.aspx

*Please drive responsibly! We are happy to call you a cab 

Contenu connexe

Tendances

SPUnite17 IT Pros Guide to Managing SharePoint Search
SPUnite17 IT Pros Guide to Managing SharePoint SearchSPUnite17 IT Pros Guide to Managing SharePoint Search
SPUnite17 IT Pros Guide to Managing SharePoint SearchNCCOMMS
 
SharePoint 2013 – the upgrade story
SharePoint 2013 – the upgrade storySharePoint 2013 – the upgrade story
SharePoint 2013 – the upgrade storySPC Adriatics
 
Pharos - Face of the KMC
Pharos - Face of the KMCPharos - Face of the KMC
Pharos - Face of the KMCRajarshi Guha
 
Improve Performance in Fast Search for SharePoint - Comperio
Improve Performance in Fast Search for SharePoint - ComperioImprove Performance in Fast Search for SharePoint - Comperio
Improve Performance in Fast Search for SharePoint - ComperioComperio - Search Matters.
 
Take Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelTake Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelJeff Fried
 
Understanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchUnderstanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchJeff Fried
 
SharePoint 2013 'Search': What you need to Know!
SharePoint 2013 'Search': What you need to Know!SharePoint 2013 'Search': What you need to Know!
SharePoint 2013 'Search': What you need to Know!WinWire Technologies Inc
 
Cloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointCloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointJeff Fried
 
Kql and the content search web part
Kql and the content search web part Kql and the content search web part
Kql and the content search web part InnoTech
 
SharePoint Saturday Perth 2013 - Overview of Search in SharePoint Server 201...
SharePoint Saturday Perth 2013  - Overview of Search in SharePoint Server 201...SharePoint Saturday Perth 2013  - Overview of Search in SharePoint Server 201...
SharePoint Saturday Perth 2013 - Overview of Search in SharePoint Server 201...Sezai Komur
 
Bpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchBpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchAgnes Molnar
 
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Lucidworks
 
Solving Real World Challenges with Enterprise Search
Solving Real World Challenges with Enterprise SearchSolving Real World Challenges with Enterprise Search
Solving Real World Challenges with Enterprise SearchSPC Adriatics
 
Fried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveFried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveJeff Fried
 
Data Con LA 2018 - Standing on shoulders of giants by Sooraj Akkammadam
Data Con LA 2018 - Standing on shoulders of giants by Sooraj AkkammadamData Con LA 2018 - Standing on shoulders of giants by Sooraj Akkammadam
Data Con LA 2018 - Standing on shoulders of giants by Sooraj AkkammadamData Con LA
 
Ms. Search and Mr. Project
Ms. Search and Mr. ProjectMs. Search and Mr. Project
Ms. Search and Mr. ProjectSPC Adriatics
 

Tendances (20)

SPUnite17 IT Pros Guide to Managing SharePoint Search
SPUnite17 IT Pros Guide to Managing SharePoint SearchSPUnite17 IT Pros Guide to Managing SharePoint Search
SPUnite17 IT Pros Guide to Managing SharePoint Search
 
SharePoint 2013 – the upgrade story
SharePoint 2013 – the upgrade storySharePoint 2013 – the upgrade story
SharePoint 2013 – the upgrade story
 
Pharos - Face of the KMC
Pharos - Face of the KMCPharos - Face of the KMC
Pharos - Face of the KMC
 
Improve Performance in Fast Search for SharePoint - Comperio
Improve Performance in Fast Search for SharePoint - ComperioImprove Performance in Fast Search for SharePoint - Comperio
Improve Performance in Fast Search for SharePoint - Comperio
 
Take Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelTake Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next Level
 
Understanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchUnderstanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid Search
 
SharePoint 2013 'Search': What you need to Know!
SharePoint 2013 'Search': What you need to Know!SharePoint 2013 'Search': What you need to Know!
SharePoint 2013 'Search': What you need to Know!
 
Search
SearchSearch
Search
 
Cloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointCloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePoint
 
Kql and the content search web part
Kql and the content search web part Kql and the content search web part
Kql and the content search web part
 
SharePoint Saturday Perth 2013 - Overview of Search in SharePoint Server 201...
SharePoint Saturday Perth 2013  - Overview of Search in SharePoint Server 201...SharePoint Saturday Perth 2013  - Overview of Search in SharePoint Server 201...
SharePoint Saturday Perth 2013 - Overview of Search in SharePoint Server 201...
 
IN106 Performance with MongoDB
IN106 Performance with MongoDBIN106 Performance with MongoDB
IN106 Performance with MongoDB
 
Bpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchBpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearch
 
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
 
Solving Real World Challenges with Enterprise Search
Solving Real World Challenges with Enterprise SearchSolving Real World Challenges with Enterprise Search
Solving Real World Challenges with Enterprise Search
 
Fried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveFried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper dive
 
SPS Toronto 2015
SPS Toronto 2015SPS Toronto 2015
SPS Toronto 2015
 
Mythbusters
MythbustersMythbusters
Mythbusters
 
Data Con LA 2018 - Standing on shoulders of giants by Sooraj Akkammadam
Data Con LA 2018 - Standing on shoulders of giants by Sooraj AkkammadamData Con LA 2018 - Standing on shoulders of giants by Sooraj Akkammadam
Data Con LA 2018 - Standing on shoulders of giants by Sooraj Akkammadam
 
Ms. Search and Mr. Project
Ms. Search and Mr. ProjectMs. Search and Mr. Project
Ms. Search and Mr. Project
 

Similaire à SharePoint Search Topology and Optimization

B365 saturday practical guide to building a scalable search architecture in s...
B365 saturday practical guide to building a scalable search architecture in s...B365 saturday practical guide to building a scalable search architecture in s...
B365 saturday practical guide to building a scalable search architecture in s...Thuan Ng
 
Search Topology and Optimization
Search Topology and OptimizationSearch Topology and Optimization
Search Topology and OptimizationMike Maadarani
 
page ranking web crawling
page ranking web crawlingpage ranking web crawling
page ranking web crawlingpradiprahul
 
Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Terrence Nguyen
 
SharePoint 2010 Global Deployment
SharePoint 2010 Global DeploymentSharePoint 2010 Global Deployment
SharePoint 2010 Global DeploymentJoel Oleson
 
Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013
Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013
Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013K.Mohamed Faizal
 
SPTECHCON - Rev Your Engines - SharePoint 2013 Performance Enhancements
SPTECHCON - Rev Your Engines - SharePoint 2013 Performance EnhancementsSPTECHCON - Rev Your Engines - SharePoint 2013 Performance Enhancements
SPTECHCON - Rev Your Engines - SharePoint 2013 Performance EnhancementsEric Shupps
 
Effective Searching by Dominik Kornas
Effective Searching by Dominik KornasEffective Searching by Dominik Kornas
Effective Searching by Dominik KornasAEM HUB
 
SharePoint 2013 Search Architecture with Russ Houberg
SharePoint 2013  Search Architecture with Russ HoubergSharePoint 2013  Search Architecture with Russ Houberg
SharePoint 2013 Search Architecture with Russ Houbergknowledgelakemarketing
 
Spark Summit EU talk by Bas Geerdink
Spark Summit EU talk by Bas GeerdinkSpark Summit EU talk by Bas Geerdink
Spark Summit EU talk by Bas GeerdinkSpark Summit
 
Highly available and scalable architectures
Highly available and scalable architecturesHighly available and scalable architectures
Highly available and scalable architecturesPhil Wicklund
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010bgerman
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Databricks
 
Outsourcing your share point hosting the cloud's fine print magnified
Outsourcing your share point hosting   the cloud's fine print magnifiedOutsourcing your share point hosting   the cloud's fine print magnified
Outsourcing your share point hosting the cloud's fine print magnifiedSherWeb
 
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.iosrjce
 
Build Your Own Search Engine
Build Your Own Search EngineBuild Your Own Search Engine
Build Your Own Search Enginegoodfriday
 

Similaire à SharePoint Search Topology and Optimization (20)

B365 saturday practical guide to building a scalable search architecture in s...
B365 saturday practical guide to building a scalable search architecture in s...B365 saturday practical guide to building a scalable search architecture in s...
B365 saturday practical guide to building a scalable search architecture in s...
 
Search Topology and Optimization
Search Topology and OptimizationSearch Topology and Optimization
Search Topology and Optimization
 
page ranking web crawling
page ranking web crawlingpage ranking web crawling
page ranking web crawling
 
PAGE RANKING
PAGE RANKING PAGE RANKING
PAGE RANKING
 
Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013
 
SharePoint 2010 Global Deployment
SharePoint 2010 Global DeploymentSharePoint 2010 Global Deployment
SharePoint 2010 Global Deployment
 
Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013
Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013
Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013
 
SPTECHCON - Rev Your Engines - SharePoint 2013 Performance Enhancements
SPTECHCON - Rev Your Engines - SharePoint 2013 Performance EnhancementsSPTECHCON - Rev Your Engines - SharePoint 2013 Performance Enhancements
SPTECHCON - Rev Your Engines - SharePoint 2013 Performance Enhancements
 
Effective Searching by Dominik Kornas
Effective Searching by Dominik KornasEffective Searching by Dominik Kornas
Effective Searching by Dominik Kornas
 
SharePoint 2013 Search Architecture with Russ Houberg
SharePoint 2013  Search Architecture with Russ HoubergSharePoint 2013  Search Architecture with Russ Houberg
SharePoint 2013 Search Architecture with Russ Houberg
 
Spark Summit EU talk by Bas Geerdink
Spark Summit EU talk by Bas GeerdinkSpark Summit EU talk by Bas Geerdink
Spark Summit EU talk by Bas Geerdink
 
Web Servers
Web Servers Web Servers
Web Servers
 
Highly available and scalable architectures
Highly available and scalable architecturesHighly available and scalable architectures
Highly available and scalable architectures
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
 
Outsourcing your share point hosting the cloud's fine print magnified
Outsourcing your share point hosting   the cloud's fine print magnifiedOutsourcing your share point hosting   the cloud's fine print magnified
Outsourcing your share point hosting the cloud's fine print magnified
 
E017624043
E017624043E017624043
E017624043
 
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
 
Build Your Own Search Engine
Build Your Own Search EngineBuild Your Own Search Engine
Build Your Own Search Engine
 

Plus de Mike Maadarani

Discover private channels in microsoft teams mike maadarani
Discover private channels in microsoft teams   mike maadaraniDiscover private channels in microsoft teams   mike maadarani
Discover private channels in microsoft teams mike maadaraniMike Maadarani
 
Increase your Collaboration with Azure Automation
Increase your Collaboration with Azure AutomationIncrease your Collaboration with Azure Automation
Increase your Collaboration with Azure AutomationMike Maadarani
 
Increase your collaboration with Azure Automation
Increase your collaboration with Azure AutomationIncrease your collaboration with Azure Automation
Increase your collaboration with Azure AutomationMike Maadarani
 
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headache
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headacheTips in migrating to SharePoint 2016 or O365, to avoid a migration headache
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headacheMike Maadarani
 
aOS Canadian Tour Share point migration tips
aOS Canadian Tour Share point migration tipsaOS Canadian Tour Share point migration tips
aOS Canadian Tour Share point migration tipsMike Maadarani
 
What's new in sharepoint 2016
What's new in sharepoint 2016What's new in sharepoint 2016
What's new in sharepoint 2016Mike Maadarani
 
SharePoint 2016 Search
SharePoint 2016 SearchSharePoint 2016 Search
SharePoint 2016 SearchMike Maadarani
 
SPS Reston SharePoint Alwyays On
SPS Reston SharePoint Alwyays OnSPS Reston SharePoint Alwyays On
SPS Reston SharePoint Alwyays OnMike Maadarani
 
SPS Chevy Chase Tips on migrating to Office 365
SPS Chevy Chase Tips on migrating to Office 365SPS Chevy Chase Tips on migrating to Office 365
SPS Chevy Chase Tips on migrating to Office 365Mike Maadarani
 
Tricks and Tips in Migrating to Office 365 and On-Premises to acoid migration...
Tricks and Tips in Migrating to Office 365 and On-Premises to acoid migration...Tricks and Tips in Migrating to Office 365 and On-Premises to acoid migration...
Tricks and Tips in Migrating to Office 365 and On-Premises to acoid migration...Mike Maadarani
 
SharePoint Tips and Tricks to avoid migration headaches
SharePoint Tips and Tricks to avoid migration headachesSharePoint Tips and Tricks to avoid migration headaches
SharePoint Tips and Tricks to avoid migration headachesMike Maadarani
 
SharePoint Always-On Deployment
SharePoint Always-On DeploymentSharePoint Always-On Deployment
SharePoint Always-On DeploymentMike Maadarani
 

Plus de Mike Maadarani (12)

Discover private channels in microsoft teams mike maadarani
Discover private channels in microsoft teams   mike maadaraniDiscover private channels in microsoft teams   mike maadarani
Discover private channels in microsoft teams mike maadarani
 
Increase your Collaboration with Azure Automation
Increase your Collaboration with Azure AutomationIncrease your Collaboration with Azure Automation
Increase your Collaboration with Azure Automation
 
Increase your collaboration with Azure Automation
Increase your collaboration with Azure AutomationIncrease your collaboration with Azure Automation
Increase your collaboration with Azure Automation
 
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headache
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headacheTips in migrating to SharePoint 2016 or O365, to avoid a migration headache
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headache
 
aOS Canadian Tour Share point migration tips
aOS Canadian Tour Share point migration tipsaOS Canadian Tour Share point migration tips
aOS Canadian Tour Share point migration tips
 
What's new in sharepoint 2016
What's new in sharepoint 2016What's new in sharepoint 2016
What's new in sharepoint 2016
 
SharePoint 2016 Search
SharePoint 2016 SearchSharePoint 2016 Search
SharePoint 2016 Search
 
SPS Reston SharePoint Alwyays On
SPS Reston SharePoint Alwyays OnSPS Reston SharePoint Alwyays On
SPS Reston SharePoint Alwyays On
 
SPS Chevy Chase Tips on migrating to Office 365
SPS Chevy Chase Tips on migrating to Office 365SPS Chevy Chase Tips on migrating to Office 365
SPS Chevy Chase Tips on migrating to Office 365
 
Tricks and Tips in Migrating to Office 365 and On-Premises to acoid migration...
Tricks and Tips in Migrating to Office 365 and On-Premises to acoid migration...Tricks and Tips in Migrating to Office 365 and On-Premises to acoid migration...
Tricks and Tips in Migrating to Office 365 and On-Premises to acoid migration...
 
SharePoint Tips and Tricks to avoid migration headaches
SharePoint Tips and Tricks to avoid migration headachesSharePoint Tips and Tricks to avoid migration headaches
SharePoint Tips and Tricks to avoid migration headaches
 
SharePoint Always-On Deployment
SharePoint Always-On DeploymentSharePoint Always-On Deployment
SharePoint Always-On Deployment
 

Dernier

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 

Dernier (20)

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 

SharePoint Search Topology and Optimization

  • 1. November 23rd, 2013 Search Topology and Optimization Mike Maadarani SharePoint Architect mike@maadarani.com
  • 2. Thank you to all of our Sponsors!!
  • 3. Bio.. Mike Maadarani App Dev and Architecture for over 18 years (15 Years Microsoft, 3 Years with the “Other Guys”) Business focused on Enterprise Content Management & Publishing Sites Technology focused on SharePoint, SQL Server and SharePoint Integration Architect, trainer, and presenter Blog: www.maadarani.com mike@maadarani.com; @mikemaadarani
  • 4. Architecture and Resource Utilization Configuring SSA and PS Agenda SharePoint 2013 Search Overview Topology Scenarios Relevancy, Query Builder, & Optimization Closing and Q&A
  • 5. Search in 2010 SharePoint 2010 Search Service Application Query Component Query Engine Property Store (SQL) Crawl Component
  • 6. FAST Search for SharePoint 2010 FAST Content SSA FAST back-end components (managed separately) Extensibility: • Sandbox • Entity Extraction FAST Query SSA
  • 7. … In SharePoint 2013 SharePoint 2013 Search Service Application Extensibility: • Web callout • Entity Extraction Crawl Component Crawl Index Component Query Engine Content Processing Component Content Pipeline Analytics Processing Component Separate crawl and indexing Query Pipeline Query Processing Component Property Store (SQL) Indexing Engine Admin Component Analysis Engine Search Admin Link/query analysis & recommendations Entire index on local disk
  • 8. SharePoint 2013 Search Architecture Public API Search topology components HTTP File shares SharePoint User profiles Lotus Notes Documentum Exchange folders Custom - BCS SharePoint SP Apps Devices Non-SP UX
  • 9. Why Search is so important? FAST I just uploaded a document. Make it searchable, quick!
  • 10. Why Search is so important? EASY
  • 11. Why Search is so important? EASY
  • 12. Why Search is so important? Search Driven Applications
  • 13. Why Search is so important? Search Everything I can find ALL of Rob Ford’s hidden videos!
  • 14. Where does Search live in the farm? Windows services Processes SharePoint Search Host Controller service Noderunner.exe Runtime environment for search components (except crawler) Runtime/lifecycle control of search components (except crawler)  hostcontrollerservice.exe SharePoint Server Search service SharePoint App Server Crawl Component  mssearch.exe  mssdmn.exe Still there, but only Crawl Component Search Service Instance: Provisioning of the search service on each box Search Service Application: SharePoint Configuration entity hostcontrollerservice.exe Host Controller Search Runtime Environment msseearch.exe mssdmn.exe Crawl Component noderunner.exe noderunner.exe noderunner.exe noderunner.exe noderunner.exe Admin Component Query Processing Component Content Processing Component Index Component Analytics Processing Component
  • 15. Where do I host my components?
  • 16. Query processing component (QPC) CPU load Load impact (relative) Driving factors QPS Query transformations Network load Driving factors Number of index partitions Size of queries and results Example: 20 index partitions @ 20 qps => 200/100 Mbit/s in/outbound Item count CPU DPS Network QPS Disk
  • 17. Index component CPU load Load impact (relative) Driving factors QPS and item count Guidelines per index component @ 2 GHz CPU 1M items: 5 QPS per CPU core 5M items: 2 QPS per CPU core 10M items: 1 QPS per CPU core Disk load Driving factors QPS and item count New content invalidates caches Disk size: 500GB @ 10M items per index component Item count CPU DPS Network QPS Disk
  • 18. Crawl component CPU load Load impact (relative) Driving factors Documents per second Link discovery Crawl management Network load Driving factors Downloading items from content sources Passing items on to CPC Disk load All documents are temporarily stored in data folder Item count CPU DPS Network QPS Disk
  • 19. Content processing component (CPC) CPU load Load impact (relative) Driving factors Documents per second Document size and complexity Feature extraction Estimate: 5-10 DPS per CPU core Network load Driving factors Documents per second Document size Item count CPU DPS Network QPS Disk
  • 20. Analytics processing component (APC) CPU load Load impact (relative) Driving factors Number of items Site activity Disk load Local disk used for temporary storage Bulk load, primacy concern is load isolation Network load Same as for CPU load PLUS: Network traffic increases when distributing APC across multiple machines Item count CPU DPS Network QPS Disk
  • 21. Search administration component Low CPU and network load Load impact (relative) Load increase with more components in the search topology Item count CPU DPS Network QPS Disk
  • 24. Fault tolerant small search topology Admin CPC Crawl Index APC VM Host Admin CPC Crawl VM QPC Index APC VM QPC VM Host
  • 25. Small search farm (up to 10M items) Other SharePoint applications Web front end Admin CPC Index Crawl APC QPC
  • 26. Scaling from small to medium search topology Adm Adm
  • 30. Challenges: Intent Infrastructure Project Where is my talk Project Plan? Are Documents held at the same place? There is rarely a single right answer Different people have different intents Query Rules help you handle intents I wonder if there are references from previous projects?
  • 31. Configuration in the Conceptual Relevance Flow Query: HR Employment quarterly report Search Web Part (WORDS (WORDS (WORDS (WORDS HR, Human Resources) AND employees, employed) AND quarterly, quarterlies) AND report, reports, reported) Query Processing Thesaurus: HR  Human Resources Best bets: HR Employment /HR/employment Dynamic Reordering Rules: Quarterly Report  {prefer docs from http://reports} Query Rule: {Terms} Quarterly Report  {Terms} ContentType=“reports” Mixed Results for: • HR Employment best bet • HR Employment quarterly report • HR Employment ContentType=reports Engine Document Collection For all queries: Authorities: Level 1: http://employment Ranking model: {incorporate user ratings}
  • 32. Authorities: SSA-level configuration Takes ~24hrs to propagate Sites that are important Sites with low intrinsic relevance
  • 34. Authorities: Connected Setting an authority affects all sites connected through hyperlinks Sites are weighted by distance to the authority
  • 35. Query Rules Tune Search Results Created at the SSA, Tenant, Site Collection or Site SSA Site Collection Site
  • 36. Query Rules Condition When Do I apply the rule? Action What to do when the rule is matched? Publishing When should the rule be active?
  • 37. Query Rules Exact match, beginning or end Ad-hoc or term store dictionary Match a regex (advanced) Is this query more likely aimed at the following source…?  Do people mostly click on result of the following type…?      Show a promoted result  Show a block of results  Replace the core results with a different query
  • 38. Query Builder Dynamically Ranking Change Part of the query Results Ranking
  • 40. Session Objective and Takeaways High Availability and Performance Better Search Quality Better management Friendly results and tools
  • 41. Q&A Thank You / Merci www.maadarani.com, mike@maadarani.com , @mikemaadarani
  • 42. Remember to fill out your evaluation forms to win some great prizes! & Join us for SharePint today! Date & Time: Nov 23rd, 2013 @6:00 pm Location: The Observatory Pub, Algonquin Student’s Association Address: A-170 on Algonquin Campus Parking: No need to move your car!* Site: http://www.algonquinsa.com/ob.aspx *Please drive responsibly! We are happy to call you a cab 