SlideShare une entreprise Scribd logo
1  sur  15
Optimizing Solr for Performance  & User Experience SFBay Apache Lucene/Solr Meetup - Jan 2010                        Brian Doll :@briandoll
[object Object]
12 year old online business
600k+ SKUs
1.5M Songs
Faceted navigation with Solr
On-site search with Solr
Lots of needles in a huge haystackSFBay Apache Lucene/Solr Meetup - Jan 2010                        Brian Doll : @briandoll
[object Object]
2k app requests per minute
MySQL database
5 memcache instancesSFBay Apache Lucene/Solr Meetup - Jan 2010                        Brian Doll :@briandoll
[object Object]
~100ms average response time
3 Solr hosts on VMWare

Contenu connexe

Plus de Brian Doll

The Ultimate Productivity Hack, Having Kids
The Ultimate Productivity Hack, Having KidsThe Ultimate Productivity Hack, Having Kids
The Ultimate Productivity Hack, Having KidsBrian Doll
 
The Performance and Scalability Mindset
The Performance and Scalability MindsetThe Performance and Scalability Mindset
The Performance and Scalability MindsetBrian Doll
 
Finding and fixing top performance issues with new relic rpm
Finding and fixing top performance issues with new relic rpmFinding and fixing top performance issues with new relic rpm
Finding and fixing top performance issues with new relic rpmBrian Doll
 
New Relic .NET Agent Overview
New Relic .NET Agent OverviewNew Relic .NET Agent Overview
New Relic .NET Agent OverviewBrian Doll
 
New Relic: Optimizing The Database SQL and NoSQL Alike
New Relic: Optimizing The Database SQL and NoSQL AlikeNew Relic: Optimizing The Database SQL and NoSQL Alike
New Relic: Optimizing The Database SQL and NoSQL AlikeBrian Doll
 
Rit. the anti-cms
Rit. the anti-cmsRit. the anti-cms
Rit. the anti-cmsBrian Doll
 
What should we work on next?
What should we work on next?What should we work on next?
What should we work on next?Brian Doll
 
Get Fit to Win
Get Fit to WinGet Fit to Win
Get Fit to WinBrian Doll
 
Optimizing Your Online Store for the Holidays
Optimizing Your Online Store for the HolidaysOptimizing Your Online Store for the Holidays
Optimizing Your Online Store for the HolidaysBrian Doll
 

Plus de Brian Doll (9)

The Ultimate Productivity Hack, Having Kids
The Ultimate Productivity Hack, Having KidsThe Ultimate Productivity Hack, Having Kids
The Ultimate Productivity Hack, Having Kids
 
The Performance and Scalability Mindset
The Performance and Scalability MindsetThe Performance and Scalability Mindset
The Performance and Scalability Mindset
 
Finding and fixing top performance issues with new relic rpm
Finding and fixing top performance issues with new relic rpmFinding and fixing top performance issues with new relic rpm
Finding and fixing top performance issues with new relic rpm
 
New Relic .NET Agent Overview
New Relic .NET Agent OverviewNew Relic .NET Agent Overview
New Relic .NET Agent Overview
 
New Relic: Optimizing The Database SQL and NoSQL Alike
New Relic: Optimizing The Database SQL and NoSQL AlikeNew Relic: Optimizing The Database SQL and NoSQL Alike
New Relic: Optimizing The Database SQL and NoSQL Alike
 
Rit. the anti-cms
Rit. the anti-cmsRit. the anti-cms
Rit. the anti-cms
 
What should we work on next?
What should we work on next?What should we work on next?
What should we work on next?
 
Get Fit to Win
Get Fit to WinGet Fit to Win
Get Fit to Win
 
Optimizing Your Online Store for the Holidays
Optimizing Your Online Store for the HolidaysOptimizing Your Online Store for the Holidays
Optimizing Your Online Store for the Holidays
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 

Dernier (20)

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Optimizing Solr for Performance & User Experience

  • 1. Optimizing Solr for Performance & User Experience SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 2.
  • 3. 12 year old online business
  • 8. Lots of needles in a huge haystackSFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll : @briandoll
  • 9.
  • 10. 2k app requests per minute
  • 12. 5 memcache instancesSFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 13.
  • 15. 3 Solr hosts on VMWare
  • 16. 50% CPU load, 2GB memory each
  • 17. 1.00 Apdex ScoreSFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 18. We love that the structure of your data is part of your Solr environment (schema.xml) And the way you query against that data is part of your app SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 19. on-site search is not as varied as we had thought SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 20. Surprisingly, lots of people search like this: Piano SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 21. What if we applied a strategy pattern to our search queries? SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 22. “Instrument” searches could get their own strategy. “Artist” searches could get their own. … SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 23. Make sure your default search is working well first! Subtle changes to our algorithm provided a 32% increase in per-search value, with some phrases seeing an increase of over 3000%! SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 24. We construct various strategies: ... DEFAULT_SEARCH_HASH = { :id_codes => { :search_type => :phrase, :boost => 1024, :slop => 2 }, :bucket_proper_names => { :search_type => :phrase, :boost => 500, :slop => 2 }, :bucket_titles => { :search_type => :phrase, :boost => 450, :slop => 100 }, :bucket_songs => { :search_type => :phrase, :boost => 64, :slop => 5 }, :bucket_primary_cats => { :search_type => :phrase, :boost => 32, :slop => 100}, :bucket_secondary_cats => { :search_type => :phrase, :boost => 16, :slop => 100 }, :bucket_binding => { :search_type => :phrase, :boost => 2, :slop => 100}, :bucket_lead_time => { :search_type => :phrase, :boost => 1, :slop => 100}, :bucket_blurbs => { :search_type => :phrase, :boost => 0.05, :slop => 7 }, :text => { :search_type => :term, :boost => 0.005 } } ... def self.popular_instrument_strategy(search_phrase) params = {:instrument_text => {:search_type => :phrase, :boost => 600, :slop => 2 } } Strategy.new("default-instrument", search_phrase, params , :top_selling) end ... SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 25. And then we have an algorithm to help determine which strategy we should apply to an incoming search request: [redacted] SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 26. Implementing these strategies provided an 86% increase in per-search value. We can now fine-tune individual categories of searches, as well as specific phrases. SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 27. Rule of thumb: Don't try to please everybody. Please the people who make you money. Tuning your search algorithm is an endless game, but if you focus on pleasing customer X, it'll cost you. SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll
  • 28. In order to make all this happen, we relied heavily on two tools: SFBay Apache Lucene/Solr Meetup - Jan 2010 Brian Doll :@briandoll