SlideShare une entreprise Scribd logo
1  sur  16
Télécharger pour lire hors ligne
Using LWE/Solr/Lucene for eCom

                 Grant Ingersoll, Lucid Imagination
                            @gsingers




  Apache Solr and Lucene and their logos are trademarks of the Apache Software Foundation
Difference Makers
§  Case Study 1: Relevance Matters
   •  Large Electronics Manufacturer
   •  Top selling product on page 10 for a search by
      product name
§  Case Study 2: Don’t Overthink it
   •  Large Online Retailer
   •  Simply adding auto-suggest added millions to
      bottom line at very little cost
§  Case Study 3: Test, Test, Test
   •  Amazon Recommendation System
   •  http://glinden.blogspot.com/2006/04/early-
      amazon-shopping-cart.html
                                                       3
Topics
§  The Stack
§  Knowing Users
§  Search Features in LWE/
    Solr/Lucene
  §  Minimum Features for
      eCom
  §  Extended Features
§  Not Just Search
§  What’s Missing?
§  What’s Next?

                              4
eCom Stack Choices

              Lucid Works Enterprise
              •  Power of Apache Solr
              •  Easy Admin, Click Scoring, Reports

              Apache Solr
              •  Faceting, Easy Config
              •  Scale

              Apache Lucene
              •  Powerful Search Library
              •  Total Flexibility

Apache Solr and Lucene and their logos are trademarks of the Apache Software Foundation   5
Users: Get to Know Them!
§  Audience Poll:
   •  How many of you are developers?
   •  How many of the developers know what the top
      10 queries are on your site?
   •  How many of the non-developers know?


§  Your users represent 100% of your opportunity
    to sell your products ;-)
   •  Shouldn’t you know what they are searching for?



                                                        6
Search Analytics
§  “If you can’t measure it, you can’t manage it”
   •  Attributed to Peter Drucker, however, see *


§  Ultimately, it’s all about conversion
   •  May not be the best measure for judging search


§  Is there One Right Answer on your Site or
    Multiple?
   •  Known Item search vs Keyword/Category

   *http://edkless.com/2009/06/peter-drucker-and-time-sheets/
                                                                7
Useful Metrics
•  Mean Reciprocal Rank or Precision @ 10
   §  Known Item vs. Keyword/Category
•  “Show me the money” -- Top Product Analysis
   §  Identity Search - If your top product is named X and
       someone searches for X, is X on the first page? Is it
       number 1?
   §  Is a top product underperforming as it relates to search?
•  Top X Queries and Query Terms
•  Zero Results and % of Zero Results
•  Avg. # of facets/filters/spellchecks clicked per
   session
•  Avg # of searches per user session
•  Auto-suggest usage
                                                                   8
Minimum Search Features
§  High Quality
    Relevance for
    keyword and known
    item search
  •  P@10 or MRR close
     to 1
§  Sub-second
    response time under
    load
§  All achievable in
    LWE/Solr/Lucene
                               9
Faceting
§  LWE/Solr support faceting by:
   •    Field
   •    Date/Number Ranges
   •    Pivot (“what if” faceting)
   •    Hierarchical
         §  (via domain modeling)
   •  Dynamic (via Carrot^2)
§  Single and multi-select faceting supported
§  Facet by Function In Development
   •  https://issues.apache.org/jira/browse/SOLR-1581
§  http://wiki.apache.org/solr/SimpleFacetParameters
                                                        10
More Features
§  Extensible Language Analysis
  •    Multilingual Support
  •    Synonyms
  •    Overrides on a per-word basis
  •    Pluggable Framework


§  Frequent/Incremental Updates
  •  How often do you update your index?
  •  Near Real Time (IndexReader.open() )
  •  Column Stride Fields (4.0)


                                            11
Relevance Controls
§  Function Queries
   •    Ratings/Reviews
   •    Margin/Inventory/Price/Location
   •    Can Sort by Functions
   •    …/solr/browse?q=ipod&bf=price
§  Editorial Controls (QueryElevationComponent)
   •  Fine grained controls
   •  …/solr/elevate?q=YYYY&enableElevation=true
§  Landing Pages (if done in search…)
   •  Implement: Docs with field that is filtered on or a separate
      index/core
   •  Editorial Controls
§  Click Scoring (LWE only)
   •  Popularity based ranking
                                                                     12
Beyond the Search Box
§  Many eCom sites actually power all navigation
    by the search engine

§  Many other tools in the Stack to help users
    discover content
   •    Auto Suggest
   •    Spell Checking
   •    More Like This
   •    Spatial



                                                    13
Complementary Tools
§  Apache Mahout
   •  Recommendation Systems
      §  Crude Solr/Mahout Rec Integration at
          https://github.com/gsingers/ApacheCon2010
   •  Classifiers/Clustering
      §  User Analysis, Content Analysis, etc.
§  Social
   •  BazaarVoice, etc.
§  Business Rules Engine
   •  Drools or others



                                                      14
What’s Missing?
§  UI Controls for non-devs:
   •    Synonyms (LWE has UI/REST support)
   •    Facets (Field support in LWE)
   •    Relevance Control (LWE REST API Support)
   •    Business Rules Integration
   •    Deeper Taxonomy Support
   •    More performance reports (LWE has some)
§  Facet Management tools
   •  Labels
   •  Sort order other than Count or Alphabetical
   •  Editorial facet control

                                                    15
What’s Next?
§  Some sample code and more discussion at
    http://www.lucidimagination.com/blog/
    2011/01/25/implementing-the-ecommerce-
    checklist-with-apache-solr-and-lucidworks/




                                                 16
Resources
§  Principles for
    Effective Search in E- §  grant@lucidimagination.com
    Commerce Design
   •  http://lucene.li/2T   §  @gsingers

§  http://
    www.lucidimagination
    .com/search/?
    q=ecommerce

          http://www.lucidimagination.com

                                                        17

Contenu connexe

En vedette

A haiti
A haitiA haiti
A haiti
tanica
 
презентация по книге дуг де карло "экстримальное управление проектами"
презентация по книге дуг де карло "экстримальное управление проектами"презентация по книге дуг де карло "экстримальное управление проектами"
презентация по книге дуг де карло "экстримальное управление проектами"
tarodnova
 
Artist Update8 11
Artist Update8 11Artist Update8 11
Artist Update8 11
LaRue
 
Lucene rev preso busch realtime search lr1010
Lucene rev preso busch realtime search lr1010Lucene rev preso busch realtime search lr1010
Lucene rev preso busch realtime search lr1010
Lucidworks (Archived)
 
Spanish bombss
Spanish bombssSpanish bombss
Spanish bombss
tanica
 
Civil War
Civil WarCivil War
Civil War
tanica
 
IAMAS 2010 First presentation
IAMAS 2010 First presentationIAMAS 2010 First presentation
IAMAS 2010 First presentation
ocrock
 

En vedette (20)

Mains aux fleurs
Mains aux fleursMains aux fleurs
Mains aux fleurs
 
A haiti
A haitiA haiti
A haiti
 
Van gogh
Van goghVan gogh
Van gogh
 
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
 SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
 
презентация по книге дуг де карло "экстримальное управление проектами"
презентация по книге дуг де карло "экстримальное управление проектами"презентация по книге дуг де карло "экстримальное управление проектами"
презентация по книге дуг де карло "экстримальное управление проектами"
 
Artist Update8 11
Artist Update8 11Artist Update8 11
Artist Update8 11
 
How The Guardian Embraced the Internet using Content, Search, and Open Source
How The Guardian Embraced the Internet using Content, Search, and Open SourceHow The Guardian Embraced the Internet using Content, Search, and Open Source
How The Guardian Embraced the Internet using Content, Search, and Open Source
 
Davis mark advanced search analytics in 20 minutes
Davis mark   advanced search analytics in 20 minutesDavis mark   advanced search analytics in 20 minutes
Davis mark advanced search analytics in 20 minutes
 
Tate Tyler - Designing the Search Experience
Tate Tyler - Designing the Search ExperienceTate Tyler - Designing the Search Experience
Tate Tyler - Designing the Search Experience
 
Understanding Lucene Search Performance
Understanding Lucene Search PerformanceUnderstanding Lucene Search Performance
Understanding Lucene Search Performance
 
Ecma 262 5th Edition を読む #5 第9条
Ecma 262 5th Edition を読む #5 第9条Ecma 262 5th Edition を読む #5 第9条
Ecma 262 5th Edition を読む #5 第9条
 
第4回「ブラウザー勉強会」オープニング トーク
第4回「ブラウザー勉強会」オープニング トーク第4回「ブラウザー勉強会」オープニング トーク
第4回「ブラウザー勉強会」オープニング トーク
 
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks
 
Lucene rev preso busch realtime search lr1010
Lucene rev preso busch realtime search lr1010Lucene rev preso busch realtime search lr1010
Lucene rev preso busch realtime search lr1010
 
"Search, APIs,Capability Management and the Sensis Journey"
"Search, APIs,Capability Management and the Sensis Journey""Search, APIs,Capability Management and the Sensis Journey"
"Search, APIs,Capability Management and the Sensis Journey"
 
Spanish bombss
Spanish bombssSpanish bombss
Spanish bombss
 
Network Forensics Puzzle Contest に挑戦 #1
Network Forensics Puzzle Contest に挑戦 #1Network Forensics Puzzle Contest に挑戦 #1
Network Forensics Puzzle Contest に挑戦 #1
 
Civil War
Civil WarCivil War
Civil War
 
IAMAS 2010 First presentation
IAMAS 2010 First presentationIAMAS 2010 First presentation
IAMAS 2010 First presentation
 
E learning At The Library
E learning At The LibraryE learning At The Library
E learning At The Library
 

Similaire à Using LWE/Solr/Lucene for eCom

SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
Agnes Molnar
 
SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013
Agnes Molnar
 
SPConnections - What's new in SharePoint 2013 Search
SPConnections - What's new in SharePoint 2013 SearchSPConnections - What's new in SharePoint 2013 Search
SPConnections - What's new in SharePoint 2013 Search
Agnes Molnar
 
Webinar: Inside Apache Solr 5
Webinar: Inside Apache Solr 5 Webinar: Inside Apache Solr 5
Webinar: Inside Apache Solr 5
Lucidworks
 

Similaire à Using LWE/Solr/Lucene for eCom (20)

Apache Solr Search Course Drupal 7 Acquia
Apache Solr Search Course Drupal 7 AcquiaApache Solr Search Course Drupal 7 Acquia
Apache Solr Search Course Drupal 7 Acquia
 
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
 
EnterpriseSearch
EnterpriseSearchEnterpriseSearch
EnterpriseSearch
 
Solr 101
Solr 101Solr 101
Solr 101
 
Solr: 4 big features
Solr: 4 big featuresSolr: 4 big features
Solr: 4 big features
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
 
Webinar: Personalized Retail Search & Recommendations with Fusion
Webinar: Personalized Retail Search & Recommendations with FusionWebinar: Personalized Retail Search & Recommendations with Fusion
Webinar: Personalized Retail Search & Recommendations with Fusion
 
SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with Solr
 
Rapid prototyping with solr - By Erik Hatcher
Rapid prototyping with solr -  By Erik Hatcher Rapid prototyping with solr -  By Erik Hatcher
Rapid prototyping with solr - By Erik Hatcher
 
Usability and Salesforce - Dallas Salesforce.com User Group September 2011
Usability and Salesforce - Dallas Salesforce.com User Group September 2011Usability and Salesforce - Dallas Salesforce.com User Group September 2011
Usability and Salesforce - Dallas Salesforce.com User Group September 2011
 
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
 
Solr site search makes shopping simple
Solr site search makes shopping simpleSolr site search makes shopping simple
Solr site search makes shopping simple
 
Inside Solr 5 - Bangalore Solr/Lucene Meetup
Inside Solr 5 - Bangalore Solr/Lucene MeetupInside Solr 5 - Bangalore Solr/Lucene Meetup
Inside Solr 5 - Bangalore Solr/Lucene Meetup
 
Designing for the Salesforce Platform
Designing for the Salesforce Platform Designing for the Salesforce Platform
Designing for the Salesforce Platform
 
SPConnections - What's new in SharePoint 2013 Search
SPConnections - What's new in SharePoint 2013 SearchSPConnections - What's new in SharePoint 2013 Search
SPConnections - What's new in SharePoint 2013 Search
 
API Design Tour: Digital River
API Design Tour: Digital RiverAPI Design Tour: Digital River
API Design Tour: Digital River
 
Webinar: Inside Apache Solr 5
Webinar: Inside Apache Solr 5 Webinar: Inside Apache Solr 5
Webinar: Inside Apache Solr 5
 
EVOLVE'16 | Enhance | Gordon Pike | Rev Up Your Marketing Engine
EVOLVE'16 | Enhance | Gordon Pike | Rev Up Your Marketing EngineEVOLVE'16 | Enhance | Gordon Pike | Rev Up Your Marketing Engine
EVOLVE'16 | Enhance | Gordon Pike | Rev Up Your Marketing Engine
 
Webinar: Fusion 3.1 - What's New
Webinar: Fusion 3.1 - What's NewWebinar: Fusion 3.1 - What's New
Webinar: Fusion 3.1 - What's New
 

Plus de Lucidworks (Archived)

Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search EngineChicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Lucidworks (Archived)
 
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with SearchChicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Lucidworks (Archived)
 
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com SearchMinneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Lucidworks (Archived)
 
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Lucidworks (Archived)
 
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Lucidworks (Archived)
 
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Lucidworks (Archived)
 
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DCBig Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Lucidworks (Archived)
 
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCWhat's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
Lucidworks (Archived)
 
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DCSolr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Lucidworks (Archived)
 
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DCIntro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Lucidworks (Archived)
 
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCTest Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Lucidworks (Archived)
 
Introducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinarIntroducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinar
Lucidworks (Archived)
 

Plus de Lucidworks (Archived) (20)

Integrating Hadoop & Solr
Integrating Hadoop & SolrIntegrating Hadoop & Solr
Integrating Hadoop & Solr
 
The Data-Driven Paradigm
The Data-Driven ParadigmThe Data-Driven Paradigm
The Data-Driven Paradigm
 
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
 
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for BusinessSFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
 
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr PerformanceSFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
 
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search EngineChicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
 
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with SearchChicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
 
What's new in solr june 2014
What's new in solr june 2014What's new in solr june 2014
What's new in solr june 2014
 
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache SolrMinneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
 
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com SearchMinneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
 
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
 
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
 
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
 
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DCBig Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
 
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCWhat's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
 
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DCSolr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DC
 
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DCIntro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
 
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCTest Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
 
Building a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLKBuilding a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLK
 
Introducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinarIntroducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinar
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Dernier (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 

Using LWE/Solr/Lucene for eCom

  • 1. Using LWE/Solr/Lucene for eCom Grant Ingersoll, Lucid Imagination @gsingers Apache Solr and Lucene and their logos are trademarks of the Apache Software Foundation
  • 2. Difference Makers §  Case Study 1: Relevance Matters •  Large Electronics Manufacturer •  Top selling product on page 10 for a search by product name §  Case Study 2: Don’t Overthink it •  Large Online Retailer •  Simply adding auto-suggest added millions to bottom line at very little cost §  Case Study 3: Test, Test, Test •  Amazon Recommendation System •  http://glinden.blogspot.com/2006/04/early- amazon-shopping-cart.html 3
  • 3. Topics §  The Stack §  Knowing Users §  Search Features in LWE/ Solr/Lucene §  Minimum Features for eCom §  Extended Features §  Not Just Search §  What’s Missing? §  What’s Next? 4
  • 4. eCom Stack Choices Lucid Works Enterprise •  Power of Apache Solr •  Easy Admin, Click Scoring, Reports Apache Solr •  Faceting, Easy Config •  Scale Apache Lucene •  Powerful Search Library •  Total Flexibility Apache Solr and Lucene and their logos are trademarks of the Apache Software Foundation 5
  • 5. Users: Get to Know Them! §  Audience Poll: •  How many of you are developers? •  How many of the developers know what the top 10 queries are on your site? •  How many of the non-developers know? §  Your users represent 100% of your opportunity to sell your products ;-) •  Shouldn’t you know what they are searching for? 6
  • 6. Search Analytics §  “If you can’t measure it, you can’t manage it” •  Attributed to Peter Drucker, however, see * §  Ultimately, it’s all about conversion •  May not be the best measure for judging search §  Is there One Right Answer on your Site or Multiple? •  Known Item search vs Keyword/Category *http://edkless.com/2009/06/peter-drucker-and-time-sheets/ 7
  • 7. Useful Metrics •  Mean Reciprocal Rank or Precision @ 10 §  Known Item vs. Keyword/Category •  “Show me the money” -- Top Product Analysis §  Identity Search - If your top product is named X and someone searches for X, is X on the first page? Is it number 1? §  Is a top product underperforming as it relates to search? •  Top X Queries and Query Terms •  Zero Results and % of Zero Results •  Avg. # of facets/filters/spellchecks clicked per session •  Avg # of searches per user session •  Auto-suggest usage 8
  • 8. Minimum Search Features §  High Quality Relevance for keyword and known item search •  P@10 or MRR close to 1 §  Sub-second response time under load §  All achievable in LWE/Solr/Lucene 9
  • 9. Faceting §  LWE/Solr support faceting by: •  Field •  Date/Number Ranges •  Pivot (“what if” faceting) •  Hierarchical §  (via domain modeling) •  Dynamic (via Carrot^2) §  Single and multi-select faceting supported §  Facet by Function In Development •  https://issues.apache.org/jira/browse/SOLR-1581 §  http://wiki.apache.org/solr/SimpleFacetParameters 10
  • 10. More Features §  Extensible Language Analysis •  Multilingual Support •  Synonyms •  Overrides on a per-word basis •  Pluggable Framework §  Frequent/Incremental Updates •  How often do you update your index? •  Near Real Time (IndexReader.open() ) •  Column Stride Fields (4.0) 11
  • 11. Relevance Controls §  Function Queries •  Ratings/Reviews •  Margin/Inventory/Price/Location •  Can Sort by Functions •  …/solr/browse?q=ipod&bf=price §  Editorial Controls (QueryElevationComponent) •  Fine grained controls •  …/solr/elevate?q=YYYY&enableElevation=true §  Landing Pages (if done in search…) •  Implement: Docs with field that is filtered on or a separate index/core •  Editorial Controls §  Click Scoring (LWE only) •  Popularity based ranking 12
  • 12. Beyond the Search Box §  Many eCom sites actually power all navigation by the search engine §  Many other tools in the Stack to help users discover content •  Auto Suggest •  Spell Checking •  More Like This •  Spatial 13
  • 13. Complementary Tools §  Apache Mahout •  Recommendation Systems §  Crude Solr/Mahout Rec Integration at https://github.com/gsingers/ApacheCon2010 •  Classifiers/Clustering §  User Analysis, Content Analysis, etc. §  Social •  BazaarVoice, etc. §  Business Rules Engine •  Drools or others 14
  • 14. What’s Missing? §  UI Controls for non-devs: •  Synonyms (LWE has UI/REST support) •  Facets (Field support in LWE) •  Relevance Control (LWE REST API Support) •  Business Rules Integration •  Deeper Taxonomy Support •  More performance reports (LWE has some) §  Facet Management tools •  Labels •  Sort order other than Count or Alphabetical •  Editorial facet control 15
  • 15. What’s Next? §  Some sample code and more discussion at http://www.lucidimagination.com/blog/ 2011/01/25/implementing-the-ecommerce- checklist-with-apache-solr-and-lucidworks/ 16
  • 16. Resources §  Principles for Effective Search in E- §  grant@lucidimagination.com Commerce Design •  http://lucene.li/2T §  @gsingers §  http:// www.lucidimagination .com/search/? q=ecommerce http://www.lucidimagination.com 17