SlideShare une entreprise Scribd logo
1  sur  31
From Search to Found

Grant Ingersoll ‐ Eran Yaniv
Thursday, August 6, 2009
Agenda
     Introductions
         Apache Solr background
     LucidWorks for Solr
     Installing LucidWorks for Solr
     Searching your domain with Solr
     Putting Solr into production
     Questions




                              Lucid Imagination, Inc.
Introductions
      Grant Ingersoll
        Lucene/Solr committer
        Co‐founder Apache Mahout project
        Co‐author of upcoming “Taming Text”
      Eran Yaniv
        Lucid Solutions Manager
        Background
        • Product management
        • Enterprise Development/IT
        • Information Retrieval



                                  Lucid Imagination, Inc.
Apache Solr Background
     Lucene‐based Search server plus many enterprise tools
       REST‐like API
       Faceting
       Distributed/Replication
       Easy configuration
       Many other features: 
       http://lucene.apache.org/solr/features.html
     Created at CNET by Yonik Seeley (Lucid co‐founder)
       Donated to the Apache Software Foundation in 2006
       Solr 1.4 release coming soon



                                 Lucid Imagination, Inc.
Solr Basics
       Content is modeled via Documents and Fields
         Content can be text, integers, floats, dates, custom
         Analysis can be employed to alter content before indexing
         Controlled via schema.xml
       Searches are supported through a wide range of Query 
     options
         Keyword
         Terms
         Phrases
         Wildcards, other
       Many clients available: HTTP, Java, Ruby, PHP, .NET, etc.

                                 Lucid Imagination, Inc.
Solr Basics
      Schema
        Define Field Types, Fields, field metadata and Analysis
        <field name="name" type="text" indexed="true" 
        stored="true"/>
        Copy Fields, Dynamic Fields, Similarity overrides
      Solr Config
        Define low‐level Lucene controls
        Specify how clients interact with Solr via Request Handlers (“mini 
        servlets”)
        Configure highlighting, spell checking, admin, etc. 




                                Lucid Imagination, Inc.
LucidWorks for Solr
      Based on Apache Solr 1.3 plus
        Installer for Linux and Windows
        Specific patches from Solr 
        • faceting improvements, other
        30‐day free “Get Started” program
        Bundled:
        • JRE
        • Apache Tomcat
        • Optimized KStemmer implementation
        • Luke
        • Lucid Gaze for Solr


                                Lucid Imagination, Inc.
Getting Started
    1.   Install Lucid Works
    2.   Model your domain
    3.   Index your content
    4.   Test
    5.   Deploy




                               Lucid Imagination, Inc.
Install Lucid Works
      Free certified distribution
        Introduced to many new users
        New users frequently use “Get Started”
        Over 50% of the cases: “How to install”
      Installer
        Simple
        Plugins and enhancements
        Updateable
        Support for Linux, Windows (Mac?)
        UI and headless



                               Lucid Imagination, Inc.
Installer Overview

                          Solr installer service
                          Hosted on lucidimagination.com
     Public repository    Manages repositories
                                                       Solr installer client
                                                       Install/Uninstall certified v.
     Beta
                                                       Check/install updates
     Password protected
                                                       install/update components
                                                       Upgrade to platform
     Early adapters



     Dev ‐ Internal
Starting Lucid Works
      cd <INSTALL_PATH>/lucidworks


      ./lucidworks.sh start (*NIX) 


      .lucidworks.bat start (Windows)


      Point your browser at http://localhost:8983/solr/




                              Lucid Imagination, Inc.
Master Your Domain with Solr


     Get to know your content


     Get to know your users


     Model in Solr




                              Lucid Imagination, Inc.
Modeling your Content
     Collection/Aggregate
       Examine collection level stats, like:
       • MIME Types
       • Number of Docs
       • Update rates
       • Languages present
       • Much, much more
       Look for patterns and relationships
       Identify helpful resources




                                Lucid Imagination, Inc.
Modeling your Content
      Randomly sample a set of your documents
      Look for:
        Common structures like titles, tables, columns, etc.
        Important metadata
        Tokenization issues
        • Try out in http://localhost:8983/solr/admin/analysis.jsp
        Importance Indicators
        May also look at paragraph, sentence, word and character issues
       Often useful to run docs through indexing process in an 
    iterative process



                                  Lucid Imagination, Inc.
Understanding your Users
  UI Expectations


  Speed and Relevance


  Search and Discovery
    Search
    Faceting
    Did you mean?
    Similar Pages (More Like This)
    Highlighting
    Document/Results Clustering
Build your Application
  Map your content into Documents and Fields via the Solr schema


  Setup your Solr access patterns in the solrconfig.xml


  Index your content 


  Search




                               Lucid Imagination, Inc.
Indexing
  Many Clients
     Java, PHP, Ruby, etc.
     See example/exampledocs
  Pull from DB, others
  Upload CSV, Solr XML
  <add><doc>
   <field 
      name="id">EN7800GTX/2DHTV/25
      6M</field>
  <field name="manu">ASUS Computer 
      Inc.</field>
   <field name="cat">electronics</field>
  </doc></add>
Search

   Clients also support search 
through API calls


   HTTP support by 
definition:
     http://localhost:8983/sol
     r/select/?q=*:*&fl=score,
     id
     http://localhost:8983/sol
     r/select/?q=name:iPod&f
     l=score,id
Load Testing
      Solr scales quite well, but you should still load test to 
    establish performance specs for your application
         Apache JMeter can be a good start


      Ideally, playback old logs at the rate they occurred


       As with any Java application, keep an eye on JVM factors 
    like heap size and garbage collection




                               Lucid Imagination, Inc.
Improving Performance
     Search
        Avoid wildcards, or at least require prefix
        Catch‐all field for “generic” search
        Choose proper faceting method for the situation
        Replicate/Shard
     Indexing
        Minimal analysis to achieve results (speeds indexing)
        Multi‐threaded, batch submission
     Usual Suspects:  CPU, Memory, Disk, JVM
      http://www.lucidimagination.com/Community/Hear‐from‐
    the‐Experts/Articles/Scaling‐Lucene‐and‐Solr/

                                Lucid Imagination, Inc.
Relevance Testing
      Often overlooked until there is a problem; instead plan for it 
    upfront


      Types:
         Ad hoc
         Log based/ QA driven
         Standard Collections and Queries (TREC)


      Best Practice:  Take top 50 or so queries by volume, plus ~20 
    random queries and rate the top ten results as relevant, 
    somewhat relevant, not relevant, embarrassing

                                Lucid Imagination, Inc.
Troubleshooting Relevance in LucidWorks for Solr

    Add an &debugQuery=true to any Query:
      Provides info on why doc scored the way it did, plus 
      other info about the Query
      http://localhost:8983/solr/select/?q=*:*&de
      bugQuery=true


     Solr’s built in 
  LukeRequestHandler
     Luke, the Lucene  index 
  browser
         lucidworks/luke.(sh|bat)
Improving your Search

Common Techniques
   Analysis:
     Lowercase, stemming, 
     synonyms, stopwords, 
     compound analysis (e.g. STR‐
     AV220 ‐> STR AV 220)
   Boosts (query and index)
   Faceting and other 
navigational aids
   Spell Checking
Improving your Queries
      Disjunction Max Query (more in a minute)
      Better stop word handling
      Phrase Queries and other Position‐based Queries
        “quick red fox”~3
      Recency/Freshness
      Invisible Queries
        Relevance Feedback and “More Like This”
      Fake Queries




                              Lucid Imagination, Inc.
Disjunction Max Query

   Useful when searching across multiple fields
   Example (thanks to Chuck Williams)
     •Query: t:elephant d:elephant t:albino d:albino


       •Doc1:                                             •Doc2:

     •t: elephant                                 •t: elephant

    •d: elephant                                     •d: albino

• Each Doc scores the same for BooleanQuery
• DisjunctionMaxQuery scores Doc2 higher
                                Lucid Imagination, Inc.
Advanced Techniques
     Payloads
       http://www.lucidimagination.com/blog/2009/08/05/getting‐
       started‐with‐payloads/
       DelimitedPayloadTokenFilter (better name?)
       • Add payloads inline:  foo|2.3 bar|5.4
       BoostingFunctionTermQuery (Lucene 2.9, Solr 1.4)
     Natural Language Processing
       Named Entity Extraction (OpenNLP, Stanford NER, Commercial)
       Sentiment Analysis
       Event Detection
       Relationship Identification

                                 Lucid Imagination, Inc.
Solr in Production
      Hardware
      Monitoring
        Lucid Gaze for Solr
        Nagios, Hyperic, Port monitoring
      Troubleshooting
        Solr Community – ad hoc support
        Lucid Support – Commercial support with SLAs
      Growth
        Query Volume
        Index Size




                                 Lucid Imagination, Inc.
Lucid Gaze for Solr
      Monitor Solr Request Handlers


      Comes with LucidWorks for Solr


      http://localhost:8983/gaze




                            Lucid Imagination, Inc.
Lucid Imagination, Inc.
Resources
   Websites
      http://www.lucidimagination.com
      http://search.lucidimagination.com
      http://lucene.apache.org/solr
   Solr Support and Training
      http://www.lucidimagination.com/How‐We‐Can‐Help
      SLAs, Public, Private and Online Training for Solr and Lucene
   Mailing Lists
      solr‐user@lucene.apache.org



                               Lucid Imagination, Inc.

Contenu connexe

En vedette

White Paper Phone Systems Market Primer
White Paper Phone Systems Market PrimerWhite Paper Phone Systems Market Primer
White Paper Phone Systems Market PrimerZAG Technical Services
 
General Orientation for DSPS 090 Fall 2014
General Orientation for DSPS 090 Fall 2014General Orientation for DSPS 090 Fall 2014
General Orientation for DSPS 090 Fall 2014Sarah Bosler
 
Expeditie mont blanc
Expeditie mont blancExpeditie mont blanc
Expeditie mont blancSoetkin89
 
Innovation and the Earned Brand - Japan
Innovation and the Earned Brand - JapanInnovation and the Earned Brand - Japan
Innovation and the Earned Brand - JapanEdelman Japan
 
Clipping Famastil Taurus 2010/02 - On Line
Clipping Famastil Taurus 2010/02 - On LineClipping Famastil Taurus 2010/02 - On Line
Clipping Famastil Taurus 2010/02 - On LineAgência DUE
 
Clipping mundo a Vapor 2010 02
Clipping mundo a Vapor 2010 02Clipping mundo a Vapor 2010 02
Clipping mundo a Vapor 2010 02Agência DUE
 
Health policy & difference
Health policy & differenceHealth policy & difference
Health policy & differenceSarah Rainey
 
The Silver Fern Report - June 2010
The Silver Fern Report - June 2010The Silver Fern Report - June 2010
The Silver Fern Report - June 2010oparvez
 
Clipping Famastil Taurus 2009
Clipping Famastil Taurus 2009Clipping Famastil Taurus 2009
Clipping Famastil Taurus 2009Agência DUE
 
Boulder Real Estate Research
Boulder Real Estate ResearchBoulder Real Estate Research
Boulder Real Estate Researchoparvez
 
Sweet Temptation - wir backen einen Kuchen wie Oberstaufen. #ITB #itbetw
Sweet Temptation - wir backen einen Kuchen wie Oberstaufen. #ITB #itbetwSweet Temptation - wir backen einen Kuchen wie Oberstaufen. #ITB #itbetw
Sweet Temptation - wir backen einen Kuchen wie Oberstaufen. #ITB #itbetwBenjamin Buhl
 
KHN Groene Hart: social media pp
KHN Groene Hart: social media ppKHN Groene Hart: social media pp
KHN Groene Hart: social media ppAl Sauerfield
 
effective design
effective designeffective design
effective designteachflute
 
Presentatiefolder
PresentatiefolderPresentatiefolder
Presentatiefolderifeel.nu
 

En vedette (20)

White Paper Phone Systems Market Primer
White Paper Phone Systems Market PrimerWhite Paper Phone Systems Market Primer
White Paper Phone Systems Market Primer
 
General Orientation for DSPS 090 Fall 2014
General Orientation for DSPS 090 Fall 2014General Orientation for DSPS 090 Fall 2014
General Orientation for DSPS 090 Fall 2014
 
Eating disorders
Eating disordersEating disorders
Eating disorders
 
Expeditie mont blanc
Expeditie mont blancExpeditie mont blanc
Expeditie mont blanc
 
Innovation and the Earned Brand - Japan
Innovation and the Earned Brand - JapanInnovation and the Earned Brand - Japan
Innovation and the Earned Brand - Japan
 
Clipping Famastil Taurus 2010/02 - On Line
Clipping Famastil Taurus 2010/02 - On LineClipping Famastil Taurus 2010/02 - On Line
Clipping Famastil Taurus 2010/02 - On Line
 
Clipping mundo a Vapor 2010 02
Clipping mundo a Vapor 2010 02Clipping mundo a Vapor 2010 02
Clipping mundo a Vapor 2010 02
 
Strategia web demo
Strategia web demoStrategia web demo
Strategia web demo
 
5 Great Interactive Websites | Keyideas Infotech
5 Great Interactive Websites | Keyideas Infotech5 Great Interactive Websites | Keyideas Infotech
5 Great Interactive Websites | Keyideas Infotech
 
Health policy & difference
Health policy & differenceHealth policy & difference
Health policy & difference
 
The Silver Fern Report - June 2010
The Silver Fern Report - June 2010The Silver Fern Report - June 2010
The Silver Fern Report - June 2010
 
Clipping Famastil Taurus 2009
Clipping Famastil Taurus 2009Clipping Famastil Taurus 2009
Clipping Famastil Taurus 2009
 
Boulder Real Estate Research
Boulder Real Estate ResearchBoulder Real Estate Research
Boulder Real Estate Research
 
Sweet Temptation - wir backen einen Kuchen wie Oberstaufen. #ITB #itbetw
Sweet Temptation - wir backen einen Kuchen wie Oberstaufen. #ITB #itbetwSweet Temptation - wir backen einen Kuchen wie Oberstaufen. #ITB #itbetw
Sweet Temptation - wir backen einen Kuchen wie Oberstaufen. #ITB #itbetw
 
KHN Groene Hart: social media pp
KHN Groene Hart: social media ppKHN Groene Hart: social media pp
KHN Groene Hart: social media pp
 
effective design
effective designeffective design
effective design
 
6 Ingredients for a Good eCommerce Blog | Keyideas Infotech
6 Ingredients for a Good eCommerce Blog | Keyideas Infotech6 Ingredients for a Good eCommerce Blog | Keyideas Infotech
6 Ingredients for a Good eCommerce Blog | Keyideas Infotech
 
Presentatiefolder
PresentatiefolderPresentatiefolder
Presentatiefolder
 
Latest Trendy Mobile eCommerce Designs
Latest Trendy Mobile eCommerce DesignsLatest Trendy Mobile eCommerce Designs
Latest Trendy Mobile eCommerce Designs
 
Lanlogic IT White Paper
Lanlogic IT White PaperLanlogic IT White Paper
Lanlogic IT White Paper
 

Similaire à Getting started faster with LucidWorks for Solr

Apache Lucene: Searching the Web and Everything Else (Jazoon07)
Apache Lucene: Searching the Web and Everything Else (Jazoon07)Apache Lucene: Searching the Web and Everything Else (Jazoon07)
Apache Lucene: Searching the Web and Everything Else (Jazoon07)dnaber
 
Dev8d Apache Solr Tutorial
Dev8d Apache Solr TutorialDev8d Apache Solr Tutorial
Dev8d Apache Solr TutorialSourcesense
 
Solr Powered Lucene
Solr Powered LuceneSolr Powered Lucene
Solr Powered LuceneErik Hatcher
 
Rapid prototyping with solr - By Erik Hatcher
Rapid prototyping with solr -  By Erik Hatcher Rapid prototyping with solr -  By Erik Hatcher
Rapid prototyping with solr - By Erik Hatcher lucenerevolution
 
VA Smalltalk Update
VA Smalltalk UpdateVA Smalltalk Update
VA Smalltalk UpdateESUG
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to SolrErik Hatcher
 
Integrating Splunk into your Spring Applications
Integrating Splunk into your Spring ApplicationsIntegrating Splunk into your Spring Applications
Integrating Splunk into your Spring ApplicationsDamien Dallimore
 
2019 10-21 Java in the Age of Serverless
2019 10-21 Java in the Age of Serverless2019 10-21 Java in the Age of Serverless
2019 10-21 Java in the Age of ServerlessMatt Rutkowski
 
Indexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrIndexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrLucidworks (Archived)
 
Indexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrIndexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrLucidworks (Archived)
 
Indexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrIndexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrLucidworks (Archived)
 
Solr search engine with multiple table relation
Solr search engine with multiple table relationSolr search engine with multiple table relation
Solr search engine with multiple table relationJay Bharat
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with SolrErik Hatcher
 
Suche mit Apache Lucene & Co.
Suche mit Apache Lucene & Co.Suche mit Apache Lucene & Co.
Suche mit Apache Lucene & Co.inovex GmbH
 
Apache Solr - An Experience Report
Apache Solr - An Experience ReportApache Solr - An Experience Report
Apache Solr - An Experience ReportNetcetera
 
Lares from LOW to PWNED
Lares from LOW to PWNEDLares from LOW to PWNED
Lares from LOW to PWNEDChris Gates
 
Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5israelekpo
 

Similaire à Getting started faster with LucidWorks for Solr (20)

Apache Lucene Searching The Web
Apache Lucene Searching The WebApache Lucene Searching The Web
Apache Lucene Searching The Web
 
Apache Lucene: Searching the Web and Everything Else (Jazoon07)
Apache Lucene: Searching the Web and Everything Else (Jazoon07)Apache Lucene: Searching the Web and Everything Else (Jazoon07)
Apache Lucene: Searching the Web and Everything Else (Jazoon07)
 
Solr 101
Solr 101Solr 101
Solr 101
 
Dev8d Apache Solr Tutorial
Dev8d Apache Solr TutorialDev8d Apache Solr Tutorial
Dev8d Apache Solr Tutorial
 
Solr Powered Lucene
Solr Powered LuceneSolr Powered Lucene
Solr Powered Lucene
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with Solr
 
Rapid prototyping with solr - By Erik Hatcher
Rapid prototyping with solr -  By Erik Hatcher Rapid prototyping with solr -  By Erik Hatcher
Rapid prototyping with solr - By Erik Hatcher
 
VA Smalltalk Update
VA Smalltalk UpdateVA Smalltalk Update
VA Smalltalk Update
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to Solr
 
Integrating Splunk into your Spring Applications
Integrating Splunk into your Spring ApplicationsIntegrating Splunk into your Spring Applications
Integrating Splunk into your Spring Applications
 
2019 10-21 Java in the Age of Serverless
2019 10-21 Java in the Age of Serverless2019 10-21 Java in the Age of Serverless
2019 10-21 Java in the Age of Serverless
 
Indexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrIndexing Text and HTML Files with Solr
Indexing Text and HTML Files with Solr
 
Indexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrIndexing Text and HTML Files with Solr
Indexing Text and HTML Files with Solr
 
Indexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrIndexing Text and HTML Files with Solr
Indexing Text and HTML Files with Solr
 
Solr search engine with multiple table relation
Solr search engine with multiple table relationSolr search engine with multiple table relation
Solr search engine with multiple table relation
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with Solr
 
Suche mit Apache Lucene & Co.
Suche mit Apache Lucene & Co.Suche mit Apache Lucene & Co.
Suche mit Apache Lucene & Co.
 
Apache Solr - An Experience Report
Apache Solr - An Experience ReportApache Solr - An Experience Report
Apache Solr - An Experience Report
 
Lares from LOW to PWNED
Lares from LOW to PWNEDLares from LOW to PWNED
Lares from LOW to PWNED
 
Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5
 

Plus de Lucidworks (Archived)

Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...Lucidworks (Archived)
 
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
 SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and SolrLucidworks (Archived)
 
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for BusinessSFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for BusinessLucidworks (Archived)
 
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr PerformanceSFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr PerformanceLucidworks (Archived)
 
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search EngineChicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search EngineLucidworks (Archived)
 
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with SearchChicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with SearchLucidworks (Archived)
 
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache SolrMinneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache SolrLucidworks (Archived)
 
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com SearchMinneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com SearchLucidworks (Archived)
 
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...Lucidworks (Archived)
 
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...Lucidworks (Archived)
 
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...Lucidworks (Archived)
 
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DCBig Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DCLucidworks (Archived)
 
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCWhat's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCLucidworks (Archived)
 
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DCSolr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DCLucidworks (Archived)
 
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DCIntro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DCLucidworks (Archived)
 
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCTest Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCLucidworks (Archived)
 
Building a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLKBuilding a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLKLucidworks (Archived)
 

Plus de Lucidworks (Archived) (20)

Integrating Hadoop & Solr
Integrating Hadoop & SolrIntegrating Hadoop & Solr
Integrating Hadoop & Solr
 
The Data-Driven Paradigm
The Data-Driven ParadigmThe Data-Driven Paradigm
The Data-Driven Paradigm
 
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
 
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
 SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
 
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for BusinessSFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
 
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr PerformanceSFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
 
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search EngineChicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
 
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with SearchChicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
 
What's new in solr june 2014
What's new in solr june 2014What's new in solr june 2014
What's new in solr june 2014
 
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache SolrMinneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
 
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com SearchMinneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
 
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
 
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
 
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
 
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DCBig Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
 
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCWhat's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
 
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DCSolr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DC
 
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DCIntro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
 
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCTest Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
 
Building a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLKBuilding a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLK
 

Dernier

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Dernier (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Getting started faster with LucidWorks for Solr

  • 2. Agenda Introductions Apache Solr background LucidWorks for Solr Installing LucidWorks for Solr Searching your domain with Solr Putting Solr into production Questions Lucid Imagination, Inc.
  • 3. Introductions Grant Ingersoll Lucene/Solr committer Co‐founder Apache Mahout project Co‐author of upcoming “Taming Text” Eran Yaniv Lucid Solutions Manager Background • Product management • Enterprise Development/IT • Information Retrieval Lucid Imagination, Inc.
  • 4. Apache Solr Background Lucene‐based Search server plus many enterprise tools REST‐like API Faceting Distributed/Replication Easy configuration Many other features:  http://lucene.apache.org/solr/features.html Created at CNET by Yonik Seeley (Lucid co‐founder) Donated to the Apache Software Foundation in 2006 Solr 1.4 release coming soon Lucid Imagination, Inc.
  • 5. Solr Basics Content is modeled via Documents and Fields Content can be text, integers, floats, dates, custom Analysis can be employed to alter content before indexing Controlled via schema.xml Searches are supported through a wide range of Query  options Keyword Terms Phrases Wildcards, other Many clients available: HTTP, Java, Ruby, PHP, .NET, etc. Lucid Imagination, Inc.
  • 6. Solr Basics Schema Define Field Types, Fields, field metadata and Analysis <field name="name" type="text" indexed="true"  stored="true"/> Copy Fields, Dynamic Fields, Similarity overrides Solr Config Define low‐level Lucene controls Specify how clients interact with Solr via Request Handlers (“mini  servlets”) Configure highlighting, spell checking, admin, etc.  Lucid Imagination, Inc.
  • 7. LucidWorks for Solr Based on Apache Solr 1.3 plus Installer for Linux and Windows Specific patches from Solr  • faceting improvements, other 30‐day free “Get Started” program Bundled: • JRE • Apache Tomcat • Optimized KStemmer implementation • Luke • Lucid Gaze for Solr Lucid Imagination, Inc.
  • 8. Getting Started 1. Install Lucid Works 2. Model your domain 3. Index your content 4. Test 5. Deploy Lucid Imagination, Inc.
  • 9. Install Lucid Works Free certified distribution Introduced to many new users New users frequently use “Get Started” Over 50% of the cases: “How to install” Installer Simple Plugins and enhancements Updateable Support for Linux, Windows (Mac?) UI and headless Lucid Imagination, Inc.
  • 10. Installer Overview Solr installer service Hosted on lucidimagination.com Public repository Manages repositories Solr installer client Install/Uninstall certified v. Beta Check/install updates Password protected install/update components Upgrade to platform Early adapters Dev ‐ Internal
  • 11.
  • 12. Starting Lucid Works cd <INSTALL_PATH>/lucidworks ./lucidworks.sh start (*NIX)  .lucidworks.bat start (Windows) Point your browser at http://localhost:8983/solr/ Lucid Imagination, Inc.
  • 13. Master Your Domain with Solr Get to know your content Get to know your users Model in Solr Lucid Imagination, Inc.
  • 14. Modeling your Content Collection/Aggregate Examine collection level stats, like: • MIME Types • Number of Docs • Update rates • Languages present • Much, much more Look for patterns and relationships Identify helpful resources Lucid Imagination, Inc.
  • 15. Modeling your Content Randomly sample a set of your documents Look for: Common structures like titles, tables, columns, etc. Important metadata Tokenization issues • Try out in http://localhost:8983/solr/admin/analysis.jsp Importance Indicators May also look at paragraph, sentence, word and character issues Often useful to run docs through indexing process in an  iterative process Lucid Imagination, Inc.
  • 16. Understanding your Users UI Expectations Speed and Relevance Search and Discovery Search Faceting Did you mean? Similar Pages (More Like This) Highlighting Document/Results Clustering
  • 17. Build your Application Map your content into Documents and Fields via the Solr schema Setup your Solr access patterns in the solrconfig.xml Index your content  Search Lucid Imagination, Inc.
  • 18. Indexing Many Clients Java, PHP, Ruby, etc. See example/exampledocs Pull from DB, others Upload CSV, Solr XML <add><doc> <field  name="id">EN7800GTX/2DHTV/25 6M</field> <field name="manu">ASUS Computer  Inc.</field> <field name="cat">electronics</field> </doc></add>
  • 19. Search Clients also support search  through API calls HTTP support by  definition: http://localhost:8983/sol r/select/?q=*:*&fl=score, id http://localhost:8983/sol r/select/?q=name:iPod&f l=score,id
  • 20. Load Testing Solr scales quite well, but you should still load test to  establish performance specs for your application Apache JMeter can be a good start Ideally, playback old logs at the rate they occurred As with any Java application, keep an eye on JVM factors  like heap size and garbage collection Lucid Imagination, Inc.
  • 21. Improving Performance Search Avoid wildcards, or at least require prefix Catch‐all field for “generic” search Choose proper faceting method for the situation Replicate/Shard Indexing Minimal analysis to achieve results (speeds indexing) Multi‐threaded, batch submission Usual Suspects:  CPU, Memory, Disk, JVM http://www.lucidimagination.com/Community/Hear‐from‐ the‐Experts/Articles/Scaling‐Lucene‐and‐Solr/ Lucid Imagination, Inc.
  • 22. Relevance Testing Often overlooked until there is a problem; instead plan for it  upfront Types: Ad hoc Log based/ QA driven Standard Collections and Queries (TREC) Best Practice:  Take top 50 or so queries by volume, plus ~20  random queries and rate the top ten results as relevant,  somewhat relevant, not relevant, embarrassing Lucid Imagination, Inc.
  • 23. Troubleshooting Relevance in LucidWorks for Solr Add an &debugQuery=true to any Query: Provides info on why doc scored the way it did, plus  other info about the Query http://localhost:8983/solr/select/?q=*:*&de bugQuery=true Solr’s built in  LukeRequestHandler Luke, the Lucene  index  browser lucidworks/luke.(sh|bat)
  • 24. Improving your Search Common Techniques Analysis: Lowercase, stemming,  synonyms, stopwords,  compound analysis (e.g. STR‐ AV220 ‐> STR AV 220) Boosts (query and index) Faceting and other  navigational aids Spell Checking
  • 25. Improving your Queries Disjunction Max Query (more in a minute) Better stop word handling Phrase Queries and other Position‐based Queries “quick red fox”~3 Recency/Freshness Invisible Queries Relevance Feedback and “More Like This” Fake Queries Lucid Imagination, Inc.
  • 26. Disjunction Max Query Useful when searching across multiple fields Example (thanks to Chuck Williams) •Query: t:elephant d:elephant t:albino d:albino •Doc1: •Doc2: •t: elephant •t: elephant •d: elephant •d: albino • Each Doc scores the same for BooleanQuery • DisjunctionMaxQuery scores Doc2 higher Lucid Imagination, Inc.
  • 27. Advanced Techniques Payloads http://www.lucidimagination.com/blog/2009/08/05/getting‐ started‐with‐payloads/ DelimitedPayloadTokenFilter (better name?) • Add payloads inline:  foo|2.3 bar|5.4 BoostingFunctionTermQuery (Lucene 2.9, Solr 1.4) Natural Language Processing Named Entity Extraction (OpenNLP, Stanford NER, Commercial) Sentiment Analysis Event Detection Relationship Identification Lucid Imagination, Inc.
  • 28. Solr in Production Hardware Monitoring Lucid Gaze for Solr Nagios, Hyperic, Port monitoring Troubleshooting Solr Community – ad hoc support Lucid Support – Commercial support with SLAs Growth Query Volume Index Size Lucid Imagination, Inc.
  • 29. Lucid Gaze for Solr Monitor Solr Request Handlers Comes with LucidWorks for Solr http://localhost:8983/gaze Lucid Imagination, Inc.
  • 31. Resources Websites http://www.lucidimagination.com http://search.lucidimagination.com http://lucene.apache.org/solr Solr Support and Training http://www.lucidimagination.com/How‐We‐Can‐Help SLAs, Public, Private and Online Training for Solr and Lucene Mailing Lists solr‐user@lucene.apache.org Lucid Imagination, Inc.