Ce diaporama a bien été signalé.
Nous utilisons votre profil LinkedIn et vos données d’activité pour vous proposer des publicités personnalisées et pertinentes. Vous pouvez changer vos préférences de publicités à tout moment.
This Ain’t Your Parents’ Search Engine 
Confidential and Proprietary © Copyright 2013 
Grant Ingersoll 
CTO, LucidWorks 
T...
Confidential and Proprietary © Copyright 2013 
Search is dead.
Long live search 
Confidential and Proprietary © Copyright 2013
Search is good for… 
• Traditional: Fast, fuzzy text matching 
across a large document collection 
• De-normalized data 
-...
Foundational Changes in Lucene/Solr 4 
•Reduced Memory usage 
•Pluggable Codecs/similarity 
•FS(A|T) 
•Doc Values (column ...
Search + Hadoop 
• What’s Old is New Again 
• “Traditional” Use Cases: 
- Build/Store indexes 
- https://cwiki.apache.org/...
LucidWorks + Hadoop 
• Ingestion Help 
- Flexible Map-Reduce content ingestion supporting: 
»Directory of files 
»CSV, Wri...
LucidWorks SiLK 
Connectors 
Confidential and Proprietary © Copyright 2013 
LucidWorks Search 
JDBC 
Connector 
Web/File 
...
Search Analytics—Data Ingestion & Visualization 
Solr/Solr Cloud 
Confidential and Proprietary © Copyright 2013 
Gateway 
...
LucidWorks Open Source 
• Logstash for Solr: https://github.com/LucidWorks/solrlogmanager 
• Banana (Kibana for Solr): htt...
Demos 
Confidential and Proprietary © Copyright 2013
Fly the friendly skies 
12 
http://www.ibm.com/developerworks/library/j-solr-lucene/index.html 
Confidential and Proprieta...
Make $$$ 
• Leverage time series 
data and visualization 
using LucidWorks SiLK 
• Monitor Social 
• Traditional Research ...
Cure what ails you 
Confidential and Proprietary © Copyright 2013
Space-Time Continuum 
15 
• Leverage Solr’s spatial 
capabilities to index non-spatial 
data, such as time 
ranges 
- Usef...
Signal Processing for Search and Discovery 
• Signals power modern relevance 
– Clicks, conversions, sharing, history, sig...
Solr Powered Signal Processing 
• Use Case: eCommerce 
• Data: 
– Product catalog (~1.2m items) 
– Click data (~3.9M click...
Meta 
• http://www.lucidworks.com 
– grant@lucidworks.com 
– @gsingers 
• Sales 
– Steve Drane (based here in Chicago) 
– ...
Prochain SlideShare
Chargement dans…5
×
Prochain SlideShare
The Latest in Spatial & Temporal Search: Presented by David Smiley
Suivant
Télécharger pour lire hors ligne et voir en mode plein écran

0

Partager

Télécharger pour lire hors ligne

This Ain't Your Parents' Search Engine

Télécharger pour lire hors ligne

Chicago Solr Meetup - June 10th.

  • Soyez le premier à aimer ceci

This Ain't Your Parents' Search Engine

  1. 1. This Ain’t Your Parents’ Search Engine Confidential and Proprietary © Copyright 2013 Grant Ingersoll CTO, LucidWorks Twitter: @gsingers
  2. 2. Confidential and Proprietary © Copyright 2013 Search is dead.
  3. 3. Long live search Confidential and Proprietary © Copyright 2013
  4. 4. Search is good for… • Traditional: Fast, fuzzy text matching across a large document collection • De-normalized data - “light” relational • Top N problems - Key-value (n=1) - Recommendations - “Good enough” classification, clustering • Faceting, aggregations, analytical slicing and dicing of data • Spatial, record/event linkage, alerting Confidential and Proprietary © Copyright 2013 http://cheezburger.com/5243950080
  5. 5. Foundational Changes in Lucene/Solr 4 •Reduced Memory usage •Pluggable Codecs/similarity •FS(A|T) •Doc Values (column oriented) •Spatial upgrade •New facets and functions •Cursors (deep paging) •Distributed capabilities •Joins/Grouping Confidential and Proprietary © Copyright 2013
  6. 6. Search + Hadoop • What’s Old is New Again • “Traditional” Use Cases: - Build/Store indexes - https://cwiki.apache.org/confluence/display/solr/ Running+Solr+on+HDFS •Enrichment and Signal processing - PageRank, Statistically Interesting Phrases, etc. Confidential and Proprietary © Copyright 2013
  7. 7. LucidWorks + Hadoop • Ingestion Help - Flexible Map-Reduce content ingestion supporting: »Directory of files »CSV, Writable, etc. »LogStash »Build Your Own • Pig Load/Store and UDFs • Hive 2-way support •http://www.lucidworks.com/search-for-hadoop/ - Open source this summer Confidential and Proprietary © Copyright 2013
  8. 8. LucidWorks SiLK Connectors Confidential and Proprietary © Copyright 2013 LucidWorks Search JDBC Connector Web/File System Crawl Data Warehouse Hadoop Connectors Clickstream Networking Data Sources Servers
  9. 9. Search Analytics—Data Ingestion & Visualization Solr/Solr Cloud Confidential and Proprietary © Copyright 2013 Gateway (Reverse Proxy) Solr Output Writer for LogStash (Http) Search Logs Visualization Configurable Dashboards Hadoop Connector LogStash GrokIngestMapper
  10. 10. LucidWorks Open Source • Logstash for Solr: https://github.com/LucidWorks/solrlogmanager • Banana (Kibana for Solr): https://github.com/LucidWorks/banana • Effortless AWS deployment and monitoring: http://www.github.com/lucidworks/solr-scale-tk • Data Quality Toolkit: https://github.com/LucidWorks/data-quality Confidential and Proprietary © Copyright 2013
  11. 11. Demos Confidential and Proprietary © Copyright 2013
  12. 12. Fly the friendly skies 12 http://www.ibm.com/developerworks/library/j-solr-lucene/index.html Confidential and Proprietary © Copyright 2013
  13. 13. Make $$$ • Leverage time series data and visualization using LucidWorks SiLK • Monitor Social • Traditional Research https://github.com/lucidworks/lws-financial-demo Confidential and Proprietary © Copyright 2013
  14. 14. Cure what ails you Confidential and Proprietary © Copyright 2013
  15. 15. Space-Time Continuum 15 • Leverage Solr’s spatial capabilities to index non-spatial data, such as time ranges - Useful for Open Hours, Shifts, etc. • Query using rectangle intersections - q = shift:"Intersects(0 19 23 365)” https://people.apache.org/~hossman/spatial-for-non-spatial-meetup-20130117/ Confidential and Proprietary © Copyright 2013
  16. 16. Signal Processing for Search and Discovery • Signals power modern relevance – Clicks, conversions, sharing, history, signatures • LucidWorks 5 makes it easy to capture and leverage signals – Recommendations, analytics, discovery • Simplifies your data workflow • Simplify your operational footprint Confidential and Proprietary © Copyright 2013
  17. 17. Solr Powered Signal Processing • Use Case: eCommerce • Data: – Product catalog (~1.2m items) – Click data (~3.9M clicks) Confidential and Proprietary © Copyright 2013
  18. 18. Meta • http://www.lucidworks.com – grant@lucidworks.com – @gsingers • Sales – Steve Drane (based here in Chicago) – steve.drane@lucidworks.com • Lucene/Solr Revolution – Washington DC, Nov 11-14 – http://www.lucenerevolution.org Confidential and Proprietary © Copyright 2013

Chicago Solr Meetup - June 10th.

Vues

Nombre de vues

447

Sur Slideshare

0

À partir des intégrations

0

Nombre d'intégrations

9

Actions

Téléchargements

8

Partages

0

Commentaires

0

Mentions J'aime

0

×