4. Search Tech has evolved
• Traditional fuzzy keyword lookup is faster than ever at
ever increasing scale
• Richer data modeling capabilities
• “light relational”
• Advanced types
• Faceting, Aggregations, Analytics
• Spatial, Record linkage, alerting
• Top N problems
6. Search + Hadoop
• What’s Old is New Again
!
• “Traditional” Use Cases:
• Build/Store indexes
• https://cwiki.apache.org/confluence/display/solr/
Running+Solr+on+HDFS
!
• Enrichment and Signal processing
• PageRank, Statistically Interesting Phrases, etc.
7. LucidWorks + Hadoop
•Ingestion Help
• Flexible Map-Reduce content ingestion supporting:
• Directory of files
• CSV, Writable, etc.
• LogStash
• Build Your Own
•Pig Load/Store and UDFs
•Hive 2-way support
•http://www.lucidworks.com/search-for-hadoop/
11. Signal Processing
• Signals power modern relevance!
• Clicks, conversions, sharing, history, signatures and more
• Make it easy to capture and leverage signals
• Power recommendations, analytics, discovery
• Simplify:
• Data workflow
• Operational footprint