SlideShare une entreprise Scribd logo
1  sur  19
Amazon CloudSearch
& Future Search Architectures
Iain Fletcher
ifletcher@Searchtechnologies.com
The expert in the search space
Agenda
A brief introduction to Search Technologies
Search Application Architectures
Common challenges, emerging architectures

Our thoughts about Amazon CloudSearch

The expert in the search space
Search Technologies Overview
Ascot, UK
Karlsruhe, DE

Cincinnati, OH
Herndon, VA
San Diego, CA
San Jose, CR

• The leading IT Services company dedicated to
Enterprise Search & Search-based Applications
• Implementation, Consulting, Managed Services
• 120 employees and growing
• Independent, working with all of the leading
software vendors and open source alternatives
500+ Customers
Future Search Architectures

The expert in the search space
A Traditional Integrated Architecture
Integrated Search Engine

Content
Sources
Employee
Directory
CMS

File Share

Etc.

Connectors

Index Pipeline

Search
Index

UI

The year is 2013, but we still have:
Dissatisfaction with search results
Scalability issues (various)
High initial investment
Inflexibility in the model
Data integration issues…
The expert in the search space
Reasons for Change
Today’s leading “indexing engines” are very capable
The challenges to success almost always lie in
peripheral tasks. On the indexing side of the house,
this means:
Gathering and feeding data to the search engine
Preparing data for indexing (normalization, metadata
enrichment. Etc.)

The expert in the search space
Reasons for Change
The need to cope with continuous and rapid data
growth
Things usually break first on the data gathering side of
the house
E.g. The Government department and the 7TB File Server

The expert in the search space
A Traditional Integrated Architecture
Integrated Search Engine

Content
Sources
Employee
Directory
CMS

Connectors

Index Pipeline

Search
Index

UI

RE-INDEX

File Share

Etc.

The expert in the search space
A Better Search Architecture
Iterative
Development

Content
Sources
Employee
Directory
CMS

Etc.

Content
Processing

Connectors

Secure
xml
Cache

Search Engine
Index
Pipeline

Search
Index

RE-INDEX

Better operational agility

The expert in the search space
A Better Search Architecture
Iterative
Development

Content
Sources
Employee
Directory
CMS

Etc.

Content
Processing

Connectors

Secure
xml
Cache

Amazon CloudSearch
Index
Pipeline

Search
Index

RE-INDEX

Better operational agility
Much improved cost of
ownership
The expert in the search space
Let’s take a look at Amazon CloudSearch
If you’ve ever switched on an EC2 server, then
you can switch on a CloudSearch server
And of course, switch it off just as easily

Xml or JSON in, for indexing
RESTful / xml out, for searching
Has the bases covered in terms of core search
functionality

The expert in the search space
Amazon CloudSearch: Important Search Functions

Full-on Search Navigators
Results sorting by property
Plenty of relevancy tuning controls
Indexing speed is impressive
We’ve tested up to 500 docs/sec

Very long results lists can be served

The expert in the search space
Cost Effective & Flexible
The usual benefits of AWS
Only use what you need
Elastic scalability

Maintenance inclusive
Compare that to the cost of a Service Level Agreement for
an on-premise alternative
Then add the labour costs….

From less than £50 per month

The expert in the search space
What do you need to get started
Repository Connectors
Capture content
Convert to xml
XSLT. Etc.

Content Processing
Java code, Groovy. Etc.
A framework product, E.g. Aspire
Commercial software to suit the
purpose

Amazon S3
Atlassian Confluence
Documentum
eRoom
Secure File Systems
IBM Connections
JIVE
Lotus Notes
RDBMS
SharePoint 2010, 2013
SocialCast
TeamForge
Web Crawling

A User Interface
The expert in the search space
Search-specific UI/UX builders are available
The expert in the search space
What Applications Suit Amazon CloudSearch?

In principle any
CloudSearch is an option that everyone should
consider for search-based applications
Especially where elastic scalability is an advantage
Or where the data already lives in the cloud,
including S3 buckets

The expert in the search space
To conclude
If you like Amazon Web Services, then you’ll like
CloudSearch for the same reasons
It is a full-function indexing engine, ideally suited to
agile, flexible search architectures
If you are looking for a search-based application, it
should be on your shortlist
For many applications, CloudSearch will prove to be
highly cost effective
The expert in the search space
My email:

ifletcher@searchtechnologies.com

Useful URLs:
Connectors for Cloudsearch
www.searchtechnologies.com/cloudsearch-data-connectors.html

Aspire Content Processing Framework
www.searchtechnologies.com/aspire.html

A Java API for Amazon Cloudsearch
www.searchtechnologies.com/java-api-amazon-cloudsearch.html

Enterprise Search Surveys
www.searchtechnologies.com/enterprise-search-surveys.html

Blog: Searching Wikipedia with Amazon CloudSearch
www.searchtechnologies.com/cloudsearch-data-connectors.html

White Paper: A Big Data Architecture for Search
www.searchtechnologies.com/big-data-enterprise-search-architecture.html
The expert in the search space

Contenu connexe

Plus de Search Technologies

The Evolution of Search and Big Data
The Evolution of Search and Big DataThe Evolution of Search and Big Data
The Evolution of Search and Big DataSearch Technologies
 
Enterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for SearchEnterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for SearchSearch Technologies
 
Advanced Query Parsing Techniques
Advanced Query Parsing TechniquesAdvanced Query Parsing Techniques
Advanced Query Parsing TechniquesSearch Technologies
 
The things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 SearchThe things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 SearchSearch Technologies
 
Enterprise Search Best Practices Webinar 4.2013
Enterprise Search Best Practices Webinar 4.2013Enterprise Search Best Practices Webinar 4.2013
Enterprise Search Best Practices Webinar 4.2013Search Technologies
 

Plus de Search Technologies (7)

The Evolution of Search and Big Data
The Evolution of Search and Big DataThe Evolution of Search and Big Data
The Evolution of Search and Big Data
 
Enterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for SearchEnterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for Search
 
Advanced Query Parsing Techniques
Advanced Query Parsing TechniquesAdvanced Query Parsing Techniques
Advanced Query Parsing Techniques
 
Wikipedia Cloud Search Webinar
Wikipedia Cloud Search WebinarWikipedia Cloud Search Webinar
Wikipedia Cloud Search Webinar
 
The things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 SearchThe things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 Search
 
Enterprise Search Best Practices Webinar 4.2013
Enterprise Search Best Practices Webinar 4.2013Enterprise Search Best Practices Webinar 4.2013
Enterprise Search Best Practices Webinar 4.2013
 
Advanced Relevancy Ranking
Advanced Relevancy RankingAdvanced Relevancy Ranking
Advanced Relevancy Ranking
 

Dernier

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 

Dernier (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Amazon CloudSearch & Future Search Architectures

  • 1. Amazon CloudSearch & Future Search Architectures Iain Fletcher ifletcher@Searchtechnologies.com The expert in the search space
  • 2. Agenda A brief introduction to Search Technologies Search Application Architectures Common challenges, emerging architectures Our thoughts about Amazon CloudSearch The expert in the search space
  • 3. Search Technologies Overview Ascot, UK Karlsruhe, DE Cincinnati, OH Herndon, VA San Diego, CA San Jose, CR • The leading IT Services company dedicated to Enterprise Search & Search-based Applications • Implementation, Consulting, Managed Services • 120 employees and growing • Independent, working with all of the leading software vendors and open source alternatives
  • 5. Future Search Architectures The expert in the search space
  • 6. A Traditional Integrated Architecture Integrated Search Engine Content Sources Employee Directory CMS File Share Etc. Connectors Index Pipeline Search Index UI The year is 2013, but we still have: Dissatisfaction with search results Scalability issues (various) High initial investment Inflexibility in the model Data integration issues… The expert in the search space
  • 7. Reasons for Change Today’s leading “indexing engines” are very capable The challenges to success almost always lie in peripheral tasks. On the indexing side of the house, this means: Gathering and feeding data to the search engine Preparing data for indexing (normalization, metadata enrichment. Etc.) The expert in the search space
  • 8. Reasons for Change The need to cope with continuous and rapid data growth Things usually break first on the data gathering side of the house E.g. The Government department and the 7TB File Server The expert in the search space
  • 9. A Traditional Integrated Architecture Integrated Search Engine Content Sources Employee Directory CMS Connectors Index Pipeline Search Index UI RE-INDEX File Share Etc. The expert in the search space
  • 10. A Better Search Architecture Iterative Development Content Sources Employee Directory CMS Etc. Content Processing Connectors Secure xml Cache Search Engine Index Pipeline Search Index RE-INDEX Better operational agility The expert in the search space
  • 11. A Better Search Architecture Iterative Development Content Sources Employee Directory CMS Etc. Content Processing Connectors Secure xml Cache Amazon CloudSearch Index Pipeline Search Index RE-INDEX Better operational agility Much improved cost of ownership The expert in the search space
  • 12. Let’s take a look at Amazon CloudSearch If you’ve ever switched on an EC2 server, then you can switch on a CloudSearch server And of course, switch it off just as easily Xml or JSON in, for indexing RESTful / xml out, for searching Has the bases covered in terms of core search functionality The expert in the search space
  • 13. Amazon CloudSearch: Important Search Functions Full-on Search Navigators Results sorting by property Plenty of relevancy tuning controls Indexing speed is impressive We’ve tested up to 500 docs/sec Very long results lists can be served The expert in the search space
  • 14. Cost Effective & Flexible The usual benefits of AWS Only use what you need Elastic scalability Maintenance inclusive Compare that to the cost of a Service Level Agreement for an on-premise alternative Then add the labour costs…. From less than £50 per month The expert in the search space
  • 15. What do you need to get started Repository Connectors Capture content Convert to xml XSLT. Etc. Content Processing Java code, Groovy. Etc. A framework product, E.g. Aspire Commercial software to suit the purpose Amazon S3 Atlassian Confluence Documentum eRoom Secure File Systems IBM Connections JIVE Lotus Notes RDBMS SharePoint 2010, 2013 SocialCast TeamForge Web Crawling A User Interface The expert in the search space
  • 16. Search-specific UI/UX builders are available The expert in the search space
  • 17. What Applications Suit Amazon CloudSearch? In principle any CloudSearch is an option that everyone should consider for search-based applications Especially where elastic scalability is an advantage Or where the data already lives in the cloud, including S3 buckets The expert in the search space
  • 18. To conclude If you like Amazon Web Services, then you’ll like CloudSearch for the same reasons It is a full-function indexing engine, ideally suited to agile, flexible search architectures If you are looking for a search-based application, it should be on your shortlist For many applications, CloudSearch will prove to be highly cost effective The expert in the search space
  • 19. My email: ifletcher@searchtechnologies.com Useful URLs: Connectors for Cloudsearch www.searchtechnologies.com/cloudsearch-data-connectors.html Aspire Content Processing Framework www.searchtechnologies.com/aspire.html A Java API for Amazon Cloudsearch www.searchtechnologies.com/java-api-amazon-cloudsearch.html Enterprise Search Surveys www.searchtechnologies.com/enterprise-search-surveys.html Blog: Searching Wikipedia with Amazon CloudSearch www.searchtechnologies.com/cloudsearch-data-connectors.html White Paper: A Big Data Architecture for Search www.searchtechnologies.com/big-data-enterprise-search-architecture.html The expert in the search space

Notes de l'éditeur

  1. Here’s the agenda for my part of the presentation.After a brief introduction to Search Technologies, I have some thoughts to share with you about how search architectures are evolving, and why.That will set the scene for taking a look at Amazon CloudSearch.