SlideShare une entreprise Scribd logo
1  sur  23
ENTERPRISE SEARCH BEST PRACTICES
Iain Fletcher
ifletcher@searchtechnologies.com
Bill Fowler
bfowler@searchtechnologies.com
Greg Gomez
ggomez@google.com
Maria Lundahl
mlundahl@google.com
Agenda
• Search Technologies Overview
• Enterprise Search Best Practices
• Three pillars of enterprise search success
• Google Demo Presentation
• Q&A
2
Search Technologies Overview
• The largest IT services company focused on search
engines
• Consulting
• Implementation
• Managed services
• Technology independent, working with most of the leading
search engine vendors
• An increasing proportion of customers using the GSA
3
Search Technologies Overview
4
San Diego, CA
San Jose, CR
Herndon, VA
Ascot, UK
Cincinnati, OH
Some of our GSA customers
5
Introduction
• The GSA is used for a wide range of search solutions
• Many initial GSA implementations address “plug-in-and-go”
• We are working with an increasing number of GSA
customers to extend to enterprise-scale
6
Enterprise Search Applications
Better search improves a
wide range of business processes
7
• Compliance & Risk
• Customer Service Support
• Research & Development
• Legal & Contracts
• Competitive Intelligence
• People/Expert Search
• General Intranet…
Internal
• Partner Extranet
• Online Publishing
• Ecommerce
• Customer Self-service
• Field Maintenance
• Website Search
• Tech Sales Support…
External
The Platform Approach
Key Advantages
• Cost sharing provides lower TCO
• Agility is improved, as it is much easier to create new
applications or repurpose content via search, to meet
emerging needs
• Improved user productivity
• Users benefit from a consistent and instantly available, repository
independent search experience
8
Relevancy &
Scalability
Connectivity &
Security
Metadata
Capture & Creation
Three pillars of Enterprise Search success
Enterprise Search Foundations
9
Relevancy & Scalability
• Relevancy is easy over small data sets
• Most modern search engines scale well in terms of
document count and query load
• Maintaining relevancy at scale is the challenge
• Poor relevancy is a common complaint with existing enterprise
search systems
• This is a sweet spot for the GSA
10
Connectivity & Security
• What is the #1 reason to not find information?
• Because it hasn’t been indexed
• Data connectors enable the search engine to access
content sources
• A growing range of connectors are available for the GSA
• There is also a connector framework for custom developments
• Every company has its own combination of repositories
11
Connectivity & Security
• Connectivity occasionally creates “issues”
• These are mostly caused by the characteristics of the repository or
the general IT environment, rather than by faults with the connector
software
• Security causes the most issues
• In larger organizations, security can be complex
• Multiple LDAP/AD servers
• Nested permissions, etc.
12
Connectivity & Security
Best Practices:
• Don’t expect connectors to always “plug-and-go”
• Usually they will, but sometimes it will take a little effort
• Secure connectivity can almost always be achieved
• Take security seriously
• Make friends not enemies of repository owners by
showing them that you take security seriously – you
will need their cooperation!
13
Metadata Capture & Creation
Metadata is the foundation of important search functions.
For example:
• It supports relevancy
• It drives dynamic navigation
• It enables infographical results display
14
Dynamic Navigation
15
Infographical Results
• Via RESTful/xml results delivery
• Holistic results display, graphical navigation….
16
Creating & Capturing Metadata
Capture
• Be diligent about taking what is available from the source
• This includes metadata encoded into file paths, or
available from complimentary sources
Creation
• Automate the creation of metadata based on
extraction techniques
• The GSA 7 provides new capabilities for this
“Entity Recognition”
17
New with GSA 7: Entity Recognition
• Provides both “regex” and dictionary-based approaches to
automated metadata creation
• Regex – identifies patterns in the content to match names, emails,
phone numbers, etc.
• List-based, enables a focus on industry or company specific
terminology
• Open & Customizable
18
Summary
• The GSA 7 has a full set of capabilities to deliver enterprise
search excellence
• Key issues are:
• Relevancy at scale
• Security-compliant connectivity capabilities
• Metadata capture and auto-generation capabilities
• Plus the administrative simplicity you’d expect from a
Google application
19
The Google Approach
• Google receives >1B queries per
day
• Algorithm is made up of >200
signals
• >500 changes to the algorithm
each year
• Each change is analyzed through
1% testing of actual Google traffic
• Over 10,000 such experiments
each year
Signals
Signals
Evolution of the Google Search Appliance
2002 2009 2012
6.
1
4
Apr Oct Dec
2011
7.
0
Sep
2004
6.10 6.12
2006 2013
50X Increase in Capacity between 2002 and 2013
2M to 100M documents
Demo – GSA 7.0
• Cross-Platform Search Results
• Security Trimming
• Dynamic Navigation
• Entity Recognition
• Document Preview
• Language Translation
• Expert Search
22
Thank You!

Contenu connexe

Tendances

Real-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance IndustryReal-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance Industry
DataWorks Summit
 
How to turn GDPR into a Strategic Advantage using Connected Data
How to turn GDPR into a Strategic Advantage using Connected DataHow to turn GDPR into a Strategic Advantage using Connected Data
How to turn GDPR into a Strategic Advantage using Connected Data
Neo4j
 
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePointReduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Concept Searching, Inc
 

Tendances (20)

Unlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location IntelligenceUnlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location Intelligence
 
AI Data Acquisition and Governance: Considerations for Success
AI Data Acquisition and Governance: Considerations for SuccessAI Data Acquisition and Governance: Considerations for Success
AI Data Acquisition and Governance: Considerations for Success
 
Real-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance IndustryReal-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance Industry
 
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarThe Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
 
Accelerating Fast Data Strategy with Data Virtualization
Accelerating Fast Data Strategy with Data VirtualizationAccelerating Fast Data Strategy with Data Virtualization
Accelerating Fast Data Strategy with Data Virtualization
 
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
 
Data lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperativeData lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperative
 
How to turn GDPR into a Strategic Advantage using Connected Data
How to turn GDPR into a Strategic Advantage using Connected DataHow to turn GDPR into a Strategic Advantage using Connected Data
How to turn GDPR into a Strategic Advantage using Connected Data
 
Semantic E-Commerce - Use Cases in Enterprise Web Applications
Semantic E-Commerce - Use Cases in Enterprise Web ApplicationsSemantic E-Commerce - Use Cases in Enterprise Web Applications
Semantic E-Commerce - Use Cases in Enterprise Web Applications
 
Chief Data Officer: Top Ten Learnings...
Chief Data Officer: Top Ten Learnings...Chief Data Officer: Top Ten Learnings...
Chief Data Officer: Top Ten Learnings...
 
Connecting External Content to SharePoint Search
Connecting External Content to SharePoint SearchConnecting External Content to SharePoint Search
Connecting External Content to SharePoint Search
 
Enterprise Cybersecurity: From Strategy to Operating Model
Enterprise Cybersecurity: From Strategy to Operating ModelEnterprise Cybersecurity: From Strategy to Operating Model
Enterprise Cybersecurity: From Strategy to Operating Model
 
Data Science Capability Framework
Data Science Capability FrameworkData Science Capability Framework
Data Science Capability Framework
 
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePointReduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
 
Big datacamp june14_alex_liu
Big datacamp june14_alex_liuBig datacamp june14_alex_liu
Big datacamp june14_alex_liu
 
AuditBucket - an introduction and overview
AuditBucket - an introduction and overviewAuditBucket - an introduction and overview
AuditBucket - an introduction and overview
 
O365Con18 - Good to Great SharePoint Governance - Eric Riz
O365Con18 - Good to Great SharePoint Governance - Eric RizO365Con18 - Good to Great SharePoint Governance - Eric Riz
O365Con18 - Good to Great SharePoint Governance - Eric Riz
 
O365Con18 - Power BI Governance - Folker Visser
O365Con18 - Power BI Governance - Folker VisserO365Con18 - Power BI Governance - Folker Visser
O365Con18 - Power BI Governance - Folker Visser
 
The art of implementing data lineage
The art of implementing data lineageThe art of implementing data lineage
The art of implementing data lineage
 
Going Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointGoing Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePoint
 

Similaire à Enterprise Search Best Practices Webinar 4.2013

Privacy Impact Assessment Management System (PIAMS)
Privacy Impact Assessment Management System (PIAMS) Privacy Impact Assessment Management System (PIAMS)
Privacy Impact Assessment Management System (PIAMS)
The Canton Group
 
SharePoint Online vs. On-Premise
SharePoint Online vs. On-PremiseSharePoint Online vs. On-Premise
SharePoint Online vs. On-Premise
Evan Hodges
 
Moving beyond Big Data, BAE Systems Detica
Moving beyond Big Data, BAE Systems Detica Moving beyond Big Data, BAE Systems Detica
Moving beyond Big Data, BAE Systems Detica
Internet World
 

Similaire à Enterprise Search Best Practices Webinar 4.2013 (20)

Realize Greater "Return on Information" with Google Enterprise Search
Realize Greater "Return on Information" with Google Enterprise SearchRealize Greater "Return on Information" with Google Enterprise Search
Realize Greater "Return on Information" with Google Enterprise Search
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
Data Democratization and AI Drive the Scope for Data Governance
Data Democratization and AI Drive the Scope for Data GovernanceData Democratization and AI Drive the Scope for Data Governance
Data Democratization and AI Drive the Scope for Data Governance
 
More databases. More hackers.
More databases. More hackers.More databases. More hackers.
More databases. More hackers.
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Find What You Need Fast with the Google Search Appliance
Find What You Need Fast with the Google Search ApplianceFind What You Need Fast with the Google Search Appliance
Find What You Need Fast with the Google Search Appliance
 
Building Rules for Data Governance
Building Rules for Data GovernanceBuilding Rules for Data Governance
Building Rules for Data Governance
 
Delivering Trusted Insights with Integrated Data Quality for Collibra
Delivering Trusted Insights with Integrated Data Quality for CollibraDelivering Trusted Insights with Integrated Data Quality for Collibra
Delivering Trusted Insights with Integrated Data Quality for Collibra
 
Four Must-Haves for Data Governance in Financial Services
Four Must-Haves for Data Governance in Financial ServicesFour Must-Haves for Data Governance in Financial Services
Four Must-Haves for Data Governance in Financial Services
 
Privacy Impact Assessment Management System (PIAMS)
Privacy Impact Assessment Management System (PIAMS) Privacy Impact Assessment Management System (PIAMS)
Privacy Impact Assessment Management System (PIAMS)
 
SharePoint Online vs. On-Premise
SharePoint Online vs. On-PremiseSharePoint Online vs. On-Premise
SharePoint Online vs. On-Premise
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overview
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog Requirements
 
Increase Contact Center Performance with Google Search and Salesforce Service...
Increase Contact Center Performance with Google Search and Salesforce Service...Increase Contact Center Performance with Google Search and Salesforce Service...
Increase Contact Center Performance with Google Search and Salesforce Service...
 
Foundational Strategies for Trust in Big Data Part 3: Data Lineage
Foundational Strategies for Trust in Big Data Part 3: Data LineageFoundational Strategies for Trust in Big Data Part 3: Data Lineage
Foundational Strategies for Trust in Big Data Part 3: Data Lineage
 
Moving beyond Big Data, BAE Systems Detica
Moving beyond Big Data, BAE Systems Detica Moving beyond Big Data, BAE Systems Detica
Moving beyond Big Data, BAE Systems Detica
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
How to deliver a Single View in Financial Services
 How to deliver a Single View in Financial Services How to deliver a Single View in Financial Services
How to deliver a Single View in Financial Services
 
Curiosity Software and RCG Global Services Present - Solving Test Data: the g...
Curiosity Software and RCG Global Services Present - Solving Test Data: the g...Curiosity Software and RCG Global Services Present - Solving Test Data: the g...
Curiosity Software and RCG Global Services Present - Solving Test Data: the g...
 
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad talk in South Africa:  From BigData to Data ScienceUsama Fayyad talk in South Africa:  From BigData to Data Science
Usama Fayyad talk in South Africa: From BigData to Data Science
 

Plus de Search Technologies

Plus de Search Technologies (6)

The Evolution of Search and Big Data
The Evolution of Search and Big DataThe Evolution of Search and Big Data
The Evolution of Search and Big Data
 
Enterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for SearchEnterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for Search
 
Advanced Query Parsing Techniques
Advanced Query Parsing TechniquesAdvanced Query Parsing Techniques
Advanced Query Parsing Techniques
 
Wikipedia Cloud Search Webinar
Wikipedia Cloud Search WebinarWikipedia Cloud Search Webinar
Wikipedia Cloud Search Webinar
 
The things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 SearchThe things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 Search
 
Advanced Relevancy Ranking
Advanced Relevancy RankingAdvanced Relevancy Ranking
Advanced Relevancy Ranking
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 

Enterprise Search Best Practices Webinar 4.2013

  • 1. ENTERPRISE SEARCH BEST PRACTICES Iain Fletcher ifletcher@searchtechnologies.com Bill Fowler bfowler@searchtechnologies.com Greg Gomez ggomez@google.com Maria Lundahl mlundahl@google.com
  • 2. Agenda • Search Technologies Overview • Enterprise Search Best Practices • Three pillars of enterprise search success • Google Demo Presentation • Q&A 2
  • 3. Search Technologies Overview • The largest IT services company focused on search engines • Consulting • Implementation • Managed services • Technology independent, working with most of the leading search engine vendors • An increasing proportion of customers using the GSA 3
  • 4. Search Technologies Overview 4 San Diego, CA San Jose, CR Herndon, VA Ascot, UK Cincinnati, OH
  • 5. Some of our GSA customers 5
  • 6. Introduction • The GSA is used for a wide range of search solutions • Many initial GSA implementations address “plug-in-and-go” • We are working with an increasing number of GSA customers to extend to enterprise-scale 6
  • 7. Enterprise Search Applications Better search improves a wide range of business processes 7 • Compliance & Risk • Customer Service Support • Research & Development • Legal & Contracts • Competitive Intelligence • People/Expert Search • General Intranet… Internal • Partner Extranet • Online Publishing • Ecommerce • Customer Self-service • Field Maintenance • Website Search • Tech Sales Support… External
  • 8. The Platform Approach Key Advantages • Cost sharing provides lower TCO • Agility is improved, as it is much easier to create new applications or repurpose content via search, to meet emerging needs • Improved user productivity • Users benefit from a consistent and instantly available, repository independent search experience 8
  • 9. Relevancy & Scalability Connectivity & Security Metadata Capture & Creation Three pillars of Enterprise Search success Enterprise Search Foundations 9
  • 10. Relevancy & Scalability • Relevancy is easy over small data sets • Most modern search engines scale well in terms of document count and query load • Maintaining relevancy at scale is the challenge • Poor relevancy is a common complaint with existing enterprise search systems • This is a sweet spot for the GSA 10
  • 11. Connectivity & Security • What is the #1 reason to not find information? • Because it hasn’t been indexed • Data connectors enable the search engine to access content sources • A growing range of connectors are available for the GSA • There is also a connector framework for custom developments • Every company has its own combination of repositories 11
  • 12. Connectivity & Security • Connectivity occasionally creates “issues” • These are mostly caused by the characteristics of the repository or the general IT environment, rather than by faults with the connector software • Security causes the most issues • In larger organizations, security can be complex • Multiple LDAP/AD servers • Nested permissions, etc. 12
  • 13. Connectivity & Security Best Practices: • Don’t expect connectors to always “plug-and-go” • Usually they will, but sometimes it will take a little effort • Secure connectivity can almost always be achieved • Take security seriously • Make friends not enemies of repository owners by showing them that you take security seriously – you will need their cooperation! 13
  • 14. Metadata Capture & Creation Metadata is the foundation of important search functions. For example: • It supports relevancy • It drives dynamic navigation • It enables infographical results display 14
  • 16. Infographical Results • Via RESTful/xml results delivery • Holistic results display, graphical navigation…. 16
  • 17. Creating & Capturing Metadata Capture • Be diligent about taking what is available from the source • This includes metadata encoded into file paths, or available from complimentary sources Creation • Automate the creation of metadata based on extraction techniques • The GSA 7 provides new capabilities for this “Entity Recognition” 17
  • 18. New with GSA 7: Entity Recognition • Provides both “regex” and dictionary-based approaches to automated metadata creation • Regex – identifies patterns in the content to match names, emails, phone numbers, etc. • List-based, enables a focus on industry or company specific terminology • Open & Customizable 18
  • 19. Summary • The GSA 7 has a full set of capabilities to deliver enterprise search excellence • Key issues are: • Relevancy at scale • Security-compliant connectivity capabilities • Metadata capture and auto-generation capabilities • Plus the administrative simplicity you’d expect from a Google application 19
  • 20. The Google Approach • Google receives >1B queries per day • Algorithm is made up of >200 signals • >500 changes to the algorithm each year • Each change is analyzed through 1% testing of actual Google traffic • Over 10,000 such experiments each year Signals Signals
  • 21. Evolution of the Google Search Appliance 2002 2009 2012 6. 1 4 Apr Oct Dec 2011 7. 0 Sep 2004 6.10 6.12 2006 2013 50X Increase in Capacity between 2002 and 2013 2M to 100M documents
  • 22. Demo – GSA 7.0 • Cross-Platform Search Results • Security Trimming • Dynamic Navigation • Entity Recognition • Document Preview • Language Translation • Expert Search 22

Notes de l'éditeur

  1. We’re an IT services company dedicated to search engines Search is all we do We believe that we are the largest company of our kindWe provide strategic consulting, implementation work, and a range of managed servicesWe are technology independent and work with all of the leading search engine vendors.Established in 2005, we now have about 110 staff and more than 400 customers. A growing proportion of our customers now use the Google Search Appliance.
  2. We have worked with more than 130 customers who have chosen to use the GSA, here are a few of them
  3. Why not stick with the embedded search functionality that most repositories provide?READ SLIDEIn the case of the GSA, it will look familiar too…
  4. Scale within the enterprise also implies a wide variety of document types, formats and lengths, which further complications relevancy calculation.I’ll leave it to Google to mention the technical aspects of achieving relevance at scale during their presentation.
  5. The number 1 reason for not finding something through using search is simple – because it has not been indexed.To index content, of course you need data connectors.In the GSA environment, there’s a growing range of connectors available, both from Google and from third parties. There’s also an API for developing new connectorsEvery company has its own combination of repositories to connect to
  6. Connectors almost always need configuration. Even where they will literally plug-and-play, and with the with GSA this is often the case, it is still worth taking the time to understand the full range of configuration possibilities. Connectors frequently have issues.We’ve seen many companies over the years who have approached enterprise search with strong expectations of plug-and-play when it comes to connectors, only to be discouraged when things don’t run entirely smoothly.When issues occur, it is usually not because of the connector software. Most issues are caused by the characteristics of the repository, or by the IT environment more generally.The enterprise search implementation team typically has no control over the configuration of document repositories, so you have to “go with the flow” and work with what is available. Security causes the most connector issues, I’ll come back to that shortly.But the good news is that with planning, and with a willingness to configure or customize, pretty much any repository can be successfully connected to, and this gets us to base camp, getting the content indexed and made searchable
  7. The point is, you need metadata to do this.The GSA provides a RESTful / xml results delivery services that can be used to create user interfaces such as these
  8. Thanks for your attention.If you have any questions that you’d prefer weren’t discussed in public, please feel free to drop me an email.