SlideShare une entreprise Scribd logo
1  sur  27
Advanced Search & Indexing
In Liferay 6.1
External sources integration

Filippo Frignocca | Marcello Torriani
Ariadne




           November 16th, 2012
Intro & Agenda

 Search in intranet portals
 Business requirements: clustering,
  advanced indexing configuration,
  external sources integration
 Ariadne experience
 Liferay 6.1 indexing and searching
  solutions
 Our solution: Liferay 6.1, Solr 4.0,
  ManifoldCF
Search in intranet portal

 Large amount of structered and
  unstructured content
 Content can be anywhere: DB,
  filesystem, repository…
 Users have to find information
  quickly and efficiently
Business requirements

Indexing & Searching in an enterprise
   environment means:
 Seamless integration in SERP
 External sources indexing
 Clustered environment
 Scalability
 Advanced indexing features needs
Our past work in Ariadne

 On Liferay 5.x
 Deep customization: a new Web CMS
 Integration of a Content Repository
 ElasticSearch
 Integration of external contents: Google Search Appliance
Liferay 6.1 and search

 Opensearch
 Faceted search
 Document & Media search:
  Liferay 6.1 supports searching
  both local and remote
  repositories.
 The search capabilities, however,
  are limited to the facilities that is
  provided by the protocol
Solr 4

 Schema aware
 Search functions
 Plug handlers and function
 Cache
 Replication
 Statistics
SolrCloud

 High scale, fault tolerant, distributed
  indexing and search
 Collections (indexes) may span through
  different SolrCores
 Zookeeper manage cluster configuration
  and coordination
ManifoldCF

 Crawler framework for connecting source
  content repositories (Sharepoint,
  Documentum, web sites, shares,...) to
  indexers (Solr, Elasticsearch, …)
 Resilient, incremental
 Integrated with document security
 Connectors:
   ●   Authority connectors: map user name to
       access to a repository
   ●   Repository connectors: fetch documents
       from a repository
   ●   Output connectors: push documents to (an
       indexer)
ManifoldCF input connectors
External sources
    UNIFIED INDEX                  crawling
                                   (ManifoldCF)
          (Solr)




Portal Assets and external CMIS-
compliant repositories
Liferay 6.1 & Solr 4

  Liferay integrates Solr 1.4
  Upgrade to 4.0:
    ●   Index Schema Definition
    ●   Solr plugin fix to handle void snippet results
    ●   Solr plugin configuration to connect to external Index
        Engine/Cluster
Schema Definition for SolR 4.0 – Liferay 6.1 integration

primary key: uid
Fix to handle results with
matches in all metadata,
not only Content or Title
External Index Configuration
ManifoldCF output connector
ManifoldCF repository connector
ManifoldCF job scheduler
ManifoldCF job monitor
ManifoldCF output configuration
Custom plugin

To show external source search results in the Liferay Faceted Search
   Portlet we need to:
 Define one (or more) Asset Model Class
 Configure a custom Indexer Class
 Implement proper Hook to override the result page jsps
 Properly configure the Faceted Search Portlet
Indexed and Model classes
Faceted Search jsp Hook
Faceted Search Configuration
Faceted Search Results
Work in progress

 Indexing and searching protected resources
 Optimize External Asset Management
 Tags and Categories consistence check
 Optimize Index schema mapping
 Entry detail link management for external results
Thank you!

www.ariadne.it
info@ariadne.it
@ariadnesrl


            Marcello
            Torriani
            torriani@ariadne.it

            Filippo
            Frignocca
            frignocca@ariadne.it

Contenu connexe

Tendances

Fhir tooling (grahame)
Fhir tooling (grahame)Fhir tooling (grahame)
Fhir tooling (grahame)DevDays
 
Code4Lib 2008 Metadata Registry
Code4Lib 2008   Metadata RegistryCode4Lib 2008   Metadata Registry
Code4Lib 2008 Metadata Registryjonphipps
 
Whats new (grahame)
Whats new (grahame)Whats new (grahame)
Whats new (grahame)DevDays
 
Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref
 
fhir-documents
fhir-documentsfhir-documents
fhir-documentsDevDays
 
Fhir dev days 2017 fhir profiling - overview and introduction v07
Fhir dev days 2017   fhir profiling - overview and introduction v07Fhir dev days 2017   fhir profiling - overview and introduction v07
Fhir dev days 2017 fhir profiling - overview and introduction v07DevDays
 

Tendances (8)

Fhir tooling (grahame)
Fhir tooling (grahame)Fhir tooling (grahame)
Fhir tooling (grahame)
 
Code4Lib 2008 Metadata Registry
Code4Lib 2008   Metadata RegistryCode4Lib 2008   Metadata Registry
Code4Lib 2008 Metadata Registry
 
Custom Metadata Types
Custom Metadata TypesCustom Metadata Types
Custom Metadata Types
 
Whats new (grahame)
Whats new (grahame)Whats new (grahame)
Whats new (grahame)
 
Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716
 
RightsDirekt
RightsDirektRightsDirekt
RightsDirekt
 
fhir-documents
fhir-documentsfhir-documents
fhir-documents
 
Fhir dev days 2017 fhir profiling - overview and introduction v07
Fhir dev days 2017   fhir profiling - overview and introduction v07Fhir dev days 2017   fhir profiling - overview and introduction v07
Fhir dev days 2017 fhir profiling - overview and introduction v07
 

Similaire à Liferay 6.1 Advanced Search & Indexing with External Sources

Fried connecting across silos seminar
Fried connecting across silos seminarFried connecting across silos seminar
Fried connecting across silos seminarJeff Fried
 
People soft search framework
People soft search frameworkPeople soft search framework
People soft search frameworkAnoop Savio
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010bgerman
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group
 
Webinar: What's New in Fusion 4.1
Webinar: What's New in Fusion 4.1Webinar: What's New in Fusion 4.1
Webinar: What's New in Fusion 4.1Lucidworks
 
Understanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchUnderstanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchJeff Fried
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Jeff Fried
 
XC partner slides
XC partner slidesXC partner slides
XC partner slidesmhruska
 
Fried baspug hybrid share point
Fried baspug hybrid share pointFried baspug hybrid share point
Fried baspug hybrid share pointJeff Fried
 
Opinioz_intern
Opinioz_internOpinioz_intern
Opinioz_internSai Ganesh
 
Understanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchUnderstanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchJeff Fried
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Jeff Fried
 
Fried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveFried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveJeff Fried
 
Succeeding with Hybrid SharePoint and search
Succeeding with Hybrid SharePoint and searchSucceeding with Hybrid SharePoint and search
Succeeding with Hybrid SharePoint and searchJeff Fried
 
Succeeding with Hybrid SharePoint
Succeeding with Hybrid SharePointSucceeding with Hybrid SharePoint
Succeeding with Hybrid SharePointJeff Fried
 
Take Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelTake Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelJeff Fried
 
Identity Manager Opensource OpenIDM Architecture
Identity Manager Opensource OpenIDM ArchitectureIdentity Manager Opensource OpenIDM Architecture
Identity Manager Opensource OpenIDM ArchitectureAidy Tificate
 
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...UKSG: connecting the knowledge community
 
SharePoint Integration and Interoperability
SharePoint Integration and InteroperabilitySharePoint Integration and Interoperability
SharePoint Integration and InteroperabilityRichard Harbridge
 

Similaire à Liferay 6.1 Advanced Search & Indexing with External Sources (20)

Fried connecting across silos seminar
Fried connecting across silos seminarFried connecting across silos seminar
Fried connecting across silos seminar
 
People soft search framework
People soft search frameworkPeople soft search framework
People soft search framework
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
 
Webinar: What's New in Fusion 4.1
Webinar: What's New in Fusion 4.1Webinar: What's New in Fusion 4.1
Webinar: What's New in Fusion 4.1
 
Understanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchUnderstanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid Search
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
 
XC partner slides
XC partner slidesXC partner slides
XC partner slides
 
Fried baspug hybrid share point
Fried baspug hybrid share pointFried baspug hybrid share point
Fried baspug hybrid share point
 
Opinioz_intern
Opinioz_internOpinioz_intern
Opinioz_intern
 
Understanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid SearchUnderstanding and Applying Cloud Hybrid Search
Understanding and Applying Cloud Hybrid Search
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
 
Apache Solr vs Oracle Endeca
Apache Solr vs Oracle EndecaApache Solr vs Oracle Endeca
Apache Solr vs Oracle Endeca
 
Fried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveFried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper dive
 
Succeeding with Hybrid SharePoint and search
Succeeding with Hybrid SharePoint and searchSucceeding with Hybrid SharePoint and search
Succeeding with Hybrid SharePoint and search
 
Succeeding with Hybrid SharePoint
Succeeding with Hybrid SharePointSucceeding with Hybrid SharePoint
Succeeding with Hybrid SharePoint
 
Take Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelTake Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next Level
 
Identity Manager Opensource OpenIDM Architecture
Identity Manager Opensource OpenIDM ArchitectureIdentity Manager Opensource OpenIDM Architecture
Identity Manager Opensource OpenIDM Architecture
 
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
 
SharePoint Integration and Interoperability
SharePoint Integration and InteroperabilitySharePoint Integration and Interoperability
SharePoint Integration and Interoperability
 

Plus de Ariadne

La Social Intranet di Elica Group [Case Study]
La Social Intranet di Elica Group [Case Study]La Social Intranet di Elica Group [Case Study]
La Social Intranet di Elica Group [Case Study]Ariadne
 
BPM NOW: comunicazione, coinvolgimento e collaborazione nel Gruppo BPM [Case ...
BPM NOW: comunicazione, coinvolgimento e collaborazione nel Gruppo BPM [Case ...BPM NOW: comunicazione, coinvolgimento e collaborazione nel Gruppo BPM [Case ...
BPM NOW: comunicazione, coinvolgimento e collaborazione nel Gruppo BPM [Case ...Ariadne
 
Liferay Portal per il Gruppo Finmeccanica [Case Study]
Liferay Portal per il Gruppo Finmeccanica [Case Study]Liferay Portal per il Gruppo Finmeccanica [Case Study]
Liferay Portal per il Gruppo Finmeccanica [Case Study]Ariadne
 
E-Learning Portal in ambito Enterprise
E-Learning Portal in ambito EnterpriseE-Learning Portal in ambito Enterprise
E-Learning Portal in ambito EnterpriseAriadne
 
Tre modelli di Intranet Portal
Tre modelli di Intranet PortalTre modelli di Intranet Portal
Tre modelli di Intranet PortalAriadne
 
Web Engineering e soluzioni per Portali Web e Intranet
Web Engineering e soluzioni per Portali Web e IntranetWeb Engineering e soluzioni per Portali Web e Intranet
Web Engineering e soluzioni per Portali Web e IntranetAriadne
 

Plus de Ariadne (6)

La Social Intranet di Elica Group [Case Study]
La Social Intranet di Elica Group [Case Study]La Social Intranet di Elica Group [Case Study]
La Social Intranet di Elica Group [Case Study]
 
BPM NOW: comunicazione, coinvolgimento e collaborazione nel Gruppo BPM [Case ...
BPM NOW: comunicazione, coinvolgimento e collaborazione nel Gruppo BPM [Case ...BPM NOW: comunicazione, coinvolgimento e collaborazione nel Gruppo BPM [Case ...
BPM NOW: comunicazione, coinvolgimento e collaborazione nel Gruppo BPM [Case ...
 
Liferay Portal per il Gruppo Finmeccanica [Case Study]
Liferay Portal per il Gruppo Finmeccanica [Case Study]Liferay Portal per il Gruppo Finmeccanica [Case Study]
Liferay Portal per il Gruppo Finmeccanica [Case Study]
 
E-Learning Portal in ambito Enterprise
E-Learning Portal in ambito EnterpriseE-Learning Portal in ambito Enterprise
E-Learning Portal in ambito Enterprise
 
Tre modelli di Intranet Portal
Tre modelli di Intranet PortalTre modelli di Intranet Portal
Tre modelli di Intranet Portal
 
Web Engineering e soluzioni per Portali Web e Intranet
Web Engineering e soluzioni per Portali Web e IntranetWeb Engineering e soluzioni per Portali Web e Intranet
Web Engineering e soluzioni per Portali Web e Intranet
 

Liferay 6.1 Advanced Search & Indexing with External Sources

  • 1. Advanced Search & Indexing In Liferay 6.1 External sources integration Filippo Frignocca | Marcello Torriani Ariadne November 16th, 2012
  • 2. Intro & Agenda  Search in intranet portals  Business requirements: clustering, advanced indexing configuration, external sources integration  Ariadne experience  Liferay 6.1 indexing and searching solutions  Our solution: Liferay 6.1, Solr 4.0, ManifoldCF
  • 3. Search in intranet portal  Large amount of structered and unstructured content  Content can be anywhere: DB, filesystem, repository…  Users have to find information quickly and efficiently
  • 4. Business requirements Indexing & Searching in an enterprise environment means:  Seamless integration in SERP  External sources indexing  Clustered environment  Scalability  Advanced indexing features needs
  • 5. Our past work in Ariadne  On Liferay 5.x  Deep customization: a new Web CMS  Integration of a Content Repository  ElasticSearch  Integration of external contents: Google Search Appliance
  • 6. Liferay 6.1 and search  Opensearch  Faceted search  Document & Media search: Liferay 6.1 supports searching both local and remote repositories.  The search capabilities, however, are limited to the facilities that is provided by the protocol
  • 7. Solr 4  Schema aware  Search functions  Plug handlers and function  Cache  Replication  Statistics
  • 8. SolrCloud  High scale, fault tolerant, distributed indexing and search  Collections (indexes) may span through different SolrCores  Zookeeper manage cluster configuration and coordination
  • 9. ManifoldCF  Crawler framework for connecting source content repositories (Sharepoint, Documentum, web sites, shares,...) to indexers (Solr, Elasticsearch, …)  Resilient, incremental  Integrated with document security  Connectors: ● Authority connectors: map user name to access to a repository ● Repository connectors: fetch documents from a repository ● Output connectors: push documents to (an indexer)
  • 11. External sources UNIFIED INDEX crawling (ManifoldCF) (Solr) Portal Assets and external CMIS- compliant repositories
  • 12. Liferay 6.1 & Solr 4  Liferay integrates Solr 1.4  Upgrade to 4.0: ● Index Schema Definition ● Solr plugin fix to handle void snippet results ● Solr plugin configuration to connect to external Index Engine/Cluster
  • 13. Schema Definition for SolR 4.0 – Liferay 6.1 integration primary key: uid
  • 14. Fix to handle results with matches in all metadata, not only Content or Title
  • 21. Custom plugin To show external source search results in the Liferay Faceted Search Portlet we need to:  Define one (or more) Asset Model Class  Configure a custom Indexer Class  Implement proper Hook to override the result page jsps  Properly configure the Faceted Search Portlet
  • 22. Indexed and Model classes
  • 26. Work in progress  Indexing and searching protected resources  Optimize External Asset Management  Tags and Categories consistence check  Optimize Index schema mapping  Entry detail link management for external results
  • 27. Thank you! www.ariadne.it info@ariadne.it @ariadnesrl Marcello Torriani torriani@ariadne.it Filippo Frignocca frignocca@ariadne.it