SlideShare une entreprise Scribd logo
1  sur  25
Télécharger pour lire hors ligne
Magnolia Solr Module
improvements
Feature overview
•   Multi site support.
•   Solr Cloud support.
•   Asynchronous indexing.
•   Improved way to configure which pages are indexed.
•   Template based boosting modifier.
•   Flexible page type resolving mechanism.
•   Search result: page type to css mapping.
•   Various solr document field configuration enhancements:
     o Multi value flag to match solr document schema.
     o Added pluggable system for converting field data to solr
       document (Adders).
•   Facets.
•   Fake facet for period filtering.
Multi site support
• Any number of named configurations.

• Link a site to a specific configuration.

• Admin central solr page updated to
  trigger deleting all documents of a
  specific site.
Solr cloud support
Asynchronous indexing
 • Indexing is not part of the workflow.

 • Creation of solr document and publication done in
   java.util.concurrent.ExecutorService.

 • Faster activation.

 • No error when indexing fails.

 • Should be configurable.
Improved way to configure which
pages are indexed
• Previously done with parameter on template definition.

• Two disadvantages:
   o No clear overview of which templates are selected

     for indexing.
   o Not possible to configure how pages with a given

     template are indexed.

• Added template configuration for templates to Website
  Document.

• Without this configuration pages are not indexed.
Improved way to configure which
pages are indexed
Template based boosting modifier
 •   Property on template configuration.

 •   Allows you to favour pages of some type with
     equal score.

 •   Defaults to 1.0 (neutral).
Flexible page type resolving
mechanism
 •   We want all documents to have a page type
     field.
 •   Based on circumstance page type must be
     resolved differently:
      o by path.
      o by template
      o by some external consideration
 •   Introduced PageTypeResolver interface. Can be
     set on Template Configuration.
Flexible page type resolving
mechanism
Search result: page type to css
mapping




  •   Simple mapping of page types to css names.

  •   Css class names used when rendering the search
      result.
Field configuration:
Multi value flag to match solr
document schema.

  •   In Solr schema fields can be multi value or not.

  •   Inserting a document with multiple values for a
      single value field yields an error.

  •   The multi value search field configuration property
      ignores subsequent values for that field.
Field configuration:
Pluggable system for converting field
data to solr document.
  •   Standard values not a problem (String, Number,
      Date, Boolean).

  •   Need more control for special cases: Images,
      Html, categories, ..
Field configuration:
Pluggable system for converting field
data to solr document.
Facets
  • Facets: one of the coolest features in Solr.

  • Added new configuration for facet fields.

  • Maps Solr field names to display field names.

  • New paragraph that shows the facets and re-submits
    the query, narrowing the search.
Facets
Fake facet for period filtering
   • Date facets have fixed time intervals.

   • Code added for configuring a set of date
     ranges.

   • Configuration option still missing.
Fake facet for period filtering
caveats:
  Index time boosting
  • Index Time Boosting is not supported by fields that omit
    norms.

  • The template boosting modifier creates non-standard
    values even for fields with no boosting configuration.

  • Now you have to set 'omitNorms' to 'true' configuring
    those fields, so any boosting is disabled for these fields
Todo:
  Solr Server Configuration
  • The solr server instances are configured
    in the repository.

  • This is not nice when you have different
    servers for test, acceptance, production.

  • Somehow externalize at least part of the
    configuration.
Todo:
  Query time boosting
  • Currently all boosting is index time.

  • It is hard to tweak the boosting
     (reïndexing required).

  • Query time boosting should become an option.

  • Performance?
Todo:
  Facets and period filter
  • Make it part of the facet configuration

  • Probably move the facet configuration out
    of the field configuration.
Todo:
  Indexing on activation
  • Postponed activation and deactivation not
    supported.

  • Indexing should be part of the work flow.

  • That precludes asynchronous indexing.
Ready to share?
  • Create separate module that depends on
    Magnolia Solr module.

  • Remove or generalize some VPRO
    specific stuff:
     o Class and package names.

     o Custom document fields hard coded.

     o Remove obsolete code/features.



  • Documentation
Solr and Image Module Extensions of Magnolia

Contenu connexe

Similaire à Solr and Image Module Extensions of Magnolia

Implementing SharePoint: Site Customization and Branding
Implementing SharePoint: Site Customization and BrandingImplementing SharePoint: Site Customization and Branding
Implementing SharePoint: Site Customization and Branding
drudolph11
 
Extjs3.4 Migration Notes
Extjs3.4 Migration NotesExtjs3.4 Migration Notes
Extjs3.4 Migration Notes
SimoAmi
 

Similaire à Solr and Image Module Extensions of Magnolia (20)

SQL Server 2008 For Developers
SQL Server 2008 For DevelopersSQL Server 2008 For Developers
SQL Server 2008 For Developers
 
Advanced Discussion on Cloud Formation
Advanced Discussion on Cloud FormationAdvanced Discussion on Cloud Formation
Advanced Discussion on Cloud Formation
 
Rdbms
RdbmsRdbms
Rdbms
 
Oracle PIM: Phantasmal Item Descriptions in your Organization
Oracle PIM: Phantasmal Item Descriptions in your OrganizationOracle PIM: Phantasmal Item Descriptions in your Organization
Oracle PIM: Phantasmal Item Descriptions in your Organization
 
Implementing SharePoint: Site Customization and Branding
Implementing SharePoint: Site Customization and BrandingImplementing SharePoint: Site Customization and Branding
Implementing SharePoint: Site Customization and Branding
 
Spring data presentation
Spring data presentationSpring data presentation
Spring data presentation
 
Sap bods Training in Hyderabad | Sap bods Online Training
Sap bods Training in Hyderabad | Sap bods  Online Training Sap bods Training in Hyderabad | Sap bods  Online Training
Sap bods Training in Hyderabad | Sap bods Online Training
 
Sap bods training in hyderabad
Sap bods training in hyderabadSap bods training in hyderabad
Sap bods training in hyderabad
 
SFDC Introduction to Apex
SFDC Introduction to ApexSFDC Introduction to Apex
SFDC Introduction to Apex
 
What's New in Apache Solr 4.10
What's New in Apache Solr 4.10What's New in Apache Solr 4.10
What's New in Apache Solr 4.10
 
Evolutionary database design
Evolutionary database designEvolutionary database design
Evolutionary database design
 
Extjs3.4 Migration Notes
Extjs3.4 Migration NotesExtjs3.4 Migration Notes
Extjs3.4 Migration Notes
 
BrightGen's Salesforce Summer 21 release webinar
BrightGen's Salesforce Summer 21 release webinarBrightGen's Salesforce Summer 21 release webinar
BrightGen's Salesforce Summer 21 release webinar
 
Where to save my data, for devs!
Where to save my data, for devs!Where to save my data, for devs!
Where to save my data, for devs!
 
9 Months Web Development Diploma Course in North Delhi
9 Months Web Development Diploma Course in North Delhi9 Months Web Development Diploma Course in North Delhi
9 Months Web Development Diploma Course in North Delhi
 
Dont Reinvent the Wheel: Tips and Tricks for reuse in ADF
Dont Reinvent the Wheel: Tips and Tricks for reuse in ADFDont Reinvent the Wheel: Tips and Tricks for reuse in ADF
Dont Reinvent the Wheel: Tips and Tricks for reuse in ADF
 
Don't Reinvent the Wheel - Tips and tricks for rease in Oracle ADF - Luc Bors
Don't Reinvent the Wheel - Tips and tricks for rease in Oracle ADF - Luc BorsDon't Reinvent the Wheel - Tips and tricks for rease in Oracle ADF - Luc Bors
Don't Reinvent the Wheel - Tips and tricks for rease in Oracle ADF - Luc Bors
 
Build a modern data platform.pptx
Build a modern data platform.pptxBuild a modern data platform.pptx
Build a modern data platform.pptx
 
Stupid Index Block Tricks
Stupid Index Block TricksStupid Index Block Tricks
Stupid Index Block Tricks
 
Salesforce online training -GoLogica
Salesforce online training -GoLogicaSalesforce online training -GoLogica
Salesforce online training -GoLogica
 

Plus de bkraft

The Open Suite Approach: How to ride the shock waves of a changing web
The Open Suite Approach: How to ride the shock waves of a changing webThe Open Suite Approach: How to ride the shock waves of a changing web
The Open Suite Approach: How to ride the shock waves of a changing web
bkraft
 
Von der statischen Website zur virtuellen Präsenz - Vortrag für Nordwestschwe...
Von der statischen Website zur virtuellen Präsenz - Vortrag für Nordwestschwe...Von der statischen Website zur virtuellen Präsenz - Vortrag für Nordwestschwe...
Von der statischen Website zur virtuellen Präsenz - Vortrag für Nordwestschwe...
bkraft
 
End to end content managed online mobile banking
End to end content managed online mobile bankingEnd to end content managed online mobile banking
End to end content managed online mobile banking
bkraft
 
Yet Another E-Commerce Integration: Magnolia Loves Hybris
Yet Another E-Commerce Integration: Magnolia Loves Hybris Yet Another E-Commerce Integration: Magnolia Loves Hybris
Yet Another E-Commerce Integration: Magnolia Loves Hybris
bkraft
 

Plus de bkraft (20)

The Open Suite Approach: How to ride the shock waves of a changing web
The Open Suite Approach: How to ride the shock waves of a changing webThe Open Suite Approach: How to ride the shock waves of a changing web
The Open Suite Approach: How to ride the shock waves of a changing web
 
Von der statischen Website zur virtuellen Präsenz - Vortrag für Nordwestschwe...
Von der statischen Website zur virtuellen Präsenz - Vortrag für Nordwestschwe...Von der statischen Website zur virtuellen Präsenz - Vortrag für Nordwestschwe...
Von der statischen Website zur virtuellen Präsenz - Vortrag für Nordwestschwe...
 
Magnolia Conference 2013: Keynote
Magnolia Conference 2013: KeynoteMagnolia Conference 2013: Keynote
Magnolia Conference 2013: Keynote
 
Webinar slides: Orchestrate Your Digital Channels with Magnolia 5
Webinar slides: Orchestrate Your Digital Channels with Magnolia 5Webinar slides: Orchestrate Your Digital Channels with Magnolia 5
Webinar slides: Orchestrate Your Digital Channels with Magnolia 5
 
Webinar - Why Magnolia 5 Rocks For IT
Webinar - Why Magnolia 5 Rocks For ITWebinar - Why Magnolia 5 Rocks For IT
Webinar - Why Magnolia 5 Rocks For IT
 
Increase Online Sales with Magnolia CMS' Shop Module
Increase Online Sales with Magnolia CMS' Shop ModuleIncrease Online Sales with Magnolia CMS' Shop Module
Increase Online Sales with Magnolia CMS' Shop Module
 
Virtual Presence Management at Magnolia Amplify Miami 2013
Virtual Presence Management at Magnolia Amplify Miami 2013Virtual Presence Management at Magnolia Amplify Miami 2013
Virtual Presence Management at Magnolia Amplify Miami 2013
 
High performance and scalability
High performance and scalability High performance and scalability
High performance and scalability
 
Multilingual websites, microsites and landing pages
Multilingual websites, microsites and landing pagesMultilingual websites, microsites and landing pages
Multilingual websites, microsites and landing pages
 
Blossom on the web
Blossom on the webBlossom on the web
Blossom on the web
 
Work life balance
Work life balanceWork life balance
Work life balance
 
Magnolia and PHPCR
Magnolia and PHPCRMagnolia and PHPCR
Magnolia and PHPCR
 
End to end content managed online mobile banking
End to end content managed online mobile bankingEnd to end content managed online mobile banking
End to end content managed online mobile banking
 
MBC Group - Magnolia in the Media
MBC Group - Magnolia in the MediaMBC Group - Magnolia in the Media
MBC Group - Magnolia in the Media
 
Yet Another E-Commerce Integration: Magnolia Loves Hybris
Yet Another E-Commerce Integration: Magnolia Loves Hybris Yet Another E-Commerce Integration: Magnolia Loves Hybris
Yet Another E-Commerce Integration: Magnolia Loves Hybris
 
Bridging the Gap: Magnolia Modules and Spring Configured Software
Bridging the Gap: Magnolia Modules and Spring Configured SoftwareBridging the Gap: Magnolia Modules and Spring Configured Software
Bridging the Gap: Magnolia Modules and Spring Configured Software
 
User Management and SSO for Austrian Government
User Management and SSO for Austrian GovernmentUser Management and SSO for Austrian Government
User Management and SSO for Austrian Government
 
Enterprise Extensions to Magnolia's Imaging
Enterprise Extensions to Magnolia's ImagingEnterprise Extensions to Magnolia's Imaging
Enterprise Extensions to Magnolia's Imaging
 
How the STK, CSS & HTML and Rapid Prototyping Accelerate the Design Process
How the STK, CSS & HTML and Rapid Prototyping Accelerate the Design ProcessHow the STK, CSS & HTML and Rapid Prototyping Accelerate the Design Process
How the STK, CSS & HTML and Rapid Prototyping Accelerate the Design Process
 
Migros.ch - Modularizing Magnolia for Switzerland's Largest Retailer
Migros.ch - Modularizing Magnolia for Switzerland's Largest RetailerMigros.ch - Modularizing Magnolia for Switzerland's Largest Retailer
Migros.ch - Modularizing Magnolia for Switzerland's Largest Retailer
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Solr and Image Module Extensions of Magnolia

  • 2. Feature overview • Multi site support. • Solr Cloud support. • Asynchronous indexing. • Improved way to configure which pages are indexed. • Template based boosting modifier. • Flexible page type resolving mechanism. • Search result: page type to css mapping. • Various solr document field configuration enhancements: o Multi value flag to match solr document schema. o Added pluggable system for converting field data to solr document (Adders). • Facets. • Fake facet for period filtering.
  • 3. Multi site support • Any number of named configurations. • Link a site to a specific configuration. • Admin central solr page updated to trigger deleting all documents of a specific site.
  • 5. Asynchronous indexing • Indexing is not part of the workflow. • Creation of solr document and publication done in java.util.concurrent.ExecutorService. • Faster activation. • No error when indexing fails. • Should be configurable.
  • 6. Improved way to configure which pages are indexed • Previously done with parameter on template definition. • Two disadvantages: o No clear overview of which templates are selected for indexing. o Not possible to configure how pages with a given template are indexed. • Added template configuration for templates to Website Document. • Without this configuration pages are not indexed.
  • 7. Improved way to configure which pages are indexed
  • 8. Template based boosting modifier • Property on template configuration. • Allows you to favour pages of some type with equal score. • Defaults to 1.0 (neutral).
  • 9. Flexible page type resolving mechanism • We want all documents to have a page type field. • Based on circumstance page type must be resolved differently: o by path. o by template o by some external consideration • Introduced PageTypeResolver interface. Can be set on Template Configuration.
  • 10. Flexible page type resolving mechanism
  • 11. Search result: page type to css mapping • Simple mapping of page types to css names. • Css class names used when rendering the search result.
  • 12. Field configuration: Multi value flag to match solr document schema. • In Solr schema fields can be multi value or not. • Inserting a document with multiple values for a single value field yields an error. • The multi value search field configuration property ignores subsequent values for that field.
  • 13. Field configuration: Pluggable system for converting field data to solr document. • Standard values not a problem (String, Number, Date, Boolean). • Need more control for special cases: Images, Html, categories, ..
  • 14. Field configuration: Pluggable system for converting field data to solr document.
  • 15. Facets • Facets: one of the coolest features in Solr. • Added new configuration for facet fields. • Maps Solr field names to display field names. • New paragraph that shows the facets and re-submits the query, narrowing the search.
  • 17. Fake facet for period filtering • Date facets have fixed time intervals. • Code added for configuring a set of date ranges. • Configuration option still missing.
  • 18. Fake facet for period filtering
  • 19. caveats: Index time boosting • Index Time Boosting is not supported by fields that omit norms. • The template boosting modifier creates non-standard values even for fields with no boosting configuration. • Now you have to set 'omitNorms' to 'true' configuring those fields, so any boosting is disabled for these fields
  • 20. Todo: Solr Server Configuration • The solr server instances are configured in the repository. • This is not nice when you have different servers for test, acceptance, production. • Somehow externalize at least part of the configuration.
  • 21. Todo: Query time boosting • Currently all boosting is index time. • It is hard to tweak the boosting (reïndexing required). • Query time boosting should become an option. • Performance?
  • 22. Todo: Facets and period filter • Make it part of the facet configuration • Probably move the facet configuration out of the field configuration.
  • 23. Todo: Indexing on activation • Postponed activation and deactivation not supported. • Indexing should be part of the work flow. • That precludes asynchronous indexing.
  • 24. Ready to share? • Create separate module that depends on Magnolia Solr module. • Remove or generalize some VPRO specific stuff: o Class and package names. o Custom document fields hard coded. o Remove obsolete code/features. • Documentation