SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Discovery Platforms
TECHNOLOGIES, TOOLS AND ISSUES




                                   Saiful Amin
                    39th Five Laws Lecture (2011)
Evolution of Discovery Tools

 Printed catalogues
 Traditional (Web)OPAC
 Integrated OPAC portals
 Federated search services
 Discovery interfaces
 Web-scale discovery services
 Integrated discovery platform
Printed catalogues

 Author browse
 Title browse
 Series browse
 Call Number browse
 Subject browse
 Shelf list (inventory)
Traditional (Web)OPAC




    (Web)Server Application



         ILS Database
            (Bibs)
Traditional (Web)OPAC

Pros                              Cons
 Keyword search!                  Uses database queries
    Author, title, subject          „LIKE‟ statements
    ISBN/LCCN search                Exact/partial match
    Boolean queries                 Limited use of search
    Proximity search                 algorithm
 Browse index                     No relevance ranking
    Authority headings
                                   Only physical collection
    Title, Call Number
                                    and e-books
 Real-time item status!
    Copies & availability info
 Link to URL (tag 856)
Integrated OPAC Portal




                                                  Enrichment
                                   Web services    Services




          Web Server Application




Website       ILS Database         ILS Database
content          (Bibs)              (Patrons)
Integrated OPAC Portal

Pros                            Cons
 All WebOPAC features           Uses database queries
   Keyword search
                                   „LIKE‟ statements
   Headings browse
                                   Exact/partial match
   Availability info
                                   Limited use of search
 Library website integration
                                    algorithms
 Patron empowerment
   Circ/Account details         No relevance ranking
   Online renewal               Still limited to only physical
   Online hold placement
                                  collection & e-books
   SDI services
   New arrivals

 OPAC enrichment
   Book cover/reviews

 Thesaurus integration
Federated Search Service

360 Search                                                          dbWiz



Research Pro                                                        Pazpar2

                                                          Full-text links



                             Web Server Application




 Library    Digital                           Science
 Catalog   Repository
                        ProQuest   EBSCO
                                               Direct
                                                        PubMed      …       Emerald
Federated Search Service
 Muse Content Architecture                       Supports 6300+ databases!




http://www.museglobal.com/technology/contentIntegration.html
Federated Search Service

Pros                           Cons
 Single search broadcast       Not all databases are
 Real-time search results       standards compliant
 Based on standards                Requires custom search scripts
   Z39.50, SRU/W
                                    Requires metadata crosswalk
   MARC, ISO2709, XML

 Supports large set of         Network intensive
  databases                         Performance issues
     7000+ in “360 Search”
                                Mostly available as hosted
     6300+ in Muse platform
 Merging and sorting            service
 No local index                    Annual subscription
  (maintenance free!)
Discovery Interface




                                                                            Enrichment
                                                        Web services         Services




                         Web Server Application

Full-text link                                                 Availability/Holds



   Digital                     Central Index
  Repository                   (Solr/Lucene)                     ILS Database
                 DC XML data                   MARC Bib data
Discovery Interface
 Word stemming                      Phrase query
    „fishing‟, „fished‟, „fish‟,       „Did you mean?‟
     „fisher‟ => „fish‟
                                        Spell Checker
 Fuzzy search
    insertion: cot  coat           Relevance ranking
    deletion: coat  cot               TF-IDF / Term Vector
    substitution: coat  cost          Term weights
 Auto-suggest                          Lucene scores
    N-gram, Edge N-gram             Faceted browsing
     analysis                           Who are main authors and
                                         their count?
                                        What are main subjects and
                                         their count?
Discovery Interface

Pros                               Cons
 Google-like search box            Searches only locally hosted
 Advanced features                  collections
   Fuzzy searching
   Relevance ranking
   Word stemming algorithms
   Social tagging/reviews
   “Did you mean?” feature
   Auto-suggest (type ahead)
   Faceted browsing

 Availability/Hold requests
 Metadata enrichment
 Linking
   Amazon/Google/Wikipedia

 Digital repository integration
Can we combine the two?

 Modern discovery interface



     Local collections +
     Remote databases


    Unified search result
Web-scale Discovery Services



  EBSCO



 ProQuest



ABI Inform
                                          Web Server Application
 PubMed
                                                                        Availability
                                                                        Full-text link
 Science
  Direct                                                             Library

   …                                    Central Index
                                                        MARC data    Catalog

               Full-text and metadata
                                                                     Digital
Lexis-Nexis                                                         Repository
                                                         DC data
Web-scale Discovery Services
Web-scale Discovery Services
Summon Service




              Content types include:
  Library catalog records      Conference proceedings
  E-journal articles           Grey literature
  Institutional repositories   Cited references
  Newspaper articles           Reports
  E-books                      Digital library
  Dissertations                Databases and more.
Web-scale Discovery Services

Pros                              Cons
 Google-like single search box    Supports limited number of
 Pre-indexed licensed content      databases (1000-1500)
 Inclusion of local collection        Requires huge investment to
   OAI-PMH, MARC updates               maintain centralized index
 Advanced features                    Publisher partnerships
   Relevance ranking                   (Licensing/legal issues)
   “Did you mean?”                    Regular pre-publication indexing
   Auto-suggest (type ahead)
                                   Mostly hosted-only service
   Faceted navigation
                                       Content bias? (ranking)
 Availability/Full-text links
                                       Vendor lock-in?
 Mobile friendly
 Web-service APIs                 Annual subscription
 Easier off-campus access
 No installation/maintenance
Can we have best of both worlds?

Modern discovery interface                   Supports large number of
                                              databases

    Local collections +                      Based on open standards
    Remote databases                          (extensible)

                                             Can be maintained locally
   Unified search result                      (No subscription!)


                       Web Server Application


                               Remote          Remote     Remote     Remote
   Digital      ILS database   database        database   database   database
  Repository       (Bibs)
                               Remote          Remote     Remote     Remote
                               database        database   database   database
Integrated Discovery Platform
                                             Semi-commercial
                                     Supports 1000+ databases




http://www.indexdata.com/masterkey
Integrated Discovery Platform
    Pazpar2 Architecture                   Open source (GPL)
                                    Build your own connector!




https://www.indexdata.com/pazpar2
Conclusion

 Each platform has its own goals:
     Pure library catalog can provide expressive search (high precision)
     Federated search improves content coverage in single search
     Discovery interfaces are designed to improve user experience for
      local collections
     Web-scale discovery provides unified search experience for local and
      remote collections (still way short in content coverage)
     Integrated platform provides extensibility (but requires significant
      effort in development and maintenance)
 One size does not fit all. No single system is perfect.
 As content becomes more open, the focus of discovery
  solutions should be on open platforms that are extensible
  as well as affordable.
Questions and Discussions

Contenu connexe

Tendances

Web scale discovery vs google scholar
Web scale discovery vs google scholarWeb scale discovery vs google scholar
Web scale discovery vs google scholarNikesh Narayanan
 
Federated to library discovery platfoms
Federated to library discovery platfomsFederated to library discovery platfoms
Federated to library discovery platfomsNikesh Narayanan
 
Current and emerging trends in library services
Current and emerging trends in library servicesCurrent and emerging trends in library services
Current and emerging trends in library servicesNikesh Narayanan
 
Federated Search: The Good, The Bad And The Ugly
Federated Search: The Good, The Bad And The UglyFederated Search: The Good, The Bad And The Ugly
Federated Search: The Good, The Bad And The Uglydorishelfer
 
Role of libraries in accelerating research
Role of libraries in accelerating researchRole of libraries in accelerating research
Role of libraries in accelerating researchNikesh Narayanan
 
Federated Search Falls Short
Federated Search Falls ShortFederated Search Falls Short
Federated Search Falls Shortslknight
 
Federated Search in a Disparate Environment
Federated Search in a Disparate EnvironmentFederated Search in a Disparate Environment
Federated Search in a Disparate EnvironmentHelen Mitchell
 
Erl10 web scale-gb-sg
Erl10 web scale-gb-sgErl10 web scale-gb-sg
Erl10 web scale-gb-sgGeorge Boston
 
Customization For Libraries
Customization For LibrariesCustomization For Libraries
Customization For LibrariesGlenda Barahona
 
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...UKSG: connecting the knowledge community
 
Key developments in electronic delivery in LIS 2005-2008
Key developments in electronic delivery in LIS 2005-2008Key developments in electronic delivery in LIS 2005-2008
Key developments in electronic delivery in LIS 2005-2008Catherine Ebenezer
 
Preprocessing of Web Log Data for Web Usage Mining
Preprocessing of Web Log Data for Web Usage MiningPreprocessing of Web Log Data for Web Usage Mining
Preprocessing of Web Log Data for Web Usage MiningAmir Masoud Sefidian
 

Tendances (19)

NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
 
Web scale discovery vs google scholar
Web scale discovery vs google scholarWeb scale discovery vs google scholar
Web scale discovery vs google scholar
 
Federated to library discovery platfoms
Federated to library discovery platfomsFederated to library discovery platfoms
Federated to library discovery platfoms
 
Current and emerging trends in library services
Current and emerging trends in library servicesCurrent and emerging trends in library services
Current and emerging trends in library services
 
Federated Search: The Good, The Bad And The Ugly
Federated Search: The Good, The Bad And The UglyFederated Search: The Good, The Bad And The Ugly
Federated Search: The Good, The Bad And The Ugly
 
Role of libraries in accelerating research
Role of libraries in accelerating researchRole of libraries in accelerating research
Role of libraries in accelerating research
 
Presentation federated search
Presentation federated searchPresentation federated search
Presentation federated search
 
20140220_Cooperation, Cloud, and Consumer Technologies
20140220_Cooperation, Cloud, and Consumer Technologies20140220_Cooperation, Cloud, and Consumer Technologies
20140220_Cooperation, Cloud, and Consumer Technologies
 
Federated Search Falls Short
Federated Search Falls ShortFederated Search Falls Short
Federated Search Falls Short
 
Federated Search in a Disparate Environment
Federated Search in a Disparate EnvironmentFederated Search in a Disparate Environment
Federated Search in a Disparate Environment
 
Ltr1
Ltr1Ltr1
Ltr1
 
Erl10 web scale-gb-sg
Erl10 web scale-gb-sgErl10 web scale-gb-sg
Erl10 web scale-gb-sg
 
3 - Discovery-systems
3  - Discovery-systems3  - Discovery-systems
3 - Discovery-systems
 
Customization For Libraries
Customization For LibrariesCustomization For Libraries
Customization For Libraries
 
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
 
Session5
Session5Session5
Session5
 
Key developments in electronic delivery in LIS 2005-2008
Key developments in electronic delivery in LIS 2005-2008Key developments in electronic delivery in LIS 2005-2008
Key developments in electronic delivery in LIS 2005-2008
 
Preprocessing of Web Log Data for Web Usage Mining
Preprocessing of Web Log Data for Web Usage MiningPreprocessing of Web Log Data for Web Usage Mining
Preprocessing of Web Log Data for Web Usage Mining
 
Electronic resource management system (ERM)
Electronic resource management system (ERM)Electronic resource management system (ERM)
Electronic resource management system (ERM)
 

Similaire à Discovery platforms: Technology, tools and issues

A University Library's Service Transformed Through WorldCat Local - Mazmin Ma...
A University Library's Service Transformed Through WorldCat Local - Mazmin Ma...A University Library's Service Transformed Through WorldCat Local - Mazmin Ma...
A University Library's Service Transformed Through WorldCat Local - Mazmin Ma...tulipbiru64
 
Connect Your Resources, Save Time, Save Money:: Connecting library electron...
Connect Your Resources, Save Time, Save Money:: Connecting library  electron...Connect Your Resources, Save Time, Save Money:: Connecting library  electron...
Connect Your Resources, Save Time, Save Money:: Connecting library electron...Richard Bernier
 
EPC Group - Comparing SharePoint 2010 Versions and Functionallity - SharePoin...
EPC Group - Comparing SharePoint 2010 Versions and Functionallity - SharePoin...EPC Group - Comparing SharePoint 2010 Versions and Functionallity - SharePoin...
EPC Group - Comparing SharePoint 2010 Versions and Functionallity - SharePoin...EPC Group
 
Standards for Semantic Mashups
Standards for Semantic MashupsStandards for Semantic Mashups
Standards for Semantic MashupsLaurent Lefort
 
Knowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge BaseKnowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge Basesherif user group
 
2015 02 19 platforms and discovery
2015 02 19 platforms and discovery2015 02 19 platforms and discovery
2015 02 19 platforms and discoveryStephen Abram
 
Ltr 1 Powerpoint
Ltr 1 PowerpointLtr 1 Powerpoint
Ltr 1 PowerpointLeonsagara
 
Ltr 1 Powerpoint
Ltr 1 PowerpointLtr 1 Powerpoint
Ltr 1 Powerpointryanoceros
 
Webinar: Semantic web for developers
Webinar: Semantic web for developersWebinar: Semantic web for developers
Webinar: Semantic web for developersSemantic Web Company
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101Chris McNulty
 
Hello SharePoint 2007!!!
Hello SharePoint 2007!!!Hello SharePoint 2007!!!
Hello SharePoint 2007!!!Marwan Tarek
 
Ltr 1 Powerpoint
Ltr 1 PowerpointLtr 1 Powerpoint
Ltr 1 PowerpointMary Chu
 
Future Of Metadata –
Future Of Metadata –Future Of Metadata –
Future Of Metadata –Jill Strass
 
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupDriving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupEPC Group
 
Semantic Search Tutorial at SemTech 2012
Semantic Search Tutorial at SemTech 2012 Semantic Search Tutorial at SemTech 2012
Semantic Search Tutorial at SemTech 2012 Thanh Tran
 
Making your it skills virtual
Making your it skills virtualMaking your it skills virtual
Making your it skills virtualErik Mitchell
 

Similaire à Discovery platforms: Technology, tools and issues (20)

1530 mon lomond breeding
1530 mon lomond breeding1530 mon lomond breeding
1530 mon lomond breeding
 
A University Library's Service Transformed Through WorldCat Local - Mazmin Ma...
A University Library's Service Transformed Through WorldCat Local - Mazmin Ma...A University Library's Service Transformed Through WorldCat Local - Mazmin Ma...
A University Library's Service Transformed Through WorldCat Local - Mazmin Ma...
 
Connect Your Resources, Save Time, Save Money:: Connecting library electron...
Connect Your Resources, Save Time, Save Money:: Connecting library  electron...Connect Your Resources, Save Time, Save Money:: Connecting library  electron...
Connect Your Resources, Save Time, Save Money:: Connecting library electron...
 
EPC Group - Comparing SharePoint 2010 Versions and Functionallity - SharePoin...
EPC Group - Comparing SharePoint 2010 Versions and Functionallity - SharePoin...EPC Group - Comparing SharePoint 2010 Versions and Functionallity - SharePoin...
EPC Group - Comparing SharePoint 2010 Versions and Functionallity - SharePoin...
 
Standards for Semantic Mashups
Standards for Semantic MashupsStandards for Semantic Mashups
Standards for Semantic Mashups
 
Knowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge BaseKnowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge Base
 
2015 02 19 platforms and discovery
2015 02 19 platforms and discovery2015 02 19 platforms and discovery
2015 02 19 platforms and discovery
 
e-library management system
e-library management systeme-library management system
e-library management system
 
Ltr 1 Powerpoint
Ltr 1 PowerpointLtr 1 Powerpoint
Ltr 1 Powerpoint
 
Ltr 1 Powerpoint
Ltr 1 PowerpointLtr 1 Powerpoint
Ltr 1 Powerpoint
 
Webinar: Semantic web for developers
Webinar: Semantic web for developersWebinar: Semantic web for developers
Webinar: Semantic web for developers
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101
 
Hello SharePoint 2007!!!
Hello SharePoint 2007!!!Hello SharePoint 2007!!!
Hello SharePoint 2007!!!
 
Ltr 1 Powerpoint
Ltr 1 PowerpointLtr 1 Powerpoint
Ltr 1 Powerpoint
 
Future Of Metadata –
Future Of Metadata –Future Of Metadata –
Future Of Metadata –
 
Saadallah vtls
Saadallah vtlsSaadallah vtls
Saadallah vtls
 
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupDriving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
 
Semantic Search Tutorial at SemTech 2012
Semantic Search Tutorial at SemTech 2012 Semantic Search Tutorial at SemTech 2012
Semantic Search Tutorial at SemTech 2012
 
Making your it skills virtual
Making your it skills virtualMaking your it skills virtual
Making your it skills virtual
 
SIL rapid capture
SIL rapid captureSIL rapid capture
SIL rapid capture
 

Dernier

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 

Dernier (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Discovery platforms: Technology, tools and issues

  • 1. Discovery Platforms TECHNOLOGIES, TOOLS AND ISSUES Saiful Amin 39th Five Laws Lecture (2011)
  • 2. Evolution of Discovery Tools  Printed catalogues  Traditional (Web)OPAC  Integrated OPAC portals  Federated search services  Discovery interfaces  Web-scale discovery services  Integrated discovery platform
  • 3. Printed catalogues  Author browse  Title browse  Series browse  Call Number browse  Subject browse  Shelf list (inventory)
  • 4. Traditional (Web)OPAC (Web)Server Application ILS Database (Bibs)
  • 5.
  • 6. Traditional (Web)OPAC Pros Cons  Keyword search!  Uses database queries  Author, title, subject  „LIKE‟ statements  ISBN/LCCN search  Exact/partial match  Boolean queries  Limited use of search  Proximity search algorithm  Browse index  No relevance ranking  Authority headings  Only physical collection  Title, Call Number and e-books  Real-time item status!  Copies & availability info  Link to URL (tag 856)
  • 7. Integrated OPAC Portal Enrichment Web services Services Web Server Application Website ILS Database ILS Database content (Bibs) (Patrons)
  • 8.
  • 9. Integrated OPAC Portal Pros Cons  All WebOPAC features  Uses database queries  Keyword search  „LIKE‟ statements  Headings browse  Exact/partial match  Availability info  Limited use of search  Library website integration algorithms  Patron empowerment  Circ/Account details  No relevance ranking  Online renewal  Still limited to only physical  Online hold placement collection & e-books  SDI services  New arrivals  OPAC enrichment  Book cover/reviews  Thesaurus integration
  • 10. Federated Search Service 360 Search dbWiz Research Pro Pazpar2 Full-text links Web Server Application Library Digital Science Catalog Repository ProQuest EBSCO Direct PubMed … Emerald
  • 11. Federated Search Service Muse Content Architecture Supports 6300+ databases! http://www.museglobal.com/technology/contentIntegration.html
  • 12.
  • 13. Federated Search Service Pros Cons  Single search broadcast  Not all databases are  Real-time search results standards compliant  Based on standards  Requires custom search scripts  Z39.50, SRU/W  Requires metadata crosswalk  MARC, ISO2709, XML  Supports large set of  Network intensive databases  Performance issues  7000+ in “360 Search”  Mostly available as hosted  6300+ in Muse platform  Merging and sorting service  No local index  Annual subscription (maintenance free!)
  • 14.
  • 15. Discovery Interface Enrichment Web services Services Web Server Application Full-text link Availability/Holds Digital Central Index Repository (Solr/Lucene) ILS Database DC XML data MARC Bib data
  • 16.
  • 17. Discovery Interface  Word stemming  Phrase query  „fishing‟, „fished‟, „fish‟,  „Did you mean?‟ „fisher‟ => „fish‟  Spell Checker  Fuzzy search  insertion: cot  coat  Relevance ranking  deletion: coat  cot  TF-IDF / Term Vector  substitution: coat  cost  Term weights  Auto-suggest  Lucene scores  N-gram, Edge N-gram  Faceted browsing analysis  Who are main authors and their count?  What are main subjects and their count?
  • 18. Discovery Interface Pros Cons  Google-like search box  Searches only locally hosted  Advanced features collections  Fuzzy searching  Relevance ranking  Word stemming algorithms  Social tagging/reviews  “Did you mean?” feature  Auto-suggest (type ahead)  Faceted browsing  Availability/Hold requests  Metadata enrichment  Linking  Amazon/Google/Wikipedia  Digital repository integration
  • 19. Can we combine the two? Modern discovery interface Local collections + Remote databases Unified search result
  • 20. Web-scale Discovery Services EBSCO ProQuest ABI Inform Web Server Application PubMed Availability Full-text link Science Direct Library … Central Index MARC data Catalog Full-text and metadata Digital Lexis-Nexis Repository DC data
  • 22. Web-scale Discovery Services Summon Service Content types include: Library catalog records Conference proceedings E-journal articles Grey literature Institutional repositories Cited references Newspaper articles Reports E-books Digital library Dissertations Databases and more.
  • 23.
  • 24. Web-scale Discovery Services Pros Cons  Google-like single search box  Supports limited number of  Pre-indexed licensed content databases (1000-1500)  Inclusion of local collection  Requires huge investment to  OAI-PMH, MARC updates maintain centralized index  Advanced features  Publisher partnerships  Relevance ranking (Licensing/legal issues)  “Did you mean?”  Regular pre-publication indexing  Auto-suggest (type ahead)  Mostly hosted-only service  Faceted navigation  Content bias? (ranking)  Availability/Full-text links  Vendor lock-in?  Mobile friendly  Web-service APIs  Annual subscription  Easier off-campus access  No installation/maintenance
  • 25. Can we have best of both worlds? Modern discovery interface  Supports large number of databases Local collections +  Based on open standards Remote databases (extensible)  Can be maintained locally Unified search result (No subscription!) Web Server Application Remote Remote Remote Remote Digital ILS database database database database database Repository (Bibs) Remote Remote Remote Remote database database database database
  • 26. Integrated Discovery Platform Semi-commercial Supports 1000+ databases http://www.indexdata.com/masterkey
  • 27.
  • 28. Integrated Discovery Platform Pazpar2 Architecture Open source (GPL) Build your own connector! https://www.indexdata.com/pazpar2
  • 29. Conclusion  Each platform has its own goals:  Pure library catalog can provide expressive search (high precision)  Federated search improves content coverage in single search  Discovery interfaces are designed to improve user experience for local collections  Web-scale discovery provides unified search experience for local and remote collections (still way short in content coverage)  Integrated platform provides extensibility (but requires significant effort in development and maintenance)  One size does not fit all. No single system is perfect.  As content becomes more open, the focus of discovery solutions should be on open platforms that are extensible as well as affordable.