SlideShare une entreprise Scribd logo
1  sur  15
Project Gutenberg as an
Information Retrieval System
Kai Li
IST616 Final Assignment
2012.11
Introduction to Project Gutenberg
• The first digital library project in the
world, initiated by the late Michael Hart in
1971.
• Project Gutenberg currently offers more than
41,000 public domain eBooks (in more than
50 languages) as well as other resources (like
scientific data).
• Website: http://www.gutenberg.org/
Intended Audience and Functionalities
• Intended audience: eBook readers and general
users.
• Functionalities: portal of the project, eBook
repository and discovery system.
Mobile Site
• There are two kinds of
interfaces of this
website based on the
device one uses. Only
the traditional nonmobile interface will be
examined in this
presentation due to the
limited scope of the
assignment.
Indexing System
Issues of Indexing/Tag System
• There is a searching box as well as a tag called
“Search Catalog”;
– The searching box is too small to be noticed;
– The tag “Search Catalog” actually leads users to a
page where one cannot find the searching box,
but only some browsing selections;

• There are a number of repetitive tags on the
left-hand bar and on the top of the page;
– For example, the tag “Book Categories”.
Means To Find a Book
• Searching
• Browsing
– By categories
Searching
Issues of Searching
• The display is different from most of the
interfaces one can see on the Internet, which
may result some difficulties for new users;
• Due to a lack of navigation mechanism and
the function to refine the result by facets, it’s
extremely inconvenient to locate a resource if
the result is big.
Precision and Recall
• The retrieval method used by this website is a
string-matching method, which matches the
string inputted by the user with the full-text of all
the resources.
– “Or” relationship used for multiple words.

• Because the scope of the index is the full-text, the
recall is higher than traditional library catalogs;
however, since it is still a string-matching
method, the precision is still not very good.
Browsing
Issues of Browsing
• There are three searching tools offered on this
page, which should have been offered on the
searching page rather than this one.
• Only one standard can be used to limit the
resources at the same time. And after one
chooses a certain standard, there is no other
way to further limit the result.
Categories/Classification
• There are two tiers of the “classification” on
this website:
– Subcategories: 23
• These subcategories are called “bookshelf” too, which
is confusing.

– Bookshelves: 133
• Which can be seen as a lower level than subcategories.
However, not all bookshelves are linked to a given
subcategory.
Overall Evaluation
• Advantages:
– Mobile functionalities:
• Mobile site
• QR codes

• Disadvantages:
– Poorly organized and
designed;
– Failing to display the full
richness of the metadata
on the website:
• LoC classification and
subject headings

– The interface being lack
of communication with
the users;
Thanks!

Contenu connexe

Tendances

Anglo-American Cataloguing Rules AACR 2 ppt
Anglo-American Cataloguing Rules AACR 2 pptAnglo-American Cataloguing Rules AACR 2 ppt
Anglo-American Cataloguing Rules AACR 2 pptUniversity of Delhi
 
Subject analysis, subject heading principles
Subject analysis, subject heading principlesSubject analysis, subject heading principles
Subject analysis, subject heading principlesRichard.Sapon-White
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in LibrariesAnupama Saini
 
RESOURCE SHARING: A LIBRARY PERCEPTIVE
RESOURCE SHARING: A LIBRARY PERCEPTIVE RESOURCE SHARING: A LIBRARY PERCEPTIVE
RESOURCE SHARING: A LIBRARY PERCEPTIVE IAEME Publication
 
Ict uses in libraries
Ict uses in librariesIct uses in libraries
Ict uses in librariesLiaquat Rahoo
 
Chain indexing
Chain indexingChain indexing
Chain indexingsilambu111
 
Trabajo final historia de las bibliotecas...
Trabajo final historia de las bibliotecas...Trabajo final historia de las bibliotecas...
Trabajo final historia de las bibliotecas...Julian Valencia
 
An an overview of selection acquisition, and usage of e resources
An an overview of selection acquisition, and usage of e resourcesAn an overview of selection acquisition, and usage of e resources
An an overview of selection acquisition, and usage of e resourcesEKITI STATE UNIVERSITY LIBRARY
 
How ict used in libraries
How ict used in librariesHow ict used in libraries
How ict used in librariesjanjangammod
 
Brodt - Plan de Desarrollo de Colecciones
Brodt - Plan de Desarrollo de ColeccionesBrodt - Plan de Desarrollo de Colecciones
Brodt - Plan de Desarrollo de ColeccionesRomina Brodt
 
Canon of classification
Canon of classificationCanon of classification
Canon of classificationavid
 
Proposal otomatisasi perpustakaan
Proposal  otomatisasi perpustakaanProposal  otomatisasi perpustakaan
Proposal otomatisasi perpustakaanJusuf Nursjamsu
 
Academic libraries in new normal
Academic libraries in new normalAcademic libraries in new normal
Academic libraries in new normalDr Trivedi
 
Historia de las bibliotecas
Historia de las bibliotecasHistoria de las bibliotecas
Historia de las bibliotecasNohelia Ríos
 

Tendances (20)

Anglo-American Cataloguing Rules AACR 2 ppt
Anglo-American Cataloguing Rules AACR 2 pptAnglo-American Cataloguing Rules AACR 2 ppt
Anglo-American Cataloguing Rules AACR 2 ppt
 
Z39.50 basics
Z39.50 basicsZ39.50 basics
Z39.50 basics
 
koha PPT 23822.pptx
koha PPT 23822.pptxkoha PPT 23822.pptx
koha PPT 23822.pptx
 
Subject analysis, subject heading principles
Subject analysis, subject heading principlesSubject analysis, subject heading principles
Subject analysis, subject heading principles
 
Koha Cataloguing Module
Koha Cataloguing ModuleKoha Cataloguing Module
Koha Cataloguing Module
 
Viniti
VinitiViniti
Viniti
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in Libraries
 
RESOURCE SHARING: A LIBRARY PERCEPTIVE
RESOURCE SHARING: A LIBRARY PERCEPTIVE RESOURCE SHARING: A LIBRARY PERCEPTIVE
RESOURCE SHARING: A LIBRARY PERCEPTIVE
 
Ict uses in libraries
Ict uses in librariesIct uses in libraries
Ict uses in libraries
 
Classified Catalogue Code (ccc)
Classified Catalogue Code (ccc)Classified Catalogue Code (ccc)
Classified Catalogue Code (ccc)
 
Chain indexing
Chain indexingChain indexing
Chain indexing
 
SQL Reports in Koha
SQL Reports in KohaSQL Reports in Koha
SQL Reports in Koha
 
Trabajo final historia de las bibliotecas...
Trabajo final historia de las bibliotecas...Trabajo final historia de las bibliotecas...
Trabajo final historia de las bibliotecas...
 
An an overview of selection acquisition, and usage of e resources
An an overview of selection acquisition, and usage of e resourcesAn an overview of selection acquisition, and usage of e resources
An an overview of selection acquisition, and usage of e resources
 
How ict used in libraries
How ict used in librariesHow ict used in libraries
How ict used in libraries
 
Brodt - Plan de Desarrollo de Colecciones
Brodt - Plan de Desarrollo de ColeccionesBrodt - Plan de Desarrollo de Colecciones
Brodt - Plan de Desarrollo de Colecciones
 
Canon of classification
Canon of classificationCanon of classification
Canon of classification
 
Proposal otomatisasi perpustakaan
Proposal  otomatisasi perpustakaanProposal  otomatisasi perpustakaan
Proposal otomatisasi perpustakaan
 
Academic libraries in new normal
Academic libraries in new normalAcademic libraries in new normal
Academic libraries in new normal
 
Historia de las bibliotecas
Historia de las bibliotecasHistoria de las bibliotecas
Historia de las bibliotecas
 

Similaire à Project Gutenberg as Information Retrieval System

Lost in Translation:
Lost in Translation: Lost in Translation:
Lost in Translation: tmnewberry
 
What Public Library Users Want and How to
What Public Library Users Want and How to What Public Library Users Want and How to
What Public Library Users Want and How to Nina McHale
 
K3 edith falk_discoverytoolslibrary
K3 edith falk_discoverytoolslibraryK3 edith falk_discoverytoolslibrary
K3 edith falk_discoverytoolslibraryevaminerva
 
K3 edith falk_discoverytoolslibrary
K3 edith falk_discoverytoolslibraryK3 edith falk_discoverytoolslibrary
K3 edith falk_discoverytoolslibraryevaminerva
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationRachel Vacek
 
Federated to library discovery platfoms
Federated to library discovery platfomsFederated to library discovery platfoms
Federated to library discovery platfomsNikesh Narayanan
 
WorldCat Local@Auraria
WorldCat Local@AurariaWorldCat Local@Auraria
WorldCat Local@AurariaNina McHale
 
Presentacion tics (1)
Presentacion tics (1)Presentacion tics (1)
Presentacion tics (1)87895
 
Discovery on a budget
Discovery on a budgetDiscovery on a budget
Discovery on a budgetChris Bulock
 
Discovery on a budget: Improved searching without a Web-scale discovery product
Discovery on a budget: Improved searching without a Web-scale discovery productDiscovery on a budget: Improved searching without a Web-scale discovery product
Discovery on a budget: Improved searching without a Web-scale discovery productNASIG
 
Rethinking Library Cooperatives: Prepared for the Program for Cooperative Cat...
Rethinking Library Cooperatives: Prepared for the Program for Cooperative Cat...Rethinking Library Cooperatives: Prepared for the Program for Cooperative Cat...
Rethinking Library Cooperatives: Prepared for the Program for Cooperative Cat...Karen S Calhoun
 
Web Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experienceWeb Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experienceNikesh Narayanan
 
Device agnostic discovery using drupal and bibliocommons
Device agnostic discovery using drupal and bibliocommonsDevice agnostic discovery using drupal and bibliocommons
Device agnostic discovery using drupal and bibliocommonsonlinenw
 
Creating better user interfaces for libraries catalogues: how to present and ...
Creating better user interfaces for libraries catalogues: how to present and ...Creating better user interfaces for libraries catalogues: how to present and ...
Creating better user interfaces for libraries catalogues: how to present and ...Tanja Merčun
 
Role of libraries in research and scholarly communication
Role of libraries in research and scholarly communicationRole of libraries in research and scholarly communication
Role of libraries in research and scholarly communicationNikesh Narayanan
 
Use of "NewGenLib" Open Source Software for Library Automation, Digital Libra...
Use of "NewGenLib" Open Source Software for Library Automation, Digital Libra...Use of "NewGenLib" Open Source Software for Library Automation, Digital Libra...
Use of "NewGenLib" Open Source Software for Library Automation, Digital Libra...Emmanuel E C
 

Similaire à Project Gutenberg as Information Retrieval System (20)

Lost in Translation:
Lost in Translation: Lost in Translation:
Lost in Translation:
 
Leveraging Library Thing (2009)
Leveraging Library Thing (2009)Leveraging Library Thing (2009)
Leveraging Library Thing (2009)
 
What Public Library Users Want and How to
What Public Library Users Want and How to What Public Library Users Want and How to
What Public Library Users Want and How to
 
K3 edith falk_discoverytoolslibrary
K3 edith falk_discoverytoolslibraryK3 edith falk_discoverytoolslibrary
K3 edith falk_discoverytoolslibrary
 
K3 edith falk_discoverytoolslibrary
K3 edith falk_discoverytoolslibraryK3 edith falk_discoverytoolslibrary
K3 edith falk_discoverytoolslibrary
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post Implementation
 
web opac
 web opac  web opac
web opac
 
Federated to library discovery platfoms
Federated to library discovery platfomsFederated to library discovery platfoms
Federated to library discovery platfoms
 
WorldCat Local@Auraria
WorldCat Local@AurariaWorldCat Local@Auraria
WorldCat Local@Auraria
 
Presentacion tics (1)
Presentacion tics (1)Presentacion tics (1)
Presentacion tics (1)
 
Discovery on a budget
Discovery on a budgetDiscovery on a budget
Discovery on a budget
 
Discovery on a budget: Improved searching without a Web-scale discovery product
Discovery on a budget: Improved searching without a Web-scale discovery productDiscovery on a budget: Improved searching without a Web-scale discovery product
Discovery on a budget: Improved searching without a Web-scale discovery product
 
Rethinking Library Cooperatives: Prepared for the Program for Cooperative Cat...
Rethinking Library Cooperatives: Prepared for the Program for Cooperative Cat...Rethinking Library Cooperatives: Prepared for the Program for Cooperative Cat...
Rethinking Library Cooperatives: Prepared for the Program for Cooperative Cat...
 
Library portal by Gaurav Boudh
Library portal by Gaurav BoudhLibrary portal by Gaurav Boudh
Library portal by Gaurav Boudh
 
Web Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experienceWeb Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experience
 
Device agnostic discovery using drupal and bibliocommons
Device agnostic discovery using drupal and bibliocommonsDevice agnostic discovery using drupal and bibliocommons
Device agnostic discovery using drupal and bibliocommons
 
Creating better user interfaces for libraries catalogues: how to present and ...
Creating better user interfaces for libraries catalogues: how to present and ...Creating better user interfaces for libraries catalogues: how to present and ...
Creating better user interfaces for libraries catalogues: how to present and ...
 
Role of libraries in research and scholarly communication
Role of libraries in research and scholarly communicationRole of libraries in research and scholarly communication
Role of libraries in research and scholarly communication
 
opacs.ppt
opacs.pptopacs.ppt
opacs.ppt
 
Use of "NewGenLib" Open Source Software for Library Automation, Digital Libra...
Use of "NewGenLib" Open Source Software for Library Automation, Digital Libra...Use of "NewGenLib" Open Source Software for Library Automation, Digital Libra...
Use of "NewGenLib" Open Source Software for Library Automation, Digital Libra...
 

Plus de Kai Li

Using a keyword extraction pipeline to understand concepts in future work sec...
Using a keyword extraction pipeline to understand concepts in future work sec...Using a keyword extraction pipeline to understand concepts in future work sec...
Using a keyword extraction pipeline to understand concepts in future work sec...Kai Li
 
Knowledge production between laboratories and scientific texts: a proposal of...
Knowledge production between laboratories and scientific texts: a proposal of...Knowledge production between laboratories and scientific texts: a proposal of...
Knowledge production between laboratories and scientific texts: a proposal of...Kai Li
 
Data and Software in Scientific Activities: a Literature Review
Data and Software in Scientific Activities: a Literature ReviewData and Software in Scientific Activities: a Literature Review
Data and Software in Scientific Activities: a Literature ReviewKai Li
 
A metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposalA metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposalKai Li
 
Software Citation, Reuse and Metadata Considerations: An Exploratory Study ...
Software Citation, Reuse and Metadata Considerations:  An Exploratory Study ...Software Citation, Reuse and Metadata Considerations:  An Exploratory Study ...
Software Citation, Reuse and Metadata Considerations: An Exploratory Study ...Kai Li
 
On metaphor: a book review of Metaphors we live by
On metaphor: a book review of Metaphors we live byOn metaphor: a book review of Metaphors we live by
On metaphor: a book review of Metaphors we live byKai Li
 
Visual perception and mixed-initiative interaction for assisted visualization...
Visual perception and mixed-initiative interaction for assisted visualization...Visual perception and mixed-initiative interaction for assisted visualization...
Visual perception and mixed-initiative interaction for assisted visualization...Kai Li
 
A family tree of graph types
A family tree of graph typesA family tree of graph types
A family tree of graph typesKai Li
 
Introduction to Visualizing Uncertainties
Introduction to Visualizing UncertaintiesIntroduction to Visualizing Uncertainties
Introduction to Visualizing UncertaintiesKai Li
 
InfoVis Final Project: NBA in historical context
InfoVis Final Project: NBA in historical contextInfoVis Final Project: NBA in historical context
InfoVis Final Project: NBA in historical contextKai Li
 
Introduction to bibframe
Introduction to bibframeIntroduction to bibframe
Introduction to bibframeKai Li
 
Grassroots Read: Planning, Marketing and Assessing Plan
Grassroots Read: Planning, Marketing and Assessing PlanGrassroots Read: Planning, Marketing and Assessing Plan
Grassroots Read: Planning, Marketing and Assessing PlanKai Li
 
RDFa: an introduction
RDFa: an introductionRDFa: an introduction
RDFa: an introductionKai Li
 
Culture Classification: An Analysis
Culture Classification: An AnalysisCulture Classification: An Analysis
Culture Classification: An AnalysisKai Li
 
RDA in China
RDA in ChinaRDA in China
RDA in ChinaKai Li
 
How Americans recognize libraries
How Americans recognize librariesHow Americans recognize libraries
How Americans recognize librariesKai Li
 
How libraries use 新浪微博
How libraries use 新浪微博How libraries use 新浪微博
How libraries use 新浪微博Kai Li
 
新一代的Opac服务
新一代的Opac服务新一代的Opac服务
新一代的Opac服务Kai Li
 
Ipad and Library
Ipad and LibraryIpad and Library
Ipad and LibraryKai Li
 
Augmented reality @ libraries
Augmented reality @ librariesAugmented reality @ libraries
Augmented reality @ librariesKai Li
 

Plus de Kai Li (20)

Using a keyword extraction pipeline to understand concepts in future work sec...
Using a keyword extraction pipeline to understand concepts in future work sec...Using a keyword extraction pipeline to understand concepts in future work sec...
Using a keyword extraction pipeline to understand concepts in future work sec...
 
Knowledge production between laboratories and scientific texts: a proposal of...
Knowledge production between laboratories and scientific texts: a proposal of...Knowledge production between laboratories and scientific texts: a proposal of...
Knowledge production between laboratories and scientific texts: a proposal of...
 
Data and Software in Scientific Activities: a Literature Review
Data and Software in Scientific Activities: a Literature ReviewData and Software in Scientific Activities: a Literature Review
Data and Software in Scientific Activities: a Literature Review
 
A metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposalA metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposal
 
Software Citation, Reuse and Metadata Considerations: An Exploratory Study ...
Software Citation, Reuse and Metadata Considerations:  An Exploratory Study ...Software Citation, Reuse and Metadata Considerations:  An Exploratory Study ...
Software Citation, Reuse and Metadata Considerations: An Exploratory Study ...
 
On metaphor: a book review of Metaphors we live by
On metaphor: a book review of Metaphors we live byOn metaphor: a book review of Metaphors we live by
On metaphor: a book review of Metaphors we live by
 
Visual perception and mixed-initiative interaction for assisted visualization...
Visual perception and mixed-initiative interaction for assisted visualization...Visual perception and mixed-initiative interaction for assisted visualization...
Visual perception and mixed-initiative interaction for assisted visualization...
 
A family tree of graph types
A family tree of graph typesA family tree of graph types
A family tree of graph types
 
Introduction to Visualizing Uncertainties
Introduction to Visualizing UncertaintiesIntroduction to Visualizing Uncertainties
Introduction to Visualizing Uncertainties
 
InfoVis Final Project: NBA in historical context
InfoVis Final Project: NBA in historical contextInfoVis Final Project: NBA in historical context
InfoVis Final Project: NBA in historical context
 
Introduction to bibframe
Introduction to bibframeIntroduction to bibframe
Introduction to bibframe
 
Grassroots Read: Planning, Marketing and Assessing Plan
Grassroots Read: Planning, Marketing and Assessing PlanGrassroots Read: Planning, Marketing and Assessing Plan
Grassroots Read: Planning, Marketing and Assessing Plan
 
RDFa: an introduction
RDFa: an introductionRDFa: an introduction
RDFa: an introduction
 
Culture Classification: An Analysis
Culture Classification: An AnalysisCulture Classification: An Analysis
Culture Classification: An Analysis
 
RDA in China
RDA in ChinaRDA in China
RDA in China
 
How Americans recognize libraries
How Americans recognize librariesHow Americans recognize libraries
How Americans recognize libraries
 
How libraries use 新浪微博
How libraries use 新浪微博How libraries use 新浪微博
How libraries use 新浪微博
 
新一代的Opac服务
新一代的Opac服务新一代的Opac服务
新一代的Opac服务
 
Ipad and Library
Ipad and LibraryIpad and Library
Ipad and Library
 
Augmented reality @ libraries
Augmented reality @ librariesAugmented reality @ libraries
Augmented reality @ libraries
 

Dernier

Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Dernier (20)

Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Project Gutenberg as Information Retrieval System

  • 1. Project Gutenberg as an Information Retrieval System Kai Li IST616 Final Assignment 2012.11
  • 2. Introduction to Project Gutenberg • The first digital library project in the world, initiated by the late Michael Hart in 1971. • Project Gutenberg currently offers more than 41,000 public domain eBooks (in more than 50 languages) as well as other resources (like scientific data). • Website: http://www.gutenberg.org/
  • 3. Intended Audience and Functionalities • Intended audience: eBook readers and general users. • Functionalities: portal of the project, eBook repository and discovery system.
  • 4. Mobile Site • There are two kinds of interfaces of this website based on the device one uses. Only the traditional nonmobile interface will be examined in this presentation due to the limited scope of the assignment.
  • 6. Issues of Indexing/Tag System • There is a searching box as well as a tag called “Search Catalog”; – The searching box is too small to be noticed; – The tag “Search Catalog” actually leads users to a page where one cannot find the searching box, but only some browsing selections; • There are a number of repetitive tags on the left-hand bar and on the top of the page; – For example, the tag “Book Categories”.
  • 7. Means To Find a Book • Searching • Browsing – By categories
  • 9. Issues of Searching • The display is different from most of the interfaces one can see on the Internet, which may result some difficulties for new users; • Due to a lack of navigation mechanism and the function to refine the result by facets, it’s extremely inconvenient to locate a resource if the result is big.
  • 10. Precision and Recall • The retrieval method used by this website is a string-matching method, which matches the string inputted by the user with the full-text of all the resources. – “Or” relationship used for multiple words. • Because the scope of the index is the full-text, the recall is higher than traditional library catalogs; however, since it is still a string-matching method, the precision is still not very good.
  • 12. Issues of Browsing • There are three searching tools offered on this page, which should have been offered on the searching page rather than this one. • Only one standard can be used to limit the resources at the same time. And after one chooses a certain standard, there is no other way to further limit the result.
  • 13. Categories/Classification • There are two tiers of the “classification” on this website: – Subcategories: 23 • These subcategories are called “bookshelf” too, which is confusing. – Bookshelves: 133 • Which can be seen as a lower level than subcategories. However, not all bookshelves are linked to a given subcategory.
  • 14. Overall Evaluation • Advantages: – Mobile functionalities: • Mobile site • QR codes • Disadvantages: – Poorly organized and designed; – Failing to display the full richness of the metadata on the website: • LoC classification and subject headings – The interface being lack of communication with the users;

Notes de l'éditeur

  1. The project has been accepting eBooks uploaded by members which are not protected by US copyright laws.
  2. Because this website is also the main page of the whole project, the audience include not only the people who want to get the eBooks but also people who are interested in the project itself.
  3. The indexing system is actually very confusing. This slide lists some of the problems.
  4. The searching result page: related bookshelves and subjects are displayed in front of all the books; books are ranked by popularity (times of download), but one can also choose to sort alphabetically or by released date.
  5. The interface was very unintuitive for me when I first used it.If the book is not ranked high in terms of alphabetic, popularity or released date, and if the result is big, it’s almost impossible for one to find a specific book. Like traditional library catalogs, this interface doesn’t support finding an unknown book very well.
  6. String-matching method cannot solve the issues of one words with multiple meanings or different words bearing the same meaning.
  7. Methods: by author; by title; by language; by recently added; by popularity.One can also browse the website by LC classification (as well as LCSH). However, they are not listed on this page. LC classification can be found only from the book pages.
  8. Not all bookshelves can be linked with a subcategory.Moreover, there are also some bookshelves containing materials in other languages that is not inside the above system, which indicates that the classification scheme in English may not cover all the resources on the website.
  9. Many libraries and other parties have imported the metadata of Gutenberg eBooks to the local systems, which makes the issues of this website a less important one.But this is still a problem!