SlideShare une entreprise Scribd logo
1  sur  32
A Unit of of the University System of Georgia
  A Unit the University System of Georgia
Bibliographic database integrity in a
             consortial environment




                  Evergreen International Conference
                                       May 21, 2009
                                     • Elaine Hardy
• PINES Bibliographic Projects and Metadata Manager
Twentieth Century Literary Criticism:
illustration of single record for each serial
                    volume
GPLS Intern’s statistics

                           Before    After
Alexander McCall Smith         245       172
Grace Livingston Hill         1119       549
Mary Higgins Clark             771       386
Magic School Bus (print)       554       218
Danielle Steel                1235       718
Duplicate records cause
  – “User information overload”
  – “Reduced system efficiency”
  – “Low cataloging productivity”
  – “Increased cost for database maintenance”
                                             Sitas and Kapidakis, 2008



“There is no question that merging such records is vital to
effective user services in a cooperative environment.”
                                                        Tennant, 2002
What patrons think ---
• wish that you would list the most current book first and have only
  one entry for each book instead of showing multiple entries.
  Sometimes I have to look through 50 - 100 entries to see 20 books
  and the newest book by the author is entry 80. There should be a
  way to stream line this procedure.
• Consolidate entries for the same title. There are numerous entries
  on some titles beyond the breakdown of hard cover, PB, large
  print,audio, etc.”
• Why so many listings for the same books--that's confusing
• When I look up a book, many times I get two pages all of the same
  title with the same cover. It confuses me because I see that my
  library system doesn't have it, but if I scroll down...Whoops! We do
  have it. What is that all about? It sucks.
• Creating a standard for the way an items information is entered.
  Some books only have half the title entered and this can create
  problems when searching for specific materials
Why?
• Big library does not equal good data
• A large library does not always follow rules and adhere
  to standards
• Size can they cut corners for “efficiency”
• Local notes don’t belong in subject fields
• Make the time to check your data
• Publishers are not catalogers’ friends
Examples of problem reference library records
.




                                                     .
    http://www-03.ibm.com/ibm/history/exhibits/mainframe/mainframe_2423PH3090.html
Legacy system characteristics
• All were IBM based systems
• No tags, thus no definition of fields
• All fields fixed length
   – allotted so many characters for each field
• No standards
   – Not required to enter pagination or publisher
• Extraction of data a problem
   – had to count in to find beginning of next field
   – In many cases, had to supply a pub date. One lib has 1901 as a
       pub date on most of their extracted records
Records from a nonMARC system
Phase II




http://commons.wikimedia.org/wiki/Template:Potd/2007-01
Records with corrupted headings
Lessons learned
•   Big library does not equal good data
•   Make the time to check your data
•   Publishers are not catalogers’ friends
•   Be careful about CIPs with no description and records with multiple ISBNs
•   Come up with realistic match when records are same but information differs
•   One library will not have the same good records across all their collections
     – may have good print but bad AV
•   LOTS of programming if multiple sources of records.
•   No matter -- budget, personnel, time -- is as important as concentrating on
    clean-up prior to migration
•   Be as specific as possible with vendors, test and have a penalty phase.
•   Have the right people in place from day one
Enable discover
Goodbye

Contenu connexe

En vedette (7)

Evergreen in Small Libraries
Evergreen in Small LibrariesEvergreen in Small Libraries
Evergreen in Small Libraries
 
You're Live, Now What?
You're Live, Now What?You're Live, Now What?
You're Live, Now What?
 
Evergreen Sysadmin Survival Skills
Evergreen Sysadmin Survival SkillsEvergreen Sysadmin Survival Skills
Evergreen Sysadmin Survival Skills
 
OpenSRF and Evergreen
OpenSRF and EvergreenOpenSRF and Evergreen
OpenSRF and Evergreen
 
ERM and Evergreen
ERM and EvergreenERM and Evergreen
ERM and Evergreen
 
Bibliographic Control and Oclc
Bibliographic Control and OclcBibliographic Control and Oclc
Bibliographic Control and Oclc
 
Effective search of bibliographic databases
Effective search of bibliographic databasesEffective search of bibliographic databases
Effective search of bibliographic databases
 

Similaire à Bibliographic Database Integrity

ALA 2010 -- Jane Burke
ALA 2010 -- Jane BurkeALA 2010 -- Jane Burke
ALA 2010 -- Jane Burke
bisg
 
Mechanical Librarian
Mechanical LibrarianMechanical Librarian
Mechanical Librarian
Andre Vellino
 
PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE.
 PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE. PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE.
PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE.
crysatal16
 
Summary of Trends in Cataloging
Summary of Trends in CatalogingSummary of Trends in Cataloging
Summary of Trends in Cataloging
William Worford
 
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
NASIG
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012
ECNOfficer
 

Similaire à Bibliographic Database Integrity (20)

The Effects of Cross-Pollination : How non-library mass market services are c...
The Effects of Cross-Pollination : How non-library mass market services are c...The Effects of Cross-Pollination : How non-library mass market services are c...
The Effects of Cross-Pollination : How non-library mass market services are c...
 
Why libraries should embrace Linked Data
Why libraries should embrace Linked DataWhy libraries should embrace Linked Data
Why libraries should embrace Linked Data
 
ALA 2010 -- Jane Burke
ALA 2010 -- Jane BurkeALA 2010 -- Jane Burke
ALA 2010 -- Jane Burke
 
Mechanical Librarian
Mechanical LibrarianMechanical Librarian
Mechanical Librarian
 
Crisis or Opportunity? Cataloging, Catalogers, RDA, and Change
Crisis or Opportunity? Cataloging, Catalogers, RDA, and ChangeCrisis or Opportunity? Cataloging, Catalogers, RDA, and Change
Crisis or Opportunity? Cataloging, Catalogers, RDA, and Change
 
PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE.
 PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE. PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE.
PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE.
 
Summary of Trends in Cataloging
Summary of Trends in CatalogingSummary of Trends in Cataloging
Summary of Trends in Cataloging
 
When?
When?When?
When?
 
Extreme Makeover: Web Site Edition
Extreme Makeover: Web Site EditionExtreme Makeover: Web Site Edition
Extreme Makeover: Web Site Edition
 
Extreme Makeover: Web Site Edition (OPLIN)
Extreme Makeover: Web Site Edition (OPLIN)Extreme Makeover: Web Site Edition (OPLIN)
Extreme Makeover: Web Site Edition (OPLIN)
 
Blogs & Wikis (and what you can do with them)
Blogs & Wikis (and what you can do with them)Blogs & Wikis (and what you can do with them)
Blogs & Wikis (and what you can do with them)
 
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...
 
Ir1
Ir1Ir1
Ir1
 
Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...
Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...
Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...
 
An Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jAn Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4j
 
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
 
Honey on the Wire KohaCon18
Honey on the Wire  KohaCon18Honey on the Wire  KohaCon18
Honey on the Wire KohaCon18
 
Organizing Infoshop Libraries and Their Collections: Bringing the Community i...
Organizing Infoshop Libraries and Their Collections: Bringing the Community i...Organizing Infoshop Libraries and Their Collections: Bringing the Community i...
Organizing Infoshop Libraries and Their Collections: Bringing the Community i...
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012
 
Some news about the SW
Some news about the SWSome news about the SW
Some news about the SW
 

Plus de Evergreen ILS

Plus de Evergreen ILS (7)

Jessamyn West: A Big Fan of Open
Jessamyn West: A Big Fan of OpenJessamyn West: A Big Fan of Open
Jessamyn West: A Big Fan of Open
 
Bits and Pieces from the UPEI Experience
Bits and Pieces from the UPEI ExperienceBits and Pieces from the UPEI Experience
Bits and Pieces from the UPEI Experience
 
Joe Lucia: Song Of The Open Road
Joe Lucia: Song Of The Open RoadJoe Lucia: Song Of The Open Road
Joe Lucia: Song Of The Open Road
 
Ready Fire Aim: The MLC Evergreen Experience
Ready Fire Aim: The MLC Evergreen ExperienceReady Fire Aim: The MLC Evergreen Experience
Ready Fire Aim: The MLC Evergreen Experience
 
Evergreen in Armenia
Evergreen in ArmeniaEvergreen in Armenia
Evergreen in Armenia
 
Evergreen Docs Planning Session 2009
Evergreen Docs Planning Session 2009Evergreen Docs Planning Session 2009
Evergreen Docs Planning Session 2009
 
Evergreen Documentation Lightning Talk
Evergreen Documentation Lightning TalkEvergreen Documentation Lightning Talk
Evergreen Documentation Lightning Talk
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Dernier (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 

Bibliographic Database Integrity

  • 1. A Unit of of the University System of Georgia A Unit the University System of Georgia
  • 2. Bibliographic database integrity in a consortial environment Evergreen International Conference May 21, 2009 • Elaine Hardy • PINES Bibliographic Projects and Metadata Manager
  • 3.
  • 4.
  • 5. Twentieth Century Literary Criticism: illustration of single record for each serial volume
  • 6. GPLS Intern’s statistics Before After Alexander McCall Smith 245 172 Grace Livingston Hill 1119 549 Mary Higgins Clark 771 386 Magic School Bus (print) 554 218 Danielle Steel 1235 718
  • 7. Duplicate records cause – “User information overload” – “Reduced system efficiency” – “Low cataloging productivity” – “Increased cost for database maintenance” Sitas and Kapidakis, 2008 “There is no question that merging such records is vital to effective user services in a cooperative environment.” Tennant, 2002
  • 8. What patrons think --- • wish that you would list the most current book first and have only one entry for each book instead of showing multiple entries. Sometimes I have to look through 50 - 100 entries to see 20 books and the newest book by the author is entry 80. There should be a way to stream line this procedure. • Consolidate entries for the same title. There are numerous entries on some titles beyond the breakdown of hard cover, PB, large print,audio, etc.” • Why so many listings for the same books--that's confusing • When I look up a book, many times I get two pages all of the same title with the same cover. It confuses me because I see that my library system doesn't have it, but if I scroll down...Whoops! We do have it. What is that all about? It sucks. • Creating a standard for the way an items information is entered. Some books only have half the title entered and this can create problems when searching for specific materials
  • 10.
  • 11.
  • 12.
  • 13.
  • 14. • Big library does not equal good data • A large library does not always follow rules and adhere to standards • Size can they cut corners for “efficiency” • Local notes don’t belong in subject fields • Make the time to check your data • Publishers are not catalogers’ friends
  • 15. Examples of problem reference library records
  • 16. . . http://www-03.ibm.com/ibm/history/exhibits/mainframe/mainframe_2423PH3090.html
  • 17. Legacy system characteristics • All were IBM based systems • No tags, thus no definition of fields • All fields fixed length – allotted so many characters for each field • No standards – Not required to enter pagination or publisher • Extraction of data a problem – had to count in to find beginning of next field – In many cases, had to supply a pub date. One lib has 1901 as a pub date on most of their extracted records
  • 18. Records from a nonMARC system
  • 19.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30. Lessons learned • Big library does not equal good data • Make the time to check your data • Publishers are not catalogers’ friends • Be careful about CIPs with no description and records with multiple ISBNs • Come up with realistic match when records are same but information differs • One library will not have the same good records across all their collections – may have good print but bad AV • LOTS of programming if multiple sources of records. • No matter -- budget, personnel, time -- is as important as concentrating on clean-up prior to migration • Be as specific as possible with vendors, test and have a penalty phase. • Have the right people in place from day one