BookNet Canada Bibliographic Manager and general standards person, Tom Richardson provides your yearly standards overview with a focus on metadata basics. Looking for information on where to start when it comes to participating in standards? Start here. Or, if you’re well-versed in standards but you want to know what you’ll need to know for the next couple of years, you’ll find that here, too!
Link to the presentation video: https://youtu.be/7xotzjMS0Gc
Presented by BookNet Canada on April 19, 2022 with support from the Department of Canadian Heritage.
3. We want you to participate – to contact us with an idea or problem,
participate in one of our products’ user groups (BookNet products
always strive for compliance with standards) or by joining a
committee, at BookNet or with other publishing organizations.
BookNet Canada is a not-for-profit that develops technology,
standards, and education to support the Canadian book publishing
supply chain, including booksellers, publishers, libraries, distributors,
wholesalers, media, sales and literary agents, associations…
Our staff work with other Canadian companies to solve supply chain
problems using standards. We ensure Canadian compatibility with
US and international standards.
Efficiency comes through standards
Participate with standards
7. What is metadata?
• It’s the information you need to run your business;
• It’s the information your trading partners need from
you so they can provide or support services;
• It’s the information an end user needs to know to
decide to buy books.
8. Metadata internal to a company
can come from external sources
• from their trading partners
• from services like BNC Sales and Library Data
From documentation like
• the BISG/BNC Best Practices for North American
metadata or BISAC Subjects
• or EDItEUR’s ONIX or Thema specifications or their
ONIX Best Practice Guide
9. Metadata is a feedback loop
Your needs inform other companies.
Their needs inform you.
EDItEUR, BNC, BISG, BTLF
organize and publish code lists and best practices
to help you trade it. In doing so provide stable
definitions for metadata when traded.
Your metadata, what you record and however
you record it, is
• what your business needs to know to thrive
• what you tell others to enable both you
and them to thrive
10. Workflow efficiency
While a database may seem the natural place for
metadata, the most important thing is being able to
find it quickly and accurately.
You want to avoid finding out that the correct book
title is only recorded on the freelance designer’s
offsite computer.
Plan so them-that-know update metadata
and leave it in the place it should be found.
11. Metadata is traded
There are lots of stakeholders who need it
• Production & editorial need to know what’s done &
needs doing;
• Retailers, wholesalers & distributors all need
information both in advance & ongoing;
• End buyers need:
– To know what they are buying;
– To have a reason to buy the book.
12. Metadata should be updated
Stakeholders depend on accuracy for their systems
• Production & editorial need to meet each others deadlines;
• Shipments and files need to arrive when they are expected;
• Retailers & end users need to know:
– When a book will be available;
– If it’s not available -- when it’s next available;
– And if it’s just not available at all.
Books have a lifecycle that needs to be tracked.
Dates need to be accurate and updated when changed.
13. Metadata should be both
unique & and match the book
Badly done metadata can defeat its purpose
TITLE: Bob (Jane’s Series) SUBTITLE: A Novel
SERIES NAME: Jane’s series SERIES NUMBER: 2
If a retailer is given a book title that embeds the series, who’s
responsible for fixing poor and repetitious on-line display?
BOB
A Novel
Jane’s Series 2
Bob (Jane’s Series)
A Novel
Jane’s series 2
Bonus question:
If you don’t send the series as
unique metadata how do you
expect a retailer to create a
series listing so they can cross
sell the series?
14. Metadata Granularity
Each stakeholder needs much the same information but
not always the same way.
It’s important that the metadata you aggregate about your
business and its products is granular enough -- tracked in
small enough components -- so it can be
reassembled to meet all use cases
Granular metadata allows you to meet all
the competing needs.
(But it shouldn’t be so finely
ground that it costs you
more than it’s worth.)
15. ONIX can help
Granularity gives context on how to best use ONIX.
Whether or not you directly use it or base a publishing
system in part on it, ONIX is:
• A “metadata system” created for trading book
publishing metadata accurately;
• Organized by EDItEUR a UK based not-for-profit who
are experts at creating just enough granularity to
support book metadata exchange.
• It’s code lists are a lingua franca – you don’t need to
use the codes in your systems but your systems should
be compatible with their definitions.
16. Just don’t forget
ONIX isn’t designed to support other business needs like
• Grant applications
• Tax preparation
• Mail merge functions
• And that…
There’s more to publishing metadata
than book metadata.
17. Computers talk to computers
The best metadata system allows for it’s transfer directly
from your systems into a stakeholders system.
That allows your actionable knowledge to maximize
everyone’s potential to act for your books.
For two computers to talk an agreed on
identifier is needed.
18. There is only one fully realized publishing identifier:
ISBN – needed for EDI and every other book related sale
or potential sale.
• A 13 digit number that’s a subset of the GTIN-13 used
by all businesses
• An ISBN represents a salable product, a single
physical binding, a single file type.
When using an ISBN there should be no ambiguity about
the specific product being referenced.
International Standard Book Number
19. • 979 ISBN-13 have NO ISBN-10 equivalent
• Every Canadian distributor has active
979 ISBNs in their dataset.
• They are now in use even if you don’t use any.
If you have any active reports based on or mixed with
ISBN-10 it’s critical that you update them to ISBN-13.
Using 978 ISBN-13 paired with a ISBN-10 in metadata is
discouraged but does no harm if trading partners request it.
ISBN-10 should not be used
20. No special rules apply to digital products but ISBN
guidelines provide for both digital and physical products
https://www.isbn-international.org/content/guidelines-assignment-e-books
• Each digital file format made available by a publisher should be
assigned it’s own ISBN. (How can anyone order it from the publisher
if it’s not uniquely identified?)
• A publisher is not responsible for assigning different ISBNs if retail
partners offer different file formats (convert the publisher’s file
format) or limit the file’s functionality by using DRM (add digital rights
management). The publisher can assign ISBNs for that use if it
benefits their systems, but their external responsibilities are to
identify what they can supply as files to others.
e-ISBNs do not exist
Only ISBNs exist.
21. This is the one thing – the most basic ability to “do
metadata” – that I’d like you to take from this webinar
It’s not a small thing and it’s harder than it sounds but it’s
the necessary first step.
The least you can do
Every publishing business should be able to
provide a list of ISBN-13s for all the products
they can sell. Both digital and print – together.
22. • available at the right time
• sent to the right places
• in sufficient in detail to support the needs of stakeholders
that support your business
• It needs to carry an identifier so that it can be rapidly
loadable to databases without human intervention
ISBN-13 is your only current option as an identifier so
• treat it with all due respect
• be able to provide a full list of active products by ISBN on
demand
Summary of Metadata Basics
24. NISO or the National Information Standards Organization, is the US
based equivalent to our CSA Canadian Standards Association, a
nonprofit membership organization that identifies, develops, maintains,
and publishes technical standards to manage information.
They have published an excellent best practice for digital books and we
highly recommend it
https://www.niso.org/publications/rp-29-2022-ebmd
E-Book Bibliographic Metadata Requirements
in the Sale, Publication, Discovery,
Delivery, and Preservation Supply Chain
25. Last revised in 2015 this remains the one document that clearly lays out
the 35 things that, at a minimum, need support in North America
metadata. Each chapter is supported by a clear business case and
need statement.
BISG with support from BNC is planning to update this important
document in 2022 making this is a good to ask:
• What should I expect others to support from my metadata?
• What do I owe others as metadata?
• What’s important, today?
Revision planned:
BISG / BNC
Best Practices for North American Metadata
26. ISNI.org
International Standard Name Identifier
People or groups, even companies, responsible for creation of content
can carry an international identifier An ISNI – an International Standard
Name Identifier is a simple way to link creators to the tools that support
them and the sale of their products
Other media – music in general and YouTube in particular – understand
it as an asset. If your company thinks blockchain might have value for
your products you’ll want to consider it. Rights sales and working with
literary agents can improve with it. A retailer’s author page can reliably
represent a single author if it can get implemented.
Between 50 and 70% of Canadian authors already have an ISNI
assigned, and most established publishers have one as well. These are
available by simply looking them up at the site above.
27. ISNI.org
Webinar on demand available
World Intellectual Property Organization (WIPO) held an excellent
introductory webinar on the ISNI standard, held on Febraury 16 2022
as part of WIPO’s also recommended Copyright Infrastructure series.
You can watch the ISNI episode here
https://c.connectedviews.com/01/SitePlayer/WIPOwebinars/?session=114723
And look for and request more information from BookNet Canada.
Let us know your plans
28. ONIX 3.0: Contributor Place
recommended for promotional support
Contributor Place is a repeatable composite supporting by 10
codes to association a place with an aspect of an author’s life.
• Code “08” Citizen of – A code of CA designates a Canadian Author
• Supports our long standing, industry wide, and very successful
promotion of Canadian authors through best seller lists, dedicated
websites and that opportunity for flag markers
• Code “04” Currently resides in is needed to support retailers and
libraries promotion of local authors. To work it needs
• Country code
• Region code – for province or state
• Location -- free text field for city name or regional sub-division.
29. Contributor Place
recommended for regional support
Code “00” Associated with
• To express unknown relationship types
(for use when expressing legacy ONIX 2.1 data in ONIX 3.0).
Code “07” Flourished in
Overused, likely as it’s the least defined code so seen as multipurpose.
• Recommended for dead authors where nationality isn’t relevant
– OR –
A comparative value when resides or another value doesn’t fit:
Resides here but Flourished somewhere else.
Flourished in can be used regionally to great effect:
• From Newfoundland but Flourished in the Northwest Territories;
• Born in Montreal but Flourished in Paris.
• Born in Medicine Hat but Flourished in Hollywood.
30. Regional Codes
Regional codes are important at subjects
• Bookstores and libraries buy regionally because readers want locally
defined books. ‘
• Where’ (and when) can influence identity. Identity is part of diversity.
– Winnipeg’s North End 1930 to 1950 is not the same as The Ward in Toronto
during the late 19th century.
• Thema’s superb Qualifier list support for both Geography and Time
• BISAC Regional Themes list is more limited but still important
Publishers often cite nonuse as a reason for not supplying regional
codes
Retailers cannot use what they don’t have.
Accurate regional coding can help sell books,
it is wanted, and should be used
31. ONIX 3.0
Current ONIX for Books is defined by
Code list Issue 56 (Issue 57 expected soon)
Version 3.0.8
• The primary change in 3.0.8 was the addition of
Block 8 Production Detail to carry assembly instructions
for digital packages for ebooks, POD or audiobooks.
• It’s optional and unlikely to be seen in general ONIX feeds
• It’s not intended to support ordering, only instructions.
• It has generated a lot more interest than expected as
companies need a better way to trade both ONIX and
production instructions.
32. ONIX 3.0
The overall quality of ONIX 3.0 in BiblioShare remains poor & has more
2.1-like-structured than “good” ONIX 3.0 should be. Some US firms want
to ensure better quality by using schema to ensure they receive “good”
ONIX 3.0
• If you’re unsure about a requested change, ask BNC. If it’s based on
the standard we can explain why it’s important and what the data
should mean.
• Send us a test file of 100 records or so to be run against EDItEUR’s
strict schema. It’s great feedback on ONIX best practices in 3.0 and
major issues.
• Ask for a Bibliographic Review or just request a call to understand
more. We are always trying to improve our educational output and
your problems are likely shared by others.
33. Learn to schema validate & use specifications
BISG BNC Best Practices for Product Metadata
Guide for North American Data Senders and Receivers
https://bisg.org/store/viewproduct.aspx?id=6972807
ONIX for Books: Implementation and Best Practice Guide
https://www.editeur.org/93/Release-3.0-Downloads/#Best%20practice
Use best practice documentation
ONIX 3.0 Specifications
Application Notes: Validating ONIX 3.0 files
• MacOs edition and Windows edition
• Visual application note (with transcript)
This and more available here:
https://www.editeur.org/93/Release-3.0-Downloads/
34. Subjects
Thema
current version Thema 1.4
April 2022 release for 1.5
every 2nd year cycle
https://www.editeur.org/151/Thema/
BISAC Subjects
current version 2021
next version 2022 Nov/Dec
yearly cycle
https://bisg.org/page/BISACSubjectCodes
BISAC to Thema Translator
current version BISAC 2020 to Thema 1.4.2
Next update BISAC 2021 to Thema 1.5 expected summer 2022
Updated when BISG makes a new mapping available
http://bisactothema.biblioshare.ca/
35. BISAC or Thema?
CataList and BiblioShare support both BISAC and Thema. We
ask that publishers support both to provide the best quality.
The systems compliment each other:
• BISAC groups of books that sell in North America providing
a focused market overview
• Thema provides a structured subject-concept system and
truly international and translated to over 20 languages.
In North America, both are better with the support of the other.
Retailer use comes with data availability.
36. Diversity Equity Inclusion
Identity Indigeneity & Justice
for book subjects & author promotion
A place for participation
Both BISAC and Thema are adding values to help buyers find books
that include various diversity related values.
Getting it right is hard as evidenced by the rapid decline in use of “own
voices” after it was found the association lead to potentially unsafe and
uncomfortable situations for book authors.
As much as we want a button that will make it easy to create a timely
list bird books by black authors, or just to promote well vetted list for
use during Indigenous or Black History months, getting it right takes
well thought out positions and choices.
Big problems are solved by participation
and sharing of information.
37. Many publishers are now getting certified and beginning to create
ebooks “made accessible”, but there are problems associated with the
metadata. While these often relate to lack of software support
sometimes it’s confusion about the metadata.
This BISG webinar is excellent and comprehensive
Accessibility and Metadata: The Critical Connection
https://youtu.be/5AXznbW3dVI
Get in touch if you have additional questions!! What problems are you
having?
Accessibility Metadata
38. It can support automation of messages. Amazon is using it.
• confirm receipt of the original ONIX message
• report to the original sender a summary of Product records
processed and updated into the recipient’s database
• report details of any errors encountered, or any queries about the
product data supplied, or
• return to the sender details of proprietary identifiers assigned by the
recipient
• it can also be used for two way communication
ONIX for Books
Acknowledgement Format Specification
Metadata is a feedback loop
Before we get started, I just wanted to take a moment to recognize that I am joining you from Toronto, where I live and BookNet works upon the traditional territories of the Mississaugas of the Credit First Nation, Anishnawbe [ah-nish-naw-bek], Haudenosaunee [hoodt-en-oh-show-nee], Wendat, and Huron Indigenous Peoples, the original nations of this land. We at BookNet endorse the Calls to Action from the Truth and Reconciliation Commission of Canada and support an ongoing shift from gatekeeping to spacemaking in the book industry.http://mncfn.ca/wp-content/uploads/2019/11/Map-Land-Cessions-and-Rouge-Tract-Claim.jpg
The key to efficiency is standards. We represent Canadian interests on international standards organizations that help shape the future of data in our industry, everything from which subject categories are available to how Publishers can identify an author’s hometown in metadata.
Standards work geared toward what our stakeholders need to best serve readers. More granular subject data, more regional data, etc.