Brendan Quinn of IPTC and Jennifer Parrucci of The New York Times present IPTC's NewsCodes vocabularies, describing what they are, how they are maintained, how they can be used and a look into the future. Including a focus on IPTC MediaTopics, our leading vocabulary for topics of news content. Originally presented at the EBU Metadata Developers Network workshop, held online from 25 - 27 May 2021.
IPTC NewsCodes - Controlled Vocabularies for the News Media (EBU MDN Workshop 2021)
1. IPTC NewsCodes: Controlled
Vocabularies for the News Media
Brendan Quinn, Managing Director, IPTC
Jennifer Parrucci, Senior Taxonomist, The New York Times
Tuesday 25 May 2021
15. Proxy (MP4)
Metadata
Metadata Watch Services
2
2: Checking every 30
seconds for new
assets
3: Query asset metadata
via XDCAM air API
4: Pass asset metadata
to G2 XML creation
services
XML Creation Services
6
6: Send NewsML-G2 XML and
MP4 to AWS S3 bucket for
Ooyala
7: Get G2 XML
and MP4 by
OOYALA services
Planning
metadata
Planning
Metadata
5: Get MP4 from
XDCAM air S3
bucket
9
9: Get G2 XML
and MP4 by
Reuters
Connect
8: Send NewsML-
G2 XML and MP4
to AWS S3 bucket
for Reuters
8
1: Connect to XDCAM Air via API calls
DPP Metadata Exchange for News – using NewsML-
G2
59. Wikidata RDF and SPARQL interface
• Wikidata exposes interfaces in RDF and SPARQL
• Queries can inspect wikidata entities by type or
property:
#People that received both Academy Award and Nobel Prize
SELECT DISTINCT ?person ?personLabel WHERE {
?person wdt:P166/wdt:P31? wd:Q7191 .
?person wdt:P166/wdt:P31? wd:Q19020 .
SERVICE wikibase:label {
bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en"
.
}
}
• “wdt:P166” is “award received”
• “/wdt:P31?” means “this property or any child
instance of it”
• “wd:7191” is “Nobel Prize”
• “wd:Q19020” is “Academy Awards”
Notes de l'éditeur
Photo (and Video) Metadata
Used since 1995 to store descriptive, administrative and rights metadata embedded in image files
Originally based on IPTC IIM standard, now based on XMP in-file metadata platform
Video Metadata Hub
Canonical mapping between video metadata standards: IPTC Photo/Video, EBUCore, ….
News Codes, Media Topics, Subject Codes
Hierarchical taxonomies of subjects for describing news content
NewsML-G2 and NewsML-1
Exchange of packages of text, images, video, audio, events and/or sports data
NewsML-G2 is newer, extensible version but NewsML-1 is still maintained
RightsML
ninjs, rNews
News markup in JavaScript, microformat version of news markup
SportsML-G2
EventsML-G2
Historical standards
NewsML-1
IPTC7901
IIM format
Schematic diagram of what was demoed on February 6th, 2019 by vendors working together using NewsML-G2 as a basis for for production of news content.