Linked data context strategy

•Download as PPT, PDF•

3 likes•1,052 views

fantasticlife

Presentation to open data week

Technology Entertainment & Humor

Research and Development ♥ BBC MMXIII
A Linked Data Context Strategy
for the BBC
Michael Smethurst,
BBC Internet Research and Future Services
With thanks to
Yves Raimond, Tristan Ferne, Olivier Thereaux, Paul Rissen

Research and Development ♥ BBC MMXIII
Why Linked Data?
1. On the web content needs context to be useful
2. The BBC has data on its output but not on the subjects of
its output
3. Commercial data is usually modelled at the wrong level
(saleable items)
4. Commercial data doesn’t give you the freedom to make
your own APIs on top
5. Using inference minimises workload

1. Consuming linked data
2. Managing linked data
3. Publishing linked data

1000 ~ 1500 programmes
~ 750 news articles
every day

Automated Tagging
+Speaker recognition
of a very large audio archive

How do we answer…
Which radio programmes interviewed Nelson
Mandela in 1990?
How can I find a picture of a relative in a
library’s photo archive?
Was my music used in the background of
that TV programme?

Thank you.
Questions to
michael.smethurst@bbc.co.uk
@fantasticlife

Recently uploaded

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Histor y of HAM Radio presentation slidevu2urc

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

A Call to Action for Generative AI in 2024Results

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Artificial Intelligence: Facts and MythsJoaquim Jorge

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Boost Fertility New Invention Ups Success Rates.pdf

Powerful Google developer tools for immediate impact! (2023-24 C)

Histor y of HAM Radio presentation slide

A Domino Admins Adventures (Engage 2024)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

A Call to Action for Generative AI in 2024

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Boost PC performance: How more available memory can improve productivity

Exploring the Future Potential of AI-Enabled Smartphone Processors

How to Troubleshoot Apps for the Modern Connected Worker

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Finology Group – Insurtech Innovation Award 2024

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Breaking the Kubernetes Kill Chain: Host Path Mount

Data Cloud, More than a CDP by Matt Robison

Artificial Intelligence: Facts and Myths

Linked data context strategy

1. Research and Development ♥ BBC MMXIII A Linked Data Context Strategy for the BBC Michael Smethurst, BBC Internet Research and Future Services With thanks to Yves Raimond, Tristan Ferne, Olivier Thereaux, Paul Rissen

2. Research and Development ♥ BBC MMXIII Why Linked Data? 1. On the web content needs context to be useful 2. The BBC has data on its output but not on the subjects of its output 3. Commercial data is usually modelled at the wrong level (saleable items) 4. Commercial data doesn’t give you the freedom to make your own APIs on top 5. Using inference minimises workload

3. “Inform, Educate and Entertain”

4. George Orwell, 1940s

6. On the Web since 1994 / 1995

8. Linked data

9. 1. Consuming linked data 2. Managing linked data 3. Publishing linked data

10. 1000 ~ 1500 programmes ~ 750 news articles every day

11.

12.

13. Loose coupling via shared identifiers

14. Linked data as experience prism

15.

16.

17.

18. Principle #1: The web is our CMS

19.

20.

21. 1. Consuming linked data 2. Managing linked data 3. Publishing linked data

22. Annotate once, infer and re-use

23.

24.

25. 1. Consuming linked data 2. Managing linked data 3. Publishing linked data

26. Principle #2: Our web site is our API

27.

28.

29. Generating data from content

30.

31. Automated Tagging +Speaker recognition of a very large audio archive

32.

33.

34. How do we answer… Which radio programmes interviewed Nelson Mandela in 1990? How can I find a picture of a relative in a library’s photo archive? Was my music used in the background of that TV programme?

35. Thank you. Questions to michael.smethurst@bbc.co.uk @fantasticlife

Editor's Notes

from its beginnings in the 1920s in radio. Now 10 national radio channels and more than 40 in the nations and regions
TV broadcast since the 1930s
On the Web since 1994. that's a lot of web-history too, we've been doing this for a while
The BBC Music Website has a content-rich offering. Not surprising when you have 10 major national radio stations, many more local stations, and a lot of music programmes in your TV schedule. But it doesn't mean you have to manage everything from bios to discography from scratch
The BBC Music Website has a content-rich offering. Not surprising when you have 10 major national radio stations, many more local stations, and a lot of music programmes in your TV schedule. But it doesn't mean you have to manage everything from bios to discography from scratch
The BBC Music Website has a content-rich offering. Not surprising when you have 10 major national radio stations, many more local stations, and a lot of music programmes in your TV schedule. But it doesn't mean you have to manage everything from bios to discography from scratch
Data is a first-class citizen
Working on the World Service audio archive three years of continuous audio
Speech recognition -> automated transcripts + topic identification (at scale) Kiwi is a framework aimed at automatically identifying topics in speech radio programmes, with topic identifiers being drawn from Linked Open Data sources such as DBpedia. In order to generate such topics in a reasonable time for large programme archives, we built a processing infrastructure distributing computations on cloud resources (e.g. Amazon EC2). We used this infrastructure to automatically tag the entire BBC World Service archive (70,000 programmes) in around two weeks.

Linked data context strategy

Recommended

Recommended

More Related Content

Similar to Linked data context strategy

Similar to Linked data context strategy (20)

More from fantasticlife

More from fantasticlife (9)

Recently uploaded

Recently uploaded (20)

Linked data context strategy

Editor's Notes