Crowdsourcing metadata for audiovisual collections

•Télécharger en tant que PPTX, PDF•

1 j'aime•908 vues

Crowdsourcing metadata for audiovisual collections: from free tekst tags to semantic concepts 7 December 2011 | DISH | Rotterdam Session: http://www.dish2011.nl/sessions/new-models-of-interaction-glams-linked-open-data-and-user-participation

Technologie Formation

Crowdsourcing metadata
for audiovisual collections
from free tekst tags to semantic concepts

Lotte Belice Baltussen – Sound and Vision

7 December 2011 | DISH

Waisda? What’s that?

Allows people to annotate
audiovisual archive material in
the form of a game.

Added value
• Time-related metadata
• Social tagging (bridging the semantic gap)
• Interaction between the archive /broadcaster and the
public
• Gathering data for further research

• Efficiency?
annotating video takes up to 5 x the length of the video

• New business model?

4

Project partners pilot

• Netherlands Institute for Sound and Vision
(project management, content, research)
• KRO (concept, content, PR)
• VU (research within PrestoPRIME)
• Q42 (developer)

Man bijt hond Woordentikkertje
After evaluation:
• Improved interface
• New scoring mechanisms (semantics)
• New content
• More feedback

How does it work?

Players choose from
‘channels’ with different
episodes

How does it work?
Scoring:
Scoring as filter • Basic rule – players score
points when their tag exactly
matches the tag entered by
another player within 10
seconds
• Multiple other scoring
mechanisms to create various
tag incentives

Generating a constant flow of traffic is a challenge!
Important: Partners, publicity on external websites with
relevant communities and a large number of visitors.

Example FWAW, in one week:
• Triple # of tags to 160.000
• Double # of registered
players to 362

Outcomes
• Stats
• 340,551 tags added to 604 items, 42,068 unique tags
• 39.134 pageviews, 555 registered players, 10,926 visits
• Average playing time 6min45, 4.287 sessions

• Matches in Waisda? • Matches GTAA / Cornetto

Evaluation
av-documentalist
• Tags mostly describe short fragments and are often not very
specific. They don’t describe a programme as a whole.
• BUT! Can be solved by filtering and mapping free tekst tags
to existing vocabularies.
• The WNW tags were the most useful and specifc; content
influences specificity.
• Tags can be used in different ways and the relevance varies
per user group.
• Documentalists exicted about further development!

Source: Jakob Nielsen’s Alertblog 9 October 2006

‘Fun’
+
Competition
+
Altruism
+
Content
+
Reward
+
…
=
Motivation

Waisda? Woordentikkertje
Months 8 4,5

Videos 648 2,892

Players 2,435 689

Tags – total 428,832 392,860
Tags – unique 48,242 (11%) 43,407 (11%)

Matches
• Players • 156,546 (37%) • 215,156 (55%)
• Geo. names* • 6,089 (1,4%) • 23,142 (5,8%)
• Persons* • 107 (0,25%) • 2,423 (0,6%)

* For Waisda? we looked at unique tags, for Woordentikkertje at the total number of tags

Tips and lessons learned so far
• What are your success criteria?
• How do you define your target users, and
how do you reach them?
• How do you motivate your target users?

• Read existing reports and literature!
• Keep learning and improving!

Future work

• Open Source version of Waisda?
• Crowdsourcing Olympics
• More research into the added value of tags for
retrieval (subtitle comparison, tests with
various end users, more research on linking
semantically rich sources to tags)

...recommended sources
blogs, feeds, people

• http://museumtwo.blogspot.com/
• http://80gb.wordpress.com/
• http://themuseumofthefuture.com/
• http://www.delicious.com/RuncocoProject/
• @ammeveleigh
• @archivesopen
• @digitalst
• @microtask
• @mia_out
• @museweb
• @runcoco
• @wittylama
This presentation is partly based on Oomen & Aroyo 2011:
http://www.slideshare.net/PaulaUdondek/crowdsourcing-in-het-cultureel-erfgoed-kansen-uitdagingen

Thanks!
@lottebelice / lbbaltussen@beeldengeluid.nl

Big thank you to:
B&G: @johanoomen / @mbrinkerink
VU: @laroyo / @McHildebrand

http://blog.waisda.nl
http://woordentikkertje.manbijthond.nl

Contenu connexe

En vedette

20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...Lotte Belice Baltussen

Mai tagung open images 5 yearsLotte Belice Baltussen

Hard Content, Fab Front-end @ IIPC 2014Lotte Belice Baltussen

Anne Frank House - user studies - Brabants Erfgoed, Helmond 22 novemberLotte Belice Baltussen

Workshop DEN Baas over eigen metadataLotte Belice Baltussen

Baltussen kom je_ook_crowdsourcing_waisdaLotte Belice Baltussen

En vedette (6)

20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...

Mai tagung open images 5 years

Hard Content, Fab Front-end @ IIPC 2014

Anne Frank House - user studies - Brabants Erfgoed, Helmond 22 november

Workshop DEN Baas over eigen metadata

Baltussen kom je_ook_crowdsourcing_waisda

Similaire à Crowdsourcing metadata for audiovisual collections

Leveraging User ResearchTom Satwicz

Support SEO With A Cohesive Patient JourneyGeonetric

Context As A Content Strategy: Creating More Meaningful Web Experiences Throu...Daniel Eizans

Vlahakis From the Perspective of the Platform ProviderNational Information Standards Organization (NISO)

Incentivising the uptake of reusable metadata in the survey production processLouise Corti

20220208Twin Cities ARMA Building Your Records Management PlaybookJesse Wilkins

Adopting Education Strategy to Jump-Start Member EngagementEvent Garde LLC

LyteSpark Group 2 (1)Fredrik S. Iversen

Adopt & Adapt: A Faster Path to Experience Governance & StandardsSusan Price

Agile Education: PO BasicsBharti Rupani

DataArt Employer brand integrated comm campaign "Geeky St. Valentine's 2018"DataArt

IST 676 - Seed Saving Digital LibraryKara Kugelmeyer

ASC Marketing Workshop - Mar 2012TRG Arts

CRC-STC May 2013 Summit Presentationcrcstc

Digital And New Media Strategy using Web 2.0Mahesh Patwardhan

How to Optimize Video Content Marketing with Google HangoutsCollegis Education

Content Along the B-to-B Decision JourneyComBlu, Inc.

Dlf 2012sherriberger

B. Mc Donald - What makes a good hub?Creativeslides

Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)CIGScotland

Similaire à Crowdsourcing metadata for audiovisual collections (20)

Leveraging User Research

Support SEO With A Cohesive Patient Journey

Context As A Content Strategy: Creating More Meaningful Web Experiences Throu...

Vlahakis From the Perspective of the Platform Provider

Incentivising the uptake of reusable metadata in the survey production process

20220208Twin Cities ARMA Building Your Records Management Playbook

Adopting Education Strategy to Jump-Start Member Engagement

LyteSpark Group 2 (1)

Adopt & Adapt: A Faster Path to Experience Governance & Standards

Agile Education: PO Basics

DataArt Employer brand integrated comm campaign "Geeky St. Valentine's 2018"

IST 676 - Seed Saving Digital Library

ASC Marketing Workshop - Mar 2012

CRC-STC May 2013 Summit Presentation

Digital And New Media Strategy using Web 2.0

How to Optimize Video Content Marketing with Google Hangouts

Content Along the B-to-B Decision Journey

Dlf 2012

B. Mc Donald - What makes a good hub?

Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)

Plus de Lotte Belice Baltussen

Digitaal duurzame links - NDE Werelddag van de Digitale DuurzaamheidLotte Belice Baltussen

Cultuurmarketing - Digitale Innovatie (19 september 2019)Lotte Belice Baltussen

Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...Lotte Belice Baltussen

18 april - CLICKNL bijeenkomst - Open Cultuur Data: game onLotte Belice Baltussen

Open cultuur data - Wikimedia Conferentie Nederland 2012Lotte Belice Baltussen

Open cultuur data - cop gouda ghaLotte Belice Baltussen

Open Cultuur Data - MOMULotte Belice Baltussen

Open Cultuur Data - Eth0:2012 Summer Lotte Belice Baltussen

Open Culture Data - PMODLotte Belice Baltussen

Open Cultuur Data competitie 2012Lotte Belice Baltussen

Open Cultuur Data - hackathon pitchesLotte Belice Baltussen

Open Cultuur Data - KVAN 2012Lotte Belice Baltussen

Baltussen - AVA_Net Najaarsconferentie 2011Lotte Belice Baltussen

Baltussen_Europeana_WWI_Crowdsourcing_Waisda_FINAL.pptLotte Belice Baltussen

Plus de Lotte Belice Baltussen (14)

Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid

Cultuurmarketing - Digitale Innovatie (19 september 2019)

Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...

18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on

Open cultuur data - Wikimedia Conferentie Nederland 2012

Open cultuur data - cop gouda gha

Open Cultuur Data - MOMU

Open Cultuur Data - Eth0:2012 Summer

Open Culture Data - PMOD

Open Cultuur Data competitie 2012

Open Cultuur Data - hackathon pitches

Open Cultuur Data - KVAN 2012

Baltussen - AVA_Net Najaarsconferentie 2011

Baltussen_Europeana_WWI_Crowdsourcing_Waisda_FINAL.ppt

Dernier

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

A Year of the Servo Reboot: Where Are We Now?Igalia

Real Time Object Detection Using Open CVKhem

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Histor y of HAM Radio presentation slidevu2urc

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Slack Application Development 101 Slidespraypatel2

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Dernier (20)

Data Cloud, More than a CDP by Matt Robison

A Year of the Servo Reboot: Where Are We Now?

Real Time Object Detection Using Open CV

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Histor y of HAM Radio presentation slide

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

08448380779 Call Girls In Civil Lines Women Seeking Men

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Breaking the Kubernetes Kill Chain: Host Path Mount

Slack Application Development 101 Slides

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

How to Troubleshoot Apps for the Modern Connected Worker

What Are The Drone Anti-jamming Systems Technology?

Finology Group – Insurtech Innovation Award 2024

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

08448380779 Call Girls In Friends Colony Women Seeking Men

Exploring the Future Potential of AI-Enabled Smartphone Processors

Driving Behavioral Change for Information Management through Data-Driven Gree...

Handwritten Text Recognition for manuscripts and early printed texts

Crowdsourcing metadata for audiovisual collections

1. Crowdsourcing metadata for audiovisual collections from free tekst tags to semantic concepts Lotte Belice Baltussen – Sound and Vision 7 December 2011 | DISH

3. Waisda? What’s that? Allows people to annotate audiovisual archive material in the form of a game.

4. Added value • Time-related metadata • Social tagging (bridging the semantic gap) • Interaction between the archive /broadcaster and the public • Gathering data for further research • Efficiency? annotating video takes up to 5 x the length of the video • New business model? 4

5. Project partners pilot • Netherlands Institute for Sound and Vision (project management, content, research) • KRO (concept, content, PR) • VU (research within PrestoPRIME) • Q42 (developer)

6. Man bijt hond Woordentikkertje After evaluation: • Improved interface • New scoring mechanisms (semantics) • New content • More feedback

9. How does it work? Players choose from ‘channels’ with different episodes

10. How does it work? Scoring: Scoring as filter • Basic rule – players score points when their tag exactly matches the tag entered by another player within 10 seconds • Multiple other scoring mechanisms to create various tag incentives

11. Evaluation Martorrel

12. Generating a constant flow of traffic is a challenge! Important: Partners, publicity on external websites with relevant communities and a large number of visitors. Example FWAW, in one week: • Triple # of tags to 160.000 • Double # of registered players to 362

13. Outcomes • Stats • 340,551 tags added to 604 items, 42,068 unique tags • 39.134 pageviews, 555 registered players, 10,926 visits • Average playing time 6min45, 4.287 sessions • Matches in Waisda? • Matches GTAA / Cornetto

14. Evaluation av-documentalist

15. Evaluation av-documentalist • Tags mostly describe short fragments and are often not very specific. They don’t describe a programme as a whole. • BUT! Can be solved by filtering and mapping free tekst tags to existing vocabularies. • The WNW tags were the most useful and specifc; content influences specificity. • Tags can be used in different ways and the relevance varies per user group. • Documentalists exicted about further development!

16. Evaluation

17. Evaluation

18. Source: Jakob Nielsen’s Alertblog 9 October 2006

19. ‘Fun’ + Competition + Altruism + Content + Reward + … = Motivation

20. Waisda? Woordentikkertje Months 8 4,5 Videos 648 2,892 Players 2,435 689 Tags – total 428,832 392,860 Tags – unique 48,242 (11%) 43,407 (11%) Matches • Players • 156,546 (37%) • 215,156 (55%) • Geo. names* • 6,089 (1,4%) • 23,142 (5,8%) • Persons* • 107 (0,25%) • 2,423 (0,6%) * For Waisda? we looked at unique tags, for Woordentikkertje at the total number of tags

21. Tips and lessons learned so far • What are your success criteria? • How do you define your target users, and how do you reach them? • How do you motivate your target users? • Read existing reports and literature! • Keep learning and improving!

22. And beyond…

23.

24.

25. Future work • Open Source version of Waisda? • Crowdsourcing Olympics • More research into the added value of tags for retrieval (subtitle comparison, tests with various end users, more research on linking semantically rich sources to tags)

26. ...recommended sources blogs, feeds, people • http://museumtwo.blogspot.com/ • http://80gb.wordpress.com/ • http://themuseumofthefuture.com/ • http://www.delicious.com/RuncocoProject/ • @ammeveleigh • @archivesopen • @digitalst • @microtask • @mia_out • @museweb • @runcoco • @wittylama This presentation is partly based on Oomen & Aroyo 2011: http://www.slideshare.net/PaulaUdondek/crowdsourcing-in-het-cultureel-erfgoed-kansen-uitdagingen

27. Thanks! @lottebelice / lbbaltussen@beeldengeluid.nl Big thank you to: B&G: @johanoomen / @mbrinkerink VU: @laroyo / @McHildebrand http://blog.waisda.nl http://woordentikkertje.manbijthond.nl

Crowdsourcing metadata for audiovisual collections

Recommandé

Recommandé

Contenu connexe

En vedette

En vedette (6)

Similaire à Crowdsourcing metadata for audiovisual collections

Similaire à Crowdsourcing metadata for audiovisual collections (20)

Plus de Lotte Belice Baltussen

Plus de Lotte Belice Baltussen (14)

Dernier

Dernier (20)

Crowdsourcing metadata for audiovisual collections