QCon London: Mastering long-running processes in modern architectures
Audiovisual Content Exploitation at FIA 15042010 NISV
1. Audiovisual content exploitation in the networked information society Roeland Ordelman Research&Development Netherlands Institute for Sound and Vision rordelman@beeldengeluid.nl
17. A strong business model is necessary to support this kind of investment and prove that such an investment will result in long-term socio-economic returns
18. The outcome of a Cost-Benefit analysis was positive: “The total balance of costs and returns of restoring, preserving and digitising audio-visual material (excluding costs of tax payments) will be between: 20+ and 60+ million.’’
20. Direct effects of the investment are revenues from sales, access for specific user groups, the repartition of copyright for the use of the material and so on.
23. conservation of culture, reinforcement of cultural awareness, reinforcement of democracy through the accessibility of information, increase in multimedia literacy and contribution to the Lisbon goals set by the EUhttp://www.prestoprime.org/project/public.en.html
82. project goals: provide demonstrator portal to show how technology could help researchers acquire information on specific user requirements search collaboration linking privacy dedicated work space http://www.verteldverleden.org
98. Crowdsourcing 14 minutes left for annotation you score when somebody else uses the same term fill in words that describe what you see or hear Play!
The concept of technology-enhanced audiovisual indexing may have been a holy grail in earlier research, but recently it has become an urgent prerequisite in the context of our present-day information society. The effective exploitation of our Digital Libraries however, is currently impeded despite a wealth of technological progress. It becomes increasingly apparent that there may be an underlying problem, rooted in the disparity between technology and user needs. In order to respond to the demands of the information society advanced techniques and new workflow practices of use need to be explored by focusing on the needs of contemporary users, both professionals and non-professionals. At the same time, we need to foster opportunities for drawing user communities into the Digital Libraries, to involve them in enhancing content exploitability, e.g., by community tagging or capturing user generated content from the internet and aligning this with archived items. In this talk, the area of tension between the current state of technology and user needs is discussed in more detail in the context of practical use cases within the Netherlands Institute for Sound and Vision.
Let me first introduce Sound and Vision. Sound and Vision maintains and provides access to 70 per cent of the Dutch audio-visual heritage comprising approximately 700,000 hours of television, radio, music and film, making SV one of the largest audiovisual archives in Europe. The archive is growing. SV is the business archive of the national broadcasting corporationsand digitally born television and radio made by the Dutch public broadcastering companies is flowing in right after it is broadcast. Also, via the SV archive service PROARCHIVE content from Dutch cultural heritage institutes and regional broadcast institutes flows in. Finally we are currently selecting manually Dutch user generated content from the internet. More on this later.
€ 173 mln in 7 jaar (looptijd 2007-2014)wel terugverdienverplichting. Tijdens looptijd € 19 mln
Om dit soort use cases mogelijk te maken is handmatig beschrijven van materiaal niet de oplossing want te kostbaar en per definitie beperkt. Bij contentbeheerders ontbreken vaak de resources om zelfs minimale beschrijvingen zoals titel en datum te produceren
Om materiaal toch, en in meer detail te kunnen beschrijven zijn een aantal strategieen denkbaar.Zo wordt de hulp van het publiek ingeroepen om materiaal te beschrijven, het zogenaamde crowdsourcing.Een belangrijk alternatief waar we in Twente onderzoek naar gedaan hebben is het gebruik van beschikbare tekstuele bronnen zoals ondertitels, notulen of aantekeningen, als beschrijvingen van content. Met behulp van spraaktechnologie kunnen deze beschrijvingen soms ook worden gesynchroniseerd met het materiaal (denk hierbij aan notulen) of is er al een vorm van synchronisatie aanwezig (in het geval van ondertitels. Maar vaak zijn dit soort bronnen niet beschikbaar. Dan kan technologie die op basis van audiovisuele kenmerken automatisch beschrijvingen genereert uitkomst bieden.
allemaal potentieel interessante content voor verschillende typen gebruikers:producers die uit zijn op hergebruik, journalisten, onderzoekers, en natuurlijk ook het algemeen publiek.Zomaar een greep uit mogelijke use cases waar je aan kunt denken: