Tackling the digital video overload

•

0 likes•285 views

Wesley De Neve

Tackling the digital video overload.

Technology

Context (1/2)
 Increasing consumption of online video content
 easy-to-use devices and online services
 cheap storage and bandwidth
 more and more people going online

 Increasing availability of online video content
 digitization of professional video archives
 popularity of user-generated video content

8/11/2012 2

Context (2/2)
 Some statistics
 professional video content
 BBC Motion Gallery (as of January 2009)
 offers over 2.5 million hours of video content
 with video content dating back 60 years in time

 user-generated video content
 YouTube (as of October 2012)
 people watch 4 billion hours of video content each month
 people upload 72 hours of video content each minute

8/11/2012 3

Digital Video Overload (1/2)
 Problem description
 our ability to manage video content is not able to keep
up with our ability to create video content

 Cause
 to facilitate text-based video search, we need to
manually annotate video content with textual labels

8/11/2012 4

Digital Video Overload (2/2)
 Real cause
 people experience manual video annotation as time-
consuming and cumbersome, thus foregoing the effort

 Solution
 automatic video content understanding
 this is, computerized translation of pixels into text

“Curiosity
on Mars”

8/11/2012 5

Automatic Video Content Understanding
 Traditionally: video content analysis
 works reasonably well in highly controlled environments
 room for improvement in terms of applicability and
effectiveness

 Nowadays: video content analysis, enhanced with
 unstructured knowledge from the Social Web, and/or
 structured knowledge from the Semantic Web

two use cases

8/11/2012 6

Social Video Face Annotation (1/2)
 Description
 improving face annotation for personal video collections
by harvesting online social network context

 Goal of video face annotation

person 2
person 1
person 3

Search for peoples

8/11/2012 7

Social Video Face Annotation (2/2)
Contact list
Labeled face images
contact 1

contact 2
occurrence
contact 3
+ probabilities
contact 4

contact 5 co-occurrence
contact 6 probabilities

video face recognition using
visual features

robust video face recognition
using visual and social features
8/11/2012 [ published in IEEE ToMM, 2011 ] 8

Annotation of Live Soccer Video (2/2)

6
Tweets/s

4

2

0
0 5 Time (s) 10

soccer event detection using
visual features

Twitter-assisted annotation What is happening?
of live soccer video What are people saying?

8/11/2012 [ submitted to IEEE ToMM, 2012 ] 10

Other Use Cases
 Movie actor recognition

 Semantic video copy
detection

 Audiovisual enrichment
of text documents

8/11/2012 11

Research Challenges (1/2)
 Design of techniques that jointly take advantage
of unstructured and structured knowledge
 unstructured knowledge: collective knowledge
 structured knowledge: Linked Data Cloud
 cf. “Everything is Connected” for video content enrichment
 http://everythingisconnected.be/

 Design of techniques for translating unstructured
knowledge into structured knowledge
 velocity, volume, and variety
 sparsity, ambiguity, and complexity
8/11/2012 12

Research Challenges (2/2)
 Design of effective semantic similarity metrics

visual distance

semantic distance

 Design of user-oriented performance metrics
 need to go beyond the use of precision and recall
 need to better capture whether the needs of users
have been met by a video content retrieval system
8/11/2012 13

Recently uploaded

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

Join our latest Connector Corner webinar to discover how UiPath Integration Service revolutionizes API-centric automation in a 'Quote to Cash' process—and how that automation empowers businesses to accelerate revenue generation. A comprehensive demo will explore connecting systems, GenAI, and people, through powerful pre-built connectors designed to speed process cycle times. Speakers: James Dickson, Senior Software Engineer Charlie Greenberg, Host, Product Marketing Manager

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

DianaGray10

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Tech Trends Report 2024 Future Today Institute.pdf

hans926745

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

This presentation explores the impact of HTML injection attacks on web applications, detailing how attackers exploit vulnerabilities to inject malicious code into web pages. Learn about the potential consequences of such attacks and discover effective mitigation strategies to protect your web applications from HTML injection vulnerabilities. for more information visit https://bostoninstituteofanalytics.org/category/cyber-security-ethical-hacking/

HTML Injection Attacks: Impact and Mitigation Strategies

Boston Institute of Analytics

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

Discord is a free app offering voice, video, and text chat functionalities, primarily catering to the gaming community. It serves as a hub for users to create and join servers tailored to their interests. Discord’s ecosystem comprises servers, each functioning as a distinct online community with its own channels dedicated to specific topics or activities. Users can engage in text-based discussions, voice calls, or video chats within these channels. Understanding Discord Servers Discord servers are virtual spaces where users congregate to interact, share content, and build communities. Servers may revolve around gaming, hobbies, interests, or fandoms, providing a platform for like-minded individuals to connect. Communication Features Discord offers a range of communication tools, including text channels for messaging, voice channels for real-time audio conversations, and video channels for face-to-face interactions. These features facilitate seamless communication and collaboration. What Does NSFW Mean? The acronym NSFW stands for “Not Safe For Work,” indicating content that may be inappropriate for professional or public settings. NSFW Content NSFW content encompasses material that is sexually explicit, violent, or otherwise graphic in nature. It often includes nudity, profanity, or depictions of sensitive topics.

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

UK Journal

presentation ICT roal in 21st century education

jfdjdjcjdnsjd

Histor y of HAM Radio presentation slide

vu2urc

Strategies for Landing an Oracle DBA Job as a Fresher

Remote DBA Services

Handwritten Text Recognition for manuscripts and early printed texts

Maria Levchenko

Tata AIG General Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

Real Time Object Detection Using Open CV

Khem

MySQL Webinar, presented on the 25th of April, 2024. Summary: MySQL solutions enable the deployment of diverse Database Architectures tailored to specific needs, including High Availability, Disaster Recovery, and Read Scale-Out. With MySQL Shell's AdminAPI, administrators can seamlessly set up, manage, and monitor these solutions, ensuring efficiency and ease of use in their administration. MySQL Router, on the other hand, provides transparent routing from the application traffic to the backend servers in the architectures, requiring minimal configuration. Completely built in-house and supported by Oracle, these solutions have been adopted by enterprises of all sizes for their business-critical applications. In this presentation, we'll delve into various database architecture solutions to help you choose the right one based on your business requirements. Focusing on technical details and the latest features to maximize the potential of these solutions.

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Miguel Araújo

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

Recently uploaded (20)

A Domino Admins Adventures (Engage 2024)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Data Cloud, More than a CDP by Matt Robison

Tech Trends Report 2024 Future Today Institute.pdf

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

HTML Injection Attacks: Impact and Mitigation Strategies

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Axa Assurance Maroc - Insurer Innovation Award 2024

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

presentation ICT roal in 21st century education

Histor y of HAM Radio presentation slide

Strategies for Landing an Oracle DBA Job as a Fresher

Handwritten Text Recognition for manuscripts and early printed texts

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Apidays New York 2024 - The value of a flexible API Management solution for O...

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Real Time Object Detection Using Open CV

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Exploring the Future Potential of AI-Enabled Smartphone Processors

Tackling the digital video overload

1. Tackling the Digital Video Overload Wesley De Neve 8/11/2012 1

2. Context (1/2)  Increasing consumption of online video content  easy-to-use devices and online services  cheap storage and bandwidth  more and more people going online  Increasing availability of online video content  digitization of professional video archives  popularity of user-generated video content 8/11/2012 2

3. Context (2/2)  Some statistics  professional video content  BBC Motion Gallery (as of January 2009)  offers over 2.5 million hours of video content  with video content dating back 60 years in time  user-generated video content  YouTube (as of October 2012)  people watch 4 billion hours of video content each month  people upload 72 hours of video content each minute 8/11/2012 3

4. Digital Video Overload (1/2)  Problem description  our ability to manage video content is not able to keep up with our ability to create video content  Cause  to facilitate text-based video search, we need to manually annotate video content with textual labels 8/11/2012 4

5. Digital Video Overload (2/2)  Real cause  people experience manual video annotation as time- consuming and cumbersome, thus foregoing the effort  Solution  automatic video content understanding  this is, computerized translation of pixels into text “Curiosity on Mars” 8/11/2012 5

6. Automatic Video Content Understanding  Traditionally: video content analysis  works reasonably well in highly controlled environments  room for improvement in terms of applicability and effectiveness  Nowadays: video content analysis, enhanced with  unstructured knowledge from the Social Web, and/or  structured knowledge from the Semantic Web two use cases 8/11/2012 6

7. Social Video Face Annotation (1/2)  Description  improving face annotation for personal video collections by harvesting online social network context  Goal of video face annotation person 2 person 1 person 3 Search for peoples 8/11/2012 7

8. Social Video Face Annotation (2/2) Contact list Labeled face images contact 1 contact 2 occurrence contact 3 + probabilities contact 4 contact 5 co-occurrence contact 6 probabilities video face recognition using visual features robust video face recognition using visual and social features 8/11/2012 [ published in IEEE ToMM, 2011 ] 8

9. Annotation of Live Soccer Video (1/2)  Description  annotation of live soccer video by harvesting collective knowledge from Twitter  Goal of annotating soccer video logo attack goal trainer logo Search for events 8/11/2012 9

10. Annotation of Live Soccer Video (2/2) 6 Tweets/s 4 2 0 0 5 Time (s) 10 soccer event detection using visual features Twitter-assisted annotation What is happening? of live soccer video What are people saying? 8/11/2012 [ submitted to IEEE ToMM, 2012 ] 10

11. Other Use Cases  Movie actor recognition  Semantic video copy detection  Audiovisual enrichment of text documents 8/11/2012 11

12. Research Challenges (1/2)  Design of techniques that jointly take advantage of unstructured and structured knowledge  unstructured knowledge: collective knowledge  structured knowledge: Linked Data Cloud  cf. “Everything is Connected” for video content enrichment  http://everythingisconnected.be/  Design of techniques for translating unstructured knowledge into structured knowledge  velocity, volume, and variety  sparsity, ambiguity, and complexity 8/11/2012 12

13. Research Challenges (2/2)  Design of effective semantic similarity metrics visual distance semantic distance  Design of user-oriented performance metrics  need to go beyond the use of precision and recall  need to better capture whether the needs of users have been met by a video content retrieval system 8/11/2012 13

14. Thank you! 8/11/2012 14

Tackling the digital video overload

Recommended

Recommended

More Related Content

More from Wesley De Neve

More from Wesley De Neve (20)

Recently uploaded

Recently uploaded (20)

Tackling the digital video overload