SlideShare a Scribd company logo
1 of 14
Tackling the Digital
   Video Overload
   Wesley De Neve




8/11/2012                 1
Context (1/2)
 Increasing consumption of online video content
    easy-to-use devices and online services
    cheap storage and bandwidth
    more and more people going online

 Increasing availability of online video content
    digitization of professional video archives
    popularity of user-generated video content




                  8/11/2012                         2
Context (2/2)
 Some statistics
   professional video content
      BBC Motion Gallery (as of January 2009)
         offers over 2.5 million hours of video content
         with video content dating back 60 years in time

   user-generated video content
      YouTube (as of October 2012)
          people watch 4 billion hours of video content each month
          people upload 72 hours of video content each minute



                   8/11/2012                                      3
Digital Video Overload (1/2)
 Problem description
   our ability to manage video content is not able to keep
    up with our ability to create video content


 Cause
   to facilitate text-based video search, we need to
    manually annotate video content with textual labels




                 8/11/2012                                4
Digital Video Overload (2/2)
 Real cause
   people experience manual video annotation as time-
    consuming and cumbersome, thus foregoing the effort


 Solution
   automatic video content understanding
   this is, computerized translation of pixels into text


                                                      “Curiosity
                                                      on Mars”


                  8/11/2012                                    5
Automatic Video Content Understanding
 Traditionally: video content analysis
   works reasonably well in highly controlled environments
   room for improvement in terms of applicability and
    effectiveness


 Nowadays: video content analysis, enhanced with
   unstructured knowledge from the Social Web, and/or
   structured knowledge from the Semantic Web

                                            two use cases



                 8/11/2012                                  6
Social Video Face Annotation (1/2)
 Description
   improving face annotation for personal video collections
    by harvesting online social network context

 Goal of video face annotation

            person 2
    person 1
                    person 3

                                 Search for peoples




                     8/11/2012                           7
Social Video Face Annotation (2/2)
      Contact list
                                                      Labeled face images
     contact 1

     contact 2
                                                            occurrence
     contact 3
                                 +                          probabilities
     contact 4

     contact 5                                             co-occurrence
     contact 6                                              probabilities




                      video face recognition using
                             visual features


                      robust video face recognition
                     using visual and social features
                     8/11/2012         [ published in IEEE ToMM, 2011 ]     8
Annotation of Live Soccer Video (1/2)
 Description
   annotation of live soccer video by harvesting collective
    knowledge from Twitter


 Goal of annotating soccer video



  logo       attack           goal     trainer       logo


                Search for events


                  8/11/2012                                 9
Annotation of Live Soccer Video (2/2)


              6
   Tweets/s




              4

              2

              0
                  0                5   Time (s)        10



                      soccer event detection using
                             visual features

                      Twitter-assisted annotation              What is happening?
                          of live soccer video                 What are people saying?

                       8/11/2012              [ submitted to IEEE ToMM, 2012 ]   10
Other Use Cases
 Movie actor recognition



 Semantic video copy
  detection



 Audiovisual enrichment
  of text documents

               8/11/2012    11
Research Challenges (1/2)
 Design of techniques that jointly take advantage
  of unstructured and structured knowledge
   unstructured knowledge: collective knowledge
   structured knowledge: Linked Data Cloud
      cf. “Everything is Connected” for video content enrichment
      http://everythingisconnected.be/


 Design of techniques for translating unstructured
  knowledge into structured knowledge
   velocity, volume, and variety
   sparsity, ambiguity, and complexity
                   8/11/2012                                        12
Research Challenges (2/2)
 Design of effective semantic similarity metrics

                            visual distance



                       semantic distance




 Design of user-oriented performance metrics
   need to go beyond the use of precision and recall
   need to better capture whether the needs of users
    have been met by a video content retrieval system
                8/11/2012                               13
Thank you!


        8/11/2012   14

More Related Content

More from Wesley De Neve

More from Wesley De Neve (20)

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla Symposium
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research Symposium
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniques
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and Bioinformatics
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
 
Ghent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesGhent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research Activities
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processing
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Tackling the digital video overload

  • 1. Tackling the Digital Video Overload Wesley De Neve 8/11/2012 1
  • 2. Context (1/2)  Increasing consumption of online video content  easy-to-use devices and online services  cheap storage and bandwidth  more and more people going online  Increasing availability of online video content  digitization of professional video archives  popularity of user-generated video content 8/11/2012 2
  • 3. Context (2/2)  Some statistics  professional video content  BBC Motion Gallery (as of January 2009)  offers over 2.5 million hours of video content  with video content dating back 60 years in time  user-generated video content  YouTube (as of October 2012)  people watch 4 billion hours of video content each month  people upload 72 hours of video content each minute 8/11/2012 3
  • 4. Digital Video Overload (1/2)  Problem description  our ability to manage video content is not able to keep up with our ability to create video content  Cause  to facilitate text-based video search, we need to manually annotate video content with textual labels 8/11/2012 4
  • 5. Digital Video Overload (2/2)  Real cause  people experience manual video annotation as time- consuming and cumbersome, thus foregoing the effort  Solution  automatic video content understanding  this is, computerized translation of pixels into text “Curiosity on Mars” 8/11/2012 5
  • 6. Automatic Video Content Understanding  Traditionally: video content analysis  works reasonably well in highly controlled environments  room for improvement in terms of applicability and effectiveness  Nowadays: video content analysis, enhanced with  unstructured knowledge from the Social Web, and/or  structured knowledge from the Semantic Web two use cases 8/11/2012 6
  • 7. Social Video Face Annotation (1/2)  Description  improving face annotation for personal video collections by harvesting online social network context  Goal of video face annotation person 2 person 1 person 3 Search for peoples 8/11/2012 7
  • 8. Social Video Face Annotation (2/2) Contact list Labeled face images contact 1 contact 2 occurrence contact 3 + probabilities contact 4 contact 5 co-occurrence contact 6 probabilities video face recognition using visual features robust video face recognition using visual and social features 8/11/2012 [ published in IEEE ToMM, 2011 ] 8
  • 9. Annotation of Live Soccer Video (1/2)  Description  annotation of live soccer video by harvesting collective knowledge from Twitter  Goal of annotating soccer video logo attack goal trainer logo Search for events 8/11/2012 9
  • 10. Annotation of Live Soccer Video (2/2) 6 Tweets/s 4 2 0 0 5 Time (s) 10 soccer event detection using visual features Twitter-assisted annotation What is happening? of live soccer video What are people saying? 8/11/2012 [ submitted to IEEE ToMM, 2012 ] 10
  • 11. Other Use Cases  Movie actor recognition  Semantic video copy detection  Audiovisual enrichment of text documents 8/11/2012 11
  • 12. Research Challenges (1/2)  Design of techniques that jointly take advantage of unstructured and structured knowledge  unstructured knowledge: collective knowledge  structured knowledge: Linked Data Cloud  cf. “Everything is Connected” for video content enrichment  http://everythingisconnected.be/  Design of techniques for translating unstructured knowledge into structured knowledge  velocity, volume, and variety  sparsity, ambiguity, and complexity 8/11/2012 12
  • 13. Research Challenges (2/2)  Design of effective semantic similarity metrics visual distance semantic distance  Design of user-oriented performance metrics  need to go beyond the use of precision and recall  need to better capture whether the needs of users have been met by a video content retrieval system 8/11/2012 13
  • 14. Thank you! 8/11/2012 14