SlideShare a Scribd company logo
1 of 14
Download to read offline
PISA
Production, Indexing and Search
of Audio-visual Material
              Image Processing
    Tinne Tuytelaars, IBBT – PSI – K.U.Leuven
Computer Assisted Analysis


! Intelligent analysis = reverse engineering




! Shot cut detection demo
! Scene segmentation demo
! Video reuse detection demo
! Face detection
! Face recognition demo
! Audio classification demo


                                               61
Shot cut detection


= Split the video stream in atomic units, corresponding to a
  continuously moving camera
! Distinguish between abrupt and smooth shotcuts
! Experimented with different methods
   ! Using color histograms
   ! Using affine motion compensation
   ! Using motion estimation within the compressed domain




                                                          62
Scene segmentation




                     68
Video reuse detection

The same video material is often reused
! can be detected automatically
! Robust to post-processing
! Efficiency
! Based on spatio-temporal local features and locality
  sensitive hashing




                                                         69
Face Detection


            • Face candidates selection:
             quot; Candidate regions have skin color # region-based skin
               segmentation
             quot; Personalized chrominance skin boundary




            • Verification based on cues:
                 quot;   Shape of ellipse
                 quot;   Ellipse-filling percentage
                 quot;   Gray-tone smoothness
                 quot;   Corners of the facial features




                                                                   70
Face detection results

• 92% good face detections
    quot;Comparable to state-of-the-art face learning of ‘Viola & Jones’ obtains 93%
    quot;adaptation to lighting conditions and personal face looks




                                                                                   71
Face recognition


! Based on a 3D morphable model
! 3D model is fitted to 2D image
! Shape and texture parameters used as face descriptor
! Robust to
   ! viewpoint changes,
   ! illumination changes,
   ! partial occlusions.




                                                         72
Face recognition




                   73
Face recognition




                   74
Face recognition




                   75
Face recognition




                   76
Face recognition




                   77
Future work


! Include facial expressions in face recognition
! Multi-modal scene segmentation
! Feedback loop from Trouvaille
! Object or scene recognition
! Thesaurus-based speech recognition




                                                   79

More Related Content

More from vrt-medialab

Multischermenonderzoek
MultischermenonderzoekMultischermenonderzoek
Multischermenonderzoekvrt-medialab
 
Browser as a broadcast medium
Browser as a broadcast mediumBrowser as a broadcast medium
Browser as a broadcast mediumvrt-medialab
 
Taming your media chaos
Taming your media chaosTaming your media chaos
Taming your media chaosvrt-medialab
 
Presentatie iMinds MediaCRM
Presentatie iMinds MediaCRMPresentatie iMinds MediaCRM
Presentatie iMinds MediaCRMvrt-medialab
 
Evaluatiestudie VillaSquare
 Evaluatiestudie VillaSquare Evaluatiestudie VillaSquare
Evaluatiestudie VillaSquarevrt-medialab
 
iMinds VillaSquare evaluation IBBT-SMIT
iMinds VillaSquare evaluation IBBT-SMITiMinds VillaSquare evaluation IBBT-SMIT
iMinds VillaSquare evaluation IBBT-SMITvrt-medialab
 
Building second screen TV apps
Building second screen TV appsBuilding second screen TV apps
Building second screen TV appsvrt-medialab
 
Multischermenonderzoek
MultischermenonderzoekMultischermenonderzoek
Multischermenonderzoekvrt-medialab
 
Exploring your media with the Semantic Web
Exploring your media with the Semantic WebExploring your media with the Semantic Web
Exploring your media with the Semantic Webvrt-medialab
 
BDMA workshop presentation - Using the Second Screen - MediaSquare - MediaCRM
BDMA workshop presentation - Using the Second Screen - MediaSquare - MediaCRMBDMA workshop presentation - Using the Second Screen - MediaSquare - MediaCRM
BDMA workshop presentation - Using the Second Screen - MediaSquare - MediaCRMvrt-medialab
 
Champ belgian broadcast_days
Champ belgian broadcast_daysChamp belgian broadcast_days
Champ belgian broadcast_daysvrt-medialab
 
Champ Pitch Celtic-Plus Event 2011
Champ Pitch Celtic-Plus Event 2011Champ Pitch Celtic-Plus Event 2011
Champ Pitch Celtic-Plus Event 2011vrt-medialab
 
html5 an introduction
html5 an introductionhtml5 an introduction
html5 an introductionvrt-medialab
 
Boost your search with semantic technology
Boost your search with semantic technologyBoost your search with semantic technology
Boost your search with semantic technologyvrt-medialab
 
Media Square : platform for second screen experiences
Media Square : platform for second screen experiencesMedia Square : platform for second screen experiences
Media Square : platform for second screen experiencesvrt-medialab
 
MediaSquare - Check into your favourite media
MediaSquare - Check into your favourite mediaMediaSquare - Check into your favourite media
MediaSquare - Check into your favourite mediavrt-medialab
 

More from vrt-medialab (20)

Multischermenonderzoek
MultischermenonderzoekMultischermenonderzoek
Multischermenonderzoek
 
Browser as a broadcast medium
Browser as a broadcast mediumBrowser as a broadcast medium
Browser as a broadcast medium
 
Champ iMinds
Champ iMindsChamp iMinds
Champ iMinds
 
Taming your media chaos
Taming your media chaosTaming your media chaos
Taming your media chaos
 
Presentatie iMinds MediaCRM
Presentatie iMinds MediaCRMPresentatie iMinds MediaCRM
Presentatie iMinds MediaCRM
 
Evaluatiestudie VillaSquare
 Evaluatiestudie VillaSquare Evaluatiestudie VillaSquare
Evaluatiestudie VillaSquare
 
iMinds VillaSquare evaluation IBBT-SMIT
iMinds VillaSquare evaluation IBBT-SMITiMinds VillaSquare evaluation IBBT-SMIT
iMinds VillaSquare evaluation IBBT-SMIT
 
Building second screen TV apps
Building second screen TV appsBuilding second screen TV apps
Building second screen TV apps
 
Multischermenonderzoek
MultischermenonderzoekMultischermenonderzoek
Multischermenonderzoek
 
Exploring your media with the Semantic Web
Exploring your media with the Semantic WebExploring your media with the Semantic Web
Exploring your media with the Semantic Web
 
BDMA workshop presentation - Using the Second Screen - MediaSquare - MediaCRM
BDMA workshop presentation - Using the Second Screen - MediaSquare - MediaCRMBDMA workshop presentation - Using the Second Screen - MediaSquare - MediaCRM
BDMA workshop presentation - Using the Second Screen - MediaSquare - MediaCRM
 
Champ belgian broadcast_days
Champ belgian broadcast_daysChamp belgian broadcast_days
Champ belgian broadcast_days
 
Champ Pitch Celtic-Plus Event 2011
Champ Pitch Celtic-Plus Event 2011Champ Pitch Celtic-Plus Event 2011
Champ Pitch Celtic-Plus Event 2011
 
medialoep
medialoepmedialoep
medialoep
 
video for html5
video for html5video for html5
video for html5
 
html5 an introduction
html5 an introductionhtml5 an introduction
html5 an introduction
 
Boost your search with semantic technology
Boost your search with semantic technologyBoost your search with semantic technology
Boost your search with semantic technology
 
Media Square : platform for second screen experiences
Media Square : platform for second screen experiencesMedia Square : platform for second screen experiences
Media Square : platform for second screen experiences
 
MediaSquare - Check into your favourite media
MediaSquare - Check into your favourite mediaMediaSquare - Check into your favourite media
MediaSquare - Check into your favourite media
 
Transmedia
TransmediaTransmedia
Transmedia
 

Recently uploaded

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 

Recently uploaded (20)

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 

PISA Production, Indexing and Search of Audio-visual Material

  • 1. PISA Production, Indexing and Search of Audio-visual Material Image Processing Tinne Tuytelaars, IBBT – PSI – K.U.Leuven
  • 2. Computer Assisted Analysis ! Intelligent analysis = reverse engineering ! Shot cut detection demo ! Scene segmentation demo ! Video reuse detection demo ! Face detection ! Face recognition demo ! Audio classification demo 61
  • 3. Shot cut detection = Split the video stream in atomic units, corresponding to a continuously moving camera ! Distinguish between abrupt and smooth shotcuts ! Experimented with different methods ! Using color histograms ! Using affine motion compensation ! Using motion estimation within the compressed domain 62
  • 5. Video reuse detection The same video material is often reused ! can be detected automatically ! Robust to post-processing ! Efficiency ! Based on spatio-temporal local features and locality sensitive hashing 69
  • 6. Face Detection • Face candidates selection: quot; Candidate regions have skin color # region-based skin segmentation quot; Personalized chrominance skin boundary • Verification based on cues: quot; Shape of ellipse quot; Ellipse-filling percentage quot; Gray-tone smoothness quot; Corners of the facial features 70
  • 7. Face detection results • 92% good face detections quot;Comparable to state-of-the-art face learning of ‘Viola & Jones’ obtains 93% quot;adaptation to lighting conditions and personal face looks 71
  • 8. Face recognition ! Based on a 3D morphable model ! 3D model is fitted to 2D image ! Shape and texture parameters used as face descriptor ! Robust to ! viewpoint changes, ! illumination changes, ! partial occlusions. 72
  • 14. Future work ! Include facial expressions in face recognition ! Multi-modal scene segmentation ! Feedback loop from Trouvaille ! Object or scene recognition ! Thesaurus-based speech recognition 79