Video complexity analyzer (VCA) for streaming applications

•

0 j'aime•1,045 vues

Vignesh V Menon and Hadi Amirpour gave a talk on ‘Video Complexity Analyzer for Streaming Applications’ at the Video Quality Experts Group (VQEG) meeting on December 14, 2021. Our research activities on video complexity analysis were presented in the talk.

Technologie

All rights reserved. ©2020
All rights reserved. ©2021
Video complexity analyzer (VCA) for streaming
applications
December 14, 2021
Presenters: Vignesh V Menon & Hadi Amirpour
1

All rights reserved. ©2020
Vignesh V Menon Hadi Amirpour
Researchers @ Christian Doppler Laboratory ATHENA, University of Klagenfurt, Austria
All rights reserved. ©2021
2
About Us

All rights reserved. ©2020
● Motivation for VCA
● Features
● Experimental Results
● Applications
● Future Roadmap
All rights reserved. ©2021
3

All rights reserved. ©2020
Motivation
All rights reserved. ©2021
4

5
Motivation
● We aim to develop online prediction systems tailor-made for live streaming applications.
● The state-of-the-art spatial and temporal complexity feature is SI-TI.
Fig.1: Correlation of SI feature with number of bits (in kb) per frame in IDR encoding with QP27 of x265 for 24 test sequences from MCML[1] and
SJTU[2] dataset.
[1] M. Cheon and J.-S. Lee, “Subjective and Objective Quality Assessment of Compressed 4K UHD Videos for Immersive Experience,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 28,
no. 7, pp. 1467–1480, 2018.
[2] L. Song, X. Tang, W. Zhang, X. Yang, and P. Xia, “The SJTU 4K Video Sequence Dataset,”Fifth International Workshop on Quality of Multimedia Experience (QoMEX2013), Jul. 2013.
Pearson correlation coefficient (PCC) of SI with bits per frame is ~0.79.

6
Motivation
● Time taken to compute SI-TI features is very high!
➢ ~0.05 seconds per frame for 1080p, ~0.2 seconds per frame for 2160p
➢ Not suitable for live applications
➢ Higher computational cost in VoD applications
Fig.2: Time taken to compute SI-TI features [1] in Intel Xeon Gold 5218R
[1] SITI source code: https://github.com/Telecommunication-Telemedia-Assessment/SITI

7
Motivation
Video Complexity Analyzer (VCA) can be realized as a fast preprocessor which
determines the spatial and temporal complexity of videos (segments) to aid the encoding
process.
Fig.3: The proposed framework for streaming applications.

All rights reserved. ©2020
Features
All rights reserved. ©2021
8

9
Spatial complexity feature
k is the block address in the pth frame,w×w pixels is the size of the block, and DCT(i, j) is
the (i, j)th
DCT component when i+j >1, and 0 otherwise.

10
Spatial complexity feature
C represents the number of blocks per frame.
Fig. 4: Number of bits (in kb) per frame and E feature of Wood SJTU sequence.

11
Temporal complexity feature
The block-wise SAD of the texture energy of each frame (p) compared to its previous
frame (p-1) is computed.

All rights reserved. ©2020
Experimental Results
All rights reserved. ©2021
12

13
Results
PCC(SI, Bits per frame) = 0.787
PCC(E, Bits per frame) = 0.856
Fig. 5: Correlation of SI and E features with number of bits (in kb) per frame.

14
Results
Fig. 6: Average time to compute E-h for various resolutions (with x86 SIMD).
Note: Presently, E-h computation is 5 times faster than SI-TI computation.

All rights reserved. ©2020
Applications
All rights reserved. ©2021
15

16
Shot Detection
We define the gradient of ‘h’ per frame ‘p’ as:
Fig. 7: Epsilon values for ToS sequence. Please note that shot transitions happen at frames: 107, 110,
238,338,465,531,596,778,850,917,1018,1361,1437.

17
Shot Detection
The algorithm is classified into two steps:
● Feature extraction
● Successive Elimination Algorithm
Source: V. V. Menon, H. Amirpour, M. Ghanbari, and C. Timmerer,“Efficient Content-Adaptive Feature-Based Shot Detection for HTTP Adaptive Streaming,”
in 2021 IEEE International Conference on Image Processing (ICIP), 2021, pp. 2174–2178.
Benchmark is the default shot detection algorithm of x265.

All rights reserved. ©2020
Future Roadmap
All rights reserved. ©2021
18

19
Future Roadmap
● The initial version will be released before March 1, 2022.
● Adding Multi-threading support
○ H computation for blocks in each frame can be realized concurrently.
○ ~6x speedup expected with 8 threads.
● Adding CUDA/ OpenCL support

All rights reserved. ©2020
Thanks for your attention!
All rights reserved. ©2021
20
Vignesh V Menon (vignesh.menon@aau.at)
Hadi Amirpour (hadi.amirpour@aau.at)

Contenu connexe

Tendances

simple video compression LaLit DuBey

Introduction to Video Compression Techniques - Anurag JainVideoguy

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya

C2 ae open set recognition哲东郑

CV分野での最近の脱○○系3選Kazuyuki Miyazawa

Introduction to object detectionBrodmann17

HEVC intra codingManohar Kuse

Flexible electronicsMohamed Amin Elaguech

Video coding standards pptLokesh Reddy Avula

Depth estimation using deep learningUniversity of Oklahoma

Semantic Segmentation Methods using Deep LearningSungjoon Choi

H.264 video standardSajan Sahu

Image SegmentationSyed Muhammad Hammad

Video Compression BasicsSanjiv Malik

Optic flow estimation with deep learningYu Huang

"The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming...Edge AI and Vision Alliance

From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...William Liang

Semantic segmentation with Convolutional Neural Network ApproachesFellowship at Vodafone FutureLab

Goal line technologysandhyamohanr

Image segmentation with deep learningAntonio Rueda-Toicen

Tendances (20)

simple video compression

Introduction to Video Compression Techniques - Anurag Jain

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...

C2 ae open set recognition

CV分野での最近の脱○○系3選

Introduction to object detection

HEVC intra coding

Flexible electronics

Video coding standards ppt

Depth estimation using deep learning

Semantic Segmentation Methods using Deep Learning

H.264 video standard

Image Segmentation

Video Compression Basics

Optic flow estimation with deep learning

"The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming...

From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...

Semantic segmentation with Convolutional Neural Network Approaches

Goal line technology

Image segmentation with deep learning

Similaire à Video complexity analyzer (VCA) for streaming applications

Machine Learning Based Video Coding Enhancements for HTTP Adaptive StreamingAlpen-Adria-Universität

IRJET- A Hybrid Image and Video Compression of DCT and DWT Techniques for H.2...IRJET Journal

Green_VCA_presentation.pdfVignesh V Menon

Effective Compression of Digital VideoIRJET Journal

Evaluation and Analysis of Rate Control Methods for H.264/AVC and MPEG-4 Vide...IJECEIAES

MHV'22 - Super-resolution Based Bitrate Adaptation for HTTP Adaptive Streamin...Minh Nguyen

PEMWN'21 - ANGELAJesus Aguilar

Evaluation and Analysis of Rate Control Methods for H.264/AVC and MPEG-4 Vide...IJECEIAES

Low complexity video coding for sensor networkeSAT Publishing House

Low complexity video coding for sensor networkeSAT Journals

OPTE: Online Per-title Encoding for Live Video StreamingAlpen-Adria-Universität

IEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVCVignesh V Menon

INCEPT: Intra CU Depth Prediction for HEVCAlpen-Adria-Universität

Iaetsd a low power and high throughput re-configurable bip for multipurpose a...Iaetsd Iaetsd

ComplexCTTP: Complexity Class Based Transcoding Time Prediction for Video Seq...Alpen-Adria-Universität

Secured Video Watermarking Based On DWTEditor IJMTER

IRJET- Sobel Edge Detection on ZYNQ based Architecture with VivadoIRJET Journal

Video Coding Enhancements for HTTP Adaptive StreamingAlpen-Adria-Universität

Research@Lunch_Presentation.pdfVignesh V Menon

Parking Surveillance Footage SummarizationIRJET Journal

Similaire à Video complexity analyzer (VCA) for streaming applications (20)

Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming

IRJET- A Hybrid Image and Video Compression of DCT and DWT Techniques for H.2...

Green_VCA_presentation.pdf

Effective Compression of Digital Video

Evaluation and Analysis of Rate Control Methods for H.264/AVC and MPEG-4 Vide...

MHV'22 - Super-resolution Based Bitrate Adaptation for HTTP Adaptive Streamin...

PEMWN'21 - ANGELA

Evaluation and Analysis of Rate Control Methods for H.264/AVC and MPEG-4 Vide...

Low complexity video coding for sensor network

OPTE: Online Per-title Encoding for Live Video Streaming

IEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVC

INCEPT: Intra CU Depth Prediction for HEVC

Iaetsd a low power and high throughput re-configurable bip for multipurpose a...

ComplexCTTP: Complexity Class Based Transcoding Time Prediction for Video Seq...

Secured Video Watermarking Based On DWT

IRJET- Sobel Edge Detection on ZYNQ based Architecture with Vivado

Video Coding Enhancements for HTTP Adaptive Streaming

Research@Lunch_Presentation.pdf

Parking Surveillance Footage Summarization

Plus de Alpen-Adria-Universität

VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instancesAlpen-Adria-Universität

GREEM: An Open-Source Energy Measurement Tool for Video ProcessingAlpen-Adria-Universität

Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Alpen-Adria-Universität

VEEP: Video Encoding Energy and CO₂ Emission PredictionAlpen-Adria-Universität

Content-adaptive Video Coding for HTTP Adaptive StreamingAlpen-Adria-Universität

Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...Alpen-Adria-Universität

Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...Alpen-Adria-Universität

Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...Alpen-Adria-Universität

Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Alpen-Adria-Universität

Machine Learning Based Resource Utilization Prediction in the Computing Conti...Alpen-Adria-Universität

Evaluation of Quality of Experience of ABR Schemes in Gaming StreamAlpen-Adria-Universität

Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...Alpen-Adria-Universität

Multi-access Edge Computing for Adaptive Video StreamingAlpen-Adria-Universität

Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentAlpen-Adria-Universität

VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...Alpen-Adria-Universität

Energy Consumption in Video Streaming: Components, Measurements, and StrategiesAlpen-Adria-Universität

Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Alpen-Adria-Universität

Video Coding Enhancements for HTTP Adaptive Streaming Using Machine LearningAlpen-Adria-Universität

Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...Alpen-Adria-Universität

SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsAlpen-Adria-Universität

Plus de Alpen-Adria-Universität (20)

VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances

GREEM: An Open-Source Energy Measurement Tool for Video Processing

Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...

VEEP: Video Encoding Energy and CO₂ Emission Prediction

Content-adaptive Video Coding for HTTP Adaptive Streaming

Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...

Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...

Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...

Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...

Machine Learning Based Resource Utilization Prediction in the Computing Conti...

Evaluation of Quality of Experience of ABR Schemes in Gaming Stream

Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...

Multi-access Edge Computing for Adaptive Video Streaming

Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment

VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...

Energy Consumption in Video Streaming: Components, Measurements, and Strategies

Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...

Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning

Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...

SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications

Dernier

Story boards and shot lists for my a level piececharlottematthew16

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Training state-of-the-art general text embeddingZilliz

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

"ML in Production",Oleksandr BaganFwdays

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

Gen AI in Business - Global Trends Report 2024.pdfAddepto

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer

Install Stable Diffusion in windows machinePadma Pradeep

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Dernier (20)

Story boards and shot lists for my a level piece

Scanning the Internet for External Cloud Exposures via SSL Certs

Training state-of-the-art general text embedding

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Advanced Test Driven-Development @ php[tek] 2024

DevoxxFR 2024 Reproducible Builds with Apache Maven

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

"ML in Production",Oleksandr Bagan

Ensuring Technical Readiness For Copilot in Microsoft 365

Developer Data Modeling Mistakes: From Postgres to NoSQL

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Dev Dives: Streamline document processing with UiPath Studio Web

Gen AI in Business - Global Trends Report 2024.pdf

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

My INSURER PTE LTD - Insurtech Innovation Award 2024

Install Stable Diffusion in windows machine

My Hashitalk Indonesia April 2024 Presentation

Human Factors of XR: Using Human Factors to Design XR Systems

Video complexity analyzer (VCA) for streaming applications

5. 5 Motivation ● We aim to develop online prediction systems tailor-made for live streaming applications. ● The state-of-the-art spatial and temporal complexity feature is SI-TI. Fig.1: Correlation of SI feature with number of bits (in kb) per frame in IDR encoding with QP27 of x265 for 24 test sequences from MCML[1] and SJTU[2] dataset. [1] M. Cheon and J.-S. Lee, “Subjective and Objective Quality Assessment of Compressed 4K UHD Videos for Immersive Experience,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 28, no. 7, pp. 1467–1480, 2018. [2] L. Song, X. Tang, W. Zhang, X. Yang, and P. Xia, “The SJTU 4K Video Sequence Dataset,”Fifth International Workshop on Quality of Multimedia Experience (QoMEX2013), Jul. 2013. Pearson correlation coefficient (PCC) of SI with bits per frame is ~0.79.

6. 6 Motivation ● Time taken to compute SI-TI features is very high! ➢ ~0.05 seconds per frame for 1080p, ~0.2 seconds per frame for 2160p ➢ Not suitable for live applications ➢ Higher computational cost in VoD applications Fig.2: Time taken to compute SI-TI features [1] in Intel Xeon Gold 5218R [1] SITI source code: https://github.com/Telecommunication-Telemedia-Assessment/SITI

7. 7 Motivation Video Complexity Analyzer (VCA) can be realized as a fast preprocessor which determines the spatial and temporal complexity of videos (segments) to aid the encoding process. Fig.3: The proposed framework for streaming applications.

9. 9 Spatial complexity feature k is the block address in the pth frame,w×w pixels is the size of the block, and DCT(i, j) is the (i, j)th DCT component when i+j >1, and 0 otherwise.

10. 10 Spatial complexity feature C represents the number of blocks per frame. Fig. 4: Number of bits (in kb) per frame and E feature of Wood SJTU sequence.

11. 11 Temporal complexity feature The block-wise SAD of the texture energy of each frame (p) compared to its previous frame (p-1) is computed.

13. 13 Results PCC(SI, Bits per frame) = 0.787 PCC(E, Bits per frame) = 0.856 Fig. 5: Correlation of SI and E features with number of bits (in kb) per frame.

14. 14 Results Fig. 6: Average time to compute E-h for various resolutions (with x86 SIMD). Note: Presently, E-h computation is 5 times faster than SI-TI computation.

16. 16 Shot Detection We define the gradient of ‘h’ per frame ‘p’ as: Fig. 7: Epsilon values for ToS sequence. Please note that shot transitions happen at frames: 107, 110, 238,338,465,531,596,778,850,917,1018,1361,1437.

17. 17 Shot Detection The algorithm is classified into two steps: ● Feature extraction ● Successive Elimination Algorithm Source: V. V. Menon, H. Amirpour, M. Ghanbari, and C. Timmerer,“Efficient Content-Adaptive Feature-Based Shot Detection for HTTP Adaptive Streaming,” in 2021 IEEE International Conference on Image Processing (ICIP), 2021, pp. 2174–2178. Benchmark is the default shot detection algorithm of x265.

19. 19 Future Roadmap ● The initial version will be released before March 1, 2022. ● Adding Multi-threading support ○ H computation for blocks in each frame can be realized concurrently. ○ ~6x speedup expected with 8 threads. ● Adding CUDA/ OpenCL support

Video complexity analyzer (VCA) for streaming applications

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Video complexity analyzer (VCA) for streaming applications

Similaire à Video complexity analyzer (VCA) for streaming applications (20)

Plus de Alpen-Adria-Universität

Plus de Alpen-Adria-Universität (20)

Dernier

Dernier (20)

Video complexity analyzer (VCA) for streaming applications