This document discusses video coding standards and techniques. It summarizes the development of standards including H.264/AVC and the newer H.265/HEVC, which can provide 50% bitrate reductions. It also covers research topics like immersive video communication and measuring human visual perception. Overall the document traces advances in video coding and compression from early standards to current research frontiers like high resolution 3D video and improved subjective quality assessment.
7. slide7
1989: Digital TV – Digital Broadcast, DVD
1999: Birth of H.26L in Berlin
Today: >3 Billion devices with H.264/AVC
50% of all bits on the Internet
Every HDTV Receiver
Every Blu-Ray Player
Most Internet Video
Countless Mobile Video
9. slide9
§ How to run the video encoder?
§ Decision between many options denoted with vector p
§ Unconstrained Lagrangian Formulation:
€
min
p
D(p)+ λ ⋅ R(p)
D - Distortion
R - Rate
RT - Target Rate
p - Parameter Vector
with λ controlling the rate-distortion trade-off
min
p
D(p) s.t. R(p)≤ RT
§ Minimization tests the various modes in video coding
[Wiegand, et al., 1996]
§ Constrained Problem:
12. slide12
Entropy
Coding
Scaling & Inv.
Transform
Motion-
Compensation
Control
Data
Quant.
Transf. coeffs
Motion
Data
Intra/Inter
Coder
Control
Decoder
Motion
Estimation
Transform/
Scal./Quant.
-
Input
Video
Signal
Split into
Macroblocks
16x16 pixels
Intra-frame
Prediction
De-blocking
Filter
Output
Video
Signal
8x8
0
4x8
0 1
0 1
2 3
4x48x4
1
08x8
Types
0
16x16
0 1
8x16
MB
Types
8x8
0 1
2 3
16x8
1
0
13. slide13
§ Division of a picture into square blocks
§ Blocks are assigned to quadtrees
§ Maximum block size is signalled (e.g. 64x64)
§ Quadtree-based
subdivision of
tree block into
prediction and
transform blocks
14. slide14
§ Transform sizes range from 4x4 to 32x32
§ Fast integer transforms specified
§ Additional new rectangular transforms proposed
21. slide21
Final approval of version 1: April 14, 2013
What comes after version 1 of H.265/MPEG-HEVC?
èThe following H.265/MPEG-HEVC extensions are
work in progress:
• Range Extensions (January 2014)
Higher bit-depths (>10bit), More chroma formats (4:4:4,
4:2:2),...
• Scalable Coding (Mid. 2014)
• 3D Multiview and Depth (January 2014 and 2015)
22.
23. slide23
§ Whole conference situation is not sufficiently natural
§ Provision of eye-contact is limited
§ Awareness of gestures and body language is not fully supported
34. slide34
§ Steady-state visual evoked
potentials (VEPs) to
objectively evaluate
Visual Cortex response
§ Event-Related Potentials
(ERPs) to objectively evaluate
subjective processes
§ ERPs are eventually leading
to judgments and evaluation
37. slide37
§ H.264/MPEG-AVC: More than 3 Billion devices
and 50% of all bits on the Internet
§ H.265/MPEG-HEVC:
§ Lagrangian approach to coder control
§ 50% bit-rate reduction relative to H.264
§ Immersive rooms
§ Seamless integration and eye contact
§ Walls are becoming displays
§ H.266 – Research Frontier
§ Even higher resolutions and 3D
§ Improved subjective measures
38. slide38
ITU-T VCEG & ISO/IEC MPEG Colleagues:
• Gary J. Sullivan
• Gisle Bjontegaard
• …
Vidyo
• Alex Eleftheriadis
• Ofer Shapiro
• …
HHI/TUB members and research associates
§ H. Schwarz, D. Marpe & D. Grois
§ P. Kauff & R. Schäfer
§ K.-R. Müller & A. Norcia
§ …