Witness the development of H.265

Transform Error Introduced by Non-orthogonality

2009-04-27 KTA View Comments

In the Geneva meeting held in Feb. 2009, a proposal with the title “Video Coding Using Extended Block Sizes” was adopted by KTA, where the MB size is extended up to 64×64 and the motion partitions are scaled accordingly. At the same time, a 2D order-16 transform was also proposed for transforming the residual blocks with the size larger than or equal to 16×16. The transformation matrix of the proposed 2D order-16 transform is given as below, which is obtained by scaling the transformation matrix of 2D order-16 DCT by the factor 128 and rounding, and is non-orthogonal.

The transformation matrix of the proposed 2D order-16 transform

  Non-orthogonality will inevitably introduce transform error. Before analyzing the transform error quantitatively, let’s recall two properties of orthogonal transforms. Firstly, signals can be reconstructed perfectly if no quantization is performed in the transform domain. Secondly, if quantization is performed in the transform domain, the average variance (or energy) of the reconstruction error equals that of the quantization error.

  When it comes to non-orthogonal transform, the two properties are not held any more. First of all, we discuss the average variance of the reconstruction error without quantization in the transform domain. Suppose the vector input to the transform is a sample of a 1D zero-mean uni-variance source with length 16. Let σr02 denote the average variance of the reconstruction error. It can be proved that the upper bound of σr02  is expressed as the following formula,

Upper Bound

where N is equal to 16, σx2 is the variance of the input, and M is calculated by (T16TT16-I)T(T16TT16-I). Then, we take the quantization into consideration and it can be proved that the relationship of the average variance of the reconstruction error, σr2, and the quantization error, σq2, can be expressed as below. All the detailed deductions can be found here.

 σr2=σq2+σr02

  Clearly, with non-orthogonal transform, the average variance of the reconstruction error σr2 is always σr02 larger than the average variance of the quantization error σq2, no matter the transform coefficients are quantized or not, whereas with an orthogonal transform, σq2 is equal to σr2.

  For the case of the proposed non-orthogonal transform, the upper bound of σr02 is equal to 0.00024σx2. Note that this figure represents the transform error for 1D input and when extended to 2D, σr02 will significantly increase, because the 1st dimension transform increases the magnitudes of the input and σx2 for the 2nd dimension transform is 256 times larger. Furthermore, in case the quantization stepsize is very small, e.g., QP<6, and the energy of the prediction error is very large, the transform error may dominate the reconstruction error.

Permanent Link: Transform Error Introduced by Non-orthogonality

5 Comments Subscribe Comments (RSS)

rong

Could you pls. tell me the title of papers which proposed this transformation matrix? Thanks.

Yu Liu

The transform matrix was proposed by Qualcomm Inc in 0901_Gen VCEG meeting. The proposal title is “Video Coding Using Extended Block Sizes”, COM16-C123. You can find more information in this post: http://www.h265.net/2009/04/kta-software-jm11kta23.html.

amiller856

Could you please direct me to understanding the DCT algorithm used in H.265 by giving some reference papers? I know it’s based on Chen’s algorithm. But the transform matrices used in the code seem to be different from whats in the original Chen paper.

Aastha

out of the 2 : sigma ro ^2 vs sigma r^2 , which one is avg variance of reconstruction error? And what is the other ? please clarify.

Aastha

Or the former is when there is no quantization and latter when there is quantization ?

Post Comment