Linear transform for simultaneous diagonalization of covariance and perceptual metric matrix in image coding

作者：

Highlights：

•

摘要

Two types of redundancies are contained in images: statistical redundancy and psychovisual redundancy. Image representation techniques for image coding should remove both redundancies in order to obtain good results. In order to establish an appropriate representation, the standard approach to transform coding only considers the statistical redundancy, whereas the psychovisual factors are introduced after the selection of the representation as a simple scalar weighting in the transform domain.In this work, we take into account the psychovisual factors in the definition of the representation together with the statistical factors, by means of the perceptual metric and the covariance matrix, respectively. In general the ellipsoids described by these matrices are not aligned. Therefore, the optimal basis for image representation should simultaneously diagonalize both matrices. This approach to the basis selection problem has several advantages in the particular application of image coding. As the transform domain is Euclidean (by definition), the quantizer design is highly simplified and at the same time, the use of scalar quantizers is truly justified. The proposed representation is compared to covariance-based representations such as the DCT and the KLT or PCA using standard JPEG-like and Max-Lloyd quantizers.

论文关键词：Image compression,Transform coding,Statistical redundancy,Psychovisual Redundancy,Perceptual metric

论文评审过程：Received 20 August 2001, Revised 5 June 2002, Accepted 9 September 2002, Available online 11 April 2003.

论文官网地址：https://doi.org/10.1016/S0031-3203(02)00325-4