Escolar Documentos
Profissional Documentos
Cultura Documentos
Abstract—A new method of estimating coding peak signal-to- et al. proposed the block-edge impairment metric (BIM) [3],
noise ratio (PSNR) without the use of reference signals is pre- taking into consideration the luminance masking effects.
sented. Although PSNR is commonly used as a measure of the Peak signal-to-noise ratio (PSNR), which is calculated as
picture degradation of digitally coded video, the calculation re-
quires source signals as a reference.Therefore, the usage of PSNR the error between the original and the processed pictures, is
is restricted to particular applications or systems. The proposed one of the common measures of coding degradation. However,
method enables PSNR estimation based on the probability density the calculation of PSNR requires source signals as a reference.
functions of quantized discrete cosine transform (DCT) coeffi- Turaga et al. proposed a PSNR estimation method without
cients extracted from an MPEG-2 bit stream. We experimented original pictures [4]. They estimated PSNR based on the statis-
with MPEG-2 video coding bit streams under varying quantiza-
tion scheme and evaluate a new method with comparing estimated tical properties of discrete cosine transform (DCT) coefficients
PSNRs with actual PSNRs. Experimental results indicate that the by supposing the coding parameters from the decoded pictures.
determination coefficients are higher than 0.9 This method can Although the metric shows good performance for estimating
apply to both SDTV and HDTV, and can evaluate PSNR of every the quality of intra-pictures under fixed quantization schemes,
frame coded by different picture types. it is not applicable for predicted pictures.
Index Terms—Discrete cosine transform (DCT) coefficients, The latter type of system utilizes several coding parameters
Laplacian distribution, MPEG-2 bit stream, peak signal-to-noise extracted from coded bit streams for estimating the picture
ratio (PSNR), quantization. quality. The coding error can be estimated from the quantiza-
tion parameters of DCT coefficients, which are derived from a
I. INTRODUCTION bit stream. Methods using coded bit streams are also suitable
for monitoring purposes when the coding scheme is fixed as in
and HDTV, and the results are presented in Section IV. The con-
clusion of this work is provided at the end.
(1)
Fig. 1. Probability density distribution of DCT coefficient distributions in a
is the standard Gamma function. Parameters and are component (u = 1, v = 0), when quantization interval SS is 32.
related to the shape and standard deviation of the distribution.
Parameter defines the kurtosis [13]. a DCT block and can be varied for each frame. The quan-
tizer_scale gives the quantizer step size for bit rate control and
(2) can be varied for each macroblock. The quantization repre-
sentative value for the th index is expressed by (5). is
Some studies have reported that the DCT coefficients of the determined by quantization interval . is quan-
ac component can be approximated as a Laplacian distribution, tizer_scale_code, which is an index of the quantizer_scale.
which is a generalized Gaussian distribution [14]–[16]. The pdf
of the Laplacian distribution is given by (1) when and
. The probability function is characterized by where
the standard deviation
(5)
(3) The intra-dc coefficients are quantized linearly and the dif-
ference values are transmitted between the adjacent blocks. The
The amplitude distribution characteristics of the intra-dc co- quantization process of intra-dc coefficients is given by:
efficients do not show conformity with those of the ac coeffi-
cients. The probability distribution of intra-dc largely depends
on the picture contents and cannot be represented as a single where
model. (6)
MPEG-2 video coding employs the motion-compensated
prediction scheme to achieve high compression ratio. The pre- is the resulting intra-dc coefficient after the inverse
diction error signal is a differential signal between the current quantization and signifies the intra-dc quantization repre-
and the prediction signals. The non-intra-DCT coefficients also sentative value. is an intra-dc quantization index.
show the Laplacian distribution statistically [17]. dct_dc_pred is the predictor. dct_diff is information about the
dc coefficient extracted from bit streams. intra_dc_mult is
B. Statistical Characteristics of Quantized DCT Coefficients the quantized value determined by intra_dc_precision. Fig. 2
The DCT coefficients are rounded off through a quantization shows (a) the distribution of decoded dc coefficients and (b)
process, producing a quantization representative value. The dis- the distribution of dct_diff values. Although the shape distri-
tribution of quantized coefficients shows comb-like shapes as bution of decoded dc coefficients is not constant and does not
depicted in Fig. 1. In the MPEG-2 video standard, the inverse have common properties for each sequence, the distribution of
quantization process is applied to all DCT coefficients, which, dct_diff, which is calculated as an adjacent difference value of
except for the intra-dc coefficient, are given by quantized dc coefficients, can be assumed to be the Laplacian
distribution [18].
• (non linear)’ Fig. 3. Relationship of kurtosis between source and quantized DCT coefficient
• test material: three HDTV test sequences [20], “Whale distributions.
Show,” “Chromakey (Sprinkling),” and “Crowded Cross-
walk” (1920 1080/59.94i, 5 s).
1) Analysis on the Kurtosis: The kurtosis, , is the degree equal. In the non-intra-coded macroblocks, the standard devia-
of peakedness of a distribution and is defined as a normalized tion of the quantized DCT coefficients is smaller than that of the
form of the fourth central moment of a distribution by source.
When the DCT coefficient values are significantly smaller
than the quantizer step size, most of the quantized coefficients
(7) become zero and the standard deviation is smaller than that of
the source signals. This often occurs since the quantization of
non-intra-coded macroblocks is a rounding-off process with a
The kurtosis is a parameter used to adjust the configuration of
dead zone around zero [21] and the DCT coefficient values of
the generalized Gaussian distribution. The larger the kurtosis,
motion-compensated picture differences are smaller than those
the sharper the shape of the distribution. The kurtosis of the
of intra-macroblocks. It shows that the standard deviation after
Laplacian distribution and the Gaussian distribution is 6 and 3,
quantization almost corresponds to the original one.
respectively.
Fig. 3 shows the relationship of the kurtosis distribution
of each component before and after quantization for (a) III. FRAMEWORK OF ESTIMATING PSNR
intra-coded macroblocks and (b) non-intra-coded macroblocks. A. Principles of Estimating PSNR
The kurtosis of the distribution of quantized coefficients be- Picture quality degradation associated with MPEG-2 coding
comes larger than that of the source coefficients. It is difficult to occurs due to the quantization of DCT coefficients. PSNR rep-
evaluate source distribution shapes from the shape of decoded resents the amount of coding errors, which is the difference be-
distributions because the kurtosis of the quantized distribu- tween the source and decoded video signals as given by
tion undergoes significant change. Table I shows the average
kurtosis of each sequence on the distribution of source and
quantized DCT coefficients. It indicates the difficulty in calcu-
lating the value of parameter from a quantized distribution.
2) Analysis of the Standard Deviation: Fig. 4 shows the re- (8)
lationship of standard deviations of the distribution before and
after quantization for (a) intra-coded macroblocks and (b) non-
intra-coded macroblocks. In both coding types, the standard de- where and are the sample values of the source signal and
viation of the source and quantized DCT coefficients are almost the corresponding decoded signal at sample number .
254 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 2, FEBRUARY 2006
TABLE I butions of DCT coefficients of the source are given, the value of
AVERAGE KURTOSIS OF EACH HDTV SEQUENCES the quantization error is obtained from
(10)
(12)
TABLE V
AVERAGE ERROR BETWEEN ESTIMATED WITH
COMPENSATION AND ACTUAL PSNR
TABLE VI
STANDARD DEVIATION OF ERROR BETWEEN ESTIMATED
WITH COMPENSATION AND ACTUAL PSNR
TABLE VII
TABLE II COEFFICIENTS OF DETERMINATION R BETWEEN ESTIMATED
AVERAGE ERROR BETWEEN ESTIMATED WITHOUT WITH COMPENSATION AND ACTUAL PSNR
COMPENSATION AND ACTUAL PSNR
TABLE VIII
VARIANCE OF RESIDUAL BETWEEN REGRESSION ESTIMATE
TABLE III WITH COMPENSATION AND ACTUAL VALUES
STANDARD DEVIATION OF ESTIMATION ERROR BETWEEN ESTIMATED
WITHOUT COMPENSATION AND ACTUAL PSNR
TABLE IX
AVERAGE ERROR BETWEEN ESTIMATED WITH COMPENSATION AND
ACTUAL VALUES OF “WHALE SHOW”, “CROWDED CROSSWALK”
TABLE IV AND “CHROMAKEY (SPRINKLING)” IN SDTV, 5 MBIT/S
COEFFICIENTS OF DETERMINATION R BETWEEN ESTIMATED
WITHOUT COMPENSATION AND ACTUAL PSNR
Fig. 7. Relationship of PSNR between estimation with compensation and actual values on I-, P-, and B-pictures (SDTV 5 Mbit/s).
Fig. 8. Relationship of PSNR between estimation with compensation and actual values on I-, P-, and B-pictures (HDTV 60 Mbit/s).
We considered that the streams of 5 Mbit/s (SDTV) and mismatch in the estimation process. This tendency also appears
18 Mbit/s (HDTV) are for broadcasting (to the home) service for high-frequency DCT components in intra-coded blocks,
and 60 Mbit/s is for contribution and distribution. The video which are quantized coarsely. In our method, the higher the bit
sequences used include a variety of criticality [23], resulting in rate, the smaller the standard deviation of estimation error.
a wide distribution of coding PSNR.
PSNR of each video frame was estimated without compen- B. Experimental Results of Estimating PSNR With
sation described in Section III A and the results are shown in Compensation
Tables II and III. These tables show the average error and the In this section, the performance of PSNR estimation
standard deviation of estimation error between estimated and with compensation is shown. We conducted tests using the
actual PSNR values for I-, P-, and B-pictures, including all the same bit streams as in the previous experiment, described in
sequences. The results indicate that this metric shows estimated Section IV-A.
PSNRs as being higher than the actual values in all picture The results are shown in Tables V and VI. These tables show
types, which demonstrates the influence of coarse quantization. the average error and the standard deviation of estimation error
Table IV show the determination coefficient in the results of between estimated and actual PSNR values for I-, P-, and B-pic-
regression analysis. tures, including all the sequences. In all cases except for HDTV
In the case of coarse quantization, almost all DCT coefficients 18 Mbit/s in B-picture, the average error is within 1.0 dB.
round to zero and the standard deviation becomes around zero, However, the case of HDTV 18 Mbit/s in B-picture is also im-
and so the quantization error is estimated lower than the actual proved in terms of the standard deviation of error. This indicates
amount and the estimated PSNR becomes higher. the effectiveness of the compensation process.
In the general coding scheme, the allocation bit rate for Tables VII and VIII show the determination coefficient
B-picture is less than that for the other pictures and coarser and the residual variance in the results of regression analysis.
quantizers are employed. This frequently causes a rounding In all cases except for HDTV 18 Mbit/s in B-picture, the coeffi-
of DCT coefficients to zero or the occurrence of noncoded cients of determination coefficient increase to more than 0.95. In
macroblocks, where the standard deviation of the distribution the case of HDTV 18 Mbit/s in B-picture, the determination co-
of quantized DCT coefficients is much smaller than that of the efficient increases to more than 0.9. This indicates that although
source distribution. Under the coding condition of HDTV at the average error of HDTV 18 Mbit/s in B-picture is large, it is
18 Mbit/s, approximately 70% of the macroblocks in a picture expected that the estimation error can be reduced by utilizing
are processed as noncoded macroblocks in the B-picture. the regression estimation.
Consequently, estimation was conducted using only 30% of Table IX shows the average error between estimated and ac-
the whole macroblocks in the B-picture, which led to a larger tual PSNR values of three sequences. The error of “Crowded
258 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 2, FEBRUARY 2006
Fig. 9. Relationship of PSNR between estimation with compensation and actual values on I-, P-, and B-pictures (HDTV 18 Mbit/s).
Crosswalk” is the largest. Compared with Table I, the kurtosis [4] D. S. Turaga, Y. Chen, and J. Caviedes, “No reference PSNR estimation
of the two sequences except “Crowded Crosswalk” is approxi- for compressed pictures,” Signal Process. Image Commun., vol. 19, pp.
173–184, 2004.
mately 6. This shows the distribution is similar to the Laplacian [5] M. Knee, “A single-ended picture quality measure for MPEG-2,” in
distribution but “Crowded Crosswalk” is inappropriate. Hence, Proc. Int. Broad., Convention (IBC 2000), Sep. 2000, pp. 95–100.
the degree of estimation error depends on the goodness of fit to [6] M. Knee, M. J. Diggins, and World Intellectual Property Bureau, “Im-
provements in Data Compression,” Int. Patent Appl. WO 00/22 834, Apr.
the Laplacian distribution. 20, 2000.
Figs. 7–9 show the relationship of PSNR between values [7] Y. Yan and K. Challapali, “A novel distortion-quantization model and
estimated with compensation and actual values on I-, P-, and its application in low bitrate video communications,” in Proc. Int. Conf.
Image Process., vol. 1, Rochester, NY, 2002, pp. 81–84.
B-pictures at each bit rate. The results show that the estimation [8] L. Karray, P. Duhamel, and O. Rioul, “Image coding with an L norm
method can be used for both SDTV and HDTV under varying and confidence interval criteria,” IEEE Trans. Image Process., vol. 7, no.
quantization schemes. 5, pp. 621–631, May 1998.
[9] “Generic Coding of Moving Pictures and Associated Audio Information:
Video,” ISO-IEC/JTC1/SC29/WG11, ISO-IEC 13 818–2, 2000.
[10] T. T. Pham and R. J. P. de Figueiredo, “Maximum likelihood estimation
V. CONCLUSION of a class of non-Gaussian densities with application to I deconvolu-
tion,” IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 1, pp.
This paper presents a method of estimating PSNR of digi- 73–82, Jan. 1989.
tally coded video using the information extracted from MPEG-2 [11] W. H. Pun and B. D. Jeffs, “Adaptive image restoration using a general-
bit streams. The estimation algorithm is based on the statistical ized Gaussian model for unknown noise,” IEEE Trans. Image Process.,
vol. 4, pp. 1451–1456, Oct. 1995.
property of the pdf of quantized DCT coefficients and assumes [12] I. Matsuda, N. Shirai, and S. Itoh, “Lossless coding using predictors
that the distribution of the DCT coefficients approximates the and VLC’s optimized for each image,” in Proc. SPIE: Visual Commun.
Laplacian distribution. Image Process.(VCIP 2003), vol. 5150, Jul. 2003, pp. 1677–1684.
[13] E. Y. Lam and J. W. Goodman, “A mathematical analysis of the DCT
We conducted some experiments to verify the performance of coefficient distributions for images,” IEEE Trans. Image Process., vol.
our method using various sequences under different coding con- 9, no. 10, pp. 1661–1666, Oct. 2000.
ditions. Experimental results indicate that the determination co- [14] T. Eude, R. Grisel, H. Cherifi, and R. Debrie, “On the distribution of the
DCT coefficients,” in Proc. IEEE ICASSP, Adelaide, Australia, 2004,
efficients between estimated PSNR and actual PSNR are higher pp. V365–V368.
than 0.9. The averages of estimation errors are within 1.0 dB [15] R. C. Reininger and J. Gibson, “Distribution of the two-dimensional
except for HDTV 18 Mbit/s in B-picture. This method is avail- DCT coefficients for images,” IEEE Trans. Commun., vol. 31, no. 6, pp.
835–839, Jun. 1993.
able for both SDTV and HDTV, and can evaluate PSNR of every [16] G. Lakhani, “Distribution-based restoration of DCT coefficients,” IEEE
frame coded by different picture types. Trans.Circuits Syst. Video Technol., vol. 10, no. 5, pp. 819–823, Aug.
While our proposed method seems to provide reasonable per- 2000.
[17] F. Bellifemine, A. Capellino, A. Chimienti, R. Picco, and R. Ponti, “Sta-
formance for monitoring systems, the factors of PSNR estima- tistical analysis of the 2D-DCT coefficients of the differential signal for
tion errors need to be estimated in order to achieve higher es- images,” Signal Process. Image Commun., vol. 4, pp. 477–488, Nov.
timation accuracy especially at high compression ratios. Future 1992.
[18] X. Wu, “Lossless compression of continuous-tone images via context
studies should include, for example, examining the Laplacian selection, quantization, and modeling,” IEEE Trans. Image Process., vol.
assumption and the estimation errors of the standard deviation 6, no. 5, pp. 656–664, May 1997.
of the quantized coefficients. [19] “Test Model 5,” The MPEG Software Simulation Group (MSSG),
Software for MPEG-2 coding and decoding, ISO.IEC JTC1/SC29/
WG11/N0400, 1993.
REFERENCES [20] “Test materials to be used in subjective assessment,” Rec. ITU-R
BT.1210–3, Software for MPEG-2 coding and decoding.
[1] “User Requirements for Objective Perceptual Video Quality Measure- [21] J. L. Mitchell, W. B. Pnnebaker, C. E. Fogg, and D. J. LeGall, “MPEG
ments in Digital Cable Television,” ITU-T Rec. J.143, May 2000. video compression standard,” ISBN 0–412–08 771–5.
[2] Z. Wang, A. C. Bovil, and B. L. Evans, “Blind measurement of blocking [22] M. A. Robertson and R. L. Stevenson, “DCT quantization noise in com-
artifacts in images,” in Int. Conf. Image Process., vol. III, Vancouver, pressed images,” in Proc. Int. Conf. Image Process., vol. 1, Thessaloniki,
Canada, Sep. 2000, pp. 981–984. Greece, 2001, pp. 185–188.
[3] H. R. Wu and M. Yuen, “A generalized block-edge impairment metric [23] E. Nakasu, K. Aoki, R. Yajima, Y. Kanatsugu, and K. Kubota, “A sta-
for video coding,” IEEE Signal Process. Lett., vol. 4, no. 1, pp. 317–320, tistical analysis of MPEG-2 picture quality for television broadcasting,”
Jan. 1997. SMPTE J., vol. 105, no. 11, pp. 702–711, 1996.
ICHIGAYA et al.: METHOD OF ESTIMATING CODING PSNR 259
Atsuro Ichigaya received the B.E. degree in en- Naohiro Hara received the B.E. degree in image
gineering systems and the M.E. degree in science engineering from Chiba University, Chiba, Japan, in
and engineering from the University of Tsukuba, 1998.
Tsukuba, Japan, in 1996 and 1998, respectively. He joined NHK, Tokyo, Japan, in 1998 and has
He joined NHK, Tokyo, Japan, in 1998 and has been with NHK Science and Technical Research Lab-
been with NHK Science and Technical Research Lab- oratories since 2000. He is in charge of research and
oratories since 2000. He is in charge of research and development of digital video processing and video
development of video monitoring systems and digital coding.
video processing. Mr. Hara is a member of IEICE and ITE of Japan
Mr. Ichigaya is a member of VRSJ, IEICE, and
the Institute of Image Information and Television
Engineers (ITE) of Japan.
Masaaki Kurozumi received the B.E. degree in elec- Eisuke Nakasu (M’89) received the B.E. and
tronics from Doshisha University, Kyoto, Japan, in M.E. degrees in electrical engineering from Keio
1993, and the M.E. degree in information processing University, Yokohama, Japan, in 1980 and 1982,
from the Nara Institute of Science and Technology, respectively.
Ikoma, Japam, in 1995. He joined NHK, Tokyo, Japan, in 1982. He is a Se-
In 1995, he joined NHK, Tokyo, Japan, and nior Research Engineer at NHK Science and Tech-
worked as a Broadcasting Engineer at the Osaka nical Research Laboratories, where he is in charge
Broadcasting Station. Since 1997, he has been with of research and development of digital broadcasting
NHK Science and Technical Research Laboratories, systems, video coding and quality evaluation.
and is engaged in research on picture coding. Mr. Nakasu is a member of IEICE, ITE, and IPS
Mr. Kurozumi is a member of ITE of Japan. of Japan.