Você está na página 1de 9

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO.

2, FEBRUARY 2006 251

A Method of Estimating Coding PSNR


Using Quantized DCT Coefficients
Atsuro Ichigaya, Masaaki Kurozumi, Naohiro Hara, Yukihiro Nishida, and Eisuke Nakasu

Abstract—A new method of estimating coding peak signal-to- et al. proposed the block-edge impairment metric (BIM) [3],
noise ratio (PSNR) without the use of reference signals is pre- taking into consideration the luminance masking effects.
sented. Although PSNR is commonly used as a measure of the Peak signal-to-noise ratio (PSNR), which is calculated as
picture degradation of digitally coded video, the calculation re-
quires source signals as a reference.Therefore, the usage of PSNR the error between the original and the processed pictures, is
is restricted to particular applications or systems. The proposed one of the common measures of coding degradation. However,
method enables PSNR estimation based on the probability density the calculation of PSNR requires source signals as a reference.
functions of quantized discrete cosine transform (DCT) coeffi- Turaga et al. proposed a PSNR estimation method without
cients extracted from an MPEG-2 bit stream. We experimented original pictures [4]. They estimated PSNR based on the statis-
with MPEG-2 video coding bit streams under varying quantiza-
tion scheme and evaluate a new method with comparing estimated tical properties of discrete cosine transform (DCT) coefficients
PSNRs with actual PSNRs. Experimental results indicate that the by supposing the coding parameters from the decoded pictures.
determination coefficients are higher than 0.9 This method can Although the metric shows good performance for estimating
apply to both SDTV and HDTV, and can evaluate PSNR of every the quality of intra-pictures under fixed quantization schemes,
frame coded by different picture types. it is not applicable for predicted pictures.
Index Terms—Discrete cosine transform (DCT) coefficients, The latter type of system utilizes several coding parameters
Laplacian distribution, MPEG-2 bit stream, peak signal-to-noise extracted from coded bit streams for estimating the picture
ratio (PSNR), quantization. quality. The coding error can be estimated from the quantiza-
tion parameters of DCT coefficients, which are derived from a
I. INTRODUCTION bit stream. Methods using coded bit streams are also suitable
for monitoring purposes when the coding scheme is fixed as in

V IDEO CODING systems are widely used for digital video


delivery and recording systems and the monitoring of
video quality has become increasingly important. The degree
the case of broadcasting.
Knee reported a method of NR-type quality measurement
based on MPEG-2 bit stream information [5], [6]. The method
of coding degradation varies depending on the criticality of
uses DCT coefficient data, the number of coefficient bits, and
the source video and careful observation of long sequences is
quantizer_scale, and shows good performance for estimating
necessary for monitoring the picture quality continuously. This
PSNR (called “PAR”) using the statistical characteristics of
is a laborious task for operators when managing the quality
video images. However, preliminary experiments are required
of services, and so a quantitative technique for automatically
to determine the calibration parameters, and a process is needed
monitoring video quality is strongly desired.
to update parameter values for estimating PSNR of P- and
For monitoring purposes, a measurement system that does
B-picture at every I-picture.
not require reference signals is preferred. This type of system is
Yan et al. proposed a rate distortion model based on a dis-
called a no-reference (NR) method [1]. Video quality measure-
tortion-quantization model for low bit rate coding [7]. Karray
ment without the original sequence is classified into two types:
et al. described the statistical properties of the reconstruction
methods using decoded video signals, and methods using infor-
error caused by quantization [8]. Both studied the new quantiza-
mation contained in a bit stream.
tion model analyzing the statistical property of source pictures.
The former type of system assumes noise models for several
In this paper, we propose a new method of estimating PSNR
spatial and/or temporal features including blockiness, edge en-
from MPEG-2 [9] bit streams under a rate control scheme. This
ergy, etc. This method can be applied to perceived video quality
method estimates DCT quantization noises based on statistical
based on human visual system (HVS) models. Wang et al. pro-
properties of quantized DCT coefficients and utilizes several
posed the blocking effect measurement algorithm by analyzing
coding parameters in a coded bit stream. The source information
the power of the blocky signal in the frequency domain [2]. Wu
required for calculating PSNR is derived by analyzing the statis-
tical properties of decoded pictures. The method supports PSNR
estimation for each frame of every picture type without prelim-
Manuscript received May 30, 2004; revised March 23, 2005. This paper was
recommended by Associate Editor L. Onural. inary experiments for calibration. Section II presents the sta-
The authors are with the NHK Science and Technical Research Lab- tistical characteristics of quantized DCT coefficients. The pro-
oratories, Tokyo 157-8510, Japan (e-mail: ichigaya.a-go@nhk.or.jp; posed method for estimating picture quality based on MPEG-2
kurozumi.m-fo@nhk.or.jp; hara.n-fy@nhk.or.jp; nishida.y-fe@nhk.or.jp;
nakasu.e-hm@nhk.or.jp). bit stream is explained in Section III. The proposed algorithms
Digital Object Identifier 10.1109/TCSVT.2005.858745 are evaluated through actual coding experiments for both SDTV
1051-8215/$20.00 © 2006 IEEE
252 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 2, FEBRUARY 2006

and HDTV, and the results are presented in Section IV. The con-
clusion of this work is provided at the end.

II. CHARACTERISTICS OF QUANTIZED DCT COEFFICIENTS


A. Statistical Characteristics of DCT Coefficients
The amplitude distribution of DCT ac coefficients of source
video signals can be modeled as the generalized Gaussian distri-
bution [10]–[12]. The probability density function (pdf) of the
generalized Gaussian distribution is defined by

(1)
Fig. 1. Probability density distribution of DCT coefficient distributions in a
is the standard Gamma function. Parameters and are component (u = 1, v = 0), when quantization interval SS is 32.
related to the shape and standard deviation of the distribution.
Parameter defines the kurtosis [13]. a DCT block and can be varied for each frame. The quan-
tizer_scale gives the quantizer step size for bit rate control and
(2) can be varied for each macroblock. The quantization repre-
sentative value for the th index is expressed by (5). is
Some studies have reported that the DCT coefficients of the determined by quantization interval . is quan-
ac component can be approximated as a Laplacian distribution, tizer_scale_code, which is an index of the quantizer_scale.
which is a generalized Gaussian distribution [14]–[16]. The pdf
of the Laplacian distribution is given by (1) when and
. The probability function is characterized by where
the standard deviation
(5)
(3) The intra-dc coefficients are quantized linearly and the dif-
ference values are transmitted between the adjacent blocks. The
The amplitude distribution characteristics of the intra-dc co- quantization process of intra-dc coefficients is given by:
efficients do not show conformity with those of the ac coeffi-
cients. The probability distribution of intra-dc largely depends
on the picture contents and cannot be represented as a single where
model. (6)
MPEG-2 video coding employs the motion-compensated
prediction scheme to achieve high compression ratio. The pre- is the resulting intra-dc coefficient after the inverse
diction error signal is a differential signal between the current quantization and signifies the intra-dc quantization repre-
and the prediction signals. The non-intra-DCT coefficients also sentative value. is an intra-dc quantization index.
show the Laplacian distribution statistically [17]. dct_dc_pred is the predictor. dct_diff is information about the
dc coefficient extracted from bit streams. intra_dc_mult is
B. Statistical Characteristics of Quantized DCT Coefficients the quantized value determined by intra_dc_precision. Fig. 2
The DCT coefficients are rounded off through a quantization shows (a) the distribution of decoded dc coefficients and (b)
process, producing a quantization representative value. The dis- the distribution of dct_diff values. Although the shape distri-
tribution of quantized coefficients shows comb-like shapes as bution of decoded dc coefficients is not constant and does not
depicted in Fig. 1. In the MPEG-2 video standard, the inverse have common properties for each sequence, the distribution of
quantization process is applied to all DCT coefficients, which, dct_diff, which is calculated as an adjacent difference value of
except for the intra-dc coefficient, are given by quantized dc coefficients, can be assumed to be the Laplacian
distribution [18].

C. Analysis of Quantized Distribution


In order to conform the distribution of quantized coefficients
where to the function models of nonquantized DCT distribution, we
(4) examined two statistical parameters: the kurtosis and the stan-
dard deviation. The coding conditions were as follows:
is the resulting coefficient after the inverse quan- • MPEG-2 MP@HL ( , );
tization and signifies the quantization representative value. • encoding scheme: Test Model 5 [19];
is the quantization index. is the • bit rate: 18 Mbit/s;
quantization weighting matrix for each DCT coefficient within • quantization matrix: default;
ICHIGAYA et al.: METHOD OF ESTIMATING CODING PSNR 253

Fig. 2. Histogram of intra-dc coefficients and dct_diff of the sequence


“Crowded Crosswalk.”

• (non linear)’ Fig. 3. Relationship of kurtosis between source and quantized DCT coefficient
• test material: three HDTV test sequences [20], “Whale distributions.
Show,” “Chromakey (Sprinkling),” and “Crowded Cross-
walk” (1920 1080/59.94i, 5 s).
1) Analysis on the Kurtosis: The kurtosis, , is the degree equal. In the non-intra-coded macroblocks, the standard devia-
of peakedness of a distribution and is defined as a normalized tion of the quantized DCT coefficients is smaller than that of the
form of the fourth central moment of a distribution by source.
When the DCT coefficient values are significantly smaller
than the quantizer step size, most of the quantized coefficients
(7) become zero and the standard deviation is smaller than that of
the source signals. This often occurs since the quantization of
non-intra-coded macroblocks is a rounding-off process with a
The kurtosis is a parameter used to adjust the configuration of
dead zone around zero [21] and the DCT coefficient values of
the generalized Gaussian distribution. The larger the kurtosis,
motion-compensated picture differences are smaller than those
the sharper the shape of the distribution. The kurtosis of the
of intra-macroblocks. It shows that the standard deviation after
Laplacian distribution and the Gaussian distribution is 6 and 3,
quantization almost corresponds to the original one.
respectively.
Fig. 3 shows the relationship of the kurtosis distribution
of each component before and after quantization for (a) III. FRAMEWORK OF ESTIMATING PSNR
intra-coded macroblocks and (b) non-intra-coded macroblocks. A. Principles of Estimating PSNR
The kurtosis of the distribution of quantized coefficients be- Picture quality degradation associated with MPEG-2 coding
comes larger than that of the source coefficients. It is difficult to occurs due to the quantization of DCT coefficients. PSNR rep-
evaluate source distribution shapes from the shape of decoded resents the amount of coding errors, which is the difference be-
distributions because the kurtosis of the quantized distribu- tween the source and decoded video signals as given by
tion undergoes significant change. Table I shows the average
kurtosis of each sequence on the distribution of source and
quantized DCT coefficients. It indicates the difficulty in calcu-
lating the value of parameter from a quantized distribution.
2) Analysis of the Standard Deviation: Fig. 4 shows the re- (8)
lationship of standard deviations of the distribution before and
after quantization for (a) intra-coded macroblocks and (b) non-
intra-coded macroblocks. In both coding types, the standard de- where and are the sample values of the source signal and
viation of the source and quantized DCT coefficients are almost the corresponding decoded signal at sample number .
254 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 2, FEBRUARY 2006

TABLE I butions of DCT coefficients of the source are given, the value of
AVERAGE KURTOSIS OF EACH HDTV SEQUENCES the quantization error is obtained from

(10)

where denotes the probability density function


(pdf) of DCT coefficient value at for each quantized
step size, and is the quantization representative value for the
quantization interval number . is the total sample
number of the DCT coefficient at for each quantized step
size. The quantization interval is expressed using the notation
[ , ).
The results of preliminary experiments for analyzing the char-
acteristics of quantized DCT coefficients indicate that the PSNR
estimating model based on the Laplacian distribution is more
suitable than the generalized Gaussian function because the kur-
tosis cannot be used for estimating distribution. Since the Lapla-
cian distribution is determined by the standard deviation, we can
estimate source distributions after quantization distribution. In
our estimation scheme, the distribution of DCT coefficients is
modeled in the form of the Laplacian distribution.
The estimation should be carried out for each quantized step
size. The whole distribution that consists of all quantized step
sizes shows the properties of whole images. On encoding with
rate control, part of the image that is quantized by the same
quantized step size has different properties from the whole im-
ages. In short, the standard deviation depends on quantizer step
Fig. 4. Relationship of standard deviations between source and quantized DCT size. This means that it is necessary to calculate MSQE for each
coefficient distributions.
quantized step size.
In the MPEG-2 coding scheme, there are three types of
picture that use different prediction methods. I-picture is coded
is a peak signal amplitude and equals 255 in the case of 8-bit using information of its own, and P-picture and B-picture are
representation. coded using motion-compensated prediction from reference
According to Parseval’s Theorem, the MSE in the pixel frame(s).
domain is equivalent to the mean square quantization error The quantization of intra-dc coefficients is linear and the step
(MSQE) in the DCT domain because DCT is a normalized size is constant in a frame. Since the shape of dc component dis-
orthogonal transformation. Therefore, it is possible to measure tribution is strongly dependent on the picture contents, intra-dc
PSNR from the quantization error in the DCT domain, using coefficients are estimated using the difference value for quan-
the following equation: tized coefficients between adjacent blocks. In the intra-coding,
the dct_diff value is analyzed instead of the dc coefficient it-
self. The distribution of the dct_diff value can be assumed to be
Laplacian distribution.
In the non-intra-coding, decoded pictures can be expressed as
, where is an input image and is a prediction
(9) image at time instance . denotes DCT, quantization and
their inverse processes. Then, the coding error is expressed
by

where and are the DCT coefficients of the source signal


and the corresponding decoded signal. When amplitude distri- (11)
ICHIGAYA et al.: METHOD OF ESTIMATING CODING PSNR 255

Equation (11) indicates that for P- and B-pictures, MSQE can


be calculated for prediction error signals, which is equivalent to
that of decoded image signals. In P- and B-pictures, there are
noncoded blocks, where no DCT coefficient data is transmitted
and there is no information available for estimating MSQE in the
coded bit stream. Therefore, MSQE is calculated using coded
blocks only

(12)

where and are ratios of intra-coded blocks and non-intra-


coded blocks among all coded blocks.
The quantization interval is not defined in the MPEG-2 video Fig. 5. Probability density distribution of DCT coefficients at DCT component
standard, and it can be arranged as a factor of the encoder de- (u = 6, v = 1), when quantization interval SS is 68.
sign. Since the coded bit stream does not include any informa-
tion on the quantization interval, quantization interval [ , ) is When a quantizer step size is sufficiently small, the standard
required to estimate coding PSNR. The quantization represen- deviation of the distribution is almost preserved irrespective of
tative value is assumed to be located at the midpoint of the the inclusion/exclusion of zero-value coefficients. On the other
quantization interval [ , ) as follows: hand, when a quantizer step size is large, the standard deviation
of quantized DCT coefficients including zero is smaller than the
source standard deviation, and the standard deviation excluding
(13) zero-value coefficients is larger than the source standard devia-
tion (Fig. 5).
Using (3), (9), (10), and (12), PSNR is obtained by the following Therefore, in order to improve the estimation accuracy, we
procedure: introduce an extended Laplacian function as follows:
Step 1) Extract quantized DCT coefficients and decode quan-
tization step size from a bit stream.
Step 2) For each DCT coefficient component and quan-
tization step size , calculate the standard
deviation . (14)
Step 3) Estimate pdf of source DCT coeffi-
cients by (3). The parameter controls the spread of the function. The
Step 4) Calculate MSQE by (10). is a Laplacian pdf scaled by the second exponential func-
Step 5) Calculate PSNR by (12) and (9). tion which is constant value under . When is small enough,
this function is closer to the standard Laplacian pdf. We apply
B. Compensation Method of Estimating the Distribution
this function to eliminate the influence of quantized zero coeffi-
In the case of coarse quantization, many quantized DCT co- cients. The value of is determined by the quantization interval
efficients are rounded off to zero by quantization and the stan- around the zero value. This function is used for estimating the
dard deviation of the distribution of quantized DCT coefficients distribution configuration as follows.
tends to become roughly zero, smaller than that of the source Step 1) Measure the standard deviation for the distribution
distribution. Where quantized DCT coefficients are around zero, of quantized DCT coefficients by frequency of
the Laplacian model works well, but other DCT coefficients are the quantization representative value and estimate
not inappropriate [22]. This situation often occurs for high-fre- pdf , as in the estimation method described in
quency components of intra-coded macroblocks because of the Section III-A
slope of a quantization matrix and all frequency components of
non-intra-macroblocks.
Fig. 5 shows an example case in which most of the coeffi-
cients become zero after quantization. The estimated standard
deviation of the distribution becomes much smaller than that of (15)
the source distribution. MSQE is then estimated smaller than
the actual value, and hence the estimation accuracy of DCT
quantization error is reduced. In these cases, the estimated stan- Step 2) Measure the standard deviation for the distribu-
dard deviation is excessively dominated by the zero coefficients. tion of quantized DCT coefficients excluding the zero-
The Laplacian distribution, which is a two-sided exponential value coefficients and estimate probability distribution
distribution, becomes roughly zero on the lower slope. There- which is a extended Laplacian distribution
fore, the estimated distribution does not admit the existence of
nonzero coefficients. In order to improve the estimation accu-
racy, nonzero coefficients should be treated carefully. where
256 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 2, FEBRUARY 2006

TABLE V
AVERAGE ERROR BETWEEN ESTIMATED WITH
COMPENSATION AND ACTUAL PSNR

TABLE VI
STANDARD DEVIATION OF ERROR BETWEEN ESTIMATED
WITH COMPENSATION AND ACTUAL PSNR

Fig. 6. Compensated probability density distribution of DCT coefficients in


DCT component (u = 6, v = 1), when quantization interval SS is 68.

TABLE VII
TABLE II COEFFICIENTS OF DETERMINATION R BETWEEN ESTIMATED
AVERAGE ERROR BETWEEN ESTIMATED WITHOUT WITH COMPENSATION AND ACTUAL PSNR
COMPENSATION AND ACTUAL PSNR

TABLE VIII
VARIANCE OF RESIDUAL BETWEEN REGRESSION ESTIMATE
TABLE III WITH COMPENSATION AND ACTUAL VALUES
STANDARD DEVIATION OF ESTIMATION ERROR BETWEEN ESTIMATED
WITHOUT COMPENSATION AND ACTUAL PSNR

TABLE IX
AVERAGE ERROR BETWEEN ESTIMATED WITH COMPENSATION AND
ACTUAL VALUES OF “WHALE SHOW”, “CROWDED CROSSWALK”
TABLE IV AND “CHROMAKEY (SPRINKLING)” IN SDTV, 5 MBIT/S
COEFFICIENTS OF DETERMINATION R BETWEEN ESTIMATED
WITHOUT COMPENSATION AND ACTUAL PSNR

IV. EXPERIMENTAL RESULTS


A. Experiment Result of Estimating PSNR Without
Compensation
(16) We have executed simulations of the estimation of PSNR
using MPEG-2 bit streams. The bit streams were produced by
where the Test Model 5 under the following conditions:
Step 3) Obtain pdf as an weighted average of and • MPEG-2 MP@ML and MP@HL ( , )’
which are the Laplacian pdfs • encoding scheme: Test Model 5 (TM5);
• quantization matrix: default;
(17) • (non linear);
• video sequences: 12 SDTV (720 480/59.94i, 15 sec)
and 12 HDTV (1920 1080/59.94i, 5 s);
As shown in Fig. 6, the compensated distribution has moved • bit rate: 5 Mbit/s (SDTV), 18 Mbit/s, and 60 Mbit/s
closer to the source distribution. (HDTV).
ICHIGAYA et al.: METHOD OF ESTIMATING CODING PSNR 257

Fig. 7. Relationship of PSNR between estimation with compensation and actual values on I-, P-, and B-pictures (SDTV 5 Mbit/s).

Fig. 8. Relationship of PSNR between estimation with compensation and actual values on I-, P-, and B-pictures (HDTV 60 Mbit/s).

We considered that the streams of 5 Mbit/s (SDTV) and mismatch in the estimation process. This tendency also appears
18 Mbit/s (HDTV) are for broadcasting (to the home) service for high-frequency DCT components in intra-coded blocks,
and 60 Mbit/s is for contribution and distribution. The video which are quantized coarsely. In our method, the higher the bit
sequences used include a variety of criticality [23], resulting in rate, the smaller the standard deviation of estimation error.
a wide distribution of coding PSNR.
PSNR of each video frame was estimated without compen- B. Experimental Results of Estimating PSNR With
sation described in Section III A and the results are shown in Compensation
Tables II and III. These tables show the average error and the In this section, the performance of PSNR estimation
standard deviation of estimation error between estimated and with compensation is shown. We conducted tests using the
actual PSNR values for I-, P-, and B-pictures, including all the same bit streams as in the previous experiment, described in
sequences. The results indicate that this metric shows estimated Section IV-A.
PSNRs as being higher than the actual values in all picture The results are shown in Tables V and VI. These tables show
types, which demonstrates the influence of coarse quantization. the average error and the standard deviation of estimation error
Table IV show the determination coefficient in the results of between estimated and actual PSNR values for I-, P-, and B-pic-
regression analysis. tures, including all the sequences. In all cases except for HDTV
In the case of coarse quantization, almost all DCT coefficients 18 Mbit/s in B-picture, the average error is within 1.0 dB.
round to zero and the standard deviation becomes around zero, However, the case of HDTV 18 Mbit/s in B-picture is also im-
and so the quantization error is estimated lower than the actual proved in terms of the standard deviation of error. This indicates
amount and the estimated PSNR becomes higher. the effectiveness of the compensation process.
In the general coding scheme, the allocation bit rate for Tables VII and VIII show the determination coefficient
B-picture is less than that for the other pictures and coarser and the residual variance in the results of regression analysis.
quantizers are employed. This frequently causes a rounding In all cases except for HDTV 18 Mbit/s in B-picture, the coeffi-
of DCT coefficients to zero or the occurrence of noncoded cients of determination coefficient increase to more than 0.95. In
macroblocks, where the standard deviation of the distribution the case of HDTV 18 Mbit/s in B-picture, the determination co-
of quantized DCT coefficients is much smaller than that of the efficient increases to more than 0.9. This indicates that although
source distribution. Under the coding condition of HDTV at the average error of HDTV 18 Mbit/s in B-picture is large, it is
18 Mbit/s, approximately 70% of the macroblocks in a picture expected that the estimation error can be reduced by utilizing
are processed as noncoded macroblocks in the B-picture. the regression estimation.
Consequently, estimation was conducted using only 30% of Table IX shows the average error between estimated and ac-
the whole macroblocks in the B-picture, which led to a larger tual PSNR values of three sequences. The error of “Crowded
258 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 2, FEBRUARY 2006

Fig. 9. Relationship of PSNR between estimation with compensation and actual values on I-, P-, and B-pictures (HDTV 18 Mbit/s).

Crosswalk” is the largest. Compared with Table I, the kurtosis [4] D. S. Turaga, Y. Chen, and J. Caviedes, “No reference PSNR estimation
of the two sequences except “Crowded Crosswalk” is approxi- for compressed pictures,” Signal Process. Image Commun., vol. 19, pp.
173–184, 2004.
mately 6. This shows the distribution is similar to the Laplacian [5] M. Knee, “A single-ended picture quality measure for MPEG-2,” in
distribution but “Crowded Crosswalk” is inappropriate. Hence, Proc. Int. Broad., Convention (IBC 2000), Sep. 2000, pp. 95–100.
the degree of estimation error depends on the goodness of fit to [6] M. Knee, M. J. Diggins, and World Intellectual Property Bureau, “Im-
provements in Data Compression,” Int. Patent Appl. WO 00/22 834, Apr.
the Laplacian distribution. 20, 2000.
Figs. 7–9 show the relationship of PSNR between values [7] Y. Yan and K. Challapali, “A novel distortion-quantization model and
estimated with compensation and actual values on I-, P-, and its application in low bitrate video communications,” in Proc. Int. Conf.
Image Process., vol. 1, Rochester, NY, 2002, pp. 81–84.
B-pictures at each bit rate. The results show that the estimation [8] L. Karray, P. Duhamel, and O. Rioul, “Image coding with an L norm
method can be used for both SDTV and HDTV under varying and confidence interval criteria,” IEEE Trans. Image Process., vol. 7, no.
quantization schemes. 5, pp. 621–631, May 1998.
[9] “Generic Coding of Moving Pictures and Associated Audio Information:
Video,” ISO-IEC/JTC1/SC29/WG11, ISO-IEC 13 818–2, 2000.
[10] T. T. Pham and R. J. P. de Figueiredo, “Maximum likelihood estimation
V. CONCLUSION of a class of non-Gaussian densities with application to I deconvolu-
tion,” IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 1, pp.
This paper presents a method of estimating PSNR of digi- 73–82, Jan. 1989.
tally coded video using the information extracted from MPEG-2 [11] W. H. Pun and B. D. Jeffs, “Adaptive image restoration using a general-
bit streams. The estimation algorithm is based on the statistical ized Gaussian model for unknown noise,” IEEE Trans. Image Process.,
vol. 4, pp. 1451–1456, Oct. 1995.
property of the pdf of quantized DCT coefficients and assumes [12] I. Matsuda, N. Shirai, and S. Itoh, “Lossless coding using predictors
that the distribution of the DCT coefficients approximates the and VLC’s optimized for each image,” in Proc. SPIE: Visual Commun.
Laplacian distribution. Image Process.(VCIP 2003), vol. 5150, Jul. 2003, pp. 1677–1684.
[13] E. Y. Lam and J. W. Goodman, “A mathematical analysis of the DCT
We conducted some experiments to verify the performance of coefficient distributions for images,” IEEE Trans. Image Process., vol.
our method using various sequences under different coding con- 9, no. 10, pp. 1661–1666, Oct. 2000.
ditions. Experimental results indicate that the determination co- [14] T. Eude, R. Grisel, H. Cherifi, and R. Debrie, “On the distribution of the
DCT coefficients,” in Proc. IEEE ICASSP, Adelaide, Australia, 2004,
efficients between estimated PSNR and actual PSNR are higher pp. V365–V368.
than 0.9. The averages of estimation errors are within 1.0 dB [15] R. C. Reininger and J. Gibson, “Distribution of the two-dimensional
except for HDTV 18 Mbit/s in B-picture. This method is avail- DCT coefficients for images,” IEEE Trans. Commun., vol. 31, no. 6, pp.
835–839, Jun. 1993.
able for both SDTV and HDTV, and can evaluate PSNR of every [16] G. Lakhani, “Distribution-based restoration of DCT coefficients,” IEEE
frame coded by different picture types. Trans.Circuits Syst. Video Technol., vol. 10, no. 5, pp. 819–823, Aug.
While our proposed method seems to provide reasonable per- 2000.
[17] F. Bellifemine, A. Capellino, A. Chimienti, R. Picco, and R. Ponti, “Sta-
formance for monitoring systems, the factors of PSNR estima- tistical analysis of the 2D-DCT coefficients of the differential signal for
tion errors need to be estimated in order to achieve higher es- images,” Signal Process. Image Commun., vol. 4, pp. 477–488, Nov.
timation accuracy especially at high compression ratios. Future 1992.
[18] X. Wu, “Lossless compression of continuous-tone images via context
studies should include, for example, examining the Laplacian selection, quantization, and modeling,” IEEE Trans. Image Process., vol.
assumption and the estimation errors of the standard deviation 6, no. 5, pp. 656–664, May 1997.
of the quantized coefficients. [19] “Test Model 5,” The MPEG Software Simulation Group (MSSG),
Software for MPEG-2 coding and decoding, ISO.IEC JTC1/SC29/
WG11/N0400, 1993.
REFERENCES [20] “Test materials to be used in subjective assessment,” Rec. ITU-R
BT.1210–3, Software for MPEG-2 coding and decoding.
[1] “User Requirements for Objective Perceptual Video Quality Measure- [21] J. L. Mitchell, W. B. Pnnebaker, C. E. Fogg, and D. J. LeGall, “MPEG
ments in Digital Cable Television,” ITU-T Rec. J.143, May 2000. video compression standard,” ISBN 0–412–08 771–5.
[2] Z. Wang, A. C. Bovil, and B. L. Evans, “Blind measurement of blocking [22] M. A. Robertson and R. L. Stevenson, “DCT quantization noise in com-
artifacts in images,” in Int. Conf. Image Process., vol. III, Vancouver, pressed images,” in Proc. Int. Conf. Image Process., vol. 1, Thessaloniki,
Canada, Sep. 2000, pp. 981–984. Greece, 2001, pp. 185–188.
[3] H. R. Wu and M. Yuen, “A generalized block-edge impairment metric [23] E. Nakasu, K. Aoki, R. Yajima, Y. Kanatsugu, and K. Kubota, “A sta-
for video coding,” IEEE Signal Process. Lett., vol. 4, no. 1, pp. 317–320, tistical analysis of MPEG-2 picture quality for television broadcasting,”
Jan. 1997. SMPTE J., vol. 105, no. 11, pp. 702–711, 1996.
ICHIGAYA et al.: METHOD OF ESTIMATING CODING PSNR 259

Atsuro Ichigaya received the B.E. degree in en- Naohiro Hara received the B.E. degree in image
gineering systems and the M.E. degree in science engineering from Chiba University, Chiba, Japan, in
and engineering from the University of Tsukuba, 1998.
Tsukuba, Japan, in 1996 and 1998, respectively. He joined NHK, Tokyo, Japan, in 1998 and has
He joined NHK, Tokyo, Japan, in 1998 and has been with NHK Science and Technical Research Lab-
been with NHK Science and Technical Research Lab- oratories since 2000. He is in charge of research and
oratories since 2000. He is in charge of research and development of digital video processing and video
development of video monitoring systems and digital coding.
video processing. Mr. Hara is a member of IEICE and ITE of Japan
Mr. Ichigaya is a member of VRSJ, IEICE, and
the Institute of Image Information and Television
Engineers (ITE) of Japan.

Yukihiro Nishida received the B.E. and M.E. de-


grees in electrical engineering from Keio University,
Yokohama, Japan, in 1983 and 1985, respectively.
He joined NHK, Tokyo, Japan, in 1985. He was a
Visiting Research Engineer at RAI Research Center,
Turin, Italy, in 1992. He is a Senior Research Engi-
neer at NHK Science and Technical Research Lab-
oratories, where he is in charge of research and de-
velopment of video coding and quality evaluation.
He participates in numerous standardization activi-
ties in the area of broadcasting.
Mr. Nishida is a member of ITE of Japan.

Masaaki Kurozumi received the B.E. degree in elec- Eisuke Nakasu (M’89) received the B.E. and
tronics from Doshisha University, Kyoto, Japan, in M.E. degrees in electrical engineering from Keio
1993, and the M.E. degree in information processing University, Yokohama, Japan, in 1980 and 1982,
from the Nara Institute of Science and Technology, respectively.
Ikoma, Japam, in 1995. He joined NHK, Tokyo, Japan, in 1982. He is a Se-
In 1995, he joined NHK, Tokyo, Japan, and nior Research Engineer at NHK Science and Tech-
worked as a Broadcasting Engineer at the Osaka nical Research Laboratories, where he is in charge
Broadcasting Station. Since 1997, he has been with of research and development of digital broadcasting
NHK Science and Technical Research Laboratories, systems, video coding and quality evaluation.
and is engaged in research on picture coding. Mr. Nakasu is a member of IEICE, ITE, and IPS
Mr. Kurozumi is a member of ITE of Japan. of Japan.

Você também pode gostar