Perceptually Optimized Coding of Color Images For Band-Limited Information Networks

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.
org/ijc
PerceptuallyOptimizedCodingofColor
ImagesforBandLimitedInformation
Networks
EvgenyGershikov
DepartmentofElectricalEngineering,OrtBraudeAcademicCollegeof Engineering,Karmiel,Israel
andDepartmentofElectricalEngineering,TechnionIIT,Haifa,Israel
eugeny@tx.technion.ac.il
uniform DFT filter banks [18]. Still it is not always

clear that the added complexity also improves the
compression results. The recently introduced Rate
Distortion (RD) model for subband transform coders
[5] can be used in such applications to assess the
performance of the compression algorithm. This RD
model, however, approximates the MSE distortion of
the compression results, which is not always well
correlatedwithsubjectiveimagequalityasseenbythe
humaneye.Morecomplicateddistortionmeasurescan
be proposed, such as calculating the MSE distortion
between two images after an intensity transformation
and filtering for grayscale images [11] or a similar
process using a nonlinear transformation of the
primarycolorcomponents,followedbyfiltering,fora
colorimage[1].Abasicmeasurethatissimilartothe
MSE, but can incorporate perceptual weights is the
Weighted MSE (WMSE). This measure assigns a
different weight to the MSE of each subband of the
image, thus simulating the varying sensitivity of the
Human Visual System (HVS) to different horizontal
and vertical spatial frequencies. As a more realistic
tool,itcanimprovetheassessmentofthemodel.
ThegoalofthisworkistodevelopaperceptualRate
Distortion (RD) model of subband transform coders
based on the WMSE as the distortion measure. We
demonstrate the efficiency of the new model for
subbandtransformcodingbypresentinganewtypeof
compression algorithms based on perceptual
optimization of the preprocessing stage and of the
subbandratesallocation.
Abstract
The Mean Square Error (MSE) or the Peak Signal to Noise
Ratio (PSNR) are common distortion measures used to
assess image quality. Nevertheless, they are usually chosen
duetotheirsimplicityandnottheirperformanceastheyare
notalwayssuitablecomparedtothehumanobserver.Inthis
work we present a RateDistortion approach tocolorimage
compressionbasedonsubbandtransformsusingperceptual
optimization of the compression quality. This approach is
based on minimization of the Weighted Mean Square Error
(WMSE)oftheencodedimage,whichbettercorrespondsto
thequalityassessmentofthehumaneye.TheWMSEcanbe
measuredintheYCbCrcolorspace,forwhichvisualweights
are relatively easily derived. Based on the new approach,
new optimized compression algorithms are introduced
using the Discrete Cosine Transform and the Discrete
Wavelet Transform. We compare the new algorithms to
presently available algorithms such as JPEG and JPEG2000.
Our conclusion is that the new WMSE optimization
approach outperforms presently available compression
systemswhenahumanobserverisconsidered.
Keywords
ColorImageCompression;WeightedMeanSquareError;Discrete
Cosine Transform; Discrete Wavelet Transform; Perceptual Rate
DistortionModel;OptimalColorComponentTransform;Optimal
RateAllocation
Introduction
Many color image coding algorithms are based on
subband transforms for the compression process. The
complexity of such algorithms varies from systems
based on elementary block transforms like the DCT
(DiscreteCosineTransform)[14]used,forexample,in
JPEG [21] to more complicated algorithms based on
the Lapped Biorthogonal Transform (LBT), the
Discrete Wavelet Transform (DWT), wavelet packets
and filter banks, such as EZW (Embedded Zerotree
Wavelet)[19], JPEG2000 [13][15], JPEG XR [2][16] or
ObjectiveRateDistortionTheoryofSubband
TransformCoders
Considerageneralsubbandtransformcoderforcolor
images. Typically, the image samples are first pre
processed, then subband transformed and quantized
and finally postprocessed losslessly. A detailed
descriptionofthesestagesisgivenbelow.
49
www.seipub.org/ijc
International Journal of Communications (IJC) Volume 2 Issue 2, June 2013
Consider now a color image. The coding algorithm

described in the beginning of this section may be
regardedasapplyingaCCTtotheimage,followedby
monochromaticly subband coding each of the new
color components. The RateDistortion model of this
algorithmis[2]:
1) Preprocessing
HereaCCT(ColorComponentsTransform)isapplied
totheRGBcolorcomponentsoftheimage.Wedenote
the RGB components in vector form as
and the newcolor components as

. The
sizeCCTmatrixisdenotedasM.Thisstagecan
bewrittenas:
(4)
(1)
ThegoalofusingtheCCTtransformisusuallytode
correlate the highly correlated RGB components
[7][10][15][23].TheCCTtransformisoftenfollowedby
level shifting as for example is the case in JPEG2000
[13] so that the sample range of values becomes
symmetricaroundzero.
where and have the same meaning as before,

but for subband of color component (
).
Optimalratesallocationforthesubbandscanbefound
by minimizing the expression of Equation (4) under
therateconstraint:
(5)
for some total image rate . Here downsampling

factors have been used. denotes how much the
number of samples of color component has been
reducedbydownsampling.Forexample,ifthedown
samplingisbyafactorof2horizontallyandvertically
then:
2) SubbandTransformingandQuantizing
Asubbandtransform,suchastheDCTortheDWTis
applied to each color component. The subband
coefficients of each color component are then
quantized. An independent uniform scalar quantizer
foreachsubbandisused.
3) Postprocessing
Thequantizedcoefficientsareencodedlosslessly.The
goal is to reduce the number of bits required for the
coefficients without loss of information. Techniques
such as runlength encoding, zero trees, delta
modulation and entropy coding are used here. This
stagehastobeadaptedtothesubbandtransformused.
The optimal rates under the rate constraint of (5) as

wellasnonnegativityconstraintsare:
Rbi
j 1 j j
i2Gb bi2 ( MM T )1 ii
1
i
ln
kk
a
3
3 ( MM T )1 k2GM kAct j 1 j j
kk
k 1
To derive the RD behavior of the algorithm, first the

RD of a scalar uniform quantizer needs to be
considered. Assuming that a random signal with
variance isquantizedbysuchaquantizer,itsRate
Distortionbehaviorhasbeenapproximatedby[6][20]:
(2)
where R is the rate and is a constant dependent

upon thedistribution of .Then based on (2) theRD
model of a general monochromatic subband coder
with subbandscanbeexpressedas:
R
3
(6)
(3)
Here istheMSEofsubband
,
is its variance, is its energy gain [20] and is the
rate allocated to it. Also is its sample rate, i.e., the
relativepartofthenumberofcoefficientsinitfromthe
total number of samples in the signal. is a constant
equalto
.
50
InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.org/ijc
Where b Acti with Acti denoting the
(11)
denotes statistic mean. Similarly in the

where
C1C2C3domain:
setofnonzero(oractive)ratesinthecolorcomponent
,i.e.,
(7)
Also
andusing(9)wecanexpress
The structure of this work is as follows. In the next

section the perceptual RateDistortion model is
introduced. Section Perceptually Optimized
Compression presents new color compression
algorithms optimized according to this model based
on the DCT subband transform and on the DWT.
Simulationsofthenewalgorithmsandcomparisonto
presently available algorithms are provided in this
section. Our conclusions and summary are given in
thefinalsection.
(14)
where is the row of
in column form. In a
similar fashion, the diagonal elements of
can be
recognized as the MSE distortions of the , ,
color components, given by (2) and slightly rewritten
tobecome:
where
The Perceptual R-D Model
(15)
denote the
(16)
once(15)issubstitutedfor .Nowif,forthesakeof
convenience, we denote the YCbCr color components
at each pixel as a vector
, then the
WMSEofthe colorcomponent
is:
(17)
Ascanbeseen,thisexpressionincorporatestheenergy
gainsofthesubbands aswellastheirsamplerates
. Also the visual weights
are part of the
expression,providingvaryingsignificancetodifferent
subbands of the same color component as well as
between color components. Defining the total WMSE
astheaverageWMSEoftheYCbCrcolorcomponents,
weget:
WMSE
(10)
SincetheSBTcoefficientsarelossyencoded,errorsare
introducedbetweenthereconstructedcoefficients
in the YCbCr color space and the original ones. The

error covariance matrix for the subband in the
YCbCrdomainis:
(9)
where stands for the CCT matrix from the YCbCr

color space to the C1C2C3 space. If is the CCT
matrixfromRGBtoC1C2C3and
istheRGBto
YCbCrmatrix,then:
and Rbi and
rateandvarianceofsubband ofcolorcomponent ,
respectively. Note that we continue here with the
consistent notation of a tilde for the variables related
to the C1C2C3 color space. Assuming that the
quantization errors of the three color components in
eachsubbandintheC1C2C3domainareuncorrelated,
becomesadiagonalmatrixand(14)becomes
We assume here that we are given the visual

perception weights corresponding to the subbands of
a certain subband transform (SBT) in a color space.
Such a space can be, for example, YCbCr as we have
chosen in this work. We now wish to derive an
expression for the WMSE distortion of a coder based
on the subband transform. The same coder described
in Subsection Objective RateDistortion theory of
subbandtransformcodersisassumed,sothataCCT
is applied to the RGB color components of the image
priortocodingandtheactualimagedatacompression
isperformedinanothercolorspacedenotedC1C2C3.
We denote as
the vector of the SBT
coefficients at some index in subband in the YCbCr
color space. Similarly, the vector of subband
coefficients in the C1C2C3 color space is denoted
.DuetothelinearityoftheSBT,the
followingrelationshipholds:
as:
(13)
of the YCbCr color

The MSE distortions
componentsinsubband arethediagonalelementsof
andthus:
(8)
(12)
by
1 3
WMSE ( xYCbCr )i
3 i 1
1 3 B 1
bGb wbi d bi
3 i 1 b0
and after substituting (16) for

becomes:
(18)
the expression
51
www.seipub.org/ijc
is the same
coefficient of at index and
coefficientafterquantizationandreconstruction.Now
3 _ 1
2
1 3 B1
WMSE bGbwbi M k2 bk eaRbk
3 i1 b0
k 1
ik
2
_ 1
1
bGb eaRbk wbi M .
3 b0
k 1
i 1
ik
B1
2
2
k bk
consider the expression
(19)
for
, where is the number of the image pixels.

UsingEquation(23),itcanbewrittenas:
Tosimplify(19)wedenote:
(24)
(20)
sothattheWMSEexpressionbecomes:
where
. Assuming zero
correlation of the quantization errors of the different
color components in each subband and between
(21)
Clearly,ifthevisualweights areallequalto1,the
WMSE expression of (21) should become the
expression for the MSE in the YCbCr domain. This
expression is given exactly by (4) with the difference
that there is to be replaced by in our case. From
thecomparisonofequations(21)and(4)weconclude
that
in that case, which means
accordingto(20)that
subbands means that

immediately
BasicOptimizationUsingtheWMSEModel
(22)
AfterderivingtheWMSEexpression,thenaturalnext
step is to use it to find the optimal rates and optimal
CCT in the WMSE sense. First we wish to minimize
the WMSE of (21), subject to the rate constraint
, resulting in the following
Lagrangian( istheLagrangemultiplier):
Astraightforwardcheckprovesthatthisisindeedthe
case.
DecorrelationoftheQuantizationErrors
In the derivation of the WMSE expression of (21) we
haveassumedthatthequantizationerrorsofthe , ,
colorcomponentsareuncorrelatedineachsubband.
It is of interest to note that the assumption in the
derivationoftheMSEexpressionof(4)wasthelackof
correlation of the quantization errors in the image
domain [2], i.e. that
and
have zero
correlation for
,
. Note that
denotes here the reconstructed component. The

question that rises is whether the assumption of zero
correlation in each subband means also zero
correlationintheimagedomain.
(25)
whichisminimizedbytheoptimalratesgivenby:
(26)
Here
(27)
Using the vector space interpretation of subband

transforms[20],wecanwriteforthecolorcomponent
:
Notethatnoconstraintsfornonnegativityoftherates
are used here, which means that high rates are
assumed. As for the optimal CCT matrix : it can be
foundbyminimizingthetargetfunction
,thatis
actuallythedenominatorofthe
in(26)aftersome
straightforwardmanipulations:
(23)
here is the color component in vector form after

lexicographic ordering.
are the SBT synthesis
vectors. Also the sum on is on all the coefficient
indices in the subband,
denotes the subband
52
follows
according to (24). But this means exactly zero

correlationofthequantizationerrorsinimagedomain.
ThusthederivationoftheWMSEexpressionof(21)is
once again consistent with the derivation of the MSE
expressionof(4).
, hence
(28)
where b Acti with Acti defined in (7) while
We should remind here that

is a function of as
given in (20). Also the variances depend on , or
more specifically on . These variances are the
diagonalelementsofthesubband covariancematrix
intheC1C2C3imagedomain:
and i areasin(8).Asfor
YRGB E YbRGB
b
Perceptually Optimized Compression

Inthissectionwepresentageneralapproachtocolor
image compression using a subband transform with
perceptual optimization of the CCT and of the
subbandratesallocation.Thisapproachconsistsofthe
stagesdescribedinthebeginningofSectionObjective
RateDistortion theory of subband transform coders.
Thedifferenceshereisthatinthepreprocessingstage,
the perceptually optimal CCT transform is applied to
thecolorcomponentsandinthequantizationstagethe
perceptually optimal rates allocation is used. We
demonstrate this approach both for the DCT and the
DWT in the following subsections. Note that a
probabilitydistributionmodelcanbeusedfortheSBT
coefficients to improve the performance of the
algorithms with respect to runtime and compression
quality [4]. For example, the Laplacian probability
modelcanbeassumedforDCTcoefficients[9].
(30)
accordingto:
(31)
Here
is defined similarly to the
definitions in the beginning of the section of . Also
denotesthe rowofthe matrixinvectorform.
Thusthetargetfunction
canberewrittenas:
3
B 1
f ( M ) ( mk T b mk )Gb bk ,
_
k 1 b 0
bk
(32)
_ 1
wbi M .
i 1
ik
3
DCTBasedCompressionAlgorithm
Since the DCT is a subband transform, the Rate
Distortion theory of Section The Perceptual RD
Model can be applied to it. To find the DCT visual
weights we use the HVS CSF (Contrast Sensitivity
Function)curvesfortheYCbCrcolorspacethatcanbe
found, for example, in [20]. To convert the cpd (cycle
per degree) units of thesegraphs to spatialfrequency
unitsfortheDCT,theequationsproposedin[22]can
be
used.
We
consider,
for
example,
512 512
images
displayed
as
onadisplaywithdotpitchof0.25mm.
The viewingdistance isassumed to befour times the
imageheight[12],i.e.,inthisexample50cm.Similarly
we can consider
images displayed as
onabigscreenataviewingdistance
of100cm.
OptimalRateswithDownSampling
When considering potential downsampling of some
ofthecolorcomponents,therateconstraintbecomes(5)
and the Lagrangian that incorporates this constraint,
as well as constraints for the nonnegativity of the
subbandrates,is:
2
1 B 1 3
L {Rbi }, M , ,{bi } bGb bk k2e aRbk bk
3 b0 k 1
(33)
3 B 1
3 B 1
i b Rbi R bi Rbi ,
i 1 b0
i 1 b0
are the Lagrange multipliers for the new

where
constraints.Theratesthatminimize(33)are:
Rbi
Thestagesoftheproposedalgorithmareasfollows:
j j
1.
j 1
i2Gb bi2 bk
1
i
ln
k k
a
3
Act 2
Act
3 GM k k k j 1 j j
k 1
(35)
andcanalsobeexpressedusingthe matrixandthe
subband b covariance matrix in the RGB image
domain:
T
,
b E YbRGB YRGB
YbRGB YRGB
b
b
,itisgivenby:
(29)
FindtheoptimalCCT byminimizing(32).
2. Apply the CCT

to the RGB color
componentsoftheimagetoreceivethenewcolor
components , , .
(34)
3. Apply the DCT block transform to each color

component ,
.
53
www.seipub.org/ijc
4. Calculate the optimal rates according to (34)

substituting there the used CCT matrix and the
variancesoftheDCTsubbands.Tofindtheactive
subbands, the algorithm presented in [5] can be
used.
allocation and CCT, as well as another version that

uses optimal rates, but in the YCbCr color space. The
PSPNRmeasureusedhereis:
5. QuantizetheDCTcoefficientsusingauniform
scalar quantizer in each subband. The (optimal)
quantization steps are found using an iterative
algorithm[5].
where
for each color component is calculated
in the DWT domain in the YCbCr color space
accordingtothevisualweightssuggestedin[20].Then
theaveragePSPNRonthe3colorcomponentsistaken.
Based on our experience and results, this is a good
measureofsubjectiveimagequality.
6. Use postquantization coding similar to the

one used in JPEG. Adaptive Huffman coding is
employed and the codes are sent with the image
data.Thisstageislosslessanddoesnotaffectthe
imagedistortion.
(36)
Note that each image in the table was compressed at

the same compression rate (given in the last column
fromtheleft)foreachofthealgorithms,buttherateis
not the same for different images. The reason is that
the the rate was chosen to achieve the same objective
performance (PSNR) for the first algorithm (from the
left) to allow meaningful comparison of average
algorithmperformances.SeealsoTable2below.
It is of interest to compare the performance of this

algorithm to the other DCTbased compression
algorithms, such as the MSE optimized algorithm
proposed in [5] and JPEG. A comparison for several
imagesisgiveninTable1.Weconsiderheretheabove
algorithm with WMSE RD optimization of the rates
TABLE1Perceptuallybasedresults(PSPNR)for(fromlefttoright):TheDCTbasedWMSEoptimizedalgorithmintheYCbCrdomain;Thesame
algorithmwithoptimalCCT;TheMSEoptimizedalgorithm;JPEG.Thecompressionrateforeachimageisshownintherightcolumn.
54
WMSEAlg.inthe
optimaldomain
40.6
MSEAlg.
JPEG
Rate[bpp]
Lena
WMSEAlg.inthe
YCbCrdomain
39.4
38.9
37.6
0.76
Peppers
39.6
39.6
38.1
36.6
0.81
Baboon
42.0
42.5
39.2
36.1
1.76
Cat
41.3
43.1
41.3
39.9
1.30
Landscape
42.5
42.5
40.5
38.0
1.85
House
39.8
40.3
39.2
38.1
0.54
JellyBeans
38.5
38.6
38.3
37.5
0.47
Fruits
41.0
42.3
40.4
38.9
0.71
Sails
41.0
42.9
39.7
37.6
1.84
Image
Monarch
39.8
40.2
38.7
37.5
1.03
Goldhill
42.9
43.4
41.9
40.6
2.17
Mean
40.7
41.5
39.7
38.0
TABLE2SameasTable1,butforPSNRinsteadofPSNR.NotethatoptimizationofPSPNR,asinducedbythehumanobserver,doesnot
necessarilymeanoptimizationofthearbitrarilyusedPSNR(seetext).
PSNR
Image
WMSEAlg.inthe
optimaldomain
30.5
MSEAlg.
JPEG
Rate[bpp]
Lena
WMSEAlg.inthe
YCbCrdomain
30.0
30.7
29.7
0.76
Peppers
30.0
30.1
30.5
29.3
0.81
Baboon
30.0
29.0
30.5
26.5
1.76
Cat
30.0
29.6
31.3
29.5
1.30
Landscape
30.0
30.1
30.3
25.9
1.85
House
30.0
30.2
30.3
29.5
0.54
JellyBeans
30.0
30.3
30.6
29.7
0.47
Fruits
30.0
29.8
30.6
30.6
0.71
Sails
30.0
29.7
30.6
28.9
1.84
Monarch
30.0
29.6
30.6
29.4
1.03
Goldhill
30.0
30.2
31.7
29.2
2.17
Mean
30.0
29.9
30.7
28.9
It can be concluded from the table that the WMSE

optimized algorithm with the optimal CCT achieves
the highestPSPNR, which is1.8dB higher onaverage
thanthePSPNRoftheMSEoptimizedalgorithmand
3.5dB above JPEG. The use of the optimal CCT in
WMSEsenseincreasestheperformancebyalmost1dB
(0.8dB) on average when perceptually optimal rates
areemployed.Anothercomparisonofinterestisofthe
standardorobjectivedistortionsofthealgorithm,i.e.,
the PSNR as presented in Table 2. As expected, the

MSEoptimizedalgorithmissuperiorhere,butwhatis
perhaps less intuitive is the fact that the use of the
optimal CCT slightly decreases the PSNR, indicating
thatPSNRandPSPNRaredifferentmeasures.Ascan
beseenintheexamplesofFigs.12below,thehuman
observer judgement is closely related to the PSPNR,
not the PSNR. Despite this both WMSE algorithms
have MSE performance superior to JPEG with a gain
55
www.seipub.org/ijc
of 1.1dB in the PSNR without using the optimal CCT

and slightly less (1dB) with the optimal CCT. We
conclude this section by presenting a visual
comparison of the algorithms as can be seen in Fig. 1
fortheLenaimageandinFig.2fortheBaboonimage.
It can be seen that the WMSE algorithm provides the

bestresultsforbothimagesvisually,whiletheresults
of the MSE algorithm are slightly less pleasing to the
eye.YetbothalgorithmsaresuperiortoJPEG.
FIG.1COMPRESSIONRESULTSFORLENAAT0.72BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBYTHEWMSE
OPTIMIZEDALGORITHM(TOPRIGHT,PSPNR=40.4DB);IMAGECOMPRESSEDBYJPEG(BOTTOMLEFT,PSPNR=37.7DB);IMAGE
COMPRESSEDBYTHEMSEOPTIMIZEDALGORITHM(BOTTOMRIGHT,PSPNR=39.3DB).ASEXPECTED,THEWMSEALGORITHM
OUTPERFORMSTHEOTHERMETHODS,ESPECIALLYINTHEMARKEDAREAS.
FIG.2COMPRESSIONRESULTSFORTHEBABOON(ZOOMEDIN)AT0.88BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBY
THEWMSEOPTIMIZEDALGORITHM(TOPRIGHT,PSPNR=36.9DB);IMAGECOMPRESSEDBYJPEG(BOTTOMLEFT,PSPNR=33.6DB);
IMAGECOMPRESSEDBYTHEMSEOPTIMIZEDALGORITHM(BOTTOMRIGHT,PSPNR=35.6DB).HEREAGAIN,THEWMSE
ALGORITHMOUTPERFORMSTHEOTHERMETHODS.
56
5. Quantize the DWT coefficients by a uniform

quantizer with a central deadzone in each
subbandsimilartotheoneusedinJPEG2000PartI
[8].Useoptimalquantizationsteps.
DWTBasedCompressionAlgorithm
When the DWT is considered, there are quite a few
options for the wavelet filter bank to be used for the
decomposition. We have chosen the Daubechies 9/7
filter bank, but obviously other choices can be
consideredaswell.Notiling[13]isused.Thechoiceof
the visual weights is according to [20]. The stages of
theproposedalgorithmare:
1.
6. Use the postquantization coding of the EZW

algorithm[19] onthequantizedsubband
coefficients. This stage is lossless and includes bit
plane coding using zero trees. The bit plane
coding is split into two passes (dominant and
subordinate) and a separate arithmetic coder is
employedforeachpass.
FindtheoptimalCCT byminimizing(32).
2. Apply the CCT

to the RGB color
componentsoftheimagetoreceivethenewcolor
components , , .
Itisinterestingtocomparetheproposedalgorithmto
JPEG2000. We have considered the JPEG2000
implementationusingtheJasPersoftwarepackage[24]
andanotherversionoftheimplementationwithfixed
visual weighting at subband level using the CSF
weights of[20]. The visual results for the Lenaimage
canbeseeninFig.3.ThePSNRresultshereare29.5dB
for the proposed WMSE optimized algorithm, 28.6dB
for JPEG2000 (original JasPer implementation) and
28.5dB for JPEG2000 with CSF weights. We conclude
that the use of CSF weights, that affects the tier2
codingstageoftheJPEG2000algorithm,decreasesthe
PSNR,butslightlyimprovesthevisualperformance.
3. ApplytheDWTtreedecompositionuptothe
required depth of the tree (3, 4, 5 or higher
according toimagesize) to each colorcomponent
,
.
4. Calculate the optimal rates according to (34)
substituting there the used CCT matrix and the
variances, sample rates and energy gains of the
DWT subbands. The determination of the active
subbands is the same as for the DCTbased
algorithmoftheprevioussubsection.
FIG.3COMPRESSIONRESULTSFORLENAAT0.52BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBYTHEDWTBASED
WMSEOPTIMIZEDALGORITHM(TOPRIGHT,PSPNR=19.7DB);IMAGECOMPRESSEDBYJPEG2000(BOTTOMLEFT,PSPNR=19.1DB);
IMAGECOMPRESSEDBYJPEG2000WITHCSFWEIGHTS(BOTTOMRIGHT,PSPNR=19.2DB).
ALSOHERE,THEWMSEALGORITHMISSUPERIORTOTHEREST.
57
www.seipub.org/ijc
andmustachetexturesintheCatimagearelost.Inthe
case of the Peppers image, the compression result of
JPEG2000 is less pleasing to the eye due to the color
artifacts introduced. Fig. 6 further demonstrates the
loss of spatial details in the case of JPEG2000
compression of the Sails image, the blurring of the
contoursintheMonarchimageandbotheffectsinthe
Goldhill image (see the top marked area for the
blurred contour effect and, for example, the bottom
left marked area for the loss of spatial details).
Furthermore, color artifacts are introduced by
JPEG2000 in the Goldhill image as indicated, for
instance,inthemarkedareainthecenteroftheimage.
Also the proposed algorithm produces an image that

ismuchmorepleasingtotheeyethanJPEG2000.
Similar results can be seen in the comparison of the
proposed algorithm and JPEG2000 for other example
imagesin Figs.4,5and6.In Fig.4 theloss ofspatial
details at high frequencies in the Landscape and
House images as well as the introduction of false
contoursintheJellyBeansimageshouldbenotedfor
JPEG2000. These effects can be seen in the marked
regions. In Fig. 5 once again the superiority of the
WMSEoptimizedalgorithmonJPEG2000canbeseen
in regions of high frequency details as demonstrated
bytheFruitsandCatimages.Forexample,thedetails
oftheappletextureintheFruitsimageandofthefur
FIG.4LANDSCAPE,HOUSEANDJELLYBEANSIMAGESFROMLEFTTORIGHT:ORIGINAL,COMPRESSEDBYTHEWMSE
ALGORITHM(WMSEALG.)ANDCOMPRESSEDBYJPEG2000.
PSPNRFORTHELANDSCAPEIMAGE:17.1DB(WMSEALG.)AND15.7DB(JPEG2000).
PSNR:28.7DB(WMSEALG.)AND25.3DB(JPEG2000)AT0.97BPP.
PSPNRFORTHEHOUSEIMAGE:19.4DB(WMSEALG.)AND19.0DB(JPEG2000).
PSPNRFORTHEJELLYBEANSIMAGE:18.8DB(WMSEALG.)AND18.2DB(JPEG2000).
INTHEONLYCASEWHERETHEPSNROFJPEG2000ISHIGHERTHANTHENEWALGORITHM(HOUSE),THEPSPNRRESULT
SUPPORTSTHEFACTTHATVISUALLYTHENEWALGORITHMPROVIDESSUPERIORRESULTS.
58
FIG.5FRUITS,CATANDPEPPERSIMAGESFROMLEFTTORIGHT:ORIGINAL,COMPRESSEDBYTHEWMSEALGORITHM(WMSE
ALG.)ANDCOMPRESSEDBYJPEG2000.
PSPNRFORTHEFRUITSIMAGE:22.2DB(WMSEALG.)AND21.1DB(JPEG2000).
PSPNRFORTHECATIMAGE:17.0DB(WMSEALG.)AND16.2DB(JPEG2000).
PSPNRFORTHEPEPPERSIMAGE:20.3DB(WMSEALG.)AND19.3DB(JPEG2000).
ASCANBESEEN,PSNRANDPSPNRRESULTSARESUPERIORFORTHENEWALGORITHMCOMPAREDTOJPEG2000.ITISALSO
OBSERVEDVISUALLYEXAMPLESAREINDICATEDINTHEMARKEDAREAS.
59
www.seipub.org/ijc
FIG.6SAILS(ZOOMEDIN),MONARCH(ZOOMEDIN)ANDGOLDHILLIMAGESFROMLEFTTORIGHT:ORIGINAL,COMPRESSED
BYTHEWMSEALGORITHM(WMSEALG.)ANDCOMPRESSEDBYJPEG2000.
PSPNRFORTHESAILSIMAGE:19.2DB(WMSEALG.)AND18.0DB(JPEG2000).
PSPNRFORTHEMONARCHIMAGE:19.9DB(WMSEALG.)AND19.6DB(JPEG2000).
PSPNRFORTHEGOLDHILLIMAGE:17.6DB(WMSEALG.)AND16.6DB(JPEG2000).
ONCEAGAIN,THEPSNRANDPSPNRRESULTSARESUPERIORFORTHENEWALGORITHMCOMPAREDTOJPEG2000(SEE
EXAMPLESINDICATEDINTHEMARKEDAREAS).
stageusingaCCT,followedbyasubbandtransform,
quantizationstage,andlosslessentropyencoding.The
algorithms are optimized with regard to the color
component transform in the preprocessing stage of
the compression as well as the quantization tables
usedinthecodingstage,bothwithrespecttoWMSE.
TheproposedDCTbasedalgorithmoutperformsboth
JPEG and the corresponding MSE optimized
algorithm. The DWTbased algorithm, as expected,
achieveshighercompressionratiosforthesameimage
quality than DCTbased techniques. We demonstrate
in this work that even when a relatively basic
algorithm is used in the postprocessing stage
(introducedforEZW),superiorresultsareobtainedby
Summary
A perceptuallybased model for the RateDistortion
functionofcolorsubbandcodershasbeenintroduced.
ThenewmodelapproximatestheWMSEdistortionof
an image in a given color space, such as YCbCr. This
distortion is then minimized to achieve perceptual
optimizationofthecompression.Whentheweightsin
the WMSE calculation are taken based on the CSF
curves of the human visual system, better
correspondence to image quality assessment by the
humaneyeisachieved.
Based on the RateDistortion model, new algorithms
have been introduced consisting of a preprocessing
60
the proposed algorithm when compared to other

DWTbasedalgorithms,suchasJPEG2000.Thisholds
even if the same WMSE distortion is used in both
JPEG2000andtheproposedalgorithm.Ourconclusion
is that based on the new perceptual RateDistortion
model, optimized compression algorithms can be
designed with compression results superior to
presentlyavailabletechniques.
[8] JPEG 2000 Part I: Final Draft International Standard

(ISO/IECFDIS154441),NCITSISO/IECJTC1/SC29/WG1
N1855(Aug.2000).
[9] Lam, E. Y. and Goodman, J. W. A mathematical
analysis of the DCT coefficient distributions for
images. IEEE Trans. on Image Processing 9 (2000):
16611666.
ACKNOWLEDGMENT
[10] Limb J. O. and Rubinstein C.B. Statistical Dependence
This research was supported in part by the HASSIP

Research Program HPRNCT200200285 of the
EuropeanCommission,andbytheOllendorffMinerva
Center.MinervaisfundedthroughtheBMBF.
Between Components of A Differentially Quantized

ColorSignal.IEEETrans.onCommunicationsCom20
(Oct.1971):890899.
[11] Mannos, J. and Sakrison, D. The effects of a visual
REFERENCES
fidelitycriterionoftheencodingofimages.IEEETrans.
onInformationTheory20(Jul.1974):525536.
[1] Faugeras,O.D.DigitalColorImageProcessingWithin
[12] Ngan, K. N., Leong, K. S. and Singh, H. Adaptive
theFrameworkofaHumanVisualModel.IEEETrans.
cosine transform coding of images in perceptual
on Acoustics Speech and Signal Processing ASSP27
domain. IEEE Trans. on Acoustics, Speech, and Signal
(1979):380393.
Processing37(Nov.1989):17431750.
[2] Fiorucci,F.,Baruffa,G.andFrescura,F. Objectiveand
[13] Rabbani, M. and Joshi, R. An overview of the JPEG
subjective quality assessment between JPEG XR with

overlap
and
JPEG
2000.",
Journal
of
2000 still image compression standard. Signal
Visual
Processing:ImageCommunication17(2002):348.
Communication and Image Representation 23 (Aug.
[14] Rao, K. R. and Yip, P. Discrete cosine transform:
2012):835844.
algorithms, advantages, applications. Academic Press,
[3] Gershikov, E. and Porat, M. A RateDistortion
1990.
ApproachtoOptimalColorImageCompression.Proc.
[15] Richter, T. and Simon, S. On the JPEG 2000 ultrafast
ofEUSIPCO,Florence,Italy,2006.
mode.Proc.ofICIP(Oct.2012):25012504.
[4] Gershikov, E. and Porat, M. Data Compression of
[16] Richter,T.SpatialConstantQuantizationinJPEGXRis
Color Images using a Probabilistic Linear Transform
Nearly Optimal. Proc. Of the Data Compression
Approach, in LNCS #4310, T. Boyanov, S. Dimova, K.
Conference(March2010):7988.
Georgiev, G. Nikolov (Eds.), 582589, SpringerVerlag
[17] Roterman, Y. and Porat, M. Color image coding using
BerlinHeidelberg,2007.
regional correlation of primary colors. Image and
[5] Gershikov, E. and Porat, M. On color transforms and
VisionComputing25(2007):637651.
bitallocationforoptimalsubbandimagecompression.
[18] Satt, A. and Malah, D. Design of Uniform DFT Filter
SignalProcessing:ImageCommunication22(2007):118.
BanksOptimizedforSubbandCodingofSpeech,IEEE
[6] Gersho, A. and Gray, R. M. Vector Quantization and
Trans. on Acoustics, Speech and signal Processing 37
SignalCompression,Boston,MA:Kluwer,1992,ch.2.
(Nov.1989):16721679.
[7] GoffmanVinopal, L. and Porat, M. Color image
[19] Shapiro, J. M. Embedded Image Coding Using
compression using intercolor correlation. Proc. of
Zerotrees of Wavelet Coefficients, IEEE Trans. on
IEEEICIP(2002):II353II356.
SignalProcessing41(1993):33453462.
61
www.seipub.org/ijc
[20] Taubman,D.S.andMarcellin,M.W.JPEG2000:image
[23] Yamaguchi, H. Efficient Encoding of Colored Pictures
compression, fundamentals, standards and practice,
inR,G,BComponents.Trans.onCommunications32
KluwerAcademicPublishers,2002.
(Nov.1984):12011209.
[24] http://www.ece.uvic.ca/mdadams/jasper
[21] Wallace, G. K. The JPEG still picture compression

standard.IEEETrans.ConsumerElectronics38(1992):
Evgeny Gershikov received his

Ph.D. in Electrical Engineering from
Technion Israel Institute of
Technology in Haifa, Israel in 2011.
His areas of interest are Signal and
Image Processing, Color Processing
and Vision, Computer Vision,
Pattern Recognition and Speech
xviiixxxiv.
[22] Wang, C. Y., Lee, S. M. and Chang, L. W. Designing
JPEG quantization tables based on human visual
system. Signal Processing: Image Communication 16
(2001):501506.
Recognition.
62

Perceptually Optimized Coding of Color Images For Band-Limited Information Networks

Enviado por

Dados do documento

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

Perceptually Optimized Coding of Color Images For Band-Limited Information Networks

Enviado por

Direitos autorais:

Formatos disponíveis

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.

uniform DFT filter banks [18]. Still it is not always

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

Consider now a color image. The coding algorithm

and the newcolor components as

where and have the same meaning as before,

for some total image rate . Here downsampling

The optimal rates under the rate constraint of (5) as

To derive the RD behavior of the algorithm, first the

where R is the rate and is a constant dependent

Where b Acti with Acti denoting the

denotes statistic mean. Similarly in the

The structure of this work is as follows. In the next

The Perceptual R-D Model

in the YCbCr color space and the original ones. The

where stands for the CCT matrix from the YCbCr

and Rbi and

We assume here that we are given the visual

of the YCbCr color

and after substituting (16) for

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

consider the expression

, where is the number of the image pixels.

subbands means that

denotes here the reconstructed component. The

Using the vector space interpretation of subband

here is the color component in vector form after

according to (24). But this means exactly zero

where b Acti with Acti defined in (7) while

We should remind here that

Perceptually Optimized Compression

are the Lagrange multipliers for the new

2. Apply the CCT

3. Apply the DCT block transform to each color

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

4. Calculate the optimal rates according to (34)

allocation and CCT, as well as another version that

6. Use postquantization coding similar to the

Note that each image in the table was compressed at

It is of interest to compare the performance of this

It can be concluded from the table that the WMSE

the PSNR as presented in Table 2. As expected, the

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

of 1.1dB in the PSNR without using the optimal CCT

It can be seen that the WMSE algorithm provides the

5. Quantize the DWT coefficients by a uniform

6. Use the postquantization coding of the EZW

2. Apply the CCT

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

Also the proposed algorithm produces an image that

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

the proposed algorithm when compared to other

[8] JPEG 2000 Part I: Final Draft International Standard

[10] Limb J. O. and Rubinstein C.B. Statistical Dependence

This research was supported in part by the HASSIP

Between Components of A Differentially Quantized

[12] Ngan, K. N., Leong, K. S. and Singh, H. Adaptive

cosine transform coding of images in perceptual

on Acoustics Speech and Signal Processing ASSP27

domain. IEEE Trans. on Acoustics, Speech, and Signal

[2] Fiorucci,F.,Baruffa,G.andFrescura,F. Objectiveand

[13] Rabbani, M. and Joshi, R. An overview of the JPEG

subjective quality assessment between JPEG XR with