Você está na página 1de 14

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.

org/ijc

PerceptuallyOptimizedCodingofColor
ImagesforBandLimitedInformation
Networks
EvgenyGershikov
DepartmentofElectricalEngineering,OrtBraudeAcademicCollegeof Engineering,Karmiel,Israel
andDepartmentofElectricalEngineering,TechnionIIT,Haifa,Israel
eugeny@tx.technion.ac.il

uniform DFT filter banks [18]. Still it is not always


clear that the added complexity also improves the
compression results. The recently introduced Rate
Distortion (RD) model for subband transform coders
[5] can be used in such applications to assess the
performance of the compression algorithm. This RD
model, however, approximates the MSE distortion of
the compression results, which is not always well
correlatedwithsubjectiveimagequalityasseenbythe
humaneye.Morecomplicateddistortionmeasurescan
be proposed, such as calculating the MSE distortion
between two images after an intensity transformation
and filtering for grayscale images [11] or a similar
process using a nonlinear transformation of the
primarycolorcomponents,followedbyfiltering,fora
colorimage[1].Abasicmeasurethatissimilartothe
MSE, but can incorporate perceptual weights is the
Weighted MSE (WMSE). This measure assigns a
different weight to the MSE of each subband of the
image, thus simulating the varying sensitivity of the
Human Visual System (HVS) to different horizontal
and vertical spatial frequencies. As a more realistic
tool,itcanimprovetheassessmentofthemodel.
ThegoalofthisworkistodevelopaperceptualRate
Distortion (RD) model of subband transform coders
based on the WMSE as the distortion measure. We
demonstrate the efficiency of the new model for
subbandtransformcodingbypresentinganewtypeof
compression algorithms based on perceptual
optimization of the preprocessing stage and of the
subbandratesallocation.

Abstract
The Mean Square Error (MSE) or the Peak Signal to Noise
Ratio (PSNR) are common distortion measures used to
assess image quality. Nevertheless, they are usually chosen
duetotheirsimplicityandnottheirperformanceastheyare
notalwayssuitablecomparedtothehumanobserver.Inthis
work we present a RateDistortion approach tocolorimage
compressionbasedonsubbandtransformsusingperceptual
optimization of the compression quality. This approach is
based on minimization of the Weighted Mean Square Error
(WMSE)oftheencodedimage,whichbettercorrespondsto
thequalityassessmentofthehumaneye.TheWMSEcanbe
measuredintheYCbCrcolorspace,forwhichvisualweights
are relatively easily derived. Based on the new approach,
new optimized compression algorithms are introduced
using the Discrete Cosine Transform and the Discrete
Wavelet Transform. We compare the new algorithms to
presently available algorithms such as JPEG and JPEG2000.
Our conclusion is that the new WMSE optimization
approach outperforms presently available compression
systemswhenahumanobserverisconsidered.
Keywords
ColorImageCompression;WeightedMeanSquareError;Discrete
Cosine Transform; Discrete Wavelet Transform; Perceptual Rate
DistortionModel;OptimalColorComponentTransform;Optimal
RateAllocation

Introduction
Many color image coding algorithms are based on
subband transforms for the compression process. The
complexity of such algorithms varies from systems
based on elementary block transforms like the DCT
(DiscreteCosineTransform)[14]used,forexample,in
JPEG [21] to more complicated algorithms based on
the Lapped Biorthogonal Transform (LBT), the
Discrete Wavelet Transform (DWT), wavelet packets
and filter banks, such as EZW (Embedded Zerotree
Wavelet)[19], JPEG2000 [13][15], JPEG XR [2][16] or

ObjectiveRateDistortionTheoryofSubband
TransformCoders
Considerageneralsubbandtransformcoderforcolor
images. Typically, the image samples are first pre
processed, then subband transformed and quantized
and finally postprocessed losslessly. A detailed
descriptionofthesestagesisgivenbelow.

49

www.seipub.org/ijc

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

Consider now a color image. The coding algorithm


described in the beginning of this section may be
regardedasapplyingaCCTtotheimage,followedby
monochromaticly subband coding each of the new
color components. The RateDistortion model of this
algorithmis[2]:

1) Preprocessing
HereaCCT(ColorComponentsTransform)isapplied
totheRGBcolorcomponentsoftheimage.Wedenote
the RGB components in vector form as

and the newcolor components as


. The
sizeCCTmatrixisdenotedasM.Thisstagecan
bewrittenas:

(4)

(1)

ThegoalofusingtheCCTtransformisusuallytode
correlate the highly correlated RGB components
[7][10][15][23].TheCCTtransformisoftenfollowedby
level shifting as for example is the case in JPEG2000
[13] so that the sample range of values becomes
symmetricaroundzero.

where and have the same meaning as before,


but for subband of color component (
).
Optimalratesallocationforthesubbandscanbefound
by minimizing the expression of Equation (4) under
therateconstraint:

(5)

for some total image rate . Here downsampling


factors have been used. denotes how much the
number of samples of color component has been
reducedbydownsampling.Forexample,ifthedown
samplingisbyafactorof2horizontallyandvertically
then:

2) SubbandTransformingandQuantizing
Asubbandtransform,suchastheDCTortheDWTis
applied to each color component. The subband
coefficients of each color component are then
quantized. An independent uniform scalar quantizer
foreachsubbandisused.
3) Postprocessing
Thequantizedcoefficientsareencodedlosslessly.The
goal is to reduce the number of bits required for the
coefficients without loss of information. Techniques
such as runlength encoding, zero trees, delta
modulation and entropy coding are used here. This
stagehastobeadaptedtothesubbandtransformused.

The optimal rates under the rate constraint of (5) as


wellasnonnegativityconstraintsare:
Rbi

j 1 j j

i2Gb bi2 ( MM T )1 ii

1
i
ln
kk
a
3
3 ( MM T )1 k2GM kAct j 1 j j
kk

k 1

To derive the RD behavior of the algorithm, first the


RD of a scalar uniform quantizer needs to be
considered. Assuming that a random signal with
variance isquantizedbysuchaquantizer,itsRate
Distortionbehaviorhasbeenapproximatedby[6][20]:

(2)

where R is the rate and is a constant dependent


upon thedistribution of .Then based on (2) theRD
model of a general monochromatic subband coder
with subbandscanbeexpressedas:

R
3

(6)

(3)

Here istheMSEofsubband
,
is its variance, is its energy gain [20] and is the
rate allocated to it. Also is its sample rate, i.e., the
relativepartofthenumberofcoefficientsinitfromthe
total number of samples in the signal. is a constant
equalto
.

50

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.org/ijc

Where b Acti with Acti denoting the

(11)

denotes statistic mean. Similarly in the


where
C1C2C3domain:

setofnonzero(oractive)ratesinthecolorcomponent
,i.e.,

(7)

Also

andusing(9)wecanexpress

The structure of this work is as follows. In the next


section the perceptual RateDistortion model is
introduced. Section Perceptually Optimized
Compression presents new color compression
algorithms optimized according to this model based
on the DCT subband transform and on the DWT.
Simulationsofthenewalgorithmsandcomparisonto
presently available algorithms are provided in this
section. Our conclusions and summary are given in
thefinalsection.

(14)
where is the row of
in column form. In a
similar fashion, the diagonal elements of
can be
recognized as the MSE distortions of the , ,
color components, given by (2) and slightly rewritten
tobecome:

where

The Perceptual R-D Model

(15)
denote the

(16)

once(15)issubstitutedfor .Nowif,forthesakeof
convenience, we denote the YCbCr color components
at each pixel as a vector
, then the
WMSEofthe colorcomponent
is:
(17)

Ascanbeseen,thisexpressionincorporatestheenergy
gainsofthesubbands aswellastheirsamplerates
. Also the visual weights
are part of the
expression,providingvaryingsignificancetodifferent
subbands of the same color component as well as
between color components. Defining the total WMSE
astheaverageWMSEoftheYCbCrcolorcomponents,
weget:

WMSE

(10)

SincetheSBTcoefficientsarelossyencoded,errorsare
introducedbetweenthereconstructedcoefficients

in the YCbCr color space and the original ones. The


error covariance matrix for the subband in the
YCbCrdomainis:

(9)

where stands for the CCT matrix from the YCbCr


color space to the C1C2C3 space. If is the CCT
matrixfromRGBtoC1C2C3and
istheRGBto
YCbCrmatrix,then:

and Rbi and

rateandvarianceofsubband ofcolorcomponent ,
respectively. Note that we continue here with the
consistent notation of a tilde for the variables related
to the C1C2C3 color space. Assuming that the
quantization errors of the three color components in
eachsubbandintheC1C2C3domainareuncorrelated,
becomesadiagonalmatrixand(14)becomes

We assume here that we are given the visual


perception weights corresponding to the subbands of
a certain subband transform (SBT) in a color space.
Such a space can be, for example, YCbCr as we have
chosen in this work. We now wish to derive an
expression for the WMSE distortion of a coder based
on the subband transform. The same coder described
in Subsection Objective RateDistortion theory of
subbandtransformcodersisassumed,sothataCCT
is applied to the RGB color components of the image
priortocodingandtheactualimagedatacompression
isperformedinanothercolorspacedenotedC1C2C3.
We denote as
the vector of the SBT
coefficients at some index in subband in the YCbCr
color space. Similarly, the vector of subband
coefficients in the C1C2C3 color space is denoted
.DuetothelinearityoftheSBT,the
followingrelationshipholds:

as:

(13)

of the YCbCr color


The MSE distortions
componentsinsubband arethediagonalelementsof
andthus:

(8)

(12)
by

1 3
WMSE ( xYCbCr )i
3 i 1

1 3 B 1
bGb wbi d bi
3 i 1 b0

and after substituting (16) for


becomes:

(18)

the expression

51

www.seipub.org/ijc

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

is the same
coefficient of at index and
coefficientafterquantizationandreconstruction.Now

3 _ 1
2
1 3 B1
WMSE bGbwbi M k2 bk eaRbk
3 i1 b0
k 1
ik

2
_ 1

1
bGb eaRbk wbi M .

3 b0
k 1
i 1

ik
B1

2
2
k bk

consider the expression

(19)

for

, where is the number of the image pixels.


UsingEquation(23),itcanbewrittenas:

Tosimplify(19)wedenote:

(24)

(20)

sothattheWMSEexpressionbecomes:

where
. Assuming zero
correlation of the quantization errors of the different
color components in each subband and between

(21)

Clearly,ifthevisualweights areallequalto1,the
WMSE expression of (21) should become the
expression for the MSE in the YCbCr domain. This
expression is given exactly by (4) with the difference
that there is to be replaced by in our case. From
thecomparisonofequations(21)and(4)weconclude
that
in that case, which means
accordingto(20)that

subbands means that


immediately

BasicOptimizationUsingtheWMSEModel

(22)

AfterderivingtheWMSEexpression,thenaturalnext
step is to use it to find the optimal rates and optimal
CCT in the WMSE sense. First we wish to minimize
the WMSE of (21), subject to the rate constraint
, resulting in the following
Lagrangian( istheLagrangemultiplier):

Astraightforwardcheckprovesthatthisisindeedthe
case.
DecorrelationoftheQuantizationErrors
In the derivation of the WMSE expression of (21) we
haveassumedthatthequantizationerrorsofthe , ,
colorcomponentsareuncorrelatedineachsubband.
It is of interest to note that the assumption in the
derivationoftheMSEexpressionof(4)wasthelackof
correlation of the quantization errors in the image
domain [2], i.e. that
and
have zero
correlation for
,
. Note that

denotes here the reconstructed component. The


question that rises is whether the assumption of zero
correlation in each subband means also zero
correlationintheimagedomain.

(25)

whichisminimizedbytheoptimalratesgivenby:

(26)

Here
(27)

Using the vector space interpretation of subband


transforms[20],wecanwriteforthecolorcomponent
:

Notethatnoconstraintsfornonnegativityoftherates
are used here, which means that high rates are
assumed. As for the optimal CCT matrix : it can be
foundbyminimizingthetargetfunction
,thatis
actuallythedenominatorofthe
in(26)aftersome
straightforwardmanipulations:

(23)

here is the color component in vector form after


lexicographic ordering.
are the SBT synthesis
vectors. Also the sum on is on all the coefficient
indices in the subband,
denotes the subband

52

follows

according to (24). But this means exactly zero


correlationofthequantizationerrorsinimagedomain.
ThusthederivationoftheWMSEexpressionof(21)is
once again consistent with the derivation of the MSE
expressionof(4).

, hence

(28)

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.org/ijc

where b Acti with Acti defined in (7) while

We should remind here that


is a function of as
given in (20). Also the variances depend on , or
more specifically on . These variances are the
diagonalelementsofthesubband covariancematrix
intheC1C2C3imagedomain:

and i areasin(8).Asfor

YRGB E YbRGB
b

Perceptually Optimized Compression


Inthissectionwepresentageneralapproachtocolor
image compression using a subband transform with
perceptual optimization of the CCT and of the
subbandratesallocation.Thisapproachconsistsofthe
stagesdescribedinthebeginningofSectionObjective
RateDistortion theory of subband transform coders.
Thedifferenceshereisthatinthepreprocessingstage,
the perceptually optimal CCT transform is applied to
thecolorcomponentsandinthequantizationstagethe
perceptually optimal rates allocation is used. We
demonstrate this approach both for the DCT and the
DWT in the following subsections. Note that a
probabilitydistributionmodelcanbeusedfortheSBT
coefficients to improve the performance of the
algorithms with respect to runtime and compression
quality [4]. For example, the Laplacian probability
modelcanbeassumedforDCTcoefficients[9].

(30)

accordingto:

(31)

Here
is defined similarly to the
definitions in the beginning of the section of . Also
denotesthe rowofthe matrixinvectorform.
Thusthetargetfunction
canberewrittenas:
3

B 1

f ( M ) ( mk T b mk )Gb bk ,
_

k 1 b 0

bk

(32)

_ 1
wbi M .
i 1

ik
3

DCTBasedCompressionAlgorithm
Since the DCT is a subband transform, the Rate
Distortion theory of Section The Perceptual RD
Model can be applied to it. To find the DCT visual
weights we use the HVS CSF (Contrast Sensitivity
Function)curvesfortheYCbCrcolorspacethatcanbe
found, for example, in [20]. To convert the cpd (cycle
per degree) units of thesegraphs to spatialfrequency
unitsfortheDCT,theequationsproposedin[22]can
be
used.
We
consider,
for
example,
512 512
images
displayed
as
onadisplaywithdotpitchof0.25mm.
The viewingdistance isassumed to befour times the
imageheight[12],i.e.,inthisexample50cm.Similarly
we can consider
images displayed as
onabigscreenataviewingdistance
of100cm.

OptimalRateswithDownSampling
When considering potential downsampling of some
ofthecolorcomponents,therateconstraintbecomes(5)
and the Lagrangian that incorporates this constraint,
as well as constraints for the nonnegativity of the
subbandrates,is:
2
1 B 1 3
L {Rbi }, M , ,{bi } bGb bk k2e aRbk bk
3 b0 k 1
(33)

3 B 1
3 B 1
i b Rbi R bi Rbi ,
i 1 b0
i 1 b0

are the Lagrange multipliers for the new


where
constraints.Theratesthatminimize(33)are:

Rbi

Thestagesoftheproposedalgorithmareasfollows:

j j

1.

j 1

i2Gb bi2 bk

1
i
ln
k k
a
3
Act 2
Act
3 GM k k k j 1 j j
k 1

(35)

andcanalsobeexpressedusingthe matrixandthe
subband b covariance matrix in the RGB image
domain:
T
,
b E YbRGB YRGB
YbRGB YRGB
b
b

,itisgivenby:

(29)

FindtheoptimalCCT byminimizing(32).

2. Apply the CCT


to the RGB color
componentsoftheimagetoreceivethenewcolor
components , , .

(34)

3. Apply the DCT block transform to each color


component ,
.

53

www.seipub.org/ijc

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

4. Calculate the optimal rates according to (34)


substituting there the used CCT matrix and the
variancesoftheDCTsubbands.Tofindtheactive
subbands, the algorithm presented in [5] can be
used.

allocation and CCT, as well as another version that


uses optimal rates, but in the YCbCr color space. The
PSPNRmeasureusedhereis:

5. QuantizetheDCTcoefficientsusingauniform
scalar quantizer in each subband. The (optimal)
quantization steps are found using an iterative
algorithm[5].

where
for each color component is calculated
in the DWT domain in the YCbCr color space
accordingtothevisualweightssuggestedin[20].Then
theaveragePSPNRonthe3colorcomponentsistaken.
Based on our experience and results, this is a good
measureofsubjectiveimagequality.

6. Use postquantization coding similar to the


one used in JPEG. Adaptive Huffman coding is
employed and the codes are sent with the image
data.Thisstageislosslessanddoesnotaffectthe
imagedistortion.

(36)

Note that each image in the table was compressed at


the same compression rate (given in the last column
fromtheleft)foreachofthealgorithms,buttherateis
not the same for different images. The reason is that
the the rate was chosen to achieve the same objective
performance (PSNR) for the first algorithm (from the
left) to allow meaningful comparison of average
algorithmperformances.SeealsoTable2below.

It is of interest to compare the performance of this


algorithm to the other DCTbased compression
algorithms, such as the MSE optimized algorithm
proposed in [5] and JPEG. A comparison for several
imagesisgiveninTable1.Weconsiderheretheabove
algorithm with WMSE RD optimization of the rates

TABLE1Perceptuallybasedresults(PSPNR)for(fromlefttoright):TheDCTbasedWMSEoptimizedalgorithmintheYCbCrdomain;Thesame

algorithmwithoptimalCCT;TheMSEoptimizedalgorithm;JPEG.Thecompressionrateforeachimageisshownintherightcolumn.

54

WMSEAlg.inthe
optimaldomain
40.6

MSEAlg.

JPEG

Rate[bpp]

Lena

WMSEAlg.inthe
YCbCrdomain
39.4

38.9

37.6

0.76

Peppers

39.6

39.6

38.1

36.6

0.81

Baboon

42.0

42.5

39.2

36.1

1.76

Cat

41.3

43.1

41.3

39.9

1.30

Landscape

42.5

42.5

40.5

38.0

1.85

House

39.8

40.3

39.2

38.1

0.54

JellyBeans

38.5

38.6

38.3

37.5

0.47

Fruits

41.0

42.3

40.4

38.9

0.71

Sails

41.0

42.9

39.7

37.6

1.84

Image

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.org/ijc

Monarch

39.8

40.2

38.7

37.5

1.03

Goldhill

42.9

43.4

41.9

40.6

2.17

Mean

40.7

41.5

39.7

38.0

TABLE2SameasTable1,butforPSNRinsteadofPSNR.NotethatoptimizationofPSPNR,asinducedbythehumanobserver,doesnot

necessarilymeanoptimizationofthearbitrarilyusedPSNR(seetext).
PSNR

Image

WMSEAlg.inthe
optimaldomain
30.5

MSEAlg.

JPEG

Rate[bpp]

Lena

WMSEAlg.inthe
YCbCrdomain
30.0

30.7

29.7

0.76

Peppers

30.0

30.1

30.5

29.3

0.81

Baboon

30.0

29.0

30.5

26.5

1.76

Cat

30.0

29.6

31.3

29.5

1.30

Landscape

30.0

30.1

30.3

25.9

1.85

House

30.0

30.2

30.3

29.5

0.54

JellyBeans

30.0

30.3

30.6

29.7

0.47

Fruits

30.0

29.8

30.6

30.6

0.71

Sails

30.0

29.7

30.6

28.9

1.84

Monarch

30.0

29.6

30.6

29.4

1.03

Goldhill

30.0

30.2

31.7

29.2

2.17

Mean

30.0

29.9

30.7

28.9

It can be concluded from the table that the WMSE


optimized algorithm with the optimal CCT achieves
the highestPSPNR, which is1.8dB higher onaverage
thanthePSPNRoftheMSEoptimizedalgorithmand
3.5dB above JPEG. The use of the optimal CCT in
WMSEsenseincreasestheperformancebyalmost1dB
(0.8dB) on average when perceptually optimal rates
areemployed.Anothercomparisonofinterestisofthe
standardorobjectivedistortionsofthealgorithm,i.e.,

the PSNR as presented in Table 2. As expected, the


MSEoptimizedalgorithmissuperiorhere,butwhatis
perhaps less intuitive is the fact that the use of the
optimal CCT slightly decreases the PSNR, indicating
thatPSNRandPSPNRaredifferentmeasures.Ascan
beseenintheexamplesofFigs.12below,thehuman
observer judgement is closely related to the PSPNR,
not the PSNR. Despite this both WMSE algorithms
have MSE performance superior to JPEG with a gain

55

www.seipub.org/ijc

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

of 1.1dB in the PSNR without using the optimal CCT


and slightly less (1dB) with the optimal CCT. We
conclude this section by presenting a visual
comparison of the algorithms as can be seen in Fig. 1
fortheLenaimageandinFig.2fortheBaboonimage.

It can be seen that the WMSE algorithm provides the


bestresultsforbothimagesvisually,whiletheresults
of the MSE algorithm are slightly less pleasing to the
eye.YetbothalgorithmsaresuperiortoJPEG.

FIG.1COMPRESSIONRESULTSFORLENAAT0.72BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBYTHEWMSE
OPTIMIZEDALGORITHM(TOPRIGHT,PSPNR=40.4DB);IMAGECOMPRESSEDBYJPEG(BOTTOMLEFT,PSPNR=37.7DB);IMAGE
COMPRESSEDBYTHEMSEOPTIMIZEDALGORITHM(BOTTOMRIGHT,PSPNR=39.3DB).ASEXPECTED,THEWMSEALGORITHM
OUTPERFORMSTHEOTHERMETHODS,ESPECIALLYINTHEMARKEDAREAS.

FIG.2COMPRESSIONRESULTSFORTHEBABOON(ZOOMEDIN)AT0.88BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBY
THEWMSEOPTIMIZEDALGORITHM(TOPRIGHT,PSPNR=36.9DB);IMAGECOMPRESSEDBYJPEG(BOTTOMLEFT,PSPNR=33.6DB);
IMAGECOMPRESSEDBYTHEMSEOPTIMIZEDALGORITHM(BOTTOMRIGHT,PSPNR=35.6DB).HEREAGAIN,THEWMSE
ALGORITHMOUTPERFORMSTHEOTHERMETHODS.

56

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.org/ijc

5. Quantize the DWT coefficients by a uniform


quantizer with a central deadzone in each
subbandsimilartotheoneusedinJPEG2000PartI
[8].Useoptimalquantizationsteps.

DWTBasedCompressionAlgorithm
When the DWT is considered, there are quite a few
options for the wavelet filter bank to be used for the
decomposition. We have chosen the Daubechies 9/7
filter bank, but obviously other choices can be
consideredaswell.Notiling[13]isused.Thechoiceof
the visual weights is according to [20]. The stages of
theproposedalgorithmare:
1.

6. Use the postquantization coding of the EZW


algorithm[19] onthequantizedsubband
coefficients. This stage is lossless and includes bit
plane coding using zero trees. The bit plane
coding is split into two passes (dominant and
subordinate) and a separate arithmetic coder is
employedforeachpass.

FindtheoptimalCCT byminimizing(32).

2. Apply the CCT


to the RGB color
componentsoftheimagetoreceivethenewcolor
components , , .

Itisinterestingtocomparetheproposedalgorithmto
JPEG2000. We have considered the JPEG2000
implementationusingtheJasPersoftwarepackage[24]
andanotherversionoftheimplementationwithfixed
visual weighting at subband level using the CSF
weights of[20]. The visual results for the Lenaimage
canbeseeninFig.3.ThePSNRresultshereare29.5dB
for the proposed WMSE optimized algorithm, 28.6dB
for JPEG2000 (original JasPer implementation) and
28.5dB for JPEG2000 with CSF weights. We conclude
that the use of CSF weights, that affects the tier2
codingstageoftheJPEG2000algorithm,decreasesthe
PSNR,butslightlyimprovesthevisualperformance.

3. ApplytheDWTtreedecompositionuptothe
required depth of the tree (3, 4, 5 or higher
according toimagesize) to each colorcomponent
,
.
4. Calculate the optimal rates according to (34)
substituting there the used CCT matrix and the
variances, sample rates and energy gains of the
DWT subbands. The determination of the active
subbands is the same as for the DCTbased
algorithmoftheprevioussubsection.

FIG.3COMPRESSIONRESULTSFORLENAAT0.52BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBYTHEDWTBASED
WMSEOPTIMIZEDALGORITHM(TOPRIGHT,PSPNR=19.7DB);IMAGECOMPRESSEDBYJPEG2000(BOTTOMLEFT,PSPNR=19.1DB);
IMAGECOMPRESSEDBYJPEG2000WITHCSFWEIGHTS(BOTTOMRIGHT,PSPNR=19.2DB).
ALSOHERE,THEWMSEALGORITHMISSUPERIORTOTHEREST.

57

www.seipub.org/ijc

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

andmustachetexturesintheCatimagearelost.Inthe
case of the Peppers image, the compression result of
JPEG2000 is less pleasing to the eye due to the color
artifacts introduced. Fig. 6 further demonstrates the
loss of spatial details in the case of JPEG2000
compression of the Sails image, the blurring of the
contoursintheMonarchimageandbotheffectsinthe
Goldhill image (see the top marked area for the
blurred contour effect and, for example, the bottom
left marked area for the loss of spatial details).
Furthermore, color artifacts are introduced by
JPEG2000 in the Goldhill image as indicated, for
instance,inthemarkedareainthecenteroftheimage.

Also the proposed algorithm produces an image that


ismuchmorepleasingtotheeyethanJPEG2000.
Similar results can be seen in the comparison of the
proposed algorithm and JPEG2000 for other example
imagesin Figs.4,5and6.In Fig.4 theloss ofspatial
details at high frequencies in the Landscape and
House images as well as the introduction of false
contoursintheJellyBeansimageshouldbenotedfor
JPEG2000. These effects can be seen in the marked
regions. In Fig. 5 once again the superiority of the
WMSEoptimizedalgorithmonJPEG2000canbeseen
in regions of high frequency details as demonstrated
bytheFruitsandCatimages.Forexample,thedetails
oftheappletextureintheFruitsimageandofthefur

FIG.4LANDSCAPE,HOUSEANDJELLYBEANSIMAGESFROMLEFTTORIGHT:ORIGINAL,COMPRESSEDBYTHEWMSE
ALGORITHM(WMSEALG.)ANDCOMPRESSEDBYJPEG2000.
PSPNRFORTHELANDSCAPEIMAGE:17.1DB(WMSEALG.)AND15.7DB(JPEG2000).
PSNR:28.7DB(WMSEALG.)AND25.3DB(JPEG2000)AT0.97BPP.
PSPNRFORTHEHOUSEIMAGE:19.4DB(WMSEALG.)AND19.0DB(JPEG2000).
PSNR:31.2DB(WMSEALG.)AND33.1DB(JPEG2000)AT0.68BPP.
PSPNRFORTHEJELLYBEANSIMAGE:18.8DB(WMSEALG.)AND18.2DB(JPEG2000).
PSNR:32.3DB(WMSEALG.)AND32.1DB(JPEG2000)AT0.48BPP.
INTHEONLYCASEWHERETHEPSNROFJPEG2000ISHIGHERTHANTHENEWALGORITHM(HOUSE),THEPSPNRRESULT
SUPPORTSTHEFACTTHATVISUALLYTHENEWALGORITHMPROVIDESSUPERIORRESULTS.

58

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.org/ijc

FIG.5FRUITS,CATANDPEPPERSIMAGESFROMLEFTTORIGHT:ORIGINAL,COMPRESSEDBYTHEWMSEALGORITHM(WMSE
ALG.)ANDCOMPRESSEDBYJPEG2000.
PSPNRFORTHEFRUITSIMAGE:22.2DB(WMSEALG.)AND21.1DB(JPEG2000).
PSNR:30.0DB(WMSEALG.)AND29.0DB(JPEG2000)AT1.34BPP.
PSPNRFORTHECATIMAGE:17.0DB(WMSEALG.)AND16.2DB(JPEG2000).
PSNR:28.9DB(WMSEALG.)AND26.9DB(JPEG2000)AT0.63BPP.
PSPNRFORTHEPEPPERSIMAGE:20.3DB(WMSEALG.)AND19.3DB(JPEG2000).
PSNR:30.8DB(WMSEALG.)AND30.7DB(JPEG2000)AT0.86BPP.
ASCANBESEEN,PSNRANDPSPNRRESULTSARESUPERIORFORTHENEWALGORITHMCOMPAREDTOJPEG2000.ITISALSO
OBSERVEDVISUALLYEXAMPLESAREINDICATEDINTHEMARKEDAREAS.

59

www.seipub.org/ijc

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

FIG.6SAILS(ZOOMEDIN),MONARCH(ZOOMEDIN)ANDGOLDHILLIMAGESFROMLEFTTORIGHT:ORIGINAL,COMPRESSED
BYTHEWMSEALGORITHM(WMSEALG.)ANDCOMPRESSEDBYJPEG2000.
PSPNRFORTHESAILSIMAGE:19.2DB(WMSEALG.)AND18.0DB(JPEG2000).
PSNR:28.9DB(WMSEALG.)AND26.6DB(JPEG2000)AT0.70BPP.
PSPNRFORTHEMONARCHIMAGE:19.9DB(WMSEALG.)AND19.6DB(JPEG2000).
PSNR:29.0DB(WMSEALG.)AND28.8DB(JPEG2000)AT0.56BPP.
PSPNRFORTHEGOLDHILLIMAGE:17.6DB(WMSEALG.)AND16.6DB(JPEG2000).
PSNR:27.0DB(WMSEALG.)AND24.5DB(JPEG2000)AT0.59BPP.
ONCEAGAIN,THEPSNRANDPSPNRRESULTSARESUPERIORFORTHENEWALGORITHMCOMPAREDTOJPEG2000(SEE
EXAMPLESINDICATEDINTHEMARKEDAREAS).

stageusingaCCT,followedbyasubbandtransform,
quantizationstage,andlosslessentropyencoding.The
algorithms are optimized with regard to the color
component transform in the preprocessing stage of
the compression as well as the quantization tables
usedinthecodingstage,bothwithrespecttoWMSE.
TheproposedDCTbasedalgorithmoutperformsboth
JPEG and the corresponding MSE optimized
algorithm. The DWTbased algorithm, as expected,
achieveshighercompressionratiosforthesameimage
quality than DCTbased techniques. We demonstrate
in this work that even when a relatively basic
algorithm is used in the postprocessing stage
(introducedforEZW),superiorresultsareobtainedby

Summary
A perceptuallybased model for the RateDistortion
functionofcolorsubbandcodershasbeenintroduced.
ThenewmodelapproximatestheWMSEdistortionof
an image in a given color space, such as YCbCr. This
distortion is then minimized to achieve perceptual
optimizationofthecompression.Whentheweightsin
the WMSE calculation are taken based on the CSF
curves of the human visual system, better
correspondence to image quality assessment by the
humaneyeisachieved.
Based on the RateDistortion model, new algorithms
have been introduced consisting of a preprocessing

60

InternationalJournalofCommunications(IJC)Volume2Issue2,June2013www.seipub.org/ijc

the proposed algorithm when compared to other


DWTbasedalgorithms,suchasJPEG2000.Thisholds
even if the same WMSE distortion is used in both
JPEG2000andtheproposedalgorithm.Ourconclusion
is that based on the new perceptual RateDistortion
model, optimized compression algorithms can be
designed with compression results superior to
presentlyavailabletechniques.

[8] JPEG 2000 Part I: Final Draft International Standard


(ISO/IECFDIS154441),NCITSISO/IECJTC1/SC29/WG1
N1855(Aug.2000).
[9] Lam, E. Y. and Goodman, J. W. A mathematical
analysis of the DCT coefficient distributions for
images. IEEE Trans. on Image Processing 9 (2000):
16611666.

ACKNOWLEDGMENT

[10] Limb J. O. and Rubinstein C.B. Statistical Dependence

This research was supported in part by the HASSIP


Research Program HPRNCT200200285 of the
EuropeanCommission,andbytheOllendorffMinerva
Center.MinervaisfundedthroughtheBMBF.

Between Components of A Differentially Quantized


ColorSignal.IEEETrans.onCommunicationsCom20
(Oct.1971):890899.
[11] Mannos, J. and Sakrison, D. The effects of a visual

REFERENCES

fidelitycriterionoftheencodingofimages.IEEETrans.
onInformationTheory20(Jul.1974):525536.

[1] Faugeras,O.D.DigitalColorImageProcessingWithin

[12] Ngan, K. N., Leong, K. S. and Singh, H. Adaptive

theFrameworkofaHumanVisualModel.IEEETrans.

cosine transform coding of images in perceptual

on Acoustics Speech and Signal Processing ASSP27

domain. IEEE Trans. on Acoustics, Speech, and Signal

(1979):380393.

Processing37(Nov.1989):17431750.

[2] Fiorucci,F.,Baruffa,G.andFrescura,F. Objectiveand

[13] Rabbani, M. and Joshi, R. An overview of the JPEG

subjective quality assessment between JPEG XR with


overlap

and

JPEG

2000.",

Journal

of

2000 still image compression standard. Signal

Visual

Processing:ImageCommunication17(2002):348.

Communication and Image Representation 23 (Aug.

[14] Rao, K. R. and Yip, P. Discrete cosine transform:

2012):835844.

algorithms, advantages, applications. Academic Press,

[3] Gershikov, E. and Porat, M. A RateDistortion

1990.

ApproachtoOptimalColorImageCompression.Proc.

[15] Richter, T. and Simon, S. On the JPEG 2000 ultrafast

ofEUSIPCO,Florence,Italy,2006.

mode.Proc.ofICIP(Oct.2012):25012504.

[4] Gershikov, E. and Porat, M. Data Compression of

[16] Richter,T.SpatialConstantQuantizationinJPEGXRis

Color Images using a Probabilistic Linear Transform

Nearly Optimal. Proc. Of the Data Compression

Approach, in LNCS #4310, T. Boyanov, S. Dimova, K.

Conference(March2010):7988.

Georgiev, G. Nikolov (Eds.), 582589, SpringerVerlag

[17] Roterman, Y. and Porat, M. Color image coding using

BerlinHeidelberg,2007.

regional correlation of primary colors. Image and

[5] Gershikov, E. and Porat, M. On color transforms and

VisionComputing25(2007):637651.

bitallocationforoptimalsubbandimagecompression.

[18] Satt, A. and Malah, D. Design of Uniform DFT Filter

SignalProcessing:ImageCommunication22(2007):118.

BanksOptimizedforSubbandCodingofSpeech,IEEE

[6] Gersho, A. and Gray, R. M. Vector Quantization and

Trans. on Acoustics, Speech and signal Processing 37

SignalCompression,Boston,MA:Kluwer,1992,ch.2.

(Nov.1989):16721679.

[7] GoffmanVinopal, L. and Porat, M. Color image

[19] Shapiro, J. M. Embedded Image Coding Using

compression using intercolor correlation. Proc. of

Zerotrees of Wavelet Coefficients, IEEE Trans. on

IEEEICIP(2002):II353II356.

SignalProcessing41(1993):33453462.

61

www.seipub.org/ijc

International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

[20] Taubman,D.S.andMarcellin,M.W.JPEG2000:image

[23] Yamaguchi, H. Efficient Encoding of Colored Pictures

compression, fundamentals, standards and practice,

inR,G,BComponents.Trans.onCommunications32

KluwerAcademicPublishers,2002.

(Nov.1984):12011209.
[24] http://www.ece.uvic.ca/mdadams/jasper

[21] Wallace, G. K. The JPEG still picture compression


standard.IEEETrans.ConsumerElectronics38(1992):

Evgeny Gershikov received his


Ph.D. in Electrical Engineering from
Technion Israel Institute of
Technology in Haifa, Israel in 2011.
His areas of interest are Signal and
Image Processing, Color Processing
and Vision, Computer Vision,
Pattern Recognition and Speech

xviiixxxiv.
[22] Wang, C. Y., Lee, S. M. and Chang, L. W. Designing
JPEG quantization tables based on human visual
system. Signal Processing: Image Communication 16
(2001):501506.
Recognition.

62

Você também pode gostar