Escolar Documentos
Profissional Documentos
Cultura Documentos
Research Article
An Effective Transform Unit Size Decision Method for
High Efficiency Video Coding
Chou-Chen Wang,1 Chi-Wei Tung,1 and Jing-Wein Wang2
1
2
1. Introduction
With the rapid development of electronic technology, the
panels of 4 2 (or 8 4) high-resolution will become
the main specification of large size digital TV in the future.
However, the currently state-of-the-art video coding standard H.264/AVC is difficult to support the video applications
of high definition (HD) and ultrahigh definition (UHD)
resolution [1]. Therefore, the ITU-T video coding experts
group (VCEG) and ISO/IEC moving pictures expert group
(MPEG) through their joint collaborative team on video
coding (JCT-VC) have developed the newest high efficiency
video coding (HEVC) for video compression standard to
satisfy the UHD requirement in 2010 [2], and the first version
of HEVC was approved as ITU-T H.265 and ISO/IEC 23008-2
by JCT-VC in January 2013 [35].
HEVC can achieve an average bit rate decrease of 50%
in comparison with H.264/AVC high profile while still
maintaining the same subjective video quality [6]. This is
CU
64 64 depth 0
TU
32 32 depth 1
16 16 depth 2
Depth
Inter PU
Intra PU
TU
0
64 64, 64 32, 32 64 64 64
32 32, 16 16
1
32 32, 32 16, 16 32 32 32 32 32, 16 16, 8 8
2
16 16, 16 8, 8 16
16 16
16 16, 8 8, 4 4
3
8 8, 8 4, 4 8
8 8, 4 4
8 8, 4 4
8 8 depth 3
4 4 dsepth 4
(1)
(2)
where all of the above matrices are with the size of . The
symbol stands for a scalar multiplication. PF1 and PF2 are
postscaling factors defined as follows:
PF1 : = ( + (1 ( + 10))) ( + 9) ,
PF2 : = ( + (1 ( + 5))) ( + 6) ,
(3)
(4)
LCU
CU splitting
Split flag = 1
Depth = 0, N = 32
Skip mode
2N
Intermodes
Inter N 2N
Inter 2N N
Inter N N
Split flag = 1
0
CU0
2N
2N
..
.
SCU
..
.
Depth = 3, N = 8
Intramodes
Split flag = 0
CU0
2N
Intra 2N 2N
2N
Depth = 1, N = 16
Inter 2N 2N
0
CU0
Intra N N
2N
Figure 2: Recursive CU structure, PU splitting for skip, and inter- and intramodes.
(6)
e
e
TU = 16 16
TU = 8 8
TU = 32 32
a
(7)
QP
22
27
32
37
Average
=0
98.5
94.0
96.5
97.8
96.7
= 32
=1
=2
62.2
49.1
65.0
58.2
81.0
68.9
80.8
65.9
72.3
60.5
=3
40.1
44.8
54.7
57.6
49.3
=8
=1
=2
84.8
75.4
88.8
81.0
90.7
82.0
94.3
85.8
89.7
81.1
=0
94.4
98.0
99.1
99.6
97.8
=3
71.9
77.2
79.3
80.3
77.2
=0
81.5
84.5
85.8
88.7
85.1
= 32
=1
=2
62.5
53.5
63.3
49.8
77.1
53.9
73.7
54.0
69.2
52.8
=3
40.4
38.6
40.6
41.3
40.2
=8
=1
=2
90.1
83.6
92.0
83.8
94.1
86.4
94.8
87.4
92.8
85.3
=0
92.1
95.1
97.2
98.6
95.8
=3
81.0
80.3
81.6
81.2
81.0
Depthpred = ,
(8)
=0
(a)
(b)
Figure 4: Test sequences (2560 1600). (a) Traffic and (b) people on street.
Col
Frame: t 1
Frame: t
Sequences
Traffic
People on street
Kimono
Park scene
Cactus
Basketball drive
BQ terrace
Average (%)
Normalized
Col
56.24
72.87
40.93
59.03
52.49
49.20
60.22
55.85
0.25
L
60.17
68.08
43.51
60.69
62.39
58.50
66.38
59.96
0.26
L-U
53.21
63.40
34.80
54.38
55.58
44.96
56.84
51.88
0.24
U
56.53
66.70
39.71
62.71
60.78
49.98
60.66
56.72
0.25
5. Experiment Results
For the performance evaluation, we assess the total execution
time of the proposed method in comparison to those of
the HEVC HM 8.1 [18, 19] and the NNZ-ETDA (threshold
= 3) in order to confirm the reduction in computational
complexity. The coding performance is evaluated based on
Bitrate, PSNR, and Time, which are defined as follows:
Bitrate =
Bitratemethod BitrateHM8.1
100%,
BitrateHM8.1
(9)
TimeHM8.1 Timemethod
100%.
TimeHM8.1
TU start
[depthmin , depthmax ]
4 predicted RQT?
[1, 4]
No
Yes
N1
Depthpred = i di
i=0
Depthpred 1
Classify current
TU into five types
Type 0
[1, 1]
Type 1
[1, 2]
Type 2
[1, 3]
Type 3
[2, 4]
Type 4
[3, 4]
Calculate RDO
Best TU partition
Depthpred
Depth range
[depthmin , depthmax ]
TU size
Depthpred 1
1 < depthpred 1.5
1.5 < depthpred 2.5
2.5 < depthpred 3.5
Depthpred > 3.5
[1, 1]
[1, 2]
[1, 3]
[2, 4]
[3, 4]
32 32
32 3216 16
32 328 8
16 164 4
8 84 4
RQT start
Initial:
Bool bCheckFull;
Bool bCheckSplit;
Bool flagETDA = 0;
No
bChechFull = 1
and
TU depth depthmin
Yes
Transfrom and quantization
Compute RD cost
Count NNZ
NNZ threshold
Yes
No
flagETDA = 1
flagETDA = 1
and
bchecksplit = 1
bCheckSplit = 1
and
TU depth < depthmax
No
Yes
No
uiPart = 0
Yes
bCheckSplit = 0
RQT start
uiPart ++
uiPart < 4
No
Yes
Compare RDcost
RDcost ++
Best TU partition
Test
sequences
Total frames
Quantization
parameter
Software
Scenario
78.27%
72.48%
81.48%
17.92%
15.54%
61.26%
11.99%
8.29%
22
27
32
22
37
27
32
37
QP
QP
(a)
(b)
Figure 8: Average time improving ratio using different QP values: (a) proposed versus HM 8.1 and (b) proposed versus NNZ-EDTA.
HM 8.1
Proposed
Comparison
Bit rate
(kb/s)
PSNR
(dB)
Time
(sec)
Bit rate
(kb/s)
PSNR
(dB)
Time
(sec)
Bit
rate
(%)
Traffic
People on street
Kimono
Park scene
Cactus
Basketball drive
BQ terrace
15823
34367
6174
8724
23165
13719
44713
41.75
40.29
41.88
40.08
38.49
39.82
38.09
10329
14881
7511
5465
5997
6072
6478
15644
34491
6233
8669
23195
14175
44540
41.66
40.19
41.86
40.01
38.46
39.80
38.03
3152
7201
2860
1936
2306
2134
2926
1.13
0.36
0.96
0.63
0.13
3.32
0.39
0.09
0.10
0.02
0.07
0.03
0.02
0.06
69.49
51.61
61.93
64.58
61.56
64.85
54.83
Average
20955
40.06
8105
20992
40.00
3216
0.37
0.06
61.26
PSNR
(dB)
Time
(%)
NNZ-ETDA
Proposed
Comparison
Bit rate
(kb/s)
PSNR
(dB)
Time
(sec)
Bit rate
(kb/s)
PSNR
(dB)
Time
(sec)
Bit
rate
(%)
Traffic
People on street
Kimono
Park scene
Cactus
Basketball drive
BQ terrace
15525
34076
6152
8603
22833
13667
44175
41.68
40.23
41.87
40.02
38.47
39.81
38.04
3671
9024
3848
2245
2725
2608
3571
15644
34491
6233
8669
23195
14175
44540
41.66
40.19
41.86
40.01
38.46
39.80
38.03
3152
7201
2860
1936
2306
2134
2926
0.77
1.22
1.32
0.77
1.59
3.72
0.83
0.02
0.04
0.01
0.01
0.01
0.01
0.01
14.15
20.20
25.69
13.78
15.39
18.19
18.05
Average
20719
40.02
3956
20992
40.00
3216
1.46
0.01
17.92
PSNR
(dB)
Time
(%)
12000
16000
10000
14000
12000
8000
10000
6000
8000
4000
6000
4000
2000
2000
0
22
27
32
37
Proposed
HM8.1
NNZ-ETDA
(a)
0
22
27
32
37
Proposed
HM8.1
NNZ-ETDA
(b)
Figure 9: The TU processing time with different QP: (a) traffic and (b) people on street.
We will further exploit the characteristics of high temporalspatial correlation existing in 3D video to develop an effective
3D-HEVC encoding method in our further works.
Conflict of Interests
The authors declare that there is no conflict of interests
regarding the publication of this paper.
Acknowledgments
This work was supported by the National Science Council,
Taiwan, under Contract no. NSC 100-2221-E-214-048 and IShou University, Taiwan, under Contract no. ISU 103-01-05A.
References
[1] Advanced video coding for generic audiovisual services, ITUT Rec. H. 264 and ISO/IEC, 14496 10, ITU-T and ISO/IEC, 2010.
[2] Joint call for proposals on video compression technology,
Kyoto, Japan, document VCEG-AM91 of ITU-T Q6/16 and
N1113 of JTC1/SC29/WG11, January 2010.
[3] J. Ohm, W. J. Han, and T. Wiegand, Overview of the high
efficiency video coding (HEVC) standard, IEEE Transactions
on Circuits and Systems for Video Technology, vol. 22, no. 12, pp.
16491668, 2012.
[4] B. Bross, W. J. Han, J. R. Ohm, G. J. Sullivan, and T. Weingand,
High efficiency video coding (HEVC) text specification draft
8, JCT-VC Document, JCTVC-J1003, July 2012.
[5] High Efficiency Video Coding, Rec. ITU-T H. 265 and ISO/IEC,
23008-2, January 2013.
[6] B. Bross, W. J. Han, J. R. Ohm, G. J. Sullivan, Y. K. Wang, and
T. Wiegand, High efficiency video coding (HEVC) text specification draft 10 (for FDIS & Consent), Geneva, Switzerland,
document JCTVC-L1003 of JCT-VC, July 2013.
[7] X. L. Shen, L. Yu, and J. Chen, Fast coding unit size selection
for HEVC based on Bayesian decision rule, in Proceedings of the
Picture Coding Symposium (PCS 12), pp. 453456, May 2012.
[8] Y. Zhang, H. Wang, and Z. Li, Fast coding unit depth decision
algorithm for interframe coding in HEVC, in Proceedings of the
Data Compression Conference, pp. 5362, March 2013.
[9] H. S. Lee, K. Y. Kim, T. R. Kim, and G. H. Park, Fast encoding
algorithm based on depth of coding-unit for high efficiency
video coding, Optical Engineering, vol. 51, no. 6, 2013.
[10] J. Kim, S. Jeong, S. Cho, and J. S. Choi, Adaptive Coding Unit
early termination algorithm for HEVC, in Proceedings of the
IEEE International Conference on Consumer Electronics (ICCE
12), pp. 261262, January 2012.
[11] L. Shen, Z. Liu, X. Zhang, W. Zhao, and Z. Zhang, An
effective CU size decision method for HEVC encoders, IEEE
Transactions on Multimedia, vol. 15, no. 2, pp. 465470, 2013.
[12] K. Lee, H. J. Lee, J. Kim, and Y. Choi, A novel algorithm for zero
block detection in high efficiency video coding, IEEE Journal
of Selected Topics in Signal Processing, vol. 7, no. 6, pp. 11241134,
2013.
[13] G. Tian and S. Goto, An optimization scheme for quadtreestructured prediction and residual encoding in HEVC, in
Proceedings of the IEEE Asia Pacific Conference on Circuits and
Systems, pp. 547550, December 2012.
[14] Y. Shi, Z. Gao, and X. Zhang, Early TU split termination in
HEVC based on quasi-zero-block, in Proceedings of the 3rd
International Conference on Electric and Electronics, pp. 450
454, January 2013.
[15] K. Choi and E. S. Jang, Early TU decision method for fast video
encoding in high efficiency video coding, Electronics Letters,
vol. 48, no. 12, pp. 971973, 2012.
[16] T. Wiegand, G. J. Sullivan, G. Bjntegaard, and A. Luthra,
Overview of the H.264/AVC video coding standard, IEEE
Transactions on Circuits and Systems for Video Technology, vol.
13, no. 7, pp. 560576, 2003.
[17] E. G. Richardson, H.264 and MPEG-4 Video Compression: Video
Coding For Next-Generation Multimedia, John Wiley & Sons,
2003.
[18] Reference software HM8.1, https://hevc.hhi.fraunhofer.de/svn/
svn HEVCSoftware/branches/.
[19] Common test conditions and software reference configurations, JCTVC-E700, Joint Collaborative Team on Video Coding (JCT-VC) 5th Meeting, Geneva, Switzerland, March 2011.
10
[20] G. Tech, K. Wegner, Y. Chen, and S. Yea, 3D-HEVC draft text 1,
Joint Collaborative Team on 3D Video Coding Extensions (JCT3V) Document JCT3V-E1001, 5th Meeting, Vienna, Austria,
August 2013.
[21] Q. Zhang, L. Tian, L. Huang, X. Wang, and H. Zhu, Rendering
distortion estimation model for 3D high efficiency depth coding, Mathematical Problems in Engineering, vol. 2014, Article
ID 940737, 7 pages, 2014.
[22] Y. Wu and S.-W. Ko, A simple and high performing rate control
initialization method for H.264 AVC coding based on motion
vector map and spatial complexity at low bitrate, Mathematical
Problems in Engineering, vol. 2014, Article ID 595383, 11 pages,
2014.
Advances in
Operations Research
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Advances in
Decision Sciences
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Journal of
Applied Mathematics
Algebra
Volume 2014
Journal of
The Scientific
World Journal
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
International Journal of
Differential Equations
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Volume 2014
Advances in
Combinatorics
Hindawi Publishing Corporation
http://www.hindawi.com
Mathematical Physics
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Journal of
Complex Analysis
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
International
Journal of
Mathematics and
Mathematical
Sciences
Mathematical Problems
in Engineering
Journal of
Mathematics
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Volume 2014
Volume 2014
Volume 2014
Discrete Mathematics
Journal of
Volume 2014
Discrete Dynamics in
Nature and Society
Journal of
Function Spaces
Hindawi Publishing Corporation
http://www.hindawi.com
Abstract and
Applied Analysis
Volume 2014
Volume 2014
Volume 2014
International Journal of
Journal of
Stochastic Analysis
Optimization
Volume 2014
Volume 2014