Você está na página 1de 43

Se p te m b e r 2 7 , 2 0 0 5 Pa g e 1

Jo h n G . A p o sto l o p o ul o s
Vi deo
Codi ng
Vi deo Compr essi on:
Pr i nci ples, Pr acti ce, and Standar ds
John G . A p ostolop oulos
Strea mi ng M ed i a Systems G roup
HP La b s, Pa lo A lto, CA
j a p os@hp l. hp . com
Jo h n G . A p o sto l o p o ul o s
Pa g e 2
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Goals f or Today s Talk
G oa ls:
Provi d e b ri ef overvi ew of i ma g e & vi d eo cod i ng a nd key
sta nd a rd s
A ssume no p ri or know l ed g e of comp ressi on
Exp la i n the a lp ha b ets soup of cod i ng sta nd a rd s, a cronyms,
etc
Jo h n G . A p o sto l o p o ul o s
Pa g e 3
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Outli ne
W hy d o w e need comp ressi on?
Ima g e comp ressi on
Color p rocessi ng , DCT, etc
Vi d eo comp ressi on
Exp loi t temp ora l d i mensi on of vi d eo si g na l Pred i cti on
G eneri c (M PEG -typ e) vi d eo cod er a rchi tecture
Current vi d eo comp ressi on sta nd a rd s
W ha t a re they?
W ha t d o the sta nd a rd s sp eci fy?
Bri ef overvi ew of H. 2 6 4 / M PEG -4 A VC vi d eo cod i ng sta nd a rd
Jo h n G . A p o sto l o p o ul o s
Pa g e 4
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Moti vati on f or Compr essi on:
Ex ample of HDTV Vi deo Si gnal
Prob l em:
Ra w vi d eo conta i ns a n i mmense a mount of d a ta
Communi ca ti on a nd stora g e ca p a b i li ti es a re li mi ted & exp ensi ve
Exa mp le HDTV vi d eo si g na l:
7 2 0 x1 2 8 0 p i xels/ fra me, p rog ressi ve sca nni ng a t 6 0 fra mes/ s:
2 0 M b / s HDTV cha nnel b a nd w i d th
Req ui res comp ressi on b y a fa ctor of 7 0
(eq ui va lent to 0 . 3 5 b i ts/ p i xel)
s Gb
color
bits
pixel
colors frames
frame
pixels
/ 3 . 1
8 3
sec
60 1280 720

,
_

,
_

,
_

,
_


Jo h n G . A p o sto l o p o ul o s
Pa g e 5
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Ex ample Vi deo Appli cati ons
and Requi r ed Compr essi on Rati os
~ 1 9 0
3 8 4 kb / s
(3 G )
7 3 M b / s 2 4 3 0 2 8 8 x 3 5 2 CIF
~ 1 4 0
6 4 kb / s
(3 G )
9 M b / s 2 4 1 5 1 4 4 x 1 7 6 Q CIF
~ 5 0
5 M b / s
(DVD)
2 5 0 M b / s 2 4 3 0 4 8 0 x 7 2 0 SDTV
~ 7 0
2 0 M b / s
(HDTV)
1 . 3 G b / s 2 4 6 0 7 2 0 x 1 2 8 0 HDTV
Req ui red
comp ressi on
ra ti o
Cha nnel
b i t ra te
Ra w b i t
ra te
b i ts/ p i xel Fp s Pi xels/ fra me Vi d eo
Forma t
W i reless vi d eo soon
W i reless vi d eo tod a y
Lots of comp ressi on
req ui red ! ! !
Jo h n G . A p o sto l o p o ul o s
Pa g e 6
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Achi evi ng Compr essi on
Red uce red und a nt or rep ea ted i nforma ti on
Temp ora l: A d j a cent fra mes hi g hly correla ted
Sp a ti a l: N ea rb y p i xels a re often correla ted
Color sp a ce: RG B comp onents a re correla ted
Rela ti vely stra i g htforw a rd to exp loi t
Red uce i rreleva nt i nforma ti on
Percep tua lly uni mp orta nt i nforma ti on, i . e. w ha t the
huma n vi sua l system (HVS) ca n not see
Di ffi cult to mod el a nd exp loi t
Jo h n G . A p o sto l o p o ul o s
Pa g e 7
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Spati al and Tempor al Redundancy
W hy ca n vi d eo b e comp ressed ?
Vi d eo conta i ns much sp a ti a l a nd temp ora l red und a ncy.
Sp a ti a l red und a ncy: N ei g hb ori ng p i xels a re si mi la r
Temp ora l red und a ncy: A d j a cent fra mes a re si mi la r
Comp ressi on i s a chi eved b y exp loi ti ng the sp a ti a l a nd temp ora l
red und a ncy i nherent to vi d eo
Jo h n G . A p o sto l o p o ul o s
Pa g e 8
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Outli ne
W hy d o w e need comp ressi on?
Ima g e comp ressi on
Color p rocessi ng , DCT, etc
Vi d eo comp ressi on
Exp loi t temp ora l d i mensi on of vi d eo si g na l Pred i cti on
G eneri c (M PEG -typ e) vi d eo cod er a rchi tecture
Current vi d eo comp ressi on sta nd a rd s
W ha t a re they?
W ha t d o the sta nd a rd s sp eci fy?
Bri ef overvi ew of H. 2 6 4 / M PEG -4 A VC vi d eo cod i ng sta nd a rd
Jo h n G . A p o sto l o p o ul o s
Pa g e 9
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Image Compr essi on:
Over vi ew
Cod i ng a n i ma g e (si ng le fra me):
RG B to YUV col or-sp a ce conversi on
Pa rti ti on i ma g e i nto 8 x8 -p i xel b l ocks
2 -D DCT of ea ch b lock
Q ua nti z e ea ch DCT coeffi ci ent
Runleng th a nd Huffma n cod e the nonz ero q ua nti z ed DCT
coeffi ci ents
Ba si s for the JPEG Ima g e Comp ressi on Sta nd a rd
JPEG -2 0 0 0 uses w a velet tra nsform a nd a ri thmeti c cod i ng
Q ua n ti z a ti o n
C o m p re sse d
Bi tstre a m
O ri g i n a l
Im a g e
Run l e n g th &
H uf f m a n
C o d i n g
RG B
to
YU V
Bl o c k D C T
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 0
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Color Space Pr ocessi ng
Imp orta nt p rop erti es
RG B comp onents a re hi g hly correla ted
HVS p ercep ti on d i ffers for lumi na nce tha n for
chromi na nce
G oa l: Convert RG B to a d i fferent color sp a ce w here
these p rop erti es ca n b e exp loi ted
3 x3 Li nea r col or-sp a ce tra nsforma ti on:
R G B Y U V
Y: Lumi na nce
U & V: Chromi na nce
1
1
1
]
1

1
1
1
]
1



1
1
1
]
1

B
G
R
V
U
Y
100 . 515 . 615 .
436 . 289 . 147 .
114 . 587 . 299 .
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 1
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Color Space Pr ocessi ng (cont.)
A d va nta g es of color sp a ce conversi on:
HVS ha s low er sp a ti a l freq uency resp onse to U a nd V
tha n to Y
Red uce sa mp li ng d ensi ty for U a nd V
HVS ha s low er sensi ti vi ty to U a nd V tha n to Y
Q ua nti z e U a nd V more coa rsely
Rea sona b le a ssump ti on: A n RG B i ma g e req ui res 3 x b i t
ra te of B& W i ma g e (si ng le-color i ma g e)
Key result: RG B i ma g e only req ui res ~ 1 . 2 5 x b i t ra te
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 2
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Color Space Pr ocessi ng (cont.)
RG B color comp onents a re:
1 ) Converted to YUV
2 ) U a nd V comp onents a re sub sa mp led b y 2 x2
RG B
to
YU V
Y
U V
1 4 4 x1 7 6 p i xels
1 4 4 x1 7 6
7 2 x8 8 7 2 x8 8
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 3
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Tr ansf or m Image Codi ng
G oa l: Tra nsform i ma g e so tha t most of the i nforma ti on (energ y) i s
concentra ted i nto only a sma ll fra cti on of the coeffi ci ents
Cod i ng only these sma ll fra cti on of the coeffi ci ents a nd d i sca rd i ng
the rest ca n often lea d to excellent reconstructed q ua li ty
The more energ y comp a cti on the b etter!
Ena b les p ercep tua l p rocessi ng (exp loi ti ng HVS resp onse to d i fferent
freq uency comp onents)
Fi rst fra m e o f Fo re m a n Se q ue n c e M a g n i tud e o f FFT o f Fo re m a n
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 4
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Spati al Pr ocessi ng: Block DCT
Bl ock Di screte Cosi ne Tra nsform (DCT)
Sp li t i ma g e i nto 8 x8 p i xel b locks
Ea ch b lock i nd ep end ently tra nsformed a nd p rocessed
Comp ute 8 x8 2 -D DCT of ea ch b lock
Q ua nti z e a nd encod e ea ch b lock
A d va nta g es:
Ena b les si mp le, sp a ti a lly-a d a p ti ve p rocessi ng
Red uces comp uta ti on a nd memory req ui rements
Sui ta b le for p a ra llel p rocessi ng
Ba si c b ui ld i ng b lock for most current i ma g e a nd vi d eo
comp ressi on sta nd a rd s i nclud i ng :
JPEG , M PEG -1 / 2 / 4 , H. 2 6 1 / 3 / 4
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 5
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Image Compr essi on:
Summar y
Cod i ng a n i ma g e (si ng le fra me):
RG B to YUV col or-sp a ce conversi on
Pa rti ti on i ma g e i nto 8 x8 -p i xel b l ocks
2 -D DCT of ea ch b lock
Q ua nti z e ea ch DCT coeffi ci ent
Runleng th a nd Huffma n cod e the nonz ero q ua nti z ed DCT
coeffi ci ents
Ba si s for the JPEG Ima g e Comp ressi on Sta nd a rd
JPEG -2 0 0 0 uses w a velet tra nsform a nd a ri thmeti c cod i ng
Q ua n ti z a ti o n
C o m p re sse d
Bi tstre a m
O ri g i n a l
Im a g e
Run l e n g th &
H uf f m a n
C o d i n g
RG B
to
YU V
Bl o c k D C T
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 6
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Outli ne
W hy d o w e need comp ressi on?
Ima g e comp ressi on
Color p rocessi ng , DCT, etc
Vi d eo comp ressi on
Exp loi t temp ora l d i mensi on of vi d eo si g na l Pred i cti on
G eneri c (M PEG -typ e) vi d eo cod er a rchi tecture
Current vi d eo comp ressi on sta nd a rd s
W ha t a re they?
W ha t d o the sta nd a rd s sp eci fy?
Bri ef overvi ew of H. 2 6 4 / M PEG -4 A VC vi d eo cod i ng sta nd a rd
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 7
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Ex ample Vi deo Encoder
DCT
Huffman
Coding
Motion
Estimation
Motion
Compensation
Buffer
RGB
to
YUV
Motion Vectors
Motion Vectors
MC-Prediction
Error
Input
Video
Signal
Output
Bitstream
Quantize
Inverse
DCT
Inverse
Quantize
Previous
Reconstructed
Frame
Buffer fullness
Frame Store
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 8
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Vi deo Compr essi on
Vi d eo: Seq uence of fra mes (i ma g es) tha t a re rela ted
Rela ted a long the temp ora l d i mensi on
M a i n a d d i ti on over i ma g e comp ressi on
Temp ora l red und a ncy
Usua lly hi g h fra me ra te: Si g ni fi ca nt temp ora l
red und a ncy
Vi d eo cod er must exp loi t the temp ora l red und a ncy
Jo h n G . A p o sto l o p o ul o s
Pa g e 1 9
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Vi deo Compr essi on
G oa l: Exp loi t the temp ora l red und a ncy
Pred i ct current fra me b a sed on p revi ously cod ed fra mes
Three typ es of cod ed fra mes:
I-f r am e: Intra -cod ed fra me, cod ed i nd ep end ently of a ll
other fra mes
P-f r am e: Pred i cti vely cod ed fra me, cod ed b a sed on
p revi ously cod ed fra me
B-f r am e: Bi -d i recti ona lly p red i cted fra me, cod ed b a sed
on b oth p revi ous a nd future cod ed fra mes
I frame
P-frame
B-frame
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 0
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Pr edi cti on f or Two Consecuti ve Fr ames:
Block-Matchi ng Moti on Esti mati on & Moti on-
Compensated Pr edi cti on
Previ ously Cod ed Fra me
(Reference Fra me)
Current Fra me
(To b e Pred i cted )
16
15
14
13
12
11
10
9
8
7
6
5
4 3
2
1
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
Reference Frame Predicted Frame
Block-ma tchi ng overvi ew :
1 ) Sp li t current fra me i nto
1 6 x1 6 -p i xel b locks
2 ) Fi nd b est ma tch for
ea ch b lock from
p ri or fra me
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 1
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Ex ample: MC-Pr edi cti on f or Two
Consecuti ve Fr ames (cont.)
Prediction of
Current Frame
Prediction Error
(Residual)
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 2
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
MC-Pr edi cti on and Bi -Di r ecti onal
MC-Pr edi cti on (P- and B-f r ames)
M oti on comp ensa ted p red i cti on: Pred i ct the current fra me
b a sed on reference fra me(s) w hi le comp ensa ti ng for the moti on
Exa mp les of b lock-b a sed moti on-comp ensa ted p red i cti on
(P-fra me) a nd b i -d i recti ona l p red i cti on (B-fra me):
16
15
14
13
12
11
10
9
8
7
6
5
4 3
2
1
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
Previous Frame B-Frame
16
15
14
13
12 11
10
9
8 7
6
5
4
3
2
1
Future Frame
16
15
14
13
12
11
10
9
8
7
6
5
4 3
2
1
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
Previous Frame P-Frame
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 3
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Vi deo Compr essi on
M a i n a d d i ti on over i ma g e comp ressi on:
Exp loi t the temp ora l red und a ncy
Pred i ct current fra me b a sed on p revi ously cod ed fra mes
Three typ es of cod ed fra mes:
I-f r am e: Intra -cod ed fra me, cod ed i nd ep end ently of a ll
other fra mes
P-f r am e: Pred i cti vely cod ed fra me, cod ed b a sed on
p revi ously cod ed fra me
B-f r am e: Bi -d i recti ona lly p red i cted fra me, cod ed b a sed
on b oth p revi ous a nd future cod ed fra mes
I frame
P-frame
B-frame
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 4
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Ex ample Use of I-,P-,B-f r ames:
MPEG Gr oup of Pi ctur es (GOP)
A rrow s show p red i cti on d ep end enci es b etw een fra mes
MPEG GOP
I
0
B
1
B
2
P
3
B
4
B
5
P
6
B
7
B
8
I
9
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 5
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Summar y of Tempor al Pr ocessi ng
Use M C-p red i cti on (P & B fra mes) to red uce temp ora l red und a ncy
M C-p red i cti on yi eld s:
1 ) M oti on vectors
2 ) M C-p red i cti on error Cod e w i th conventi ona l i ma g e cod er
Someti mes M C-p red i cti on ma y p erform b a d l y
Exa mp les: Comp lex moti on, new i ma g ery (occlusi ons)
A p p roa ch:
1 . Id enti fy fra me or i nd i vi d ua l b locks w here p red i cti on fa i ls
2 . Cod e w i thout p red i cti on
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 6
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Basi c Vi deo Compr essi on Ar chi tectur e
Exp loi ti ng the red und a nci es:
Temp ora l: M C-p red i cti on (P a nd B fra mes)
Sp a ti a l: Block DCT
Col or: Col or sp a ce conversi on
Sca la r q ua nti z a ti on of DCT coeffi ci ents
Zi g z a g sca nni ng , runleng th a nd Huffma n cod i ng of the
nonz ero q ua nti z ed DCT coeffi ci ents
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 7
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Ex ample Vi deo Encoder
DCT
Huffman
Coding
Motion
Estimation
Motion
Compensation
Buffer
RGB
to
YUV
Motion Vectors
Motion Vectors
MC-Prediction
Error
Input
Video
Signal
Output
Bitstream
Quantize
Inverse
DCT
Inverse
Quantize
Previous
Reconstructed
Frame
Buffer fullness
Frame Store
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 8
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Ex ample Vi deo Decoder
Huffman
Decoder
Motion
Compensation
Buffer YUV to RGB
Motion Vectors
Reconstructed
Frame
Output
Video
Signal
Input
Bitstream
MC-Prediction
Error
Inverse
DCT
Inverse
Quantize
Frame Store
Previous
Reconstructed
Frame
Jo h n G . A p o sto l o p o ul o s
Pa g e 2 9
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Outli ne
W hy d o w e need comp ressi on?
Ima g e comp ressi on
Color p rocessi ng , DCT, etc
Vi d eo comp ressi on
Exp loi t temp ora l d i mensi on of vi d eo si g na l Pred i cti on
G eneri c (M PEG -typ e) vi d eo cod er a rchi tecture
Current vi d eo comp ressi on sta nd a rd s
W ha t a re they?
W ha t d o the sta nd a rd s sp eci fy?
Bri ef overvi ew of H. 2 6 4 / M PEG -4 A VC vi d eo cod i ng sta nd a rd
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 0
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Moti vati on f or Standar ds
G oa l of sta nd a rd s:
Ensuri ng i nterop era b i li ty: Ena b li ng communi ca ti on
b etw een d evi ces ma d e b y d i fferent ma nufa cturers
Promoti ng a technolog y or i nd ustry
Red uci ng costs
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 1
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
What do the Standar ds Speci f y?
Encoder
Bitstream Decoder
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 2
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
What do the Standar ds Speci f y?
N ot the encod er
N ot the d ecod er
Just the b i tstrea m synta x a nd the d ecod i ng p rocess (e. g . use IDCT,
b ut not how to i mp lement the IDCT)
Ena b les i mp roved encod i ng & d ecod i ng stra teg i es to b e
emp loyed i n a sta nd a rd -comp a ti b le ma nner
Encoder Bitstream Decoder
Scope of Standardization
(Decoding
Process)
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 3
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Cur r ent Image and Vi deo
Compr essi on Standar ds
Standard Application Bit Rate
JPEG Continuous-tone still-image
compression
Variable
H.261 Video telephony and
teleconferencing over ISDN
p x 64 kb/s
MPEG-1 Video on digital storage media
(CD-ROM)
1.5 Mb/s
MPEG-2 Digital Television 2-20 Mb/s
H.263 Video telephony over PSTN 33.6-? kb/s
MPEG-4 Object-based coding, synthetic
content, interactivity
Variable
JPEG-2000 Improved still image compression Variable
H.264 /
MPEG-4 AVC
Improved video compression 10s kb/s to Mb/s
ISO : JPEG & M PEG fa mi ly of sta nd a rd s
ITU : H. 2 6 x fa mi ly of sta nd a rd s
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 4
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
MPEG Gr oup of Pi ctur es (GOP) Str uctur e
Comp osed of I, P, a nd B fra mes
A rrow s show p red i cti on d ep end enci es
MPEG GOP
I
0
B
1
B
2
P
3
B
4
B
5
P
6
B
7
B
8
I
9
Peri od i c I-fra mes ena b le ra nd om a ccess i nto the cod ed b i tstrea m
Pa ra meters: (1 ) Sp a ci ng b etw een I fra mes, (2 ) numb er of B fra mes
b etw een I a nd P fra mes
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 5
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 6
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Outli ne
W hy d o w e need comp ressi on?
Ima g e comp ressi on
Color p rocessi ng , DCT, etc
Vi d eo comp ressi on
Exp loi t temp ora l d i mensi on of vi d eo si g na l Pred i cti on
G eneri c (M PEG -typ e) vi d eo cod er a rchi tecture
Current vi d eo comp ressi on sta nd a rd s
W ha t a re they?
W ha t d o the sta nd a rd s sp eci fy?
Bri ef overvi ew of H. 2 6 4 / M PEG -4 A VC vi d eo cod i ng sta nd a rd
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 7
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
H.264 / MPEG-4 Par t-10 Advanced
Vi deo Codi ng (AVC)
Joi ntly d esi g ned b y ISO a nd ITU sta nd a rd s b od i es
Know n b y d i fferent na mes: H. 2 6 4 , M PEG -4 Pa rt-1 0 A d va nced
Vi d eo Cod i ng (A VC), Joi nt Vi d eo Tea m (JVT), etc.
G oa ls:
Si g ni fi ca nt i mp rovement i n comp ressi on
N etw ork fri end li ness from g round up :
Pa cket-b a sed d eli very
Ca rry over RTP/ UDP/ IP, M PEG -2 TS, H. 3 2 4 M , M PEG -4 fi l e
forma t, etc.
Error & loss resi li ence
Exa ct ma tch d ecod i ng (e. g . i nteg er tra nsforms)
A d d i ti ona l recent g oa ls:
Professi ona l a p p li ca ti ons, e. g . > 8 b i ts p er sa mp le, 4 : 4 : 4 color
sa mp li ng ,
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 8
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
H.264 / MPEG-4 AVC:
Enhancements over Pr i or Standar ds
Imp roved temp ora l p red i cti on:
A d a p ti ve b lock-si z e M C-p red i cti on (1 6 x1 6 to 4 x4 p i xels)
M ulti p le reference fra mes stored for p ossi b le use for
p red i cti on
-p i xel moti on vector a ccura cy
Deb locki ng fi lter w i thi n p red i cti on loop
Imp roved sp a ti a l p red i cti on:
Di recti ona l sp a ti a l p red i cti on of p i xels
Integ er DCT-li ke tra nsform (4 x4 p i xel b locks)
Imp roved entrop y cod i ng (p rob a b i li ty p red i cti on):
Context-b a sed a d a p ti ve b i na ry a ri thmeti c cod er (CA BA C)
Enha nced fea tures:
Imp roved error-resi l i ence techni q ues
SP & SI sw i tchi ng p i ctures
Jo h n G . A p o sto l o p o ul o s
Pa g e 3 9
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
H.264 / AVC Pr of i les & Levels
Profi le: Sub set of tools a p p li ca b le to sp eci fi c a p p li ca ti on(s)
H. 2 6 4 versi on 1 d efi ned 3 p rofi les:
Ba seli ne (e. g . , vi d eoconferenci ng & w i reless)
M a i n (e. g . , b roa d ca st, enterta i nment)
Extend ed (e. g . , strea mi ng ) [ Sup erset of Ba seli ne]
X SP & SI sl i ces
X M ore Error Resi li ence
X X Error Resi li ence
X CA BA C
X X Interla ced
X X B sl i ces
X X X I & P sli ces
Extend ed M a i n Ba sel i ne Cod i ng Tools
Levels sp eci fy constra i nts on ma xi mum comp uta ti on a nd memory
Pi xels p er second , cod ed b i t ra te, # reference fra mes
Jo h n G . A p o sto l o p o ul o s
Pa g e 4 0
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
H.264 / MPEG-4 AVC Amendment 1:
Fi deli ty Range Ex tensi ons (FREx t)
Desi g ned for p rofessi ona l a p p li ca ti ons w hi ch req ui re hi g her
q ua li ty i n terms of b i ts/ p i xel, b i t ra tes, sp a ti a l resoluti ons, etc.
N ew Hi g h p rofi l es:
Hi g h: A d d ed 8 x8 tra nsform, i mp roved q ua nti z a ti on
Hi g h 1 0 : A b ove p lus sup p ort for 1 0 b i ts/ p i xel
Hi g h 4 : 2 : 2 : A b ove p lus 4 : 2 : 2 forma t
Hi g h 4 : 4 : 4 : A b ove p lus 4 : 4 : 4 forma t & 1 2 b i ts/ p i xel
Hi g h p rofi le li kely for b roa d ca st & enterta i nment q ua li ty a p p s
Hi g h 4 : 2 : 2 for stud i o envi ronments
Jo h n G . A p o sto l o p o ul o s
Pa g e 4 1
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
~ 5 0 % red ucti on
(H. 2 6 4 vs M PEG -2 )
Exa mp le Comp a ri son: Results d ep end strong ly
on sp eci fi c seq uence & cod i ng tools emp loyed !
Jo h n G . A p o sto l o p o ul o s
Pa g e 4 2
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Summar y
O vervi ew of b a si c p ri nci p les & p ra cti ce of i ma g e a nd
vi d eo cod i ng , a nd key sta nd a rd s
Vi d eo cod i ng i s cri ti ca l ena b ler for ma ny a p p li ca ti ons:
Di g i ta l TV
DVD
Vi d eo strea mi ng
W i reless vi d eo, e. g . over 3 G cellula r netw orks
Future of vi d eo cod i ng :
Conti nued i mp rovements i n comp ressi on
N ew or i mp roved functi ona li ti es
e. g . Sca la b le Vi d eo Cod i ng (SVC) sta nd a rd und erw a y
Cross-la yer d esi g n
e. g . Cod i ng a nd p a cketi z a ti on or tra nsmi ssi on
Jo h n G . A p o sto l o p o ul o s
Pa g e 4 3
Vi deo
Codi ng
Se p te m b e r 2 7 , 2 0 0 5
Addi ti onal Inf or mati on
Vi d eo Strea mi ng : Concep ts, A lg ori thms, a nd Systems , HPL
Tech Rep ort HPL-2 0 0 2 -2 6 0
w w w . hp l . hp . com/ techrep orts/ 2 0 0 2 / HPL-2 0 0 2 -2 6 0 . html
Vi d eo Comp ressi on a nd Vi d eo Strea mi ng , lectures a t M IT
6 . 3 4 4 , Sp ri ng 2 0 0 4
w w w . mi t. ed u/ ~ 6 . 3 4 4 / Sp ri ng 2 0 0 4 / Sp ri ng 2 0 0 4 . html
Enterp ri se Strea mi ng : Di fferent Cha lleng es from Internet
Strea mi ng , HPL Tech Rep ort HPL-2 0 0 5 -9 8
w w w . hp l . hp . com/ techrep orts/ 2 0 0 5 / HPL-2 0 0 5 -9 8 . html

Você também pode gostar