Unit-2 Multimedia Information Representation

Unit-2 Multimedia information representation
Introduction
 All types of multimedia information are stored and processed within
a computer in a digital form
 In the case of textual information consisting of strings of characters

entered at a keyboard, for example, each character is represented by
a unique combination of a fixed number of bits - known as a
codeword
 The complete text by a string of such code words
 Signal whose amplitude varies continuously with time is known as

an analog signal
 In order to store and process such signals - and hence types of media
within a computer, it is necessary first to convert any time-varying
analog signals into a digital form
 In addition, the signals used to operate devices such as loudspeakers

,- for speech and audio - and computer monitors - for the display of
digitized images for example - are also analog signals
 Thus digital values representing such media types must be converted

back again into, corresponding time-varying analog form on output
from the computer
 The conversion of an analog signal into a digital form is carried out

using an electrical circuit known as a signal encoder
 By first sampling the amplitude of the analog signal at repetitive time
intervals and then converting the amplitude of each sample into a
corresponding digital value
 The conversion of the stored digitized samples relating to a particular

media type into their corresponding time-varying analog form is
performed by an electrical circuit known as a signal decoder
Digitization principles
Analog signals
 The general properties relating to any time-varying analog signal are

shown in Figure 2.1
 As we can see in part (a) of the figure, the amplitude of such signals
varies continuously with time
 In addition, a mathematical technique known as Fourier analysis
can be used to show that any time-varying analog signal is made up
of a possibly infinite number of single- frequency sinusoidal signals
whose amplitude and phase vary continuously with time relative to
each other
 As an example, the highest and lowest frequency components of the

signal shown in Figure 2.1 (a) may be those shown in Figure 2.1 (b)
 The range- of frequencies of the sinusoidal components that make up
a signal is called the signal bandwidth and two examples are shown
in part (c) of the figure
 These relate to an audio signal, the first a speech signal and the
second a music signal produced by, say, an orchestra
 In terms of speech, humans produce sounds - which are converted

into electrical signals by a microphone - that are made up of a range
of sinusoidal signals varying in frequency between 50Hz and 10kHz
 In the case of a music signal, however, the range of signals is wider

and varies between 15Hz and 20 kHz, this being comparable with
the limits of the sensitivity of the ear
 Ideally, when an analog signal is being transmitted through a
network the bandwidth of the transmission channel - that is, the
range of frequencies the channel will pass - should be equal to or
greater than the bandwidth of the signal
 If the bandwidth of the channel is less than this, then some of the low
find/or high frequency components will be lost thereby degrading the
quality of the received signal
 This type of transmission channel is called band limiting channel

and its effect is shown in Figure 2.1 (d)
Encoder design
 The conversion of a time varying analog signal into a digital form is

carried out using a signal encoder
 It consists of two main parts : a band limiting filter and an analog-

to-digital converter (ADC)
 ADC – comprises a sample and hold circuit and a quantizer

 Band limiting Filter – to remove selected higher frequency
components from the source signal
 The output of the filter is then fed to the sample and hold circuit
 Sample and hold – used to sample the amplitude of the filtered

signal at regular time intervals and to hold the sample amplitude
constant between samples and fed to the quantizer
 Quantizer – converts each sample amplitude into a binary value

known as a codeword
 The most significant bit (MSB) of each codeword indicates the

polarity (sign) of the sample, positive or negative relative to the zero
level
 Binary 0 – positive value, Binary 1 – negative value

Sampling rate
 In relation to the sampling rate, the Nyquist sampling theorem states
that: in order to obtain an accurate representation of a time-varying
analog signal, its amplitude must be sampled at a minimum rate that
is equal to or greater than twice the highest sinusoidal frequency
component that is present in the signal
 This is known as the Nyquist rate and is normally represented as

either Hz or, more correctly, samples per second (sps)
 Aliasing effect due to under sampling and can be removed using an

anti aliasing filter
Quantization Intervals
 Quantization error/noise – difference between the actual signal
amplitude and the corresponding nominal amplitude
 Dynamic range – ratio of the peak amplitude of a signal to its

minimum amplitude, expressed in decibels or dB
Decoder design
 Low pass filter only passes those frequency components that made
up the original filtered signal and also known as recovery or
reconstruction filter
Text
 There are three types of text that are used to produce pages of
documents :
1. Unformatted text
2. Formatted text
3. Hypertext
1. Unformatted – plaintext comprise strings of fixed size characters

from a limited character set
 It uses ASCII character set – American standard code for

Information interchange
 It uses 7 bits – 128 alternative 128 characters
 Bit 7 is the MSB and for example to represent a character ‘M’ the
codeword is 1001101
 Printable characters – Alphabetic, numeric and punctuation

characters
 Control characters – (i) Format control characters – Backspace

(BS), DEL, ESC, SP etc
(ii) Information separators – FS (File separator), RS (Record

separator)
(iii) Transmission control characters – SOH (start of heading),

STX (start of text), ETX (end of text)
 In the other case, characters are replaced by mosaic characters
 Example is videotext/teletext
2. Formatted – rich text comprises strings of characters of different

styles, size, and shape with tables, graphics and images, bold or
italicized
 Word processing packages – preparation of papers, books,

magazines, journals etc
 Document have chapters, sections, paragraphs etc with tables,

graphics and pictures
3. Hypertext – documents having defined linkage points referred as

‘hyperlinks’ between them
 The linked set of pages that are stored in a particular server are
accessed and viewed using a client program known as a browser
 Examples may include Mozilla Firefox, internet explorer, Google

chrome, opera etc
 First page is a homepage
 Unique network wide name known as uniform resource locator

(URL)
 Hypertext markup language (HTML) used
 Directives in HTML are tags <>

Images
1. Computer graphics/graphics
2. Digitized documents
3. Digitized pictures
 Image represented by a 2-dimensional matrix of individual picture

elements known as pixels or pels
Graphics
 Range of software packages and programs available for the creation
of computer graphics
 VGA – video graphics array is a common type of display
 Attributes – includes visual objects like lines, circles, squares, color

of the border, its shadow and so on (hand drawn free form objects)
 Rendering – To create solid objects (color fill)
 Different formats – BMP (bit map format), GIF (Graphical

interchange format), TIFF (Tagged image file format)
 Matrix of 640 horizontal pixels by 480 vertical pixels
 To change the shape, size or color and also to introduce complete

predrawn images, either previously created by the author of the
graphic or selected from a gallery of images that come with the
package (clip - art)
 Considering 8-bits per pixel – each pixel to have one of 256 different
colors
Digitized documents
 Scanner associated with Facsimile (fax)
 Page scan – left to right (starts at the top of the page and end at the
bottom)
 Fax machines – represent each pixel, a 0 for a white pel and a 1 for
a black pel
Digitized pictures
 The three primary colors – Red, Green and Blue (R, G and B)
 Additive color mixing – Black is produced when all the 3 primary

colors are zero (produces color image on a black surface) – display
applications
 Subtractive color mixing – White is produced when all 3 chosen

primary colors are zero (produces color image on a white surface) –
printing applications
Raster scan principle
 Scan – number of discrete horizontal lines, starts at the top left

corner of the screen and the last of which ends at the bottom right
corner
 Its known as progressive scanning
 Each complete set of horizontal scan lines is called a Frame
 N = 525 (North and South America and most of Asia) individual scan
lines
 N = 625 (Europe and a number of other countries)
 The inside of the display screen of the picture tube is coated with a
light sensitive phosphor that emits light when energized by the
electron beam
 Brightness depends on power in the electron beam
 Color tube uses 3 phosphor triad, pixel has a shape of the spot and
spot size is 0.025 inches (0.635mm)
 Frame refresh rate must be higher to reduce a flicker
 60Hz – 60 times per second (NTSC), 50Hz – 50 times per second

(PAL/CCIR/SECAM)
 Video RAM – To store the pixel image
 Video controller – Hardware subsystem that reads the pixel values

stored in a video RAM and converts these into the equivalent set of
red, green and blue analog signals for output to the display
 Pixel depth – The number of bits per pixel
 Ex – 12 bits, 4 bits per primary color yielding 4096 different colors
 Ex – 24 bits, 8 bits per primary color yielding in excess of 16 million

(224)colors
 Aspect ratio – It is the ratio of screen width to screen height
 SVGA – Super VGA
 To store a image, the memory requirement is high i.e., 307.2kb (for

VGA), For SVGA its 2.36 Mb
Standard Resolution Number of Memory
colors required per
frame (bytes)
VGA 640x480x8 256 307.2kB
XGA 640x480x16 64k 614.4kB
1024x768x8 256 786.432kB
SVGA 800x600x16 64k 960kB
1024x768x8 256 786.432kB
1024x768x24 16M 2359.296kB
Table 2.1 Example display resolutions and memory requirements

Digital cameras and scanners
 Capture and store a digital image produced by the digital
camera/scanner
 Camera may be still image camera or a video camera
 In digital computer, a set of digitized images can be stored within the

camera itself and then downloaded into the computer
 An image is captured within the camera/scanner using a solid state

device called an image sensor
 They are light sensitive cells called photo sites in digital cameras
 Most widely used image sensor is a charge coupled device (CCD)
 The set of charges on the first row of photo sites are transferred to
the readout register
 Photo sites in a row is coupled to the corresponding photo sites in the
two adjoining rows and, as each row is transferred to the readout
register, the set of charges on each of the other rows in the matrix
move down to the next row of photo site positions
 Once in the read out register, the charge on each photo site position
is shifted out, amplified and digitized using an ADC
Audio
PCM speech
 One of the application uses PCM speech is PSTN
 It includes two extra/additional circuits – compressor and expander
 Due to linear quantization intervals, irrespective of the magnitude of

the input signal, the same level of quantization noise is produced
 The effect of this is that, the noise level is the same for both low
amplitude (quiet) signals and high amplitude (loud) signals
 The quantization intervals are made non-linear (unequal) with

narrower intervals used for smaller amplitude signals than for larger
signals
 Its achieved by compressor and expander
 Compression characteristics μ –law (North America and Japan)
A –law (Europe and other countries)

CD Quality audio
 The discs used in CD players and CD-ROMs are digital storage

devices for stereophonic music and more general multimedia
information streams
 Standard associated with the devices known as CD-digital audio

(CD-DA)
 Bits per channel = sampling rate x bits/sample
= 44.1 x 103 (44.1kbps sampling rate) x 16 (standard)
= 705.6kbps
Total bit rate = 2 x 705.6kbps = 1.411 Mbps

MIDI (Musical Instrument digital interface)
 The input device may be a piano (keyboard) , electric guitar etc
 Special effects, Amplification etc
 Ex – Piano code 0, Violin code 40
 Sound card are used sometimes
Video
1. Entertainment – Broadcast television and VCR/DVD recordings
2. Interpersonal – Video telephony and video conferencing
3. Interactive – windows containing short video clips

Broadcast television
 Scanning sequence – minimum refresh rate of 50 times per second to
avoid flicker
 But for smooth motion, a refresh rate of 25 times per second is

sufficient
 To minimize the transmission bandwidth, the image/picture

transmitted with each frame into 2 halves
 Each is known as a field, the first comprising of only odd scan lines
and the second even scan lines
 Two fields are integrated together in the television receiver using a

technique known as Interlaced scanning
 525 line system – each field comprises of 262.5 lines – 240 visible
 625 line system – each field comprises of 312.5 lines and 288 visible
 Refresh rate – 30/25 times per second
 R, G and B color used for color television broadcasts
 Three main properties of a color
1. Brightness – Amount of energy in the signal
2. Hue – Actual color of the source
3. Saturation – Strength of the color
 Luminance – refers to the brightness of the source
 Hue and saturation as its chrominance
White = 0.299R+0.587G+0.114B
 Ys (luminance amplitude) = 0.299Rs+0.587Gs+0.114Bs
 Rs, Gs and Bs are the magnitudes of the color
 Cb = Bs – Ys and Cr = Rs – Ys
 Y, Cb and Cr combined together to get a composite video signal
PAL : Y = 0.299R+0.587G+0.114B
U = 0.493 (B - Y)
V = 0.877 (R - Y)
NTSC : referred as I and Q signals
Y = 0.299R+0.587G+0.114B
I = 0.74 (R - Y) – 0.27 (B - Y)
Q = 0.48 (R - Y) + 0.41 (B - Y)
Signal bandwidth
 Bandwidth of the transmission channel used for color broadcast and

monochrome broadcast are same
Digital video
 Digitization of video signals has been carried out in television studios for
many years
 For example, to perform conversions from one video format into another
 Some of the format includes –
1. 4:2:2 format – Television studios
2. 4:2:0 format – Video broadcast applications
3. HDTV format – High definition television (HDTV)
4. SIF (source intermediate format)
5. CIF (common intermediate format) – Video conferencing applications
6. QCIF (quarter CIF) – Video telephony applications

Digitization System Spatial Temporal
format resolution resolution
4:2:0 525 – line Y = 640x480 60Hz
625 – line Cb=Cr=320x240 50Hz
Y = 768x576
Cb=Cr=384x288
SIF 525 – line Y = 320x240 30Hz
625 – line Cb=Cr=160x240 25Hz
Y = 384x288
Cb=Cr=192x144
CIF Y = 384x288 30Hz
Cb=Cr=192x144
QCIF Y = 192x144 15/7.5Hz
Cb=Cr=96x72
Table 2.2 PC video digitization formats

Unit-2 Multimedia Information Representation

Enviado por

Dados do documento

Descrição original:

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

Unit-2 Multimedia Information Representation

Enviado por

Direitos autorais:

Formatos disponíveis

Unit-2 Multimedia information representation

 In the case of textual information consisting of strings of characters

 The complete text by a string of such code words

 Signal whose amplitude varies continuously with time is known as

 In addition, the signals used to operate devices such as loudspeakers

 Thus digital values representing such media types must be converted

 The conversion of an analog signal into a digital form is carried out

 The conversion of the stored digitized samples relating to a particular

 The general properties relating to any time-varying analog signal are

 As an example, the highest and lowest frequency components of the

 In terms of speech, humans produce sounds - which are converted

 In the case of a music signal, however, the range of signals is wider

 This type of transmission channel is called band limiting channel

 The conversion of a time varying analog signal into a digital form is

 It consists of two main parts : a band limiting filter and an analog-

 ADC – comprises a sample and hold circuit and a quantizer

 Sample and hold – used to sample the amplitude of the filtered

 Quantizer – converts each sample amplitude into a binary value

 The most significant bit (MSB) of each codeword indicates the

 Binary 0 – positive value, Binary 1 – negative value

 This is known as the Nyquist rate and is normally represented as

 Aliasing effect due to under sampling and can be removed using an

 Dynamic range – ratio of the peak amplitude of a signal to its

1. Unformatted – plaintext comprise strings of fixed size characters

 It uses ASCII character set – American standard code for

 Printable characters – Alphabetic, numeric and punctuation

 Control characters – (i) Format control characters – Backspace

(ii) Information separators – FS (File separator), RS (Record

(iii) Transmission control characters – SOH (start of heading),

2. Formatted – rich text comprises strings of characters of different

 Word processing packages – preparation of papers, books,

 Document have chapters, sections, paragraphs etc with tables,

3. Hypertext – documents having defined linkage points referred as

 Examples may include Mozilla Firefox, internet explorer, Google

 First page is a homepage

 Unique network wide name known as uniform resource locator

 Hypertext markup language (HTML) used

 Directives in HTML are tags <>

 Image represented by a 2-dimensional matrix of individual picture

 Attributes – includes visual objects like lines, circles, squares, color

 Different formats – BMP (bit map format), GIF (Graphical

 Matrix of 640 horizontal pixels by 480 vertical pixels

 To change the shape, size or color and also to introduce complete

 Additive color mixing – Black is produced when all the 3 primary

 Subtractive color mixing – White is produced when all 3 chosen

 Scan – number of discrete horizontal lines, starts at the top left

 Its known as progressive scanning

 Each complete set of horizontal scan lines is called a Frame

 N = 625 (Europe and a number of other countries)

 Frame refresh rate must be higher to reduce a flicker

 60Hz – 60 times per second (NTSC), 50Hz – 50 times per second

 Video controller – Hardware subsystem that reads the pixel values

 Ex – 12 bits, 4 bits per primary color yielding 4096 different colors

 Ex – 24 bits, 8 bits per primary color yielding in excess of 16 million

 SVGA – Super VGA

 To store a image, the memory requirement is high i.e., 307.2kb (for

Table 2.1 Example display resolutions and memory requirements

 Camera may be still image camera or a video camera

 In digital computer, a set of digitized images can be stored within the

 An image is captured within the camera/scanner using a solid state

 Most widely used image sensor is a charge coupled device (CCD)

 It includes two extra/additional circuits – compressor and expander