Academic Press Library in Signal Processing: Image and Video Compression and Multimedia

Ebook1,014 pages10 hours

Academic Press Library in Signal Processing: Image and Video Compression and Multimedia

Name: Academic Press Library in Signal Processing: Image and Video Compression and Multimedia
ISBN: 9780124201576

By Academic Press

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This fifth volume, edited and authored by world leading experts, gives a review of the principles, methods and techniques of important and emerging research topics and technologies in image and video compression and multimedia.

With this reference source you will:

Quickly grasp a new area of research
Understand the underlying principles of a topic and its application
Ascertain how a topic relates to other areas and learn of the research issues yet to be resolved

Quick tutorial reviews of important and emerging topics of research in Image and Video Compression and Multimedia
Comprehensive references to journal articles and other literature on which to build further, more specific and detailed knowledge
Edited by leading people in the field who, through their reputation, have been able to commission experts to write on a particular topic

Skip carousel

LanguageEnglish

PublisherAcademic Press

Release dateJun 12, 2014

ISBN9780124201576

Related to Academic Press Library in Signal Processing

Titles in the series (2)

Skip carousel

Academic Press Library in Signal Processing: Image and Video Compression and Multimedia
Ebook
Academic Press Library in Signal Processing: Image and Video Compression and Multimedia
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings

Related ebooks

Skip carousel

Feature Extraction and Image Processing for Computer Vision
Ebook
Feature Extraction and Image Processing for Computer Vision
byMark Nixon
Rating: 4 out of 5 stars
4/5
Flash Memory Integration: Performance and Energy Issues
Ebook
Flash Memory Integration: Performance and Energy Issues
byJalil Boukhobza
Rating: 0 out of 5 stars
0 ratings
Computer Science and Ambient Intelligence
Ebook
Computer Science and Ambient Intelligence
byGaëlle Calvary
Rating: 0 out of 5 stars
0 ratings
Trends, Discovery, and People in the Digital Age
Ebook
Trends, Discovery, and People in the Digital Age
byWendy Evans
Rating: 0 out of 5 stars
0 ratings
The Information Process: A Model and Hierarchy
Ebook
The Information Process: A Model and Hierarchy
byVictor Yang
Rating: 0 out of 5 stars
0 ratings
Intelligent Networks: Recent Approaches and Applications in Medical Systems
Ebook
Intelligent Networks: Recent Approaches and Applications in Medical Systems
bySyed V. Ahamed
Rating: 0 out of 5 stars
0 ratings
Open-Source Lab: How to Build Your Own Hardware and Reduce Research Costs
Ebook
Open-Source Lab: How to Build Your Own Hardware and Reduce Research Costs
byJoshua M. Pearce
Rating: 4 out of 5 stars
4/5
Multimedia Semantics: Metadata, Analysis and Interaction
Ebook
Multimedia Semantics: Metadata, Analysis and Interaction
byRaphael Troncy
Rating: 0 out of 5 stars
0 ratings
COBOL Software Modernization: From Principles to Implementation with the BLU AGE Method
Ebook
COBOL Software Modernization: From Principles to Implementation with the BLU AGE Method
byFranck Barbier
Rating: 1 out of 5 stars
1/5
System Assurance: Beyond Detecting Vulnerabilities
Ebook
System Assurance: Beyond Detecting Vulnerabilities
byNikolai Mansourov
Rating: 0 out of 5 stars
0 ratings
Deploying Wireless Sensor Networks: Theory and Practice
Ebook
Deploying Wireless Sensor Networks: Theory and Practice
byMustapha Reda Senouci
Rating: 5 out of 5 stars
5/5
Deep Learning on Edge Computing Devices: Design Challenges of Algorithm and Architecture
Ebook
Deep Learning on Edge Computing Devices: Design Challenges of Algorithm and Architecture
byXichuan Zhou
Rating: 0 out of 5 stars
0 ratings
Reading and Writing Knowledge in Scientific Communities: Digital Humanities and Knowledge Construction
Ebook
Reading and Writing Knowledge in Scientific Communities: Digital Humanities and Knowledge Construction
byGérald Kembellec
Rating: 0 out of 5 stars
0 ratings
Taking the LEAP: The Methods and Tools of the Linked Engineering and Manufacturing Platform (LEAP)
Ebook
Taking the LEAP: The Methods and Tools of the Linked Engineering and Manufacturing Platform (LEAP)
byDimitris Kiritsis
Rating: 0 out of 5 stars
0 ratings
Digital Information Strategies: From Applications and Content to Libraries and People
Ebook
Digital Information Strategies: From Applications and Content to Libraries and People
byDavid Baker
Rating: 0 out of 5 stars
0 ratings
Foundations of Quantum Programming
Ebook
Foundations of Quantum Programming
byMingsheng Ying
Rating: 4 out of 5 stars
4/5
Broadband Optical Access Networks
Ebook
Broadband Optical Access Networks
byLeonid G. Kazovsky
Rating: 0 out of 5 stars
0 ratings
Bits on Chips
Ebook
Bits on Chips
byHarry Veendrick
Rating: 0 out of 5 stars
0 ratings
E-BIOMED: A Proposal for Electronic Publications in the Biomedical Sciences
Ebook
E-BIOMED: A Proposal for Electronic Publications in the Biomedical Sciences
byNational Institutes of Health
Rating: 0 out of 5 stars
0 ratings
Practical Open Source Software for Libraries
Ebook
Practical Open Source Software for Libraries
byNicole Engard
Rating: 0 out of 5 stars
0 ratings
Joint Source-Channel Decoding: A Cross-Layer Perspective with Applications in Video Broadcasting
Ebook
Joint Source-Channel Decoding: A Cross-Layer Perspective with Applications in Video Broadcasting
byPierre Duhamel
Rating: 0 out of 5 stars
0 ratings
MSP430-based Robot Applications: A Guide to Developing Embedded Systems
Ebook
MSP430-based Robot Applications: A Guide to Developing Embedded Systems
byDan Harres
Rating: 5 out of 5 stars
5/5
Broadband Circuits for Optical Fiber Communication
Ebook
Broadband Circuits for Optical Fiber Communication
byEduard Säckinger
Rating: 0 out of 5 stars
0 ratings
Software Defined Networks: A Comprehensive Approach
Ebook
Software Defined Networks: A Comprehensive Approach
byPaul Goransson
Rating: 0 out of 5 stars
0 ratings
An Introduction to Search Engines and Web Navigation
Ebook
An Introduction to Search Engines and Web Navigation
byMark Levene
Rating: 0 out of 5 stars
0 ratings
Getting It Right: R&D Methods for Science and Engineering
Ebook
Getting It Right: R&D Methods for Science and Engineering
byPeter Bock
Rating: 0 out of 5 stars
0 ratings
Object-Oriented Analysis and Design for Information Systems: Agile Modeling with UML, OCL, and IFML
Ebook
Object-Oriented Analysis and Design for Information Systems: Agile Modeling with UML, OCL, and IFML
byRaul Sidnei Wazlawick
Rating: 1 out of 5 stars
1/5
Computer–Assisted Research in the Humanities: A Directory of Scholars Active
Ebook
Computer–Assisted Research in the Humanities: A Directory of Scholars Active
byJoseph Raben
Rating: 0 out of 5 stars
0 ratings
Internet of Things: Technologies and Applications for a New Age of Intelligence
Ebook
Internet of Things: Technologies and Applications for a New Age of Intelligence
byVlasios Tsiatsis
Rating: 0 out of 5 stars
0 ratings
The Physics of Computing
Ebook
The Physics of Computing
byMarilyn Wolf
Rating: 0 out of 5 stars
0 ratings

Technology & Engineering For You

Skip carousel

The Big Book of Hacks: 264 Amazing DIY Tech Projects
Ebook
The Big Book of Hacks: 264 Amazing DIY Tech Projects
byDoug Cantor
Rating: 4 out of 5 stars
4/5
The Art of War
Ebook
The Art of War
bySun Tzu
Rating: 4 out of 5 stars
4/5
Improvised Munitions Handbook – Learn How to Make Explosive Devices & Weapons from Scratch (Warfare Skills Series): Illustrated & With Clear Instructions
Ebook
Improvised Munitions Handbook – Learn How to Make Explosive Devices & Weapons from Scratch (Warfare Skills Series): Illustrated & With Clear Instructions
byU.S. Department of Defense
Rating: 4 out of 5 stars
4/5
The Art of Tinkering: Meet 150+ Makers Working at the Intersection of Art, Science & Technology
Ebook
The Art of Tinkering: Meet 150+ Makers Working at the Intersection of Art, Science & Technology
byKaren Wilkinson
Rating: 4 out of 5 stars
4/5
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
Ebook
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
byScott H. Young
Rating: 4 out of 5 stars
4/5
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
Ebook
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
byTJ Books
Rating: 0 out of 5 stars
0 ratings
Electrical Engineering 101: Everything You Should Have Learned in School...but Probably Didn't
Ebook
Electrical Engineering 101: Everything You Should Have Learned in School...but Probably Didn't
byDarren Ashby
Rating: 5 out of 5 stars
5/5
Sneaky Uses for Everyday Things: How to Turn a Penny into a Radio, Make a Flood Alarm with an Aspirin, Change Milk into Plastic, Extract Water and Electricity from Thin Air, Turn on a TV with your Ring, and Other Amazing Feats
Ebook
Sneaky Uses for Everyday Things: How to Turn a Penny into a Radio, Make a Flood Alarm with an Aspirin, Change Milk into Plastic, Extract Water and Electricity from Thin Air, Turn on a TV with your Ring, and Other Amazing Feats
byCy Tymony
Rating: 3 out of 5 stars
3/5
The 48 Laws of Power in Practice: The 3 Most Powerful Laws & The 4 Indispensable Power Principles
Ebook
The 48 Laws of Power in Practice: The 3 Most Powerful Laws & The 4 Indispensable Power Principles
byJon Waterlow
Rating: 5 out of 5 stars
5/5
The Big Book of Maker Skills: Tools & Techniques for Building Great Tech Projects
Ebook
The Big Book of Maker Skills: Tools & Techniques for Building Great Tech Projects
byChris Hackett
Rating: 4 out of 5 stars
4/5
U.S. Marine Close Combat Fighting Handbook
Ebook
U.S. Marine Close Combat Fighting Handbook
byUnited States Marine Corps
Rating: 4 out of 5 stars
4/5
80/20 Principle: The Secret to Working Less and Making More
Ebook
80/20 Principle: The Secret to Working Less and Making More
byPaul J. Stanley
Rating: 5 out of 5 stars
5/5
Pilot's Handbook of Aeronautical Knowledge (Federal Aviation Administration)
Ebook
Pilot's Handbook of Aeronautical Knowledge (Federal Aviation Administration)
byFederal Aviation Administration
Rating: 4 out of 5 stars
4/5
The Basics of Bitcoins and Blockchains: An Introduction to Cryptocurrencies and the Technology that Powers Them (Cryptography, Derivatives Investments, Futures Trading, Digital Assets, NFT)
Ebook
The Basics of Bitcoins and Blockchains: An Introduction to Cryptocurrencies and the Technology that Powers Them (Cryptography, Derivatives Investments, Futures Trading, Digital Assets, NFT)
byAntony Lewis
Rating: 4 out of 5 stars
4/5
The Homeowner's DIY Guide to Electrical Wiring
Ebook
The Homeowner's DIY Guide to Electrical Wiring
byDavid Herres
Rating: 5 out of 5 stars
5/5
The Wuhan Cover-Up: And the Terrifying Bioweapons Arms Race
Ebook
The Wuhan Cover-Up: And the Terrifying Bioweapons Arms Race
byRobert F. Kennedy, Jr.
Rating: 0 out of 5 stars
0 ratings
Logic Pro X For Dummies
Ebook
Logic Pro X For Dummies
byGraham English
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
Ghost Rider: Travels on the Healing Road
Ebook
Ghost Rider: Travels on the Healing Road
byNeil Peart
Rating: 4 out of 5 stars
4/5
The Total Motorcycling Manual: 291 Essential Skills
Ebook
The Total Motorcycling Manual: 291 Essential Skills
byMark Lindemann
Rating: 5 out of 5 stars
5/5
The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos,
Ebook
The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos,
byAlbert Rutherford
Rating: 4 out of 5 stars
4/5
Broken Money: Why Our Financial System is Failing Us and How We Can Make it Better
Ebook
Broken Money: Why Our Financial System is Failing Us and How We Can Make it Better
byLyn Alden
Rating: 5 out of 5 stars
5/5
Understanding Media: The Extensions of Man
Ebook
Understanding Media: The Extensions of Man
byMarshall McLuhan
Rating: 4 out of 5 stars
4/5
How to Disappear and Live Off the Grid: A CIA Insider's Guide
Ebook
How to Disappear and Live Off the Grid: A CIA Insider's Guide
byJohn Kiriakou
Rating: 0 out of 5 stars
0 ratings
The Art of War
Ebook
The Art of War
bySun Tsu
Rating: 4 out of 5 stars
4/5
No Nonsense Technician Class License Study Guide: for Tests Given Between July 2018 and June 2022
Ebook
No Nonsense Technician Class License Study Guide: for Tests Given Between July 2018 and June 2022
byDan Romanchik KB6NU
Rating: 5 out of 5 stars
5/5
The Fast Track to Your Technician Class Ham Radio License: For Exams July 1, 2022 - June 30, 2026
Ebook
The Fast Track to Your Technician Class Ham Radio License: For Exams July 1, 2022 - June 30, 2026
byMichael Burnette, AF7KB
Rating: 5 out of 5 stars
5/5
The CIA Lockpicking Manual
Ebook
The CIA Lockpicking Manual
byCentral Intelligence Agency
Rating: 5 out of 5 stars
5/5
Smart Phone Dumb Phone: Free Yourself from Digital Addiction
Ebook
Smart Phone Dumb Phone: Free Yourself from Digital Addiction
byAllen Carr
Rating: 0 out of 5 stars
0 ratings
Selfie: How We Became So Self-Obsessed and What It's Doing to Us
Ebook
Selfie: How We Became So Self-Obsessed and What It's Doing to Us
byWill Storr
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Getting Your First Conference Presentation - Sarah Harvey - ASW #271: We return to the practice of presentations, this time with a perspective from a conference organizer. And we have tons of questions! What makes a topic stand out? How can an old, boring topic be given new life? How do you prepare as a first-time...
Podcast episode
Getting Your First Conference Presentation - Sarah Harvey - ASW #271: We return to the practice of presentations, this time with a perspective from a conference organizer. And we have tons of questions! What makes a topic stand out? How can an old, boring topic be given new life? How do you prepare as a first-time...
bySecurity Weekly Podcast Network (Audio)
0 ratings
0% found this document useful
An AI Hammer in Search of a Nail
Podcast episode
An AI Hammer in Search of a Nail
byHow to Fix the Internet
0 ratings
0% found this document useful
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
Podcast episode
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
byAerospace Engineering Podcast
0 ratings
0% found this document useful
[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research
Podcast episode
[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful
How to Secure DevOps
Podcast episode
How to Secure DevOps
byThe Cloudcast
0 ratings
0% found this document useful
The Future of Streaming with Roneil Rumburg: Roneil Rumburg, Co-Founder and CEO of Audius, discusses Audius, users owned platforms and Web3, the creator economy, the applications of IPFS, Bitcoin’s role in the global monetary system, and much more
Podcast episode
The Future of Streaming with Roneil Rumburg: Roneil Rumburg, Co-Founder and CEO of Audius, discusses Audius, users owned platforms and Web3, the creator economy, the applications of IPFS, Bitcoin’s role in the global monetary system, and much more
byThe Charlie Shrem Show
0 ratings
0% found this document useful
Hack to the Future: Giving young people the tools to create a better online world through computer programming means also giving them the power to fix our real world. But what happens when educators stand in the way?
Podcast episode
Hack to the Future: Giving young people the tools to create a better online world through computer programming means also giving them the power to fix our real world. But what happens when educators stand in the way?
byHow to Fix the Internet
0 ratings
0% found this document useful
Accessibility: How technology is leading the way
Podcast episode
Accessibility: How technology is leading the way
byTechnology Now
0 ratings
0% found this document useful
Anyone Listening? Quantum Cryptography Applications with Vlatko Vedral: Upgrading isn't just for phone systems. Quantum information science tackles the upgrade of old existing technologies, which run by classical physics laws, to those that function in the quantum realm. It's as easy as it sounds: Vlatko Vederal tells...
Podcast episode
Anyone Listening? Quantum Cryptography Applications with Vlatko Vedral: Upgrading isn't just for phone systems. Quantum information science tackles the upgrade of old existing technologies, which run by classical physics laws, to those that function in the quantum realm. It's as easy as it sounds: Vlatko Vederal tells...
byFinding Genius Podcast
0 ratings
0% found this document useful
Season Four Launchisode: As our first trilogy comes to a close, and we embark on the next one, we’re doing what all great trilogies do: Upending everything that made the initial one great and starting afresh. We've incorporated listener feedback, hear more about our guests' personal lives, dive into architecture, and debut a new segment!
Podcast episode
Season Four Launchisode: As our first trilogy comes to a close, and we embark on the next one, we’re doing what all great trilogies do: Upending everything that made the initial one great and starting afresh. We've incorporated listener feedback, hear more about our guests' personal lives, dive into architecture, and debut a new segment!
byElixir Wizards
0 ratings
0% found this document useful
215 — Workplace design in the Covid era: Earlier in the year, a report by academics at Cardiff and Southampton Universities found that a majority of people would like to continue working from home in some capacity, even after social distancing is no longer a requirement. But what would a...
Podcast episode
215 — Workplace design in the Covid era: Earlier in the year, a report by academics at Cardiff and Southampton Universities found that a majority of people would like to continue working from home in some capacity, even after social distancing is no longer a requirement. But what would a...
byThe Mind Tools L&D Podcast
0 ratings
0% found this document useful
?️ThursdAI - Jul 27: SDXL1.0, Superconductors? StackOverflowAI and Frontier Model Forum
Podcast episode
?️ThursdAI - Jul 27: SDXL1.0, Superconductors? StackOverflowAI and Frontier Model Forum
byThursdAI - The top AI news from the past week
0 ratings
0% found this document useful
299 — How can we embed learning in our workplaces?: We’re all familiar with one-off learning events like e-learning modules, workshops and virtual classrooms. They’re common because they’re logistically easier for everyone, but how we can adapt these to embed learning after the event? To discuss,...
Podcast episode
299 — How can we embed learning in our workplaces?: We’re all familiar with one-off learning events like e-learning modules, workshops and virtual classrooms. They’re common because they’re logistically easier for everyone, but how we can adapt these to embed learning after the event? To discuss,...
byThe Mind Tools L&D Podcast
0 ratings
0% found this document useful
The Web Goes World Wide
Podcast episode
The Web Goes World Wide
byHISTORY This Week
0 ratings
0% found this document useful
Episode 44: Digital Image Correlation: harnessing photography to map strain
Podcast episode
Episode 44: Digital Image Correlation: harnessing photography to map strain
byMaterialism: A Materials Science Podcast
0 ratings
0% found this document useful
The Cloudcast #203 - Docker Networking: Aaron and Brian talk to John Willis (@botchagulpe; VP of Customer Enablement @Docker) and Madhu Venugopal (@MadhuVenugopal, Sr.Director Networking @Docker) about the evolution from Socketplane to Docker Networking, the new plugin architecture in v1.7, ...
Podcast episode
The Cloudcast #203 - Docker Networking: Aaron and Brian talk to John Willis (@botchagulpe; VP of Customer Enablement @Docker) and Madhu Venugopal (@MadhuVenugopal, Sr.Director Networking @Docker) about the evolution from Socketplane to Docker Networking, the new plugin architecture in v1.7, ...
byThe Cloudcast
0 ratings
0% found this document useful
A "AI & ML" Look Ahead for 2020
Podcast episode
A "AI & ML" Look Ahead for 2020
byThe Cloudcast
0 ratings
0% found this document useful
Episode 99: Post-Devcon5 catch-up with Tarun, James & Georgios: In this week's episode, Anna sits down with Tarun Chitra (Gauntlet Networks), James Prestwich (Summa) and Georgios Konstantopoulos at the end of the last day of Devcon5 in Osaka. They cover a broad range of topics from their Devcon take-aways, emerging ideas in the ecosystem, the latest in PoS systems, Roll-up and DeFi, to the explosion in zero knowledge research, and more.
Podcast episode
Episode 99: Post-Devcon5 catch-up with Tarun, James & Georgios: In this week's episode, Anna sits down with Tarun Chitra (Gauntlet Networks), James Prestwich (Summa) and Georgios Konstantopoulos at the end of the last day of Devcon5 in Osaka. They cover a broad range of topics from their Devcon take-aways, emerging ideas in the ecosystem, the latest in PoS systems, Roll-up and DeFi, to the explosion in zero knowledge research, and more.
byZero Knowledge
0 ratings
0% found this document useful
2022 Look Ahead, Developer Careers
Podcast episode
2022 Look Ahead, Developer Careers
byThe Cloudcast
0 ratings
0% found this document useful
Episode 41: 3D Printing Case Studies: examples of the power of additive manufacturing
Podcast episode
Episode 41: 3D Printing Case Studies: examples of the power of additive manufacturing
byMaterialism: A Materials Science Podcast
0 ratings
0% found this document useful
The Cloudcast #355 - Exploring IoT Edge
Podcast episode
The Cloudcast #355 - Exploring IoT Edge
byThe Cloudcast
0 ratings
0% found this document useful
ActivityPub and the End of Walled Gardens, with Evan Prodromou: Flipboard CEO Mike McCue talks to Evan Prodromou, one of the co-authors of ActivityPub. What does this protocol unlock for product builders and entrepreneurs, and why does it matter?
Podcast episode
ActivityPub and the End of Walled Gardens, with Evan Prodromou: Flipboard CEO Mike McCue talks to Evan Prodromou, one of the co-authors of ActivityPub. What does this protocol unlock for product builders and entrepreneurs, and why does it matter?
byDot Social
0 ratings
0% found this document useful
Which technologies get cheaper over time, and why?
Podcast episode
Which technologies get cheaper over time, and why?
byVolts
0 ratings
0% found this document useful
State of Containers in the Public Cloud
Podcast episode
State of Containers in the Public Cloud
byThe Cloudcast
0 ratings
0% found this document useful
WLP194 Access All Areas: Welcome back A small correction to a recent discussion, when we look at ’s research about gender and remote work – we’re glad to point out that the work we’d referred to as UK only actually had much broader scope, and we strongly recommend...
Podcast episode
WLP194 Access All Areas: Welcome back A small correction to a recent discussion, when we look at ’s research about gender and remote work – we’re glad to point out that the work we’d referred to as UK only actually had much broader scope, and we strongly recommend...
by21st Century Work Life and leading remote teams
0 ratings
0% found this document useful
The Opportunities & Challenges of Voice Technology [S4 E8]: Smart speakers are the fastest adopted consumer technology in history. Faster even than smartphones. The current use case is around routine activities like voice calling, playing music and accessing news headlines,
Podcast episode
The Opportunities & Challenges of Voice Technology [S4 E8]: Smart speakers are the fastest adopted consumer technology in history. Faster even than smartphones. The current use case is around routine activities like voice calling, playing music and accessing news headlines,
byDigital Download with Paul Sutton
0 ratings
0% found this document useful
317: Bots Building Jails: Setting up buildbot in FreeBSD jails, Set up a mail server with OpenSMTPD, Dovecot and Rspamd, OpenBSD amateur packet radio with HamBSD, DragonFlyBSD's HAMMER2 gets fsck, return of startx for users.
Podcast episode
317: Bots Building Jails: Setting up buildbot in FreeBSD jails, Set up a mail server with OpenSMTPD, Dovecot and Rspamd, OpenBSD amateur packet radio with HamBSD, DragonFlyBSD's HAMMER2 gets fsck, return of startx for users.
byBSD Now
0 ratings
0% found this document useful
Updates to Infrastructure as Code
Podcast episode
Updates to Infrastructure as Code
byThe Cloudcast
0 ratings
0% found this document useful
88: Flexible Electronics for Better Health (ft. Stan Farnsworth): What defines a flexible electronic? Is it just a squishy computer? Plastic with a leaf printed on it? What about artificial tissue? Well, it could be all of these! But what materials science goes into the production of flexible electronics? Today’s g...
Podcast episode
88: Flexible Electronics for Better Health (ft. Stan Farnsworth): What defines a flexible electronic? Is it just a squishy computer? Plastic with a leaf printed on it? What about artificial tissue? Well, it could be all of these! But what materials science goes into the production of flexible electronics? Today’s g...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
The Future of AI and ML in Process Automation // Slater Victoroff // MLOps Coffee Sessions #64
Podcast episode
The Future of AI and ML in Process Automation // Slater Victoroff // MLOps Coffee Sessions #64
byMLOps.community
0 ratings
0% found this document useful

Skip carousel

The Ham Notebook
CQ Amateur Radio
Article
The Ham Notebook
May 1, 2021
5 min read
Everyone Loves The Chat Box: How Climate Science Moved Online
NPR
Article
Everyone Loves The Chat Box: How Climate Science Moved Online
Aug 3, 2020
5 min read
“You Don’t Need A Computer, Let Alone One With 75,000 Processor Cores, To Think About The Parts Of A Problem”
PC Pro Magazine
Article
“You Don’t Need A Computer, Let Alone One With 75,000 Processor Cores, To Think About The Parts Of A Problem”
Dec 10, 2020
9 min read
Q&A
Rotman Management
Article
Q&A
Jan 1, 2022
You have referred to our current environment as ‘exponential’. How do you define that term? A linear process is one where something progresses straight from one stage to another. For example, your age increases by a predictable ‘one’ with each revolu
8 min read
Picture In A Mainframe
Linux Format
Article
Picture In A Mainframe
Jul 2, 2019
11 min read
Documentation And Support
Linux Format
Article
Documentation And Support
Jun 4, 2019
Because Visopsys is primarily designed for developers and students, the bulk of its documentation caters for this group. There’s a lot of information that exposes the internals of the OS, which is a treasure trove for any Computer Science student. Ko
1 min read
Learning Curve
CQ Amateur Radio
Article
Learning Curve
Jul 1, 2019
The British poet, John Masefield, penned the line in his poem Sea Fever, “And all I ask is a tall ship and a star to steer her by.” Masefield’s line, combined with the adage, “Give a man a fish, you feed him for a day. Teach a man to fish, and you fe
6 min read
Note-taking Applications For Family History
Family Tree UK
Article
Note-taking Applications For Family History
Mar 10, 2023
7 min read
Zero Bias: A Cq Editorial
CQ Amateur Radio
Article
Zero Bias: A Cq Editorial
Aug 1, 2023
I’m writing this just after the 4th of July, when it’s even more common than usual to hear people say “thank you for your service” to just about anybody in a uniform. This is great, of course, as long as it’s sincere and not just a cliché, but that’s
3 min read
Future Historians Probably Won't Understand Our Internet, and That's Okay
The Atlantic
Article
Future Historians Probably Won't Understand Our Internet, and That's Okay
Dec 6, 2017
6 min read
This PC Does Not Exist
Maximum PC
Article
This PC Does Not Exist
May 23, 2023
7 min read
Ideas and Resources for Growing Youth Involvement in Amateur Radio
CQ Amateur Radio
Article
Ideas and Resources for Growing Youth Involvement in Amateur Radio
Mar 1, 2020
11 min read
Folding@home In Practice
Maximum PC
Article
Folding@home In Practice
Jul 20, 2021
A computational chemist undertaking a postdoctoral study at the KTH Royal Institute of Technology in Stockholm, Sweden, Sergio Perez Conesa uses Folding@home in the hope of uncovering a new drug or treatment for common illnesses associated with ion c
5 min read
How Covid-19 Is Forcing Universities To Change
BBC History Magazine
Article
How Covid-19 Is Forcing Universities To Change
Sep 3, 2020
6 min read
The Dawn Of The Age Of Artificial Intelligence
The Atlantic
Article
The Dawn Of The Age Of Artificial Intelligence
Feb 14, 2014
6 min read
Can Twitter Fit Inside the Library of Congress?
The Atlantic
Article
Can Twitter Fit Inside the Library of Congress?
Aug 4, 2016
5 min read
Cracking The Code: Introducing The Next-gen To STEM
APC
Article
Cracking The Code: Introducing The Next-gen To STEM
May 17, 2021
4 min read
Online Research
Writing Magazine
Article
Online Research
Aug 6, 2020
The internet has the potential to give you much of the information, if not all the information, you need for your research, provided you know how to use it effectively. There are different ways to approach online research, but following a process tha
3 min read
For National STEM Day, Argonne Lab’s Valerie Taylor Talks About AI, ‘Star Trek’ And Diversity In The Sciences
Chicago Tribune
Article
For National STEM Day, Argonne Lab’s Valerie Taylor Talks About AI, ‘Star Trek’ And Diversity In The Sciences
Nov 9, 2023
5 min read
Photogenealogy: Step 5 Your Photo Legacy
Family Tree UK
Article
Photogenealogy: Step 5 Your Photo Legacy
Nov 11, 2022
4 min read
Neural Pathways
Guitar Magazine
Article
Neural Pathways
Jul 2, 2021
5 min read
Educational Effects
Landscape Architecture Australia
Article
Educational Effects
Jul 27, 2020
5 min read
When Linux Won’t Do
Linux Format
Article
When Linux Won’t Do
Feb 11, 2020
10 min read
Super Linux!
Linux Format
Article
Super Linux!
Jun 2, 2020
1 min read
“Streaming Is Probably The Most Profligate Bandwidth Consumer Of The Modern Era”
PC Pro Magazine
Article
“Streaming Is Probably The Most Profligate Bandwidth Consumer Of The Modern Era”
Mar 10, 2022
7 min read
Open Britannia
Linux Format
Article
Open Britannia
Jun 30, 2020
10 min read
The Logic of the Filing Cabinet Is Everywhere
The Atlantic
Article
The Logic of the Filing Cabinet Is Everywhere
Jun 8, 2021
7 min read
“Hi-tech, Privacy-busting Exploits Don’t Have To Be Based Around Smart Speakers Or Malware”
PC Pro Magazine
Article
“Hi-tech, Privacy-busting Exploits Don’t Have To Be Based Around Smart Speakers Or Malware”
Aug 13, 2020
7 min read
Science Is Becoming Less Human
The Atlantic
Article
Science Is Becoming Less Human
Dec 11, 2023
This summer, a pill intended to treat a chronic, incurable lung disease entered mid-phase human trials. Previous studies have demonstrated that the drug is safe to swallow, although whether it will improve symptoms of the painful fibrosis that it tar
8 min read
Intel ...ON TE FUTURE OF... Computing
TechLife
Article
Intel ...ON TE FUTURE OF... Computing
Jan 13, 2020
5 min read

Reviews for Academic Press Library in Signal Processing

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Academic Press Library in Signal Processing - Academic Press

Cover image

Title page

Copyright

Introduction

Signal Processing at Your Fingertips!

About the Editors

Section Editors

Authors Biography

Section 1: Image/Video Compression

Chapter 1. An Introduction to Video Coding

Abstract

Nomenclature

5.01.1 Introduction

5.01.2 Applications areas for video coding

5.01.3 Requirements of a compression system

5.01.4 The basics of compression

5.01.5 Decorrelating transforms

5.01.6 Symbol encoding

5.01.7 Motion estimation

5.01.8 The block-based motion-compensated video coding architecture

5.01.9 Standardization of video coding systems

5.01.10 Conclusions

Additional resources

Glossary of terms

References

Chapter 2. Motion Estimation—A Video Coding Viewpoint

Abstract

Nomenclature

5.02.1 Introduction

5.02.2 Motion representation and models

5.02.3 Optical flow approaches

5.02.4 Pel-recursive approaches

5.02.5 Transform-domain approaches

5.02.6 Block matching approaches

5.02.7 Parametric motion estimation

5.02.8 Multi-resolution approaches

5.02.9 Motion compensation

5.02.10 Performance assessment criteria for motion estimation algorithms

5.02.11 Summary and concluding remarks

Relevant websites

Glossary

References

Chapter 3. High Efficiency Video Coding (HEVC) for Next Generation Video Applications

Abstract

Nomenclature

5.03.1 Introduction

5.03.2 Requirements and solutions for video compression

5.03.3 Basic principles behind HEVC

5.03.4 Performance evaluation

5.03.5 Conclusions

Relevant Websites

Glossary

References

Chapter 4. Stereoscopic and Multi-View Video Coding

Abstract

Nomenclature

5.04.1 Introduction

5.04.2 Fundamentals of stereoscopic vision

5.04.3 3D display technologies

5.04.4 Applications of stereoscopic and multi-view video

5.04.5 Compression of stereoscopic and multi-view video

5.04.6 Quality evaluation of 3D video

5.04.7 Conclusions

Relevant Websites

References

Chapter 5. Perceptually Optimized Video Compression

Abstract

Nomenclature

5.05.1 Introduction

5.05.2 Perceptual coding tools in video coding architectures

5.05.3 The modeling of the human visual system sensitivity to coding artifacts

5.05.4 Integration of JND models in video coding architectures

5.05.5 Practical perceptual video coding schemes

5.05.6 Conclusions and considerations for future research directions

Glossary

References

Chapter 6. How to Use Texture Analysis and Synthesis Methods for Video Compression

Abstract

Nomenclature

5.06.1 Introduction

5.06.2 Perception-oriented video coding strategies

5.06.3 Block-based video coding techniques

5.06.4 Region-based video coding techniques

5.06.5 Conclusion

Relevant websites of open source software

Glossary

References

Chapter 7. Measuring Video Quality

Abstract

Nomenclature

5.07.1 Introduction

5.07.2 Background

5.07.3 Subjective testing

5.07.4 Subjective datasets

5.07.5 Objective quality metrics

5.07.6 Conclusions

Additional resources

References

Chapter 8. Multiple Description Coding

Abstract

Nomenclature

5.08.1 Introduction and history

5.08.2 Theoretical basis

5.08.3 Speech coding

5.08.4 Image coding

5.08.5 Video coding

5.08.6 Network coding

5.08.7 Stereoscopic 3D

5.08.8 Other applications

5.08.9 Implementations and patents

5.08.10 Conclusion

List of Relevant Websites

References

Chapter 9. Video Error Concealment

Abstract

Nomenclature

5.09.1 Introduction

5.09.2 Spatial error concealment (SEC)

5.09.3 Temporal error concealment (TEC)

5.09.4 Mode selection

5.09.5 Discussion and conclusions

Glossary

References

Section 2: Multimedia Signal Processing

Chapter 10. Introduction to Multimedia Signal Processing

References

Chapter 11. Multimedia Streaming

Abstract

Overview

5.11.1 Introduction

5.11.2 Transport protocols

5.11.3 Streaming on wired networks

5.11.4 Streaming on wireless networks

5.11.5 Error control, detection, concealment

5.11.6 Scalable video coding

5.11.7 Multiple Description Coding

5.11.8 Network coding

Conclusions

Glossary

References

Chapter 12. Multimedia Content-Based Visual Retrieval

Abstract

5.12.1 Introduction

5.12.2 General pipeline overview

5.12.3 Local feature representation

5.12.4 Feature quantization

5.12.5 Index strategy

5.12.6 Retrieval scoring

5.12.7 Post-processing

5.12.8 Conclusion

Acknowledgment

References

Chapter 13. Joint Audio-Visual Processing for Video Copy Detection

Abstract

5.13.1 Introduction

5.13.2 Visual-based video copy detection algorithm

5.13.3 Audio-based video copy detection

5.13.4 Joint audio- and visual-based video copy detection

5.13.5 Copy and near-duplicate detection

5.13.6 Region and partial content search

5.13.7 Conclusion and future trends

References

Index

Copyright

Introduction

Signal Processing at Your Fingertips!

Let us flash back to the 1970s when the editors-in-chief of this e-reference were graduate students. One of the time-honored traditions then was to visit the libraries several times a week to keep track of the latest research findings. After your advisor and teachers, the librarians were your best friends. We visited the engineering and mathematics libraries of our Universities every Friday afternoon and poured over the IEEE Transactions, Annals of Statistics, the Journal of Royal Statistical Society, Biometrika, and other journals so that we could keep track of the recent results published in these journals. Another ritual that was part of these outings was to take sufficient number of coins so that papers of interest could be xeroxed. As there was no Internet, one would often request copies of reprints from authors by mailing postcards and most authors would oblige. Our generation maintained thick folders of hardcopies of papers. Prof. Azriel Rosenfeld (one of RC’s mentors) maintained a library of over 30,000 papers going back to the early 1950s!

Another fact to recall is that in the absence of Internet, research results were not so widely disseminated then and even if they were, there was a delay between when the results were published in technologically advanced western countries and when these results were known to scientists in third world countries. For example, till the late 1990s, scientists in US and most countries in Europe had a lead time of at least a year to 18 months since it took that much time for papers to appear in journals after submission. Add to this the time it took for the Transactions to go by surface mails to various libraries in the world. Scientists who lived and worked in the more prosperous countries were aware of the progress in their fields by visiting each other or attending conferences.

Let us race back to 21st century! We live and experience a world which is fast changing with rates unseen before in the human history. The era of Information and Knowledge societies had an impact on all aspects of our social as well as personal lives. In many ways, it has changed the way we experience and understand the world around us; that is, the way we learn. Such a change is much more obvious to the younger generation, which carries much less momentum from the past, compared to us, the older generation. A generation which has grew up in the Internet age, the age of Images and Video games, the age of IPAD and Kindle, the age of the fast exchange of information. These new technologies comprise a part of their real world, and Education and Learning can no more ignore this reality. Although many questions are still open for discussions among sociologists, one thing is certain. Electronic publishing and dissemination, embodying new technologies, is here to stay. This is the only way that effective pedagogic tools can be developed and used to assist the learning process from now on. Many kids in the early school or even preschool years have their own IPADs to access information in the Internet. When they grow up to study engineering, science, or medicine or law, we doubt if they ever will visit a library as they would by then expect all information to be available at their fingertips, literally!

Another consequence of this development is the leveling of the playing field. Many institutions in lesser developed countries could not afford to buy the IEEE Transactions and other journals of repute. Even if they did, given the time between submission and publication of papers in journals and the time it took for the Transactions to be sent over surface mails, scientists and engineers in lesser developed countries were behind by two years or so. Also, most libraries did not acquire the proceedings of conferences and so there was a huge gap in the awareness of what was going on in technologically advanced countries. The lucky few who could visit US and some countries in Europe were able to keep up with the progress in these countries. This has changed. Anyone with an Internet connection can request or download papers from the sites of scientists. Thus there is a leveling of the playing field which will lead to more scientist and engineers being groomed all over the world.

The aim of Online Reference for Signal Processing project is to implement such a vision. We all know that asking any of our students to search for information, the first step for him/her will be to click on the web and possibly in the Wikipedia. This was the inspiration for our project. To develop a site, related to the Signal Processing, where a selected set of reviewed articles will become available at a first click. However, these articles are fully refereed and written by experts in the respected topic. Moreover, the authors will have the luxury to update their articles regularly, so that to keep up with the advances that take place as time evolves. This will have a double benefit. Such articles, besides the more classical material, will also convey the most recent results providing the students/researchers with up-to-date information. In addition, the authors will have the chance of making their article a more permanent source of reference, that keeps up its freshness in spite of the passing time.

The other major advantage is that authors have the chance to provide, alongside their chapters, any multimedia tool in order to clarify concepts as well as to demonstrate more vividly the performance of various methods, in addition to the static figures and tables. Such tools can be updated at the author’s will, building upon previous experience and comments. We do hope that, in future editions, this aspect of this project will be further enriched and strengthened.

In the previously stated context, the Online Reference in Signal Processing provides a revolutionary way of accessing, updating and interacting with online content. In particular, the Online Reference will be a living, highly structured, and searchable peer-reviewed electronic reference in signal/image/video Processing and related applications, using existing books and newly commissioned content, which gives tutorial overviews of the latest technologies and research, key equations, algorithms, applications, standards, code, core principles, and links to key Elsevier journal articles and abstracts of non-Elsevier journals.

The audience of the Online Reference in Signal Processing is intended to include practicing engineers in signal/image processing and applications, researchers, PhD students, post Docs, consultants, and policy makers in governments. In particular, the readers can be benefited in the following needs:

• To learn about new areas outside their own expertise.

• To understand how their area of research is connected to other areas outside their expertise.

• To learn how different areas are interconnected and impact on each other: the need for a helicopter perspective that shows the wood for the trees.

• To keep up-to-date with new technologies as they develop: what they are about, what is their potential, what are the research issues that need to be resolved, and how can they be used.

• To find the best and most appropriate journal papers and keeping up-to-date with the newest, best papers as they are written.

• To link principles to the new technologies.

The Signal Processing topics have been divided into a number of subtopics, which have also dictated the way the different articles have been compiled together. Each one of the subtopics has been coordinated by an AE (Associate Editor). In particular:

1. Signal Processing Theory (Prof. P. Diniz)

2. Machine Learning (Prof. J. Suykens)

3. DSP for Communications (Prof. N. Sidiropulos)

4. Radar Signal Processing (Prof. F. Gini)

5. Statistical SP (Prof. A. Zoubir)

6. Array Signal Processing (Prof. M. Viberg)

7. Image Enhancement and Restoration (Prof. H. J. Trussell)

8. Image Analysis and Recognition (Prof. Anuj Srivastava)

9. Video Processing (other than compression), Tracking, Super Resolution, Motion Estimation, etc. (Prof. A. R. Chowdhury)

10. Hardware and Software for Signal Processing Applications (Prof. Ankur Srivastava)

11. Speech Processing/Audio Processing (Prof. P. Naylor)

12. Still Image Compression (Prof. David R. Bull)

13. Video Compression (Prof. David R. Bull)

14. Multimedia (Prof. Min Wu)

We would like to thank all the Associate Editors for all the time and effort in inviting authors as well as coordinating the reviewing process. The Associate Editors have also provided succinct summaries of their areas.

The articles included in the current editions comprise the first phase of the project. In the second phase, besides the updates of the current articles, more articles will be included to further enrich the existing number of topics. Also, we envisage that, in the future editions, besides the scientific articles we are going to be able to include articles of historical value. Signal Processing has now reached an age that its history has to be traced back and written.

Last but not least, we would like to thank all the authors for their effort to contribute in this new and exciting project. We earnestly hope that in the area of Signal Processing, this reference will help level the playing field by highlighting the research progress made in a timely and accessible manner to anyone who has access to the Internet. With this effort the next breakthrough advances may be coming from all around the world.

The companion site for this work: http://booksite.elsevier.com/9780124166165 includes multimedia files (Video/Audio) and MATLAB codes for selected chapters.

Rama Chellappa

Sergios Theodoridis

Section 1: Image/Video Compression

Outline

Chapter 1 An Introduction to Video Coding

Chapter 2 Motion Estimation—A Video Coding Viewpoint

Chapter 3 High Efficiency Video Coding (HEVC) for Next Generation Video Applications

Chapter 4 Stereoscopic and Multi-View Video Coding

Chapter 5 Perceptually Optimized Video Compression

Chapter 6 How to Use Texture Analysis and Synthesis Methods for Video Compression

Chapter 7 Measuring Video Quality

Chapter 8 Multiple Description Coding

Chapter 9 Video Error Concealment

Chapter 1

An Introduction to Video Coding

David R. Bull, Bristol Vision Institute, University of Bristol, Bristol BS8 1UB, UK

Abstract

Visual information is the primary consumer of communications bandwidth across all broadcast, internet, and mobile networks. Users are demanding increased video quality, increased quantities of video content, more extensive access, and better reliability. This is creating a major tension between the available capacity per user in the network and the bit rates required to transmit video content at the desired quality. Network operators, content creators, and service providers therefore are all seeking better ways to transmit the highest quality video at the lowest bit rate, something that can only be achieved through video compression.

This chapter provides an introduction to some of the most common image and video compression methods in use today and sets the scene for the rest of the contributions in later chapters. It first explains, in the context of a range of video applications, why compression is needed and what compression ratios are required. It then examines the basic video compression architecture, using the ubiquitous hybrid, block-based motion compensated codec. Finally it briefly examines why standards are so important in supporting interoperability.

This chapter, necessarily only provides an overview of video coding algorithms, and the reader if referred to Ref. [1] for a more comprehensive description of the methods used in today’s compression systems.

Keywords

Image compression; Video compression; Video applications; Discrete cosine transform; Entropy coding; Motion estimation; Video standards

Nomenclature

1-D one dimensional

2-D two dimensional

3-D three dimensional

AC alternating current. Used to denote all transform coefficients except the zero frequency coefficient

ADSL asymmetric digital subscriber line

ASP advanced simple profile (of MPEG-4)

AVC advanced video codec (H.264)

B bi-coded picture

bpp bits per pixel

bps bits per second

CCIR international radio consultative committee (now ITU)

CIF common intermediate format

codec encoder and decoder

CT computerized tomography

CTU coding tree unit

CU coding unit

DC direct current. Refers to zero frequency transform coefficient.

DCT discrete cosine transform

DFD displaced frame difference

DFT discrete Fourier transform

DPCM differential pulse code modulation

DVB digital video broadcasting

EBU European Broadcasting Union

FD frame difference

fps frames per second

GOB group of blocks

GOP group of pictures

HDTV high definition television

HEVC high efficiency video codec (H.265)

HVS human visual system

I intra coded picture

IEC International Electrotechnical Commission

IEEE Institute of Electrical and Electronic Engineers

IP internet protocol

ISDN integrated services digital network

ISO International Standards Organization

ITU International Telecommunications Union. -R Radio; -T Telecommunications

JPEG Joint Photographic Experts Group

kbps kilobits per second

LTE long term evolution (4G mobile radio technology)

MB macroblock

mbps mega bits per second

MC motion compensation

MCP motion compensated prediction

ME motion estimation

MEC motion estimation and compensation

MPEG Motion Picture Experts Group

MRI magnetic resonance imaging

MV motion vector

P predicted picture

PSNR peak signal to noise ratio

QAM quadrature amplitude modulation

QCIF quarter CIF resolution

QPSK quadrature phase shift keying

RGB red, green, and blue color primaries

SG study group (of ITU)

SMPTE Society of Motion Picture and Television Engineers

TV television

UHDTV ultra high definition television

UMTS universal mobile telecommunications system

VDSL very high bit rate digital subscriber line

VLC variable length coding

VLD variable length decoding

YCbCr color coordinate system comprising luminance, Y, and two chrominance channels, Cb and Cr

5.01.1 Introduction

5.01.2 Applications areas for video coding

By 2020 it is predicted that the number of network-connected devices will reach 1000 times the world’s population; there will be 7 trillion connected devices for 7 billion people [2]. Cisco predict [3] that this will result in 1.3 zettabytes of global internet traffic in 2016, with over 80% of this being video traffic. This explosion in video technology and the associated demand for video content are driven by:

• Increased numbers of users with increased expectations of quality and mobility.

• Increased amounts of user generated content available through social networking and download sites.

• The emergence of new ways of working using distributed applications and environments such as the cloud.

• Emerging immersive and interactive entertainment formats for film, television, and streaming.

5.01.2.1 Markets for video technology

A huge and increasing number of applications rely on video technology. These include:

5.01.2.1.1 Consumer video

Entertainment, personal communications, and social interaction provide the primary applications in consumer video, and these will dominate the video landscape of the future. There has, for example, been a massive increase in the consumption and sharing of content on mobile devices and this is likely to be the major driver over the coming years. The key drivers in this sector are:

• Broadcast television, digital cinema and the demand for more immersive content (3-D, multiview, higher resolution, frame rate, and dynamic range).

• Internet streaming, peer to peer distribution, and personal mobile communication systems.

• Social networking, user-generated content, and content-based search and retrieval.

• In-home wireless content distribution systems and gaming.

5.01.2.1.2 Surveillance

We have become increasingly aware of our safety and security, and video monitoring is playing an increasingly important role in this respect. It is estimated that the market for networked cameras (non-consumer) [4] will be $4.5 billion in 2017. Aligned with this, there will be an even larger growth in video analytics. The key drivers in this sector are:

• Surveillance of public spaces and high profile events.

• National security.

• Battlefield situational awareness, threat detection, classification, and tracking.

• Emergency services, including police, ambulance, and fire.

5.01.2.1.3 Business and automation

Visual communications are playing an increasingly important role in business. For example, the demand for higher quality video conferencing and the sharing of visual content have increased. Similarly in the field of automation, vision-based systems are playing a key role in transportation systems and are now underpinning many manufacturing processes, often demanding the storage or distribution of compressed video content. The drivers in this case can be summarized as:

• Video conferencing, tele-working, and other interactive services.

• Publicity, advertising, news, and journalism.

• Design, modeling, simulation.

• Transport systems, including vehicle guidance, assistance, and protection.

• Automated manufacturing and robotic systems.

5.01.2.1.4 Healthcare

Monitoring the health of the population is becoming increasingly dependent on imaging methods to aid diagnoses. Methods such as CT and MRI produce enormous amounts of data for each scan and these need to be stored as efficiently as possible while retaining the highest quality. Video is also becoming increasingly important as a point-of-care technology for monitoring patients in their own homes. The primary healthcare drivers for compression are:

• Point-of-care monitoring.

• Emergency services and remote diagnoses.

• Tele-surgery.

• Medical imaging.

It is clear that all of the above application areas require considerable trade-offs to be made between cost, complexity, robustness, and performance. These issues are addressed further in the following section.

5.01.3 Requirements of a compression system

5.01.3.1 Requirements

The primary requirement of a video compression system is to produce the highest quality at the lowest bit rate. Other desirable features include:

• Robustness to loss: We want to maintain high quality when signals are transmitted over error-prone channels by ensuring that the bitstream is error-resilient.

• Reconfigurability and flexibility: To support delivery over time-varying channels or heterogeneous networks.

• Low complexity: Particularly for low power portable implementations.

• Low delay: To support interactivity.

• Authentication and rights management: To support conditional access, content ownership verification, or to detect tampering.

• Standardization: To support interoperability.

5.01.3.2 Trade-offs

In practice, it is usual that a compromise must be made in terms of trade-offs between these features because of cost or complexity constraints and because of limited bandwidth or lossy channels. Areas of possible compromise include:

Lossy vs lossless compression: We must exploit any redundancy in the image or video signal in such a way that it delivers the desired compression with the minimum perceived distortion. This usually means that the original signal cannot be perfectly reconstructed.

Rate vs quality: In order to compromise between bit rate and quality, we must trade off parameters such as frame rate, spatial resolution (luma and chroma), dynamic range, prediction mode, and latency. A codec will include a rate-distortion optimization mechanism that will make coding decisions (for example relating to prediction mode, block size, etc.) based on a rate-distortion objective function [1,5,6].

Complexity vs cost: In general, as additional features are incorporated, the video encoder will become more complex. However, more complex architectures invariably are more expensive and may introduce more delay.

Delay vs performance: Low latency is important in interactive applications. However, increased performance can often be obtained if greater latency can be tolerated.

Redundancy vs error resilience: Conventionally in data transmission applications, channel and source coding have been treated independently, with source compression used to remove picture redundancy and error detection and correction mechanisms added to protect the bitstream against errors. However, in the case of video coding, alternative mechanisms exist for making the compressed bitstream more resilient to errors, or dynamic channel conditions, or for concealing errors at the decoder. Some of these are discussed in Chapters 8 and 9.

5.01.3.3 How much do we need to compress?

Typical video compression ratio requirements are currently between 100:1 and 200:1. However this could increase to many hundreds or even thousands to one as new more demanding formats emerge.

5.01.3.3.1 Bit rate requirements

Pictures are normally acquired as an array of color samples, usually based on combinations of the red, green and blue primaries. They are then usually converted to some other more convenient color space, such as Y, Cb, Cr that encodes luminance separately to two color difference signals [1]. Table 1.1 shows typical sampling parameters for a range of common video formats. Without any compression, it can be seen, even for the lower resolution formats, that the bit rate requirements are high—much higher than what is normally provided by today’s communication channels. Note that the chrominance signals are encoded at a reduced resolution as indicated by the 4:2:2 and 4:2:0 labels. Also note that two formats are included for the HDTV case (the same could be done for the other formats); for broadcast quality systems, the 4:2:2 format is actually more representative of the original bit rate as this is what is produced by most high quality cameras. The 4:2:0 format, on the other hand, is that normally employed for transmission after compression.

Table 1.1

Typical Parameters for Common Digital Video Formats and their (Uncompressed) Bit Rate Requirements

UHDTV = Ultra High Definition Television; HDTV = High Definition Television; SDTV = Standard Definition Television; CIF = Common Intermediate Format; QCIF = Quarter CIF.

aEncoding at 10 bits.

Finally, it is worth highlighting that the situation is actually worse than that shown in Table 1.1 especially for the new Ultra High Definition (UHDTV) standard [7] where higher frame rates and longer wordlengths will normally be used. For example at 120 frames per second (fps) with a 10 bit wordlength for each sample, the raw bit rate increases to 60 Gbps for a single video stream! This will increase even further if 3-D or multiview formats are employed.

5.01.3.3.2 Bandwidth availability

Let us now examine the bandwidths available in typical communication channels as summarized in Table 1.2. This table shows the theoretical maximum bit rates under optimum operating conditions and it should be noted that these are rarely, if ever, achieved in practice. The bit rates available to an individual user at the application layer will normally be significantly lower than the figures quoted in Table 1.2. The effective throughput is influenced by a large range of internal and external factors including: overheads due to link layer and application layer protocols; network contention, congestion, and numbers of users; asymmetry between download and upload rates; and of course the prevailing channel conditions. In particular, as channel conditions deteriorate, modulation and coding schemes will need to be increasingly robust. This will create lower spectral efficiency with increased coding overhead needed in order to maintain a given quality. The number of retransmissions will also inevitably increase as the channel worsens. As an example, DVB-T2 will reduce from 50 Mbps (256QAM @ 5/6 code-rate) to around 7.5 Mbps when channel conditions dictate a change in modulation and coding mode down to 1/2 rate QPSK. Similarly for 802.11n, realistic bandwidths per user can easily reduce well below 10 Mbps. 3G download speeds never offer 384 kbps—more frequently they will be less than 100 kbps.

Table 1.2

Theoretical Bandwidth Characteristics for Common Communication Systems

pixels) formats at 30 fps, with a bit rate between 0.5 and 1 Mbps encoded using the H.264/AVC [8] standard. In this case the raw bit rate, assuming color subsampling in 4:2:0 format, will be 110.6 Mbps. As we can see, this is between 100 and 200 times the bit rate supported for transmission.

5.01.4 The basics of compression

A simplified block diagram of a video compression system is shown in Figure 1.1. This shows an input being encoded, transmitted, and decoded.

Figure 1.1 Simplified high level video compression architecture.

5.01.4.1 Still image encoding

If we ignore the blocks labeled as motion compensation, the diagram in Figure 1.1 describes a still image encoding system, such as that used in JPEG [9]. The intra-frame encoder performs coding of the picture without reference to any other frames. This is normally achieved by exploiting spatial redundancy through transform-based decorrelation followed by variable length symbol encoding (VLC). The image is then conditioned for transmission using some means of error-resilient coding that makes the encoded bitstream more robust to channel errors. At the decoder, the inverse operations are performed and the original image is reconstructed at the output.

5.01.4.2 Video encoding

A video signal can be considered as a sequence of still images, acquired typically at a rate of 24, 25, 30, 50, or 60 fps. Although it is possible to encode a video sequence as a series of still images using intra-frame methods as described above, we can achieve significantly higher coding efficiency if we also exploit the temporal redundancy that exists in most natural video sequences. This is achieved using inter-frame motion prediction as represented by the motion compensation block in Figure 1.1. This block predicts the structure of the incoming video frame based on the contents of previously encoded frames. The encoding continues as for the intra-frame case, except this time the intra-frame encoder block processes the low energy residual signal remaining after prediction, rather than the original frame. After variable length encoding, the encoded signal will be buffered prior to transmission. The buffer serves to smooth out content-dependent variations in the output bit rate and the buffer is managed by a rate controller algorithm which adjusts coding parameters in order to match the video output to the instantaneous capacity of the channel.

Because of the reliance on both spatial and temporal prediction, compressed video bitstreams are more prone to channel errors than still images, suffering from temporal as well as spatial error propagation. Methods of mitigating this, making the bitstream more robust and correcting or concealing the resulting artifacts are described in later chapters.

5.01.4.3 Coding units and macroblocks

block, comprising luma and chroma information, called a macroblock.

5.01.4.3.1 Macroblocks

A typical macroblock structure is illustrated in levels.

Figure 1.2 Typical macroblock structure.

5.01.4.3.2 Coding tree units

The recent HEVC coding standard . It also provides much more flexibility in terms of block partitioning to support its various prediction modes. Further details on the HEVC standard are provided in Chapter 3.

5.01.4.4 Video quality assessment

The most obvious way of assessing video quality is to ask a human viewer. Subjective testing methodologies have therefore become an important component in the design and optimization of new compression systems. However, such tests are costly and time consuming and cannot be used for real-time rate-distortion optimization. Hence objective metrics are frequently used instead as these can provide an instantaneous estimate of video quality. These are discussed alongside subjective evaluation methods in Chapter 7. In particular, metrics that can more accurately predict visual quality, aligned with the HVS are highly significant in the context of future coding strategies such as those discussed in Chapters 5 and 6.

5.01.5 Decorrelating transforms

Transformation presents a convenient basis for compression and this comes about through three mechanisms:

1. It provides data decorrelation and creates a frequency-related distribution of energy allowing low energy coefficients to be discarded.

2. Retained coefficients can be quantized, using a scalar quantizer, according to their perceptual importance.

3. The sparse matrix of all remaining quantized coefficients exhibits symbol redundancy which can be exploited using variable length coding.

is chosen to provide a compromise between complexity and decorrelation performance. Transformation maps the raw input data into a representation more amenable to compression. Decorrelating transforms, when applied to correlated data, such as natural images, produce energy compaction in the transform domain coefficients and these can be quantized to reduce the dynamic range of the transformed output, according to a fidelity and/or bit rate criterion. For correlated spatial data, the resulting block of coefficients after quantization will be sparse. Quantization is not a reversible process, hence once quantized, the original signal cannot be perfectly reconstructed and some degree of signal distortion is introduced. This is thus the basis of lossy compression.

5.01.5.1 The discrete cosine transform (DCT)

The discrete cosine transform was first introduced by Ahmed et al. in 1974 [12] and is the most widely used unitary transform for image and video coding applications. Like the discrete Fourier transform, the DCT provides information about a signal in the frequency domain. However, unlike the DFT, the DCT of a real-valued signal is itself real valued and importantly it also does not introduce artifacts due to periodic extension of the input data.

With the DFT, a finite length data sequence is naturally extended by periodic extension. Discontinuities in the time (or spatial) domain therefore produce ringing or spectral leakage in the frequency domain. This can be avoided if the data sequence is symmetrically (rather than periodically) extended prior to application of the DFT. This produces an even sequence which has the added benefit of yielding real-valued coefficients. The DCT is not as useful as the DFT for frequency domain signal analysis due to its deficiencies when representing pure sinusoidal waveforms. However, in its primary role of signal compression, it performs exceptionally well.

As we will see, the DCT has good energy compaction properties and its performance approaches that of the optimum transform for correlated image data. The 1-D DCT, in its most popular form, is given by:

(1.1)

are the input data. Similarly the 2-D DCT is given by:

(1.2)

DCT in Figure 1.3. Further details on the derivation and characteristics of the DCT can be found in [1].

Figure 1.3 2-D DCT basis functions for N = 8.

5.01.5.2 Coefficient quantization

Quantization is an important step in lossy compression as it provides the basis for creating a sparse matrix of quantized coefficients that can be efficiently entropy coded for transmission. It is however an irreversible operation and must be carefully managed—one of the challenges is to perform quantization in such a way as to minimize its psychovisual impact. The quantizer comprises a set of decision levels and a set of reconstruction levels.

Intra-frame transform coefficients are normally quantized using a uniform quantizer, with the coefficients pre-weighted to reflect the frequency dependent sensitivity of the human visual system. A general expression which captures this is given in Eq. (1.3), where Q is the quantizer step-size, k is a constant, and W is a coefficient-dependent weighting matrix obtained from psychovisual experiments.

(1.3)

After transmission or storage, we must rescale the quantized transform coefficients prior to inverse transformation, thus:

(1.4)

An example of the effects of coefficient quantization on reconstruction quality for a range of block types is shown in Figure 1.4. It can be observed that more textured blocks require larger numbers of coefficients in order to create a good approximation to the original content. The best reconstruction can be achieved with fewer coefficients for the case of untextured blocks, as shown for the left-hand block in the figure.

Figure 1.4 Effects of coefficient quantization on various types of data block.

5.01.6 Symbol encoding

The sparsity of the quantized coefficient matrix can be exploited (typically by run-length coding) to produce a compact sequence of symbols. The symbol encoder assigns a codeword (a binary string) to each symbol. The code is designed to reduce coding redundancy and it normally uses variable length codewords. This operation is reversible.

5.01.6.1 Dealing with sparse matrices

After applying a forward transform and quantization, the resulting matrix contains a relatively small proportion of non-zero entries with most of its energy compacted toward the lower frequencies (i.e. the top left corner of the matrix). In such cases, run-length coding (RLC) can be used to efficiently represent long strings of identical values by grouping them into a single symbol which codes the value and the number of repetitions. This is a simple and effective method of reducing redundancies in a sequence.

block of data and its transform coefficients in Figure 1.5. If we scan the matrix using a zig-zag pattern, as shown in the figure, then this is more energy-efficient than scanning by rows or columns.

Figure 1.5 Zig-zag scanning prior to variable length coding.

5.01.6.2 Entropy encoding

Several methods exist that can exploit data statistics during symbol encoding. The most relevant of these in the context of image and video encoding are:

• Huffman coding [13]: This is a method for coding individual symbols at a rate close to the first-order entropy, often used in conjunction with other techniques in a lossy codec.

• Arithmetic coding [14–16]: This is a more sophisticated method which is capable of achieving fractional bit rates for symbols, thereby providing greater compression efficiency for more common symbols.

Frequently, the above methods are used in combination. For example, DC DCT coefficients are often encoded using a combination of predictive coding (DPCM) and either Huffman or arithmetic coding. Furthermore, motion vectors are similarly encoded using a form of predictive coding to condition the data prior to entropy coding.

5.01.7 Motion estimation

For still natural images, significant spatial redundancies exist and we have seen that these can be exploited via decorrelating transforms, The simplest approach to encoding a sequence of moving images is thus to apply an intra-frame (still image) coding method to each frame. This can have some benefits, especially in terms of the error resilience properties of the codec. However it generally results in limited compression performance.

For real-time video transmission over low bandwidth channels, there is often insufficient capacity to code each frame in a video sequence independently (25–30 fps is required to avoid flicker). The solution is thus to exploit the temporal correlation that exists between temporally adjacent frames in a video sequence. This inter-frame redundancy can be reduced through motion prediction, resulting in further improvements in coding efficiency.

In motion compensated prediction, a motion model (usually block-based translation only) is assumed and motion estimation (ME) is used to estimate the motion that occurs between the reference frame and the current frame. Once the motion is estimated, a process known as motion compensation (MC) is invoked to use the motion information from ME to modify the contents of the reference frame, according to the motion model, in order to produce a prediction of the current frame. The prediction is called a motion-compensated prediction (MCP) or a displaced frame (DF). The prediction error is known as the displaced frame difference (DFD) signal. Figure 1.6 shows how the pdf of pixel values is modified for FD and DFD frames, compared to an original frame from the Football sequence.

Figure 1.6 Probability distributions for an original frame and FD and DFD frames from the Football sequence.

A thorough description of motion estimation methods and their performance is provided in Chapter 2 and in [1].

5.01.8 The block-based motion-compensated video coding architecture

5.01.8.1 Picture types and prediction modes

5.01.8.1.1 Prediction modes

Four main classes of prediction are used in video compression:

• Intra-prediction: Blocks in the picture are predicted spatially from data adjacent to the current block being coded.

• Forward prediction: The reference picture occurs temporally before the current picture.

• Backward prediction: The reference picture occurs temporally after the current picture.

• Bidirectional prediction: Two (or more) reference pictures (forward and backward) are employed and the candidate predictions are combined in some way to form the final prediction.

5.01.8.1.2 Picture types

Three major types of picture (or frame) are employed in most video codecs:

• I-pictures: These are intra-coded (coded without reference to any other pictures).

• P-pictures: These are inter-coded with forward (or backward) prediction from another I- or P-picture.

• B-pictures: These are inter-coded with prediction from more than one I- and/or P-picture.

Coded pictures are arranged in a sequence known as a Group of Pictures (GOP). A typical GOP structure comprising 12 frames is shown in Figure 1.7. A GOP will contain one I-picture and zero or more P- and B- pictures. The 12 frame GOP in Figure 1.7 is sometimes referred to as an IBBPBBPBBPBB structure and it is clear, for reasons of causality, that the encoding order is different to that shown in the figure since the P-pictures must be encoded prior to the preceding B-pictures.

Figure 1.7 Typical group of pictures structure.

5.01.8.2 Operation of the video encoder

5.01.8.2.1 Intra-mode encoding

A generic structure of a video encoder is given in Figure 1.8. We first describe the operation of the encoder in intra-mode:

1. The inter/intra switch is placed in the intra position.

3. The transformed frame is then entropy coded and transmitted to the channel.

Figure 1.8 The block-based motion-compensated video encoder.

5.01.8.2.2 Inter-mode encoding

After the first intra-frame is encoded, the following frames in the GOP will be encoded in inter-mode and this is described below:

1. The inter/intra switch is placed in the inter position.

6. The transformed DFD, motion vectors and control parameters are then entropy coded and transmitted to the channel.

5.01.8.3 Operation of the video decoder

The structure of the video decoder is illustrated in Figure 1.9 and described below. By comparing the encoder and decoder architectures, it can be seen that the encoder contains a complete replica of the decoder in its prediction feedback loop. This ensures that (in the absence of channel errors) there is no drift between the encoder and decoder operations. Its operation is as follows, firstly in intra-mode:

1. The inter/intra switch is placed in the intra position.

2. Entropy decoding is then performed on the transmitted control parameters and quantized DFD coefficients.

Similarly in inter-mode:

1. The inter/intra switch is placed in the inter position.

2. Entropy decoding is firstly performed on the control parameters, quantized DFD coefficients, and motion vector.

Figure 1.9 The block-based motion-compensated video decoder.

5.01.9 Standardization of video coding systems

Standardization of image and video formats and compression methods has been instrumental in the success and universal adoption of video technology. An overview of coding standards is provided below and a more detailed description of the primary features of the most recent standard (HEVC) is provided in Chapter 3.

Standards are essential for interoperability, enabling material from different sources to be processed, and transmitted over a wide range of networks or stored on a wide range of devices. This interoperability opens up an enormous market for video equipment, which can exploit the advantages of volume manufacturing, while also providing the widest possible range of services for users. Video coding standards define the bitstream format and decoding process, not (for the most part) the encoding process. This is illustrated in Figure 1.10. A standard-compliant encoder is thus one that produces a compliant bitstream and a standard-compliant decoder is one that can decode a standard-compliant bitstream. The real challenge lies in the bitstream generation, i.e. the encoding, and this is where manufacturers can differentiate their products in terms of coding efficiency, complexity, or other attributes. Finally it is important to note that the fact that an encoder is standard-compliant, provides no guarantee of absolute video quality.

Figure 1.10 The scope of standardization.

5.01.9.1 A brief history of video encoding standards

A chronology of video coding standards is represented in Figure 1.11. This shows how the International Standards Organization (ISO) and the International Telecommunications Union (ITU-T) have worked both independently and in collaboration on various standards. In recent years, most ventures have benefited from close collaborative working.

Figure 1.11 A chronology of video coding standards from 1990 to the present date.

Study Group SG.XV of the CCITT (now ITU-T) produced the first international video coding standard, H.120, in 1984. H.120 addressed videoconferencing applications at 2.048 Mbps and 1.544 Mbps for 625/50 and 525/60 TV systems respectively. This standard was never a commercial success. H.261 targetted at ISDN conferencing applications. This was the first block-based hybrid compression algorithm using a combination of transformation (the Discrete Cosine Transform (DCT)), temporal Differential Pulse Code Modulation (DPCM), and motion compensation. This architecture has stood the test of time as all major video coding standards since have been based on it.

In 1988 the Moving Picture Experts Group (MPEG) was founded, delivering a video coding algorithm targeted at digital storage media at 1.5 Mbs/s in 1992. This was followed in 1994 by MPEG-2 [18], specifically targetted at the emerging digital video broadcasting market. MPEG-2 was instrumental, through its inclusion in all set-top boxes for more than a decade, in truly underpinning the digital broadcasting revolution. A little later in the 1990s ITU-T produced the H.263 standard [19]. This addressed the emerging mobile telephony, internet, and conferencing markets at the time. Although mobile applications were slower than expected to take off, H.263 had a significant impact in conferencing, surveillance, and applications based on the then-new Internet Protocol.

MPEG-4 [20] was a hugely ambitious project that sought to introduce new approaches based on object-based as well as, or instead of, waveform-based methods. It was found to be too complex and only its Advanced Simple Profile (ASP) was used in practice. This formed the basis for the emerging digital camera technology of the time.

Around the same time ITU-T started its work on H.264 and this delivered its standard, in partnership with ISO/IEC, in 2004 [8]. In the same way that MPEG-2 transformed the digital broadcasting landscape, so has H.264/AVC transformed the mobile communications and internet video domains. H.264/AVC is by far the most ubiquitous video coding standard to date. Most recently in 2013, the joint activities of ISO and ITU-T delivered the HEVC standard [10,21], offering bit rate reductions of up to 50% compared with H.264/AVC.

Further extensions to video compression standards have been developed that enable the efficient processing of new formats, in particular more immersive formats such as stereoscopic 3-D. The compression challenges and architectures associated with stereoscopic and multiview coding are described in detail in Chapter 4.

5.01.9.2 The performance of current standards

An excellent comparison of coding standards is provided by Ohm et al. in [21], where a comprehensive comparison between HEVC and previous standards is reported. We reproduce their results here in Table 1.3, noting that even greater improvements were reported for interactive applications. As a rule of thumb video coding standards have, since the introduction of H.120 in 1984, delivered a halving of bit rate for the equivalent video quality every 10 years. This is evidenced by the results in Table 1.3.

Table 1.3

Comparison of Video Coding Standards for Entertainment Applications. Average Bit-Rate Savings are Shown for Equal Objective Quality Values Measured Using PSNR

5.01.10 Conclusions

This chapter has introduced the requirements for, and applications of video compression. It has examined the architecture of a compression system in the context of modern communication systems and has shown that its design is often a compromise between coding rate, picture quality, and implementation complexity. The basic operation of a block-based motion compensated video encoding system has been described, explaining its major components, and its operating characteristics. Finally the justification for universal compression standards has been presented, and the performance of recent standards compared in terms of their rate-distortion characteristics.

The remainder of this section will cover many of these topics in more detail and place them in the context of current and future standards, formats, and transmission requirements. Chapter 2 expands on the important area of motion estimation, demonstrating efficient means of achieving temporal decorrelation in the coding process. Chapter 3 provides the reader with an overview of the new HEVC standard and Chapter 4 considers the extensions to compression mechanisms that are required to deal with multiview and stereoscopic content. Chapters 5 and 6, provide an insight into future coding possibilities based on an increased awareness of perceptual properties an limitations. Chapter 7 covers the important area of how we can assess video quality, both for comparing the performance of different codecs and for optimizing coding decisions within the compression process. Finally, Chapters 8 and 9 address the topics of content delivery, network adaptation, and error concealment.

Additional resources

1. http://www.poynton.com/Poynton-video-eng.html. Lots of information on color conversion, formats, and video preprocessing.

2. http://mpeg.chiariglione.org/. This is the home page of the Moving Picture Experts Group (MPEG), a working group of ISO/IEC with the mission to develop standards for coded representation of digital audio and video and related data. Lots of information on standards with tutorial content.

Glossary of terms

1080p30 a means of representing the sampling structure of the video format. In this case representing an HD format of 1080 lines with progressive temporal sampling at 30 fps

Discrete cosine transform a popular decorrelating transform used in image and video coding. Achieves close to optimum performance with a fixed set of basis functions

Entropy coding the process used to efficiently encode symbols formed from scans of quantized transform coefficients (or motion vectors). A reversible process, usually performed using Huffman or arithmetic coding

Error resilience the process of making an encoded bitstream more robust to the effects of loss during transmission

Lossless coding a reversible coding process where the original signal can be exactly reconstructed after compression

Lossy coding an irreversible coding process where the original signal cannot be exactly reconstructed after compression. Distortions are introduced during coding

Macroblock the basic coding unit used in many standards. Typically comprising (for 4:2:0) a luma block of 16 by 16 samples and two chroma blocks each of 8 by 8 samples

Motion estimation the process of temporally predicting the current picture based on the content of other pictures in the sequence

Quantization the process of approximating the output from the decorrelating transform. A primary mechanism for achieving rate-distortion trade-offs during coding

Streaming compressed video delivery from a server. Media is constantly received by and presented to an end-user while being delivered

Zettabyte

X:Y:Z a means of describing the color subsampling method used in a format or by a codec. e.g. 4:2:0 means that the chroma (color difference signals are subsampled by a factor of two horizontally and vertically prior to coding or transmission

References

1. Bull D. Communicating Pictures. Academic Press 2014.

2. .

3. Cisco Visual Networking Index: Forecast and Methodology, 2011–2016 (update 2012–2017). .

4. Network Camera and Video Analytics Market – Global Forecast, Trend & Analysis – Segmentation by Technology, Function, Resolution, Product & Service Type, System Architecture, Verticals, Application and Geography (2012–2017) Report by marketsandmarkets.com, Report Code: SE 1238, 2012.

5. Ortega A, Ramchandran K. Rate-distortion methods for image and video compression. IEEE Signal Process Mag. 1998;15(6):23–50.

6. Sullivan G, Wiegand T. Rate-distortion optimization for video compression. IEEE Signal Process Mag. 1998;15(6):74–90.

7. Recommendation ITU-R BT.2020 (08/2012), Parameter Values for Ultra-High Definition Television Systems for Production and International Programme Exchange, ITU-R, 2012.

8. ITU-T and ISO/IEC JTC 1, Advanced Video Coding for Generic Audiovisual Services, ITU-T Rec. H.264 and ISO/IEC 14496-10 (AVC), Version 1, 2003; Version 2, 2004; Versions 3, 4, 2005; Versions 5, 6, 2006; Versions 7, 8, 2007; Versions 9, 10, 11, 2009; Versions 12,

Enjoying the preview?

Page 1 of 1

Academic Press Library in Signal Processing: Image and Video Compression and Multimedia

About this ebook

Related to Academic Press Library in Signal Processing

Titles in the series (2)

Related ebooks

Technology & Engineering For You

Related podcast episodes

Related articles

Reviews for Academic Press Library in Signal Processing

What did you think?

Book preview

Academic Press Library in Signal Processing - Academic Press

Table of Contents

Section 1: Image/Video Compression

Section 2: Multimedia Signal Processing

Signal Processing at Your Fingertips!

Section 1: Image/Video Compression

Outline

Abstract

Keywords

Image compression; Video compression; Video applications; Discrete cosine transform; Entropy coding; Motion estimation; Video standards

Nomenclature

5.01.1 Introduction

5.01.2 Applications areas for video coding

5.01.2.1 Markets for video technology

5.01.2.1.1 Consumer video

5.01.2.1.2 Surveillance

5.01.2.1.3 Business and automation

5.01.2.1.4 Healthcare

5.01.3 Requirements of a compression system

5.01.3.1 Requirements

5.01.3.2 Trade-offs

5.01.3.3 How much do we need to compress?

5.01.3.3.1 Bit rate requirements

5.01.3.3.2 Bandwidth availability

5.01.4 The basics of compression

5.01.4.1 Still image encoding

5.01.4.2 Video encoding

5.01.4.3 Coding units and macroblocks

5.01.4.3.1 Macroblocks

5.01.4.3.2 Coding tree units

5.01.4.4 Video quality assessment

5.01.5 Decorrelating transforms

5.01.5.1 The discrete cosine transform (DCT)

(1.1)

(1.2)

5.01.5.2 Coefficient quantization

5.01.6 Symbol encoding

5.01.6.1 Dealing with sparse matrices

5.01.6.2 Entropy encoding

5.01.7 Motion estimation

5.01.8 The block-based motion-compensated video coding architecture

5.01.8.1 Picture types and prediction modes

5.01.8.1.1 Prediction modes

5.01.8.1.2 Picture types

5.01.8.2 Operation of the video encoder

5.01.8.2.1 Intra-mode encoding

5.01.8.2.2 Inter-mode encoding

5.01.8.3 Operation of the video decoder

5.01.9 Standardization of video coding systems

5.01.9.1 A brief history of video encoding standards

5.01.9.2 The performance of current standards

5.01.10 Conclusions

Additional resources

Glossary of terms

References