Loading…

Rate-distortion optimal bit allocation for object-based video coding

In object-based video encoding, the encoding of the video data is decoupled into the encoding of shape, motion, and texture information, which enables certain functionalities, like content-based interactivity and content-based scalability. The fundamental problem, however, of how to jointly encode t...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on circuits and systems for video technology 2005-09, Vol.15 (9), p.1113-1123
Main Authors: Haohong Wang, Schuster, G.M., Katsaggelos, A.K.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c354t-f76f0cfd4265a8f6a94939564c5d2420b86bb70e0011a2b458b31e9cdb2a17c23
cites cdi_FETCH-LOGICAL-c354t-f76f0cfd4265a8f6a94939564c5d2420b86bb70e0011a2b458b31e9cdb2a17c23
container_end_page 1123
container_issue 9
container_start_page 1113
container_title IEEE transactions on circuits and systems for video technology
container_volume 15
creator Haohong Wang
Schuster, G.M.
Katsaggelos, A.K.
description In object-based video encoding, the encoding of the video data is decoupled into the encoding of shape, motion, and texture information, which enables certain functionalities, like content-based interactivity and content-based scalability. The fundamental problem, however, of how to jointly encode this separate information to reach the best coding efficiency has not been studied thoroughly. In this paper, we present an operational rate-distortion optimal scheme for the allocation of bits among shape, motion, and texture in object-based video encoding. Our approach is based on Lagrangian relaxation and dynamic programming. We implement our algorithm on the MPEG-4 video verification model, although it is applicable to any object-based video encoding scheme. The performance is accessed utilizing a proposed metric that jointly captures the distortion due to the encoding of the shape and texture. Experimental results demonstrate that the gains of lossy shape encoding depend on the percentage the shape bits occupy out of the total bit budget. This gain may be small or may be realized at very low bit rates for certain typical scenes.
doi_str_mv 10.1109/TCSVT.2005.852629
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TCSVT_2005_852629</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1501879</ieee_id><sourcerecordid>2543579501</sourcerecordid><originalsourceid>FETCH-LOGICAL-c354t-f76f0cfd4265a8f6a94939564c5d2420b86bb70e0011a2b458b31e9cdb2a17c23</originalsourceid><addsrcrecordid>eNpdkFtLAzEQhYMoWKs_QHxZBPFpa5LN9VHqFQqCVl9Dks1KynZTk1Tw37vbLQg-zTDzzeHMAeAcwRlCUN4s528fyxmGkM4ExQzLAzBBlIoSY0gP-x5SVAqM6DE4SWkFISKC8Am4e9XZlbVPOcTsQ1eETfZr3RbG50K3bbB6N25CLIJZOZtLo5Ori29fu1DYUPvu8xQcNbpN7mxfp-D94X45fyoXL4_P89tFaStKctlw1kDb1AQzqkXDtCSykpQRS2tMMDSCGcOh680hjQ2hwlTISVsbrBG3uJqC61F3E8PX1qWs1j5Z17a6c2GblJAMcVYh3pOX_8hV2MauN6ckwpDDgZsCNEI2hpSia9Qm9r_HH4WgGlJVu1TVkKoaU-1vrvbCOlndNlF31qe_Qw4FY2TQvhg575z7W1OIBJfVL6stf28</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>912070176</pqid></control><display><type>article</type><title>Rate-distortion optimal bit allocation for object-based video coding</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Haohong Wang ; Schuster, G.M. ; Katsaggelos, A.K.</creator><creatorcontrib>Haohong Wang ; Schuster, G.M. ; Katsaggelos, A.K.</creatorcontrib><description>In object-based video encoding, the encoding of the video data is decoupled into the encoding of shape, motion, and texture information, which enables certain functionalities, like content-based interactivity and content-based scalability. The fundamental problem, however, of how to jointly encode this separate information to reach the best coding efficiency has not been studied thoroughly. In this paper, we present an operational rate-distortion optimal scheme for the allocation of bits among shape, motion, and texture in object-based video encoding. Our approach is based on Lagrangian relaxation and dynamic programming. We implement our algorithm on the MPEG-4 video verification model, although it is applicable to any object-based video encoding scheme. The performance is accessed utilizing a proposed metric that jointly captures the distortion due to the encoding of the shape and texture. Experimental results demonstrate that the gains of lossy shape encoding depend on the percentage the shape bits occupy out of the total bit budget. This gain may be small or may be realized at very low bit rates for certain typical scenes.</description><identifier>ISSN: 1051-8215</identifier><identifier>EISSN: 1558-2205</identifier><identifier>DOI: 10.1109/TCSVT.2005.852629</identifier><identifier>CODEN: ITCTEM</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Allocations ; Applied sciences ; Bit rate ; Coding ; Distortion ; Dynamic programming ; Encoding ; Exact sciences and technology ; Gain ; Image processing ; Information, signal and communications theory ; Lagrangian functions ; Layout ; MPEG 4 Standard ; MPEG-4 ; object-based video ; Optimization ; Rate-distortion ; Scalability ; Shape ; shape coding ; Signal processing ; Studies ; Surface layer ; Systems, networks and services of telecommunications ; Telecommunications ; Telecommunications and information theory ; Texture ; Transmission and modulation (techniques and equipments) ; Video coding</subject><ispartof>IEEE transactions on circuits and systems for video technology, 2005-09, Vol.15 (9), p.1113-1123</ispartof><rights>2005 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2005</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c354t-f76f0cfd4265a8f6a94939564c5d2420b86bb70e0011a2b458b31e9cdb2a17c23</citedby><cites>FETCH-LOGICAL-c354t-f76f0cfd4265a8f6a94939564c5d2420b86bb70e0011a2b458b31e9cdb2a17c23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1501879$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=17086646$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Haohong Wang</creatorcontrib><creatorcontrib>Schuster, G.M.</creatorcontrib><creatorcontrib>Katsaggelos, A.K.</creatorcontrib><title>Rate-distortion optimal bit allocation for object-based video coding</title><title>IEEE transactions on circuits and systems for video technology</title><addtitle>TCSVT</addtitle><description>In object-based video encoding, the encoding of the video data is decoupled into the encoding of shape, motion, and texture information, which enables certain functionalities, like content-based interactivity and content-based scalability. The fundamental problem, however, of how to jointly encode this separate information to reach the best coding efficiency has not been studied thoroughly. In this paper, we present an operational rate-distortion optimal scheme for the allocation of bits among shape, motion, and texture in object-based video encoding. Our approach is based on Lagrangian relaxation and dynamic programming. We implement our algorithm on the MPEG-4 video verification model, although it is applicable to any object-based video encoding scheme. The performance is accessed utilizing a proposed metric that jointly captures the distortion due to the encoding of the shape and texture. Experimental results demonstrate that the gains of lossy shape encoding depend on the percentage the shape bits occupy out of the total bit budget. This gain may be small or may be realized at very low bit rates for certain typical scenes.</description><subject>Allocations</subject><subject>Applied sciences</subject><subject>Bit rate</subject><subject>Coding</subject><subject>Distortion</subject><subject>Dynamic programming</subject><subject>Encoding</subject><subject>Exact sciences and technology</subject><subject>Gain</subject><subject>Image processing</subject><subject>Information, signal and communications theory</subject><subject>Lagrangian functions</subject><subject>Layout</subject><subject>MPEG 4 Standard</subject><subject>MPEG-4</subject><subject>object-based video</subject><subject>Optimization</subject><subject>Rate-distortion</subject><subject>Scalability</subject><subject>Shape</subject><subject>shape coding</subject><subject>Signal processing</subject><subject>Studies</subject><subject>Surface layer</subject><subject>Systems, networks and services of telecommunications</subject><subject>Telecommunications</subject><subject>Telecommunications and information theory</subject><subject>Texture</subject><subject>Transmission and modulation (techniques and equipments)</subject><subject>Video coding</subject><issn>1051-8215</issn><issn>1558-2205</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNpdkFtLAzEQhYMoWKs_QHxZBPFpa5LN9VHqFQqCVl9Dks1KynZTk1Tw37vbLQg-zTDzzeHMAeAcwRlCUN4s528fyxmGkM4ExQzLAzBBlIoSY0gP-x5SVAqM6DE4SWkFISKC8Am4e9XZlbVPOcTsQ1eETfZr3RbG50K3bbB6N25CLIJZOZtLo5Ori29fu1DYUPvu8xQcNbpN7mxfp-D94X45fyoXL4_P89tFaStKctlw1kDb1AQzqkXDtCSykpQRS2tMMDSCGcOh680hjQ2hwlTISVsbrBG3uJqC61F3E8PX1qWs1j5Z17a6c2GblJAMcVYh3pOX_8hV2MauN6ckwpDDgZsCNEI2hpSia9Qm9r_HH4WgGlJVu1TVkKoaU-1vrvbCOlndNlF31qe_Qw4FY2TQvhg575z7W1OIBJfVL6stf28</recordid><startdate>20050901</startdate><enddate>20050901</enddate><creator>Haohong Wang</creator><creator>Schuster, G.M.</creator><creator>Katsaggelos, A.K.</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20050901</creationdate><title>Rate-distortion optimal bit allocation for object-based video coding</title><author>Haohong Wang ; Schuster, G.M. ; Katsaggelos, A.K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c354t-f76f0cfd4265a8f6a94939564c5d2420b86bb70e0011a2b458b31e9cdb2a17c23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Allocations</topic><topic>Applied sciences</topic><topic>Bit rate</topic><topic>Coding</topic><topic>Distortion</topic><topic>Dynamic programming</topic><topic>Encoding</topic><topic>Exact sciences and technology</topic><topic>Gain</topic><topic>Image processing</topic><topic>Information, signal and communications theory</topic><topic>Lagrangian functions</topic><topic>Layout</topic><topic>MPEG 4 Standard</topic><topic>MPEG-4</topic><topic>object-based video</topic><topic>Optimization</topic><topic>Rate-distortion</topic><topic>Scalability</topic><topic>Shape</topic><topic>shape coding</topic><topic>Signal processing</topic><topic>Studies</topic><topic>Surface layer</topic><topic>Systems, networks and services of telecommunications</topic><topic>Telecommunications</topic><topic>Telecommunications and information theory</topic><topic>Texture</topic><topic>Transmission and modulation (techniques and equipments)</topic><topic>Video coding</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Haohong Wang</creatorcontrib><creatorcontrib>Schuster, G.M.</creatorcontrib><creatorcontrib>Katsaggelos, A.K.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><jtitle>IEEE transactions on circuits and systems for video technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Haohong Wang</au><au>Schuster, G.M.</au><au>Katsaggelos, A.K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Rate-distortion optimal bit allocation for object-based video coding</atitle><jtitle>IEEE transactions on circuits and systems for video technology</jtitle><stitle>TCSVT</stitle><date>2005-09-01</date><risdate>2005</risdate><volume>15</volume><issue>9</issue><spage>1113</spage><epage>1123</epage><pages>1113-1123</pages><issn>1051-8215</issn><eissn>1558-2205</eissn><coden>ITCTEM</coden><abstract>In object-based video encoding, the encoding of the video data is decoupled into the encoding of shape, motion, and texture information, which enables certain functionalities, like content-based interactivity and content-based scalability. The fundamental problem, however, of how to jointly encode this separate information to reach the best coding efficiency has not been studied thoroughly. In this paper, we present an operational rate-distortion optimal scheme for the allocation of bits among shape, motion, and texture in object-based video encoding. Our approach is based on Lagrangian relaxation and dynamic programming. We implement our algorithm on the MPEG-4 video verification model, although it is applicable to any object-based video encoding scheme. The performance is accessed utilizing a proposed metric that jointly captures the distortion due to the encoding of the shape and texture. Experimental results demonstrate that the gains of lossy shape encoding depend on the percentage the shape bits occupy out of the total bit budget. This gain may be small or may be realized at very low bit rates for certain typical scenes.</abstract><cop>New York, NY</cop><pub>IEEE</pub><doi>10.1109/TCSVT.2005.852629</doi><tpages>11</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1051-8215
ispartof IEEE transactions on circuits and systems for video technology, 2005-09, Vol.15 (9), p.1113-1123
issn 1051-8215
1558-2205
language eng
recordid cdi_crossref_primary_10_1109_TCSVT_2005_852629
source IEEE Electronic Library (IEL) Journals
subjects Allocations
Applied sciences
Bit rate
Coding
Distortion
Dynamic programming
Encoding
Exact sciences and technology
Gain
Image processing
Information, signal and communications theory
Lagrangian functions
Layout
MPEG 4 Standard
MPEG-4
object-based video
Optimization
Rate-distortion
Scalability
Shape
shape coding
Signal processing
Studies
Surface layer
Systems, networks and services of telecommunications
Telecommunications
Telecommunications and information theory
Texture
Transmission and modulation (techniques and equipments)
Video coding
title Rate-distortion optimal bit allocation for object-based video coding
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T07%3A42%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Rate-distortion%20optimal%20bit%20allocation%20for%20object-based%20video%20coding&rft.jtitle=IEEE%20transactions%20on%20circuits%20and%20systems%20for%20video%20technology&rft.au=Haohong%20Wang&rft.date=2005-09-01&rft.volume=15&rft.issue=9&rft.spage=1113&rft.epage=1123&rft.pages=1113-1123&rft.issn=1051-8215&rft.eissn=1558-2205&rft.coden=ITCTEM&rft_id=info:doi/10.1109/TCSVT.2005.852629&rft_dat=%3Cproquest_cross%3E2543579501%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c354t-f76f0cfd4265a8f6a94939564c5d2420b86bb70e0011a2b458b31e9cdb2a17c23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=912070176&rft_id=info:pmid/&rft_ieee_id=1501879&rfr_iscdi=true