Loading…

Parallel Deblocking Filtering in MPEG-4 AVC/H.264 on Massively Parallel Architectures

The deblocking filter in the MPEG-4 AVC/H.264 standard is computationally complex because of its high content adaptivity, resulting in a significant number of data dependencies. These data dependencies interfere with parallel filtering of multiple macroblocks (MBs) on massively parallel architecture...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on circuits and systems for video technology 2011-01, Vol.21 (1), p.96-100
Main Authors: Pieters, Bart, Hollemeersch, Charles-Frederik J, De Cock, Jan, Lambert, Peter, De Neve, Wesley, Van de Walle, Rik
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c400t-5eac732332f59f9287b79bc3185c93f7f8f05ed120eabfbbb1c7271bb23d1ba23
cites cdi_FETCH-LOGICAL-c400t-5eac732332f59f9287b79bc3185c93f7f8f05ed120eabfbbb1c7271bb23d1ba23
container_end_page 100
container_issue 1
container_start_page 96
container_title IEEE transactions on circuits and systems for video technology
container_volume 21
creator Pieters, Bart
Hollemeersch, Charles-Frederik J
De Cock, Jan
Lambert, Peter
De Neve, Wesley
Van de Walle, Rik
description The deblocking filter in the MPEG-4 AVC/H.264 standard is computationally complex because of its high content adaptivity, resulting in a significant number of data dependencies. These data dependencies interfere with parallel filtering of multiple macroblocks (MBs) on massively parallel architectures. In this letter, we introduce a novel MB partitioning scheme for concurrent deblocking in the MPEG-4 AVC/H.264 standard, based on our idea of deblocking filter independency, a corrected version of the limited error propagation effect proposed in the letter. Our proposed scheme enables concurrent MB deblocking of luma samples with limited synchronization effort, independently of slice configuration, and is compliant with the MPEG-4 H.264/AVC standard. We implemented the method on the massively parallel architecture of the graphics processing unit (GPU). Experimental results show that our GPU implementation achieves faster-than real-time deblocking at 1309 frames per second for 1080p video pictures. Both software-based deblocking filters and state-of-the-art GPU-enabled algorithms are outperformed in terms of speed by factors up to 10.2 and 19.5, respectively, for 1080p video pictures.
doi_str_mv 10.1109/TCSVT.2011.2105553
format article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_proquest_miscellaneous_864420148</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5686922</ieee_id><sourcerecordid>864420148</sourcerecordid><originalsourceid>FETCH-LOGICAL-c400t-5eac732332f59f9287b79bc3185c93f7f8f05ed120eabfbbb1c7271bb23d1ba23</originalsourceid><addsrcrecordid>eNpdkEFrGzEQhZeSQpO0f6C9LIXQ0zqa0corHY3jxIGUBurkKiR5lCiVdx1pXci_725tfMhpHjPfG2ZeUXwFNgFg6nI1__24miADmCAwIQT_UJyCELJCZOJk0ExAJRHEp-Is5xfGoJZ1c1o83JtkYqRYXpGNnfsT2qfyOsSe0qhCW_68X9xUdTl7nF8uJzity27omZzDX4pv5dE-S-459OT6XaL8ufjoTcz05VDPi4frxWq-rO5-3dzOZ3eVqxnrK0HGNRw5Ry-UVygb2yjrOEjhFPeNl54JWgMyMtZba8E12IC1yNdgDfLz4sd-7zZ1rzvKvd6E7ChG01K3y1pO6xrHVwfy-zvypduldjhOS8GbGlGOEO4hl7qcE3m9TWFj0psGpsec9f-c9ZizPuQ8mC4Om012JvpkWhfy0YlcgQKhBu7bngtEdByLqZwqRP4Py-uEUQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>853742288</pqid></control><display><type>article</type><title>Parallel Deblocking Filtering in MPEG-4 AVC/H.264 on Massively Parallel Architectures</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Pieters, Bart ; Hollemeersch, Charles-Frederik J ; De Cock, Jan ; Lambert, Peter ; De Neve, Wesley ; Van de Walle, Rik</creator><creatorcontrib>Pieters, Bart ; Hollemeersch, Charles-Frederik J ; De Cock, Jan ; Lambert, Peter ; De Neve, Wesley ; Van de Walle, Rik</creatorcontrib><description>The deblocking filter in the MPEG-4 AVC/H.264 standard is computationally complex because of its high content adaptivity, resulting in a significant number of data dependencies. These data dependencies interfere with parallel filtering of multiple macroblocks (MBs) on massively parallel architectures. In this letter, we introduce a novel MB partitioning scheme for concurrent deblocking in the MPEG-4 AVC/H.264 standard, based on our idea of deblocking filter independency, a corrected version of the limited error propagation effect proposed in the letter. Our proposed scheme enables concurrent MB deblocking of luma samples with limited synchronization effort, independently of slice configuration, and is compliant with the MPEG-4 H.264/AVC standard. We implemented the method on the massively parallel architecture of the graphics processing unit (GPU). Experimental results show that our GPU implementation achieves faster-than real-time deblocking at 1309 frames per second for 1080p video pictures. Both software-based deblocking filters and state-of-the-art GPU-enabled algorithms are outperformed in terms of speed by factors up to 10.2 and 19.5, respectively, for 1080p video pictures.</description><identifier>ISSN: 1051-8215</identifier><identifier>EISSN: 1558-2205</identifier><identifier>DOI: 10.1109/TCSVT.2011.2105553</identifier><identifier>CODEN: ITCTEM</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Algorithms ; Applied sciences ; Architecture ; Automatic voltage control ; Circuits ; Computer architecture ; Deblocking ; Detection, estimation, filtering, equalization, prediction ; Electronics ; Exact sciences and technology ; Filtering ; Filtration ; Frames per second ; GPU ; Graphics processing unit ; Image processing ; in-loop filtering ; Information, signal and communications theory ; Integrated circuits ; Integrated circuits by function (including memories and processors) ; massively parallel ; MPEG-4 AVC/H.264 ; Parallel processing ; Pictures ; Semiconductor electronics. Microelectronics. Optoelectronics. Solid state devices ; Signal and communications theory ; Signal processing ; Signal, noise ; Streaming media ; Studies ; Synchronization ; Telecommunications and information theory ; Transform coding</subject><ispartof>IEEE transactions on circuits and systems for video technology, 2011-01, Vol.21 (1), p.96-100</ispartof><rights>2015 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Jan 2011</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c400t-5eac732332f59f9287b79bc3185c93f7f8f05ed120eabfbbb1c7271bb23d1ba23</citedby><cites>FETCH-LOGICAL-c400t-5eac732332f59f9287b79bc3185c93f7f8f05ed120eabfbbb1c7271bb23d1ba23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5686922$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,4024,27923,27924,27925,54796</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=23919159$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Pieters, Bart</creatorcontrib><creatorcontrib>Hollemeersch, Charles-Frederik J</creatorcontrib><creatorcontrib>De Cock, Jan</creatorcontrib><creatorcontrib>Lambert, Peter</creatorcontrib><creatorcontrib>De Neve, Wesley</creatorcontrib><creatorcontrib>Van de Walle, Rik</creatorcontrib><title>Parallel Deblocking Filtering in MPEG-4 AVC/H.264 on Massively Parallel Architectures</title><title>IEEE transactions on circuits and systems for video technology</title><addtitle>TCSVT</addtitle><description>The deblocking filter in the MPEG-4 AVC/H.264 standard is computationally complex because of its high content adaptivity, resulting in a significant number of data dependencies. These data dependencies interfere with parallel filtering of multiple macroblocks (MBs) on massively parallel architectures. In this letter, we introduce a novel MB partitioning scheme for concurrent deblocking in the MPEG-4 AVC/H.264 standard, based on our idea of deblocking filter independency, a corrected version of the limited error propagation effect proposed in the letter. Our proposed scheme enables concurrent MB deblocking of luma samples with limited synchronization effort, independently of slice configuration, and is compliant with the MPEG-4 H.264/AVC standard. We implemented the method on the massively parallel architecture of the graphics processing unit (GPU). Experimental results show that our GPU implementation achieves faster-than real-time deblocking at 1309 frames per second for 1080p video pictures. Both software-based deblocking filters and state-of-the-art GPU-enabled algorithms are outperformed in terms of speed by factors up to 10.2 and 19.5, respectively, for 1080p video pictures.</description><subject>Algorithms</subject><subject>Applied sciences</subject><subject>Architecture</subject><subject>Automatic voltage control</subject><subject>Circuits</subject><subject>Computer architecture</subject><subject>Deblocking</subject><subject>Detection, estimation, filtering, equalization, prediction</subject><subject>Electronics</subject><subject>Exact sciences and technology</subject><subject>Filtering</subject><subject>Filtration</subject><subject>Frames per second</subject><subject>GPU</subject><subject>Graphics processing unit</subject><subject>Image processing</subject><subject>in-loop filtering</subject><subject>Information, signal and communications theory</subject><subject>Integrated circuits</subject><subject>Integrated circuits by function (including memories and processors)</subject><subject>massively parallel</subject><subject>MPEG-4 AVC/H.264</subject><subject>Parallel processing</subject><subject>Pictures</subject><subject>Semiconductor electronics. Microelectronics. Optoelectronics. Solid state devices</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal, noise</subject><subject>Streaming media</subject><subject>Studies</subject><subject>Synchronization</subject><subject>Telecommunications and information theory</subject><subject>Transform coding</subject><issn>1051-8215</issn><issn>1558-2205</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><recordid>eNpdkEFrGzEQhZeSQpO0f6C9LIXQ0zqa0corHY3jxIGUBurkKiR5lCiVdx1pXci_725tfMhpHjPfG2ZeUXwFNgFg6nI1__24miADmCAwIQT_UJyCELJCZOJk0ExAJRHEp-Is5xfGoJZ1c1o83JtkYqRYXpGNnfsT2qfyOsSe0qhCW_68X9xUdTl7nF8uJzity27omZzDX4pv5dE-S-459OT6XaL8ufjoTcz05VDPi4frxWq-rO5-3dzOZ3eVqxnrK0HGNRw5Ry-UVygb2yjrOEjhFPeNl54JWgMyMtZba8E12IC1yNdgDfLz4sd-7zZ1rzvKvd6E7ChG01K3y1pO6xrHVwfy-zvypduldjhOS8GbGlGOEO4hl7qcE3m9TWFj0psGpsec9f-c9ZizPuQ8mC4Om012JvpkWhfy0YlcgQKhBu7bngtEdByLqZwqRP4Py-uEUQ</recordid><startdate>201101</startdate><enddate>201101</enddate><creator>Pieters, Bart</creator><creator>Hollemeersch, Charles-Frederik J</creator><creator>De Cock, Jan</creator><creator>Lambert, Peter</creator><creator>De Neve, Wesley</creator><creator>Van de Walle, Rik</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>201101</creationdate><title>Parallel Deblocking Filtering in MPEG-4 AVC/H.264 on Massively Parallel Architectures</title><author>Pieters, Bart ; Hollemeersch, Charles-Frederik J ; De Cock, Jan ; Lambert, Peter ; De Neve, Wesley ; Van de Walle, Rik</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c400t-5eac732332f59f9287b79bc3185c93f7f8f05ed120eabfbbb1c7271bb23d1ba23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Algorithms</topic><topic>Applied sciences</topic><topic>Architecture</topic><topic>Automatic voltage control</topic><topic>Circuits</topic><topic>Computer architecture</topic><topic>Deblocking</topic><topic>Detection, estimation, filtering, equalization, prediction</topic><topic>Electronics</topic><topic>Exact sciences and technology</topic><topic>Filtering</topic><topic>Filtration</topic><topic>Frames per second</topic><topic>GPU</topic><topic>Graphics processing unit</topic><topic>Image processing</topic><topic>in-loop filtering</topic><topic>Information, signal and communications theory</topic><topic>Integrated circuits</topic><topic>Integrated circuits by function (including memories and processors)</topic><topic>massively parallel</topic><topic>MPEG-4 AVC/H.264</topic><topic>Parallel processing</topic><topic>Pictures</topic><topic>Semiconductor electronics. Microelectronics. Optoelectronics. Solid state devices</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal, noise</topic><topic>Streaming media</topic><topic>Studies</topic><topic>Synchronization</topic><topic>Telecommunications and information theory</topic><topic>Transform coding</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pieters, Bart</creatorcontrib><creatorcontrib>Hollemeersch, Charles-Frederik J</creatorcontrib><creatorcontrib>De Cock, Jan</creatorcontrib><creatorcontrib>Lambert, Peter</creatorcontrib><creatorcontrib>De Neve, Wesley</creatorcontrib><creatorcontrib>Van de Walle, Rik</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998–Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><jtitle>IEEE transactions on circuits and systems for video technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pieters, Bart</au><au>Hollemeersch, Charles-Frederik J</au><au>De Cock, Jan</au><au>Lambert, Peter</au><au>De Neve, Wesley</au><au>Van de Walle, Rik</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Parallel Deblocking Filtering in MPEG-4 AVC/H.264 on Massively Parallel Architectures</atitle><jtitle>IEEE transactions on circuits and systems for video technology</jtitle><stitle>TCSVT</stitle><date>2011-01</date><risdate>2011</risdate><volume>21</volume><issue>1</issue><spage>96</spage><epage>100</epage><pages>96-100</pages><issn>1051-8215</issn><eissn>1558-2205</eissn><coden>ITCTEM</coden><abstract>The deblocking filter in the MPEG-4 AVC/H.264 standard is computationally complex because of its high content adaptivity, resulting in a significant number of data dependencies. These data dependencies interfere with parallel filtering of multiple macroblocks (MBs) on massively parallel architectures. In this letter, we introduce a novel MB partitioning scheme for concurrent deblocking in the MPEG-4 AVC/H.264 standard, based on our idea of deblocking filter independency, a corrected version of the limited error propagation effect proposed in the letter. Our proposed scheme enables concurrent MB deblocking of luma samples with limited synchronization effort, independently of slice configuration, and is compliant with the MPEG-4 H.264/AVC standard. We implemented the method on the massively parallel architecture of the graphics processing unit (GPU). Experimental results show that our GPU implementation achieves faster-than real-time deblocking at 1309 frames per second for 1080p video pictures. Both software-based deblocking filters and state-of-the-art GPU-enabled algorithms are outperformed in terms of speed by factors up to 10.2 and 19.5, respectively, for 1080p video pictures.</abstract><cop>New York, NY</cop><pub>IEEE</pub><doi>10.1109/TCSVT.2011.2105553</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1051-8215
ispartof IEEE transactions on circuits and systems for video technology, 2011-01, Vol.21 (1), p.96-100
issn 1051-8215
1558-2205
language eng
recordid cdi_proquest_miscellaneous_864420148
source IEEE Electronic Library (IEL) Journals
subjects Algorithms
Applied sciences
Architecture
Automatic voltage control
Circuits
Computer architecture
Deblocking
Detection, estimation, filtering, equalization, prediction
Electronics
Exact sciences and technology
Filtering
Filtration
Frames per second
GPU
Graphics processing unit
Image processing
in-loop filtering
Information, signal and communications theory
Integrated circuits
Integrated circuits by function (including memories and processors)
massively parallel
MPEG-4 AVC/H.264
Parallel processing
Pictures
Semiconductor electronics. Microelectronics. Optoelectronics. Solid state devices
Signal and communications theory
Signal processing
Signal, noise
Streaming media
Studies
Synchronization
Telecommunications and information theory
Transform coding
title Parallel Deblocking Filtering in MPEG-4 AVC/H.264 on Massively Parallel Architectures
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T14%3A55%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Parallel%20Deblocking%20Filtering%20in%20MPEG-4%20AVC/H.264%20on%20Massively%20Parallel%20Architectures&rft.jtitle=IEEE%20transactions%20on%20circuits%20and%20systems%20for%20video%20technology&rft.au=Pieters,%20Bart&rft.date=2011-01&rft.volume=21&rft.issue=1&rft.spage=96&rft.epage=100&rft.pages=96-100&rft.issn=1051-8215&rft.eissn=1558-2205&rft.coden=ITCTEM&rft_id=info:doi/10.1109/TCSVT.2011.2105553&rft_dat=%3Cproquest_ieee_%3E864420148%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c400t-5eac732332f59f9287b79bc3185c93f7f8f05ed120eabfbbb1c7271bb23d1ba23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=853742288&rft_id=info:pmid/&rft_ieee_id=5686922&rfr_iscdi=true