Loading…

Algorithm optimization and hardware implementation for Merge mode in HEVC

Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distort...

Full description

Saved in:
Bibliographic Details
Published in:Journal of real-time image processing 2020-06, Vol.17 (3), p.623-630
Main Authors: Shi, Long-zhao, Gao, Xiaohong, Yang, Xiuzhi, Chen, Zhifeng, Zheng, Mingkui
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203
cites cdi_FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203
container_end_page 630
container_issue 3
container_start_page 623
container_title Journal of real-time image processing
container_volume 17
creator Shi, Long-zhao
Gao, Xiaohong
Yang, Xiuzhi
Chen, Zhifeng
Zheng, Mingkui
description Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distortion. However, this process requires a large number of complex computations and memory access, thereby resulting in the low efficiency of hardware implementation. This paper proposes a new Merge candidate decision scheme that determines the most favorable Merge candidate from a full list of candidates by comparing the sum of absolute transformed difference with the weighted header bit instead of performing a complex calculation for sum of squared difference and entropy coding process in HM16.7. The simulation results show that the performance of the proposed algorithm is close to that of HM16.7 and increases the BD-rate only by 0.22–1.21%. The multilevel pipelines architecture is also exploited in the hardware design. The weighted header bit operation is performed by using the look-up table, which reduces both the complexity and encoding clock cycle. The designed system is implemented with a register transfer level code. The synthesis results from the Design Compiler show that compared with other architecture, the proposed architecture offers great advantages in resource utilization and can process 1920 × 1080 at 353 frame/s for P-slices with a clock frequency of 1057 MHz and logic gate count of 285.2 K.
doi_str_mv 10.1007/s11554-018-0818-4
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2918675836</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2918675836</sourcerecordid><originalsourceid>FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203</originalsourceid><addsrcrecordid>eNp1kEFPwzAMhSMEEmPwA7hV4lxw0iZpj9M02KQhLsA1Shtn67Q2JemE4NcvUxGcuNiW3nu2_BFyS-GeAsiHQCnneQq0SKGIJT8jE1oImhaMlue_M8AluQphByCkyPiErGb7jfPNsG0T1w9N23zroXFdojuTbLU3n9pj0rT9HlvshlGzzifP6DeYtM5EtUuWi_f5Nbmweh_w5qdPydvj4nW-TNcvT6v5bJ3WGRVDKk0tKCK1Ntc1VqAzCyWXdQm2YlpgrZnNrbEgDdOohRDGSgqCGy6qikE2JXfj3t67jwOGQe3cwXfxpGJlfFPyIhPRRUdX7V0IHq3qfdNq_6UoqBMxNRJTkZg6EVN5zLAxE6K326D_2_x_6AhID28G</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2918675836</pqid></control><display><type>article</type><title>Algorithm optimization and hardware implementation for Merge mode in HEVC</title><source>Springer Nature</source><creator>Shi, Long-zhao ; Gao, Xiaohong ; Yang, Xiuzhi ; Chen, Zhifeng ; Zheng, Mingkui</creator><creatorcontrib>Shi, Long-zhao ; Gao, Xiaohong ; Yang, Xiuzhi ; Chen, Zhifeng ; Zheng, Mingkui</creatorcontrib><description>Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distortion. However, this process requires a large number of complex computations and memory access, thereby resulting in the low efficiency of hardware implementation. This paper proposes a new Merge candidate decision scheme that determines the most favorable Merge candidate from a full list of candidates by comparing the sum of absolute transformed difference with the weighted header bit instead of performing a complex calculation for sum of squared difference and entropy coding process in HM16.7. The simulation results show that the performance of the proposed algorithm is close to that of HM16.7 and increases the BD-rate only by 0.22–1.21%. The multilevel pipelines architecture is also exploited in the hardware design. The weighted header bit operation is performed by using the look-up table, which reduces both the complexity and encoding clock cycle. The designed system is implemented with a register transfer level code. The synthesis results from the Design Compiler show that compared with other architecture, the proposed architecture offers great advantages in resource utilization and can process 1920 × 1080 at 353 frame/s for P-slices with a clock frequency of 1057 MHz and logic gate count of 285.2 K.</description><identifier>ISSN: 1861-8200</identifier><identifier>EISSN: 1861-8219</identifier><identifier>DOI: 10.1007/s11554-018-0818-4</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Algorithms ; Candidates ; Coding standards ; Complexity ; Computer architecture ; Computer Graphics ; Computer Science ; Design optimization ; Efficiency ; Gate counting ; Hardware ; Image Processing and Computer Vision ; Logic circuits ; Lookup tables ; Mathematical analysis ; Multimedia Information Systems ; Original Research Paper ; Pattern Recognition ; Resource utilization ; Signal,Image and Speech Processing ; Sums ; Video compression</subject><ispartof>Journal of real-time image processing, 2020-06, Vol.17 (3), p.623-630</ispartof><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2018</rights><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2018.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203</citedby><cites>FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Shi, Long-zhao</creatorcontrib><creatorcontrib>Gao, Xiaohong</creatorcontrib><creatorcontrib>Yang, Xiuzhi</creatorcontrib><creatorcontrib>Chen, Zhifeng</creatorcontrib><creatorcontrib>Zheng, Mingkui</creatorcontrib><title>Algorithm optimization and hardware implementation for Merge mode in HEVC</title><title>Journal of real-time image processing</title><addtitle>J Real-Time Image Proc</addtitle><description>Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distortion. However, this process requires a large number of complex computations and memory access, thereby resulting in the low efficiency of hardware implementation. This paper proposes a new Merge candidate decision scheme that determines the most favorable Merge candidate from a full list of candidates by comparing the sum of absolute transformed difference with the weighted header bit instead of performing a complex calculation for sum of squared difference and entropy coding process in HM16.7. The simulation results show that the performance of the proposed algorithm is close to that of HM16.7 and increases the BD-rate only by 0.22–1.21%. The multilevel pipelines architecture is also exploited in the hardware design. The weighted header bit operation is performed by using the look-up table, which reduces both the complexity and encoding clock cycle. The designed system is implemented with a register transfer level code. The synthesis results from the Design Compiler show that compared with other architecture, the proposed architecture offers great advantages in resource utilization and can process 1920 × 1080 at 353 frame/s for P-slices with a clock frequency of 1057 MHz and logic gate count of 285.2 K.</description><subject>Algorithms</subject><subject>Candidates</subject><subject>Coding standards</subject><subject>Complexity</subject><subject>Computer architecture</subject><subject>Computer Graphics</subject><subject>Computer Science</subject><subject>Design optimization</subject><subject>Efficiency</subject><subject>Gate counting</subject><subject>Hardware</subject><subject>Image Processing and Computer Vision</subject><subject>Logic circuits</subject><subject>Lookup tables</subject><subject>Mathematical analysis</subject><subject>Multimedia Information Systems</subject><subject>Original Research Paper</subject><subject>Pattern Recognition</subject><subject>Resource utilization</subject><subject>Signal,Image and Speech Processing</subject><subject>Sums</subject><subject>Video compression</subject><issn>1861-8200</issn><issn>1861-8219</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp1kEFPwzAMhSMEEmPwA7hV4lxw0iZpj9M02KQhLsA1Shtn67Q2JemE4NcvUxGcuNiW3nu2_BFyS-GeAsiHQCnneQq0SKGIJT8jE1oImhaMlue_M8AluQphByCkyPiErGb7jfPNsG0T1w9N23zroXFdojuTbLU3n9pj0rT9HlvshlGzzifP6DeYtM5EtUuWi_f5Nbmweh_w5qdPydvj4nW-TNcvT6v5bJ3WGRVDKk0tKCK1Ntc1VqAzCyWXdQm2YlpgrZnNrbEgDdOohRDGSgqCGy6qikE2JXfj3t67jwOGQe3cwXfxpGJlfFPyIhPRRUdX7V0IHq3qfdNq_6UoqBMxNRJTkZg6EVN5zLAxE6K326D_2_x_6AhID28G</recordid><startdate>20200601</startdate><enddate>20200601</enddate><creator>Shi, Long-zhao</creator><creator>Gao, Xiaohong</creator><creator>Yang, Xiuzhi</creator><creator>Chen, Zhifeng</creator><creator>Zheng, Mingkui</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>20200601</creationdate><title>Algorithm optimization and hardware implementation for Merge mode in HEVC</title><author>Shi, Long-zhao ; Gao, Xiaohong ; Yang, Xiuzhi ; Chen, Zhifeng ; Zheng, Mingkui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Candidates</topic><topic>Coding standards</topic><topic>Complexity</topic><topic>Computer architecture</topic><topic>Computer Graphics</topic><topic>Computer Science</topic><topic>Design optimization</topic><topic>Efficiency</topic><topic>Gate counting</topic><topic>Hardware</topic><topic>Image Processing and Computer Vision</topic><topic>Logic circuits</topic><topic>Lookup tables</topic><topic>Mathematical analysis</topic><topic>Multimedia Information Systems</topic><topic>Original Research Paper</topic><topic>Pattern Recognition</topic><topic>Resource utilization</topic><topic>Signal,Image and Speech Processing</topic><topic>Sums</topic><topic>Video compression</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shi, Long-zhao</creatorcontrib><creatorcontrib>Gao, Xiaohong</creatorcontrib><creatorcontrib>Yang, Xiuzhi</creatorcontrib><creatorcontrib>Chen, Zhifeng</creatorcontrib><creatorcontrib>Zheng, Mingkui</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Journal of real-time image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shi, Long-zhao</au><au>Gao, Xiaohong</au><au>Yang, Xiuzhi</au><au>Chen, Zhifeng</au><au>Zheng, Mingkui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Algorithm optimization and hardware implementation for Merge mode in HEVC</atitle><jtitle>Journal of real-time image processing</jtitle><stitle>J Real-Time Image Proc</stitle><date>2020-06-01</date><risdate>2020</risdate><volume>17</volume><issue>3</issue><spage>623</spage><epage>630</epage><pages>623-630</pages><issn>1861-8200</issn><eissn>1861-8219</eissn><abstract>Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distortion. However, this process requires a large number of complex computations and memory access, thereby resulting in the low efficiency of hardware implementation. This paper proposes a new Merge candidate decision scheme that determines the most favorable Merge candidate from a full list of candidates by comparing the sum of absolute transformed difference with the weighted header bit instead of performing a complex calculation for sum of squared difference and entropy coding process in HM16.7. The simulation results show that the performance of the proposed algorithm is close to that of HM16.7 and increases the BD-rate only by 0.22–1.21%. The multilevel pipelines architecture is also exploited in the hardware design. The weighted header bit operation is performed by using the look-up table, which reduces both the complexity and encoding clock cycle. The designed system is implemented with a register transfer level code. The synthesis results from the Design Compiler show that compared with other architecture, the proposed architecture offers great advantages in resource utilization and can process 1920 × 1080 at 353 frame/s for P-slices with a clock frequency of 1057 MHz and logic gate count of 285.2 K.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s11554-018-0818-4</doi><tpages>8</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1861-8200
ispartof Journal of real-time image processing, 2020-06, Vol.17 (3), p.623-630
issn 1861-8200
1861-8219
language eng
recordid cdi_proquest_journals_2918675836
source Springer Nature
subjects Algorithms
Candidates
Coding standards
Complexity
Computer architecture
Computer Graphics
Computer Science
Design optimization
Efficiency
Gate counting
Hardware
Image Processing and Computer Vision
Logic circuits
Lookup tables
Mathematical analysis
Multimedia Information Systems
Original Research Paper
Pattern Recognition
Resource utilization
Signal,Image and Speech Processing
Sums
Video compression
title Algorithm optimization and hardware implementation for Merge mode in HEVC
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T14%3A40%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Algorithm%20optimization%20and%20hardware%20implementation%20for%20Merge%20mode%20in%20HEVC&rft.jtitle=Journal%20of%20real-time%20image%20processing&rft.au=Shi,%20Long-zhao&rft.date=2020-06-01&rft.volume=17&rft.issue=3&rft.spage=623&rft.epage=630&rft.pages=623-630&rft.issn=1861-8200&rft.eissn=1861-8219&rft_id=info:doi/10.1007/s11554-018-0818-4&rft_dat=%3Cproquest_cross%3E2918675836%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2918675836&rft_id=info:pmid/&rfr_iscdi=true