Loading…
Algorithm optimization and hardware implementation for Merge mode in HEVC
Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distort...
Saved in:
Published in: | Journal of real-time image processing 2020-06, Vol.17 (3), p.623-630 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203 |
---|---|
cites | cdi_FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203 |
container_end_page | 630 |
container_issue | 3 |
container_start_page | 623 |
container_title | Journal of real-time image processing |
container_volume | 17 |
creator | Shi, Long-zhao Gao, Xiaohong Yang, Xiuzhi Chen, Zhifeng Zheng, Mingkui |
description | Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distortion. However, this process requires a large number of complex computations and memory access, thereby resulting in the low efficiency of hardware implementation. This paper proposes a new Merge candidate decision scheme that determines the most favorable Merge candidate from a full list of candidates by comparing the sum of absolute transformed difference with the weighted header bit instead of performing a complex calculation for sum of squared difference and entropy coding process in HM16.7. The simulation results show that the performance of the proposed algorithm is close to that of HM16.7 and increases the BD-rate only by 0.22–1.21%. The multilevel pipelines architecture is also exploited in the hardware design. The weighted header bit operation is performed by using the look-up table, which reduces both the complexity and encoding clock cycle. The designed system is implemented with a register transfer level code. The synthesis results from the Design Compiler show that compared with other architecture, the proposed architecture offers great advantages in resource utilization and can process 1920 × 1080 at 353 frame/s for P-slices with a clock frequency of 1057 MHz and logic gate count of 285.2 K. |
doi_str_mv | 10.1007/s11554-018-0818-4 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2918675836</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2918675836</sourcerecordid><originalsourceid>FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203</originalsourceid><addsrcrecordid>eNp1kEFPwzAMhSMEEmPwA7hV4lxw0iZpj9M02KQhLsA1Shtn67Q2JemE4NcvUxGcuNiW3nu2_BFyS-GeAsiHQCnneQq0SKGIJT8jE1oImhaMlue_M8AluQphByCkyPiErGb7jfPNsG0T1w9N23zroXFdojuTbLU3n9pj0rT9HlvshlGzzifP6DeYtM5EtUuWi_f5Nbmweh_w5qdPydvj4nW-TNcvT6v5bJ3WGRVDKk0tKCK1Ntc1VqAzCyWXdQm2YlpgrZnNrbEgDdOohRDGSgqCGy6qikE2JXfj3t67jwOGQe3cwXfxpGJlfFPyIhPRRUdX7V0IHq3qfdNq_6UoqBMxNRJTkZg6EVN5zLAxE6K326D_2_x_6AhID28G</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2918675836</pqid></control><display><type>article</type><title>Algorithm optimization and hardware implementation for Merge mode in HEVC</title><source>Springer Nature</source><creator>Shi, Long-zhao ; Gao, Xiaohong ; Yang, Xiuzhi ; Chen, Zhifeng ; Zheng, Mingkui</creator><creatorcontrib>Shi, Long-zhao ; Gao, Xiaohong ; Yang, Xiuzhi ; Chen, Zhifeng ; Zheng, Mingkui</creatorcontrib><description>Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distortion. However, this process requires a large number of complex computations and memory access, thereby resulting in the low efficiency of hardware implementation. This paper proposes a new Merge candidate decision scheme that determines the most favorable Merge candidate from a full list of candidates by comparing the sum of absolute transformed difference with the weighted header bit instead of performing a complex calculation for sum of squared difference and entropy coding process in HM16.7. The simulation results show that the performance of the proposed algorithm is close to that of HM16.7 and increases the BD-rate only by 0.22–1.21%. The multilevel pipelines architecture is also exploited in the hardware design. The weighted header bit operation is performed by using the look-up table, which reduces both the complexity and encoding clock cycle. The designed system is implemented with a register transfer level code. The synthesis results from the Design Compiler show that compared with other architecture, the proposed architecture offers great advantages in resource utilization and can process 1920 × 1080 at 353 frame/s for P-slices with a clock frequency of 1057 MHz and logic gate count of 285.2 K.</description><identifier>ISSN: 1861-8200</identifier><identifier>EISSN: 1861-8219</identifier><identifier>DOI: 10.1007/s11554-018-0818-4</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Algorithms ; Candidates ; Coding standards ; Complexity ; Computer architecture ; Computer Graphics ; Computer Science ; Design optimization ; Efficiency ; Gate counting ; Hardware ; Image Processing and Computer Vision ; Logic circuits ; Lookup tables ; Mathematical analysis ; Multimedia Information Systems ; Original Research Paper ; Pattern Recognition ; Resource utilization ; Signal,Image and Speech Processing ; Sums ; Video compression</subject><ispartof>Journal of real-time image processing, 2020-06, Vol.17 (3), p.623-630</ispartof><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2018</rights><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2018.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203</citedby><cites>FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Shi, Long-zhao</creatorcontrib><creatorcontrib>Gao, Xiaohong</creatorcontrib><creatorcontrib>Yang, Xiuzhi</creatorcontrib><creatorcontrib>Chen, Zhifeng</creatorcontrib><creatorcontrib>Zheng, Mingkui</creatorcontrib><title>Algorithm optimization and hardware implementation for Merge mode in HEVC</title><title>Journal of real-time image processing</title><addtitle>J Real-Time Image Proc</addtitle><description>Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distortion. However, this process requires a large number of complex computations and memory access, thereby resulting in the low efficiency of hardware implementation. This paper proposes a new Merge candidate decision scheme that determines the most favorable Merge candidate from a full list of candidates by comparing the sum of absolute transformed difference with the weighted header bit instead of performing a complex calculation for sum of squared difference and entropy coding process in HM16.7. The simulation results show that the performance of the proposed algorithm is close to that of HM16.7 and increases the BD-rate only by 0.22–1.21%. The multilevel pipelines architecture is also exploited in the hardware design. The weighted header bit operation is performed by using the look-up table, which reduces both the complexity and encoding clock cycle. The designed system is implemented with a register transfer level code. The synthesis results from the Design Compiler show that compared with other architecture, the proposed architecture offers great advantages in resource utilization and can process 1920 × 1080 at 353 frame/s for P-slices with a clock frequency of 1057 MHz and logic gate count of 285.2 K.</description><subject>Algorithms</subject><subject>Candidates</subject><subject>Coding standards</subject><subject>Complexity</subject><subject>Computer architecture</subject><subject>Computer Graphics</subject><subject>Computer Science</subject><subject>Design optimization</subject><subject>Efficiency</subject><subject>Gate counting</subject><subject>Hardware</subject><subject>Image Processing and Computer Vision</subject><subject>Logic circuits</subject><subject>Lookup tables</subject><subject>Mathematical analysis</subject><subject>Multimedia Information Systems</subject><subject>Original Research Paper</subject><subject>Pattern Recognition</subject><subject>Resource utilization</subject><subject>Signal,Image and Speech Processing</subject><subject>Sums</subject><subject>Video compression</subject><issn>1861-8200</issn><issn>1861-8219</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp1kEFPwzAMhSMEEmPwA7hV4lxw0iZpj9M02KQhLsA1Shtn67Q2JemE4NcvUxGcuNiW3nu2_BFyS-GeAsiHQCnneQq0SKGIJT8jE1oImhaMlue_M8AluQphByCkyPiErGb7jfPNsG0T1w9N23zroXFdojuTbLU3n9pj0rT9HlvshlGzzifP6DeYtM5EtUuWi_f5Nbmweh_w5qdPydvj4nW-TNcvT6v5bJ3WGRVDKk0tKCK1Ntc1VqAzCyWXdQm2YlpgrZnNrbEgDdOohRDGSgqCGy6qikE2JXfj3t67jwOGQe3cwXfxpGJlfFPyIhPRRUdX7V0IHq3qfdNq_6UoqBMxNRJTkZg6EVN5zLAxE6K326D_2_x_6AhID28G</recordid><startdate>20200601</startdate><enddate>20200601</enddate><creator>Shi, Long-zhao</creator><creator>Gao, Xiaohong</creator><creator>Yang, Xiuzhi</creator><creator>Chen, Zhifeng</creator><creator>Zheng, Mingkui</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>20200601</creationdate><title>Algorithm optimization and hardware implementation for Merge mode in HEVC</title><author>Shi, Long-zhao ; Gao, Xiaohong ; Yang, Xiuzhi ; Chen, Zhifeng ; Zheng, Mingkui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Candidates</topic><topic>Coding standards</topic><topic>Complexity</topic><topic>Computer architecture</topic><topic>Computer Graphics</topic><topic>Computer Science</topic><topic>Design optimization</topic><topic>Efficiency</topic><topic>Gate counting</topic><topic>Hardware</topic><topic>Image Processing and Computer Vision</topic><topic>Logic circuits</topic><topic>Lookup tables</topic><topic>Mathematical analysis</topic><topic>Multimedia Information Systems</topic><topic>Original Research Paper</topic><topic>Pattern Recognition</topic><topic>Resource utilization</topic><topic>Signal,Image and Speech Processing</topic><topic>Sums</topic><topic>Video compression</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shi, Long-zhao</creatorcontrib><creatorcontrib>Gao, Xiaohong</creatorcontrib><creatorcontrib>Yang, Xiuzhi</creatorcontrib><creatorcontrib>Chen, Zhifeng</creatorcontrib><creatorcontrib>Zheng, Mingkui</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Journal of real-time image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shi, Long-zhao</au><au>Gao, Xiaohong</au><au>Yang, Xiuzhi</au><au>Chen, Zhifeng</au><au>Zheng, Mingkui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Algorithm optimization and hardware implementation for Merge mode in HEVC</atitle><jtitle>Journal of real-time image processing</jtitle><stitle>J Real-Time Image Proc</stitle><date>2020-06-01</date><risdate>2020</risdate><volume>17</volume><issue>3</issue><spage>623</spage><epage>630</epage><pages>623-630</pages><issn>1861-8200</issn><eissn>1861-8219</eissn><abstract>Merge mode is a new tool for improving inter-frame coding efficiency in high-efficiency video coding. This tool can save the bitrate for the motion vector by sharing this vector with neighboring blocks. Merge is a process that selects a candidate motion vector by calculating the cost of rate-distortion. However, this process requires a large number of complex computations and memory access, thereby resulting in the low efficiency of hardware implementation. This paper proposes a new Merge candidate decision scheme that determines the most favorable Merge candidate from a full list of candidates by comparing the sum of absolute transformed difference with the weighted header bit instead of performing a complex calculation for sum of squared difference and entropy coding process in HM16.7. The simulation results show that the performance of the proposed algorithm is close to that of HM16.7 and increases the BD-rate only by 0.22–1.21%. The multilevel pipelines architecture is also exploited in the hardware design. The weighted header bit operation is performed by using the look-up table, which reduces both the complexity and encoding clock cycle. The designed system is implemented with a register transfer level code. The synthesis results from the Design Compiler show that compared with other architecture, the proposed architecture offers great advantages in resource utilization and can process 1920 × 1080 at 353 frame/s for P-slices with a clock frequency of 1057 MHz and logic gate count of 285.2 K.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s11554-018-0818-4</doi><tpages>8</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1861-8200 |
ispartof | Journal of real-time image processing, 2020-06, Vol.17 (3), p.623-630 |
issn | 1861-8200 1861-8219 |
language | eng |
recordid | cdi_proquest_journals_2918675836 |
source | Springer Nature |
subjects | Algorithms Candidates Coding standards Complexity Computer architecture Computer Graphics Computer Science Design optimization Efficiency Gate counting Hardware Image Processing and Computer Vision Logic circuits Lookup tables Mathematical analysis Multimedia Information Systems Original Research Paper Pattern Recognition Resource utilization Signal,Image and Speech Processing Sums Video compression |
title | Algorithm optimization and hardware implementation for Merge mode in HEVC |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T14%3A40%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Algorithm%20optimization%20and%20hardware%20implementation%20for%20Merge%20mode%20in%20HEVC&rft.jtitle=Journal%20of%20real-time%20image%20processing&rft.au=Shi,%20Long-zhao&rft.date=2020-06-01&rft.volume=17&rft.issue=3&rft.spage=623&rft.epage=630&rft.pages=623-630&rft.issn=1861-8200&rft.eissn=1861-8219&rft_id=info:doi/10.1007/s11554-018-0818-4&rft_dat=%3Cproquest_cross%3E2918675836%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c316t-7dc61ee1ff4aceb0a3f0957c90fb2a6eca2f4fdf07d2aea666df71065d56bb203%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2918675836&rft_id=info:pmid/&rfr_iscdi=true |