{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T10:25:11Z","timestamp":1760955911170,"version":"3.41.0"},"reference-count":37,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2019,1,10]],"date-time":"2019-01-10T00:00:00Z","timestamp":1547078400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Des. Autom. Electron. Syst."],"published-print":{"date-parts":[[2019,3,31]]},"abstract":"<jats:p>Variable block size motion estimation has contributed greatly to achieving an optimal interframe encoding, but involves high computational complexity and huge memory access, which is the most critical bottleneck in ultra-high-definition video encoding. This article presents a hardware-efficient block matching algorithm with an efficient hardware design that is able to reduce the computational complexity of motion estimation while providing a sustained and steady coding performance for high-quality video encoding. A three-level memory organization is proposed to reduce memory bandwidth requirement while supporting a predictive common search window. By applying multiple search strategies and early termination, the proposed design provides 1.8 to 3.7 times higher hardware efficiency than other works. Furthermore, on-chip memory has been reduced by 96.5% and off-chip bandwidth requirement has been reduced by 39.4% thanks to the proposed three-level memory organization. The corresponding power consumption is only 198mW at the highest working frequency of 500MHz. The proposed design is attractive for high-quality video encoding in real-time applications with low power consumption.<\/jats:p>","DOI":"10.1145\/3290408","type":"journal-article","created":{"date-parts":[[2019,1,10]],"date-time":"2019-01-10T13:36:47Z","timestamp":1547127407000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["A Hardware-Efficient Block Matching Algorithm and Its Hardware Design for Variable Block Size Motion Estimation in Ultra-High-Definition Video Encoding"],"prefix":"10.1145","volume":"24","author":[{"given":"Jianwei","family":"Zheng","sequence":"first","affiliation":[{"name":"Xiamen University and University of Illinois at Urbana-Champaign, Urbana, IL"}]},{"given":"Chao","family":"Lu","sequence":"additional","affiliation":[{"name":"Southern Illinois University Carbondale, Carbondale, IL"}]},{"given":"Jiefeng","family":"Guo","sequence":"additional","affiliation":[{"name":"Xiamen University, Xiamen, China"}]},{"given":"Deming","family":"Chen","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Urbana, IL"}]},{"given":"Donghui","family":"Guo","sequence":"additional","affiliation":[{"name":"Xiamen University, Xiamen, China"}]}],"member":"320","published-online":{"date-parts":[[2019,1,10]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2012.2221192"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2003.815165"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2008.2000526"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2012.2221191"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/76.313138"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/76.499840"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/83.821744"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSAP.2010.12"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2011.2148450"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1455229.1455246"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2011.2171677"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2009.2031787"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3144534"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2013.2293136"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSII.2004.829555"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2011.2165592"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/76.981846"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2006.871388"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2009.2017122"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2010.2058476"},{"volume-title":"27th International Conference on Field Programmable Logic and Applications (FPL\u201917)","author":"Zhang X.","key":"e_1_2_1_21_1","unstructured":"X. Zhang , X. Liu , A. Ramachandran , C. Zhuge , S. Tang , P. Ouyang , Z. Cheng , K. Rupnow , and D. Chen . 2017. High-performance video content recognition with long-term recurrent convolutional network for FPGA . In 27th International Conference on Field Programmable Logic and Applications (FPL\u201917) . IEEE, 1--4. X. Zhang, X. Liu, A. Ramachandran, C. Zhuge, S. Tang, P. Ouyang, Z. Cheng, K. Rupnow, and D. Chen. 2017. High-performance video content recognition with long-term recurrent convolutional network for FPGA. In 27th International Conference on Field Programmable Logic and Applications (FPL\u201917). IEEE, 1--4."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2005.848306"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2009.2020322"},{"volume-title":"International Conference on Field-Programmable Technology. IEEE, 1--8.","author":"Rupnow K.","key":"e_1_2_1_24_1","unstructured":"K. Rupnow , Y. Liang , Y. Li , D. Min , M. Do , and D. Chen . 2011. High level synthesis of stereo matching: Productivity, performance, and software constraints . In International Conference on Field-Programmable Technology. IEEE, 1--8. K. Rupnow, Y. Liang, Y. Li, D. Min, M. Do, and D. Chen. 2011. High level synthesis of stereo matching: Productivity, performance, and software constraints. In International Conference on Field-Programmable Technology. IEEE, 1--8."},{"volume-title":"5th International Conference on Digital Information and Communication Technology and Its Applications (DICTAP\u201915)","author":"AlQaralleh E. A.","key":"e_1_2_1_25_1","unstructured":"E. A. AlQaralleh , Y. A. Alqudah , and B. H. Sababha . 2015. Hardware efficient early termination mechanism in motion estimation for H.264\/AVC . In 5th International Conference on Digital Information and Communication Technology and Its Applications (DICTAP\u201915) . IEEE, 13--17. E. A. AlQaralleh, Y. A. Alqudah, and B. H. Sababha. 2015. Hardware efficient early termination mechanism in motion estimation for H.264\/AVC. In 5th International Conference on Digital Information and Communication Technology and Its Applications (DICTAP\u201915). IEEE, 13--17."},{"key":"e_1_2_1_26_1","volume-title":"Efficient algorithm adaptations and fully-parallel hardware architecture for HEVC intra encoder","author":"Zhang Yuanzhi","year":"2018","unstructured":"Yuanzhi Zhang and Chao Lu. 2018. Efficient algorithm adaptations and fully-parallel hardware architecture for HEVC intra encoder . IEEE Transactions on Circuits and Systems for Video Technology , early access ( 2018 ). Yuanzhi Zhang and Chao Lu. 2018. Efficient algorithm adaptations and fully-parallel hardware architecture for HEVC intra encoder. IEEE Transactions on Circuits and Systems for Video Technology, early access (2018)."},{"volume-title":"IEEE International Symposium on Circuits and Systems. IEEE, 625--628","author":"Lin W.","key":"e_1_2_1_27_1","unstructured":"W. Lin , K. Panusopone , D. Baylon , and M. T. Sun . 2009. A new class-based early termination method for fast motion estimation in video coding . In IEEE International Symposium on Circuits and Systems. IEEE, 625--628 . W. Lin, K. Panusopone, D. Baylon, and M. T. Sun. 2009. A new class-based early termination method for fast motion estimation in video coding. In IEEE International Symposium on Circuits and Systems. IEEE, 625--628."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2753758"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2002.806251"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1093\/ietfec\/e89-a.4.979"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2012.88"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2013.2290385"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSI.2005.854409"},{"key":"e_1_2_1_34_1","unstructured":"JVT. 2017. Joint Video Team Reference Software JM 18.0 {Online}. http:\/\/iphome.hhi.de\/suehring\/tml\/download. (2017).  JVT. 2017. Joint Video Team Reference Software JM 18.0 {Online}. http:\/\/iphome.hhi.de\/suehring\/tml\/download. (2017)."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2016.2583979"},{"volume-title":"Derf\u2019s Test Media Collection {Online}. Retrieved","year":"2018","key":"e_1_2_1_36_1","unstructured":"Xiph.org. 2017. Derf\u2019s Test Media Collection {Online}. Retrieved December 5, 2018 from http:\/\/media.xiph.org\/video\/derf\/. Xiph.org. 2017. Derf\u2019s Test Media Collection {Online}. Retrieved December 5, 2018 from http:\/\/media.xiph.org\/video\/derf\/."},{"key":"e_1_2_1_37_1","volume-title":"13th VCEG-M33 Meeting. IUT-T, 1--5.","author":"Bjontegaard G.","year":"2001","unstructured":"G. Bjontegaard . 2001 . Calculation of average PSNR differerces between RD-curves . In 13th VCEG-M33 Meeting. IUT-T, 1--5. G. Bjontegaard. 2001. Calculation of average PSNR differerces between RD-curves. In 13th VCEG-M33 Meeting. IUT-T, 1--5."}],"container-title":["ACM Transactions on Design Automation of Electronic Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3290408","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3290408","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:58:05Z","timestamp":1750208285000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3290408"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,1,10]]},"references-count":37,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2019,3,31]]}},"alternative-id":["10.1145\/3290408"],"URL":"https:\/\/doi.org\/10.1145\/3290408","relation":{},"ISSN":["1084-4309","1557-7309"],"issn-type":[{"type":"print","value":"1084-4309"},{"type":"electronic","value":"1557-7309"}],"subject":[],"published":{"date-parts":[[2019,1,10]]},"assertion":[{"value":"2018-06-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-01-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}