{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T21:42:10Z","timestamp":1768340530661,"version":"3.49.0"},"reference-count":51,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2024,1,22]],"date-time":"2024-01-22T00:00:00Z","timestamp":1705881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,5,31]]},"abstract":"<jats:p>As a practical technique in mainstream video coding applications, rate control dominates important to ensure compression quality with limited bitrates constraints. However, most rate control methods mainly focus on objective quality while ignoring the perceptual quality improvement for human eyes. In this paper, we propose a two-stage rate control algorithm to optimize the perceptual quality at the frame encoding stage and the coding tree unit (CTU) encoding stage for high efficiency video coding (HEVC), respectively. Firstly, for the frame encoding stage, with inter-frame distortion dependency consideration, a frame-level rate control method is presented by adjusting the frame-level Lagrange multiplier adaptively with a preprocessing method. Secondly, for the CTU encoding stage, we propose a saliency-based CTU-level perceptual quality rate control algorithm, which employs CTU-level saliency weight to adjust the perceptual rate-distortion (R-D) model. We conduct the CTU-level rate control by an optimized Lagrange multiplier and quantization parameter (QP) to achieve perceptual quality optimization. Extensive experimental results reveal that, compared with state-of-the-art rate control methods on HEVC, our algorithm achieves significant perceptual coding performance with improved subjective visual quality.<\/jats:p>","DOI":"10.1145\/3636510","type":"journal-article","created":{"date-parts":[[2023,12,13]],"date-time":"2023-12-13T11:42:50Z","timestamp":1702467770000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Two-Stage Perceptual Quality Oriented Rate Control Algorithm\u00a0for HEVC"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-0830-1830","authenticated-orcid":false,"given":"Yunyao","family":"Yan","sequence":"first","affiliation":[{"name":"School of Electronic and Computer Engineering, Peking University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5831-1897","authenticated-orcid":false,"given":"Guoqing","family":"Xiang","sequence":"additional","affiliation":[{"name":"School of Computer Science, Peking University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2778-3768","authenticated-orcid":false,"given":"Huizhu","family":"Jia","sequence":"additional","affiliation":[{"name":"School of Computer Science, Peking University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9765-4523","authenticated-orcid":false,"given":"Jie","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Electronic and Computer Engineering, Peking University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8479-6960","authenticated-orcid":false,"given":"Xiaofeng","family":"Huang","sequence":"additional","affiliation":[{"name":"School of Communication Engineering, Hangzhou Dianzi University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4325-7741","authenticated-orcid":false,"given":"Xiaodong","family":"Xie","sequence":"additional","affiliation":[{"name":"School of Computer Science, Peking University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,1,22]]},"reference":[{"key":"e_1_3_1_2_2","article-title":"Common test conditions and software reference configurations","volume":"12","author":"Bossen Frank","year":"2013","unstructured":"Frank Bossen et\u00a0al. 2013. Common test conditions and software reference configurations. JCTVC-L1100 12 (2013), 7.","journal-title":"JCTVC-L1100"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2021.3101953"},{"issue":"9","key":"e_1_3_1_4_2","doi-asserted-by":"crossref","first-page":"4541","DOI":"10.1109\/TIP.2019.2911180","article-title":"An optimized rate control for low-delay H. 265\/HEVC","volume":"28","author":"Chen Zhenzhong","year":"2019","unstructured":"Zhenzhong Chen and Xiang Pan. 2019. An optimized rate control for low-delay H. 265\/HEVC. IEEE Transactions on Image Processing 28, 9 (2019), 4541\u20134552.","journal-title":"IEEE Transactions on Image Processing"},{"key":"e_1_3_1_5_2","first-page":"1","article-title":"Rate control based on unified RQ model for HEVC","author":"Choi Hyomin","year":"2012","unstructured":"Hyomin Choi, Junghak Nam, Jonghun Yoo, D. Sim, and I. V. Bajic. 2012. Rate control based on unified RQ model for HEVC. ITU-T SG16 Contribution, JCTVC-H0213 (2012), 1\u201313.","journal-title":"ITU-T SG16 Contribution, JCTVC-H0213"},{"key":"e_1_3_1_6_2","volume-title":"Proceedings of VLBV Workshop","author":"Vito Fabio De","year":"2005","unstructured":"Fabio De Vito, Tanir Ozcelebi, Reha Civanlar, A. Murat Tekalp, and Juan Carlos De Martin. 2005. Rate control for GOP-level rate adaptation in H. 264 video coding. In Proceedings of VLBV Workshop."},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2015.2444671"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2535254"},{"issue":"1","key":"e_1_3_1_9_2","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1109\/TCSVT.2017.2769703","article-title":"Temporal-layer-motivated lambda domain picture level rate control for random-access configuration in H. 265\/HEVC","volume":"29","author":"Gong Yanchao","year":"2017","unstructured":"Yanchao Gong, Shuai Wan, Kaifang Yang, Hong Ren Wu, and Ying Liu. 2017. Temporal-layer-motivated lambda domain picture level rate control for random-access configuration in H. 265\/HEVC. IEEE Transactions on Circuits and Systems for Video Technology 29, 1 (2017), 156\u2013170.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"issue":"2","key":"e_1_3_1_10_2","first-page":"270","article-title":"Optimal bit allocation at frame level for rate control in HEVC","volume":"65","author":"Guo Hongwei","year":"2018","unstructured":"Hongwei Guo, Ce Zhu, Shengxi Li, and Yanbo Gao. 2018. Optimal bit allocation at frame level for rate control in HEVC. IEEE Transactions on Broadcasting 65, 2 (2018), 270\u2013281.","journal-title":"IEEE Transactions on Broadcasting"},{"issue":"1","key":"e_1_3_1_11_2","first-page":"113","article-title":"Inter-block dependency-based CTU level rate control for HEVC","volume":"66","author":"Guo Hongwei","year":"2019","unstructured":"Hongwei Guo, Ce Zhu, Mai Xu, and Shuai Li. 2019. Inter-block dependency-based CTU level rate control for HEVC. IEEE Transactions on Broadcasting 66, 1 (2019), 113\u2013126.","journal-title":"IEEE Transactions on Broadcasting"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/TBC.2023.3247953"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2818439"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2015.2389491"},{"key":"e_1_3_1_15_2","first-page":"1","volume-title":"2019 IEEE Visual Communications and Image Processing (VCIP \u201919)","author":"Ku ChungWen","year":"2019","unstructured":"ChungWen Ku, Guoqing Xiang, Feng Qi, Wei Yan, Yuan Li, and Xiaodong Xie. 2019. Bit allocation based on visual saliency in HEVC. In 2019 IEEE Visual Communications and Image Processing (VCIP \u201919). IEEE, 1\u20134."},{"issue":"3","key":"e_1_3_1_16_2","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1109\/TCSVT.2013.2276880","article-title":"A frame-level rate control scheme based on texture and nontexture rate models for high efficiency video coding","volume":"24","author":"Lee B.","year":"2013","unstructured":"B. Lee, M. Kim, and T. Q. Nguyen. 2013. A frame-level rate control scheme based on texture and nontexture rate models for high efficiency video coding. IEEE Transactions on Circuits and Systems for Video Technology 24, 3 (2013), 465\u2013479.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"e_1_3_1_17_2","first-page":"1","article-title":"Rate control by R-lambda model for HEVC","author":"Li Bin","year":"2012","unstructured":"Bin Li, Houqiang Li, Li Li, and Jinlei Zhang. 2012. Rate control by R-lambda model for HEVC. ITU-T SG16 Contribution, JCTVC-K0103 (2012), 1\u20135.","journal-title":"ITU-T SG16 Contribution, JCTVC-K0103"},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2016.2598672"},{"issue":"11","key":"e_1_3_1_19_2","first-page":"2409","article-title":"Optimal bit allocation for CTU level rate control in HEVC","volume":"27","author":"Li Shengxi","year":"2016","unstructured":"Shengxi Li, Mai Xu, Zulin Wang, and Xiaoyan Sun. 2016. Optimal bit allocation for CTU level rate control in HEVC. IEEE Transactions on Circuits and Systems for Video Technology 27, 11 (2016), 2409\u20132424.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"issue":"5","key":"e_1_3_1_20_2","doi-asserted-by":"crossref","first-page":"887","DOI":"10.1007\/s11760-019-01620-3","article-title":"A perceptual rate control algorithm based on luminance adaptation for HEVC encoders","volume":"14","author":"Lim Woong","year":"2020","unstructured":"Woong Lim and Donggyu Sim. 2020. A perceptual rate control algorithm based on luminance adaptation for HEVC encoders. Signal, Image and Video Processing 14, 5 (2020), 887\u2013895.","journal-title":"Signal, Image and Video Processing"},{"key":"e_1_3_1_21_2","article-title":"\\(\\lambda\\) -domain VVC rate control based on game theory","author":"Lin Jielian","year":"2022","unstructured":"Jielian Lin, Aiping Huang, Keke Zhang, Xu Wang, and Tiesong Zhao. 2022. \\(\\lambda\\) -domain VVC rate control based on game theory. arXiv preprint arXiv:2205.03595 (2022).","journal-title":"arXiv preprint arXiv:2205.03595"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3072225"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3078622"},{"issue":"4","key":"e_1_3_1_24_2","first-page":"2371","article-title":"High efficiency rate control for versatile video coding based on composite cauchy distribution","volume":"32","author":"Mao Yunhao","year":"2021","unstructured":"Yunhao Mao, Meng Wang, Shiqi Wang, and Sam Kwong. 2021. High efficiency rate control for versatile video coding based on composite cauchy distribution. IEEE Transactions on Circuits and Systems for Video Technology 32, 4 (2021), 2371\u20132384.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/3380827"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/BMSB.2017.7986143"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2012.175"},{"key":"e_1_3_1_28_2","article-title":"Methodology for the subjective assessment of the quality of television pictures","author":"Series B. T.","year":"2012","unstructured":"B. T. Series. 2012. Methodology for the subjective assessment of the quality of television pictures. Recommendation ITU-R BT (2012), 500\u201313.","journal-title":"Recommendation ITU-R BT"},{"key":"e_1_3_1_29_2","first-page":"89","volume-title":"2013 Picture Coding Symposium (PCS \u201913)","author":"Si Junjun","year":"2013","unstructured":"Junjun Si, Siwei Ma, and Wen Gao. 2013. Efficient bit allocation and CTU level rate control for high efficiency video coding. In 2013 Picture Coding Symposium (PCS \u201913). IEEE, 89\u201392."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2012.2221191"},{"key":"e_1_3_1_31_2","doi-asserted-by":"crossref","first-page":"883","DOI":"10.1117\/12.411871","volume-title":"Visual Communications and Image Processing 2001","author":"Tourapis Alexis Michael","year":"2000","unstructured":"Alexis Michael Tourapis, Oscar Chi Lim Au, and Ming Lei Liou. 2000. Predictive motion vector field adaptive search technique (PMVFAST): Enhancing block-based motion estimation. In Visual Communications and Image Processing 2001, Vol. 4310. SPIE, 883\u2013892."},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-020-09442-z"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-017-4914-4"},{"key":"e_1_3_1_34_2","first-page":"1","volume-title":"2018 IEEE International Symposium on Circuits and Systems (ISCAS \u201918)","author":"Wang Hao","year":"2018","unstructured":"Hao Wang, Li Song, Rong Xie, Zhengyi Luo, and Xiangwen Wang. 2018. Masking effects based rate control scheme for high efficiency video coding. In 2018 IEEE International Symposium on Circuits and Systems (ISCAS \u201918). IEEE, 1\u20135."},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-019-7680-7"},{"key":"e_1_3_1_36_2","first-page":"1","volume-title":"2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP \u201915)","author":"Wang Shiqi","year":"2015","unstructured":"Shiqi Wang, Abdul Rehman, Kai Zeng, and Zhou Wang. 2015. SSIM-inspired two-pass rate control for high efficiency video coding. In 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP \u201915). IEEE, 1\u20135."},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/DCC.2015.35"},{"key":"e_1_3_1_38_2","first-page":"1581","volume-title":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP \u201916)","author":"Wu Jinjian","year":"2016","unstructured":"Jinjian Wu, Guangming Shi, Weisi Lin, and C. C. Jay Kuo. 2016. Enhanced just noticeable difference model with visual regularity consideration. In 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP \u201916). IEEE, 1581\u20131585."},{"key":"e_1_3_1_39_2","first-page":"1","volume-title":"2020 IEEE International Conference on Consumer Electronics (ICCE \u201920)","author":"Xiang Guoqing","year":"2020","unstructured":"Guoqing Xiang, Huizhu Jia, Lin Ding, Fan Yang, Yuan Li, and Xiaodong Xie. 2020. Perceptual CTU level bit allocation for AVS2. In 2020 IEEE International Conference on Consumer Electronics (ICCE \u201920). IEEE, 1\u20134."},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvcir.2017.11.011"},{"issue":"1","key":"e_1_3_1_41_2","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1109\/TBC.2021.3120916","article-title":"Perceptual quality consistency oriented CTU level rate control for HEVC intra coding","volume":"68","author":"Xiang Guoqing","year":"2021","unstructured":"Guoqing Xiang, Xinfeng Zhang, Xiaofeng Huang, Fan Yang, Chuang Zhu, Huizhu Jia, and Xiaodong Xie. 2021. Perceptual quality consistency oriented CTU level rate control for HEVC intra coding. IEEE Transactions on Broadcasting 68, 1 (2021), 69\u201382.","journal-title":"IEEE Transactions on Broadcasting"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2525004"},{"issue":"1","key":"e_1_3_1_43_2","first-page":"369","article-title":"Learning to detect video saliency with HEVC features","volume":"26","author":"Xu Mai","year":"2016","unstructured":"Mai Xu, Lai Jiang, Xiaoyan Sun, Zhaoting Ye, and Zulin Wang. 2016. Learning to detect video saliency with HEVC features. IEEE Transactions on Image Processing 26, 1 (2016), 369\u2013385.","journal-title":"IEEE Transactions on Image Processing"},{"key":"e_1_3_1_44_2","article-title":"An adaptive spatio-temporal perception aware quantization algorithm for AVS2","volume":"73","author":"Yan Yunyao","year":"2020","unstructured":"Yunyao Yan, Guoqing Xiang, Yuan Li, Xiaodong Xie, and Huizhu Jia. 2020. An adaptive spatio-temporal perception aware quantization algorithm for AVS2. Journal of Visual Communication and Image Representation 73 (2020), 102917.","journal-title":"Journal of Visual Communication and Image Representation"},{"key":"e_1_3_1_45_2","first-page":"1690","volume-title":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","author":"Yeo Chuohao","year":"2013","unstructured":"Chuohao Yeo, Hui Li Tan, and Yih Han Tan. 2013. SSIM-based adaptive quantization in HEVC. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 1690\u20131694."},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2015.2477682"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11554-011-0237-2"},{"issue":"2","key":"e_1_3_1_48_2","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1109\/TCE.2021.3065636","article-title":"Hybrid distortion-based rate-distortion optimization and rate control for H. 265\/HEVC","volume":"67","author":"Yuan Hui","year":"2021","unstructured":"Hui Yuan, Qun Wang, Qi Liu, Junyan Huo, and Peng Li. 2021. Hybrid distortion-based rate-distortion optimization and rate control for H. 265\/HEVC. IEEE Transactions on Consumer Electronics 67, 2 (2021), 97\u2013106.","journal-title":"IEEE Transactions on Consumer Electronics"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-020-09721-9"},{"key":"e_1_3_1_50_2","article-title":"Optimum quality control algorithm for versatile video coding","author":"Zhou Mingliang","year":"2022","unstructured":"Mingliang Zhou, Xuekai Wei, Cheng Ji, Tao Xiang, and Bin Fang. 2022. Optimum quality control algorithm for versatile video coding. IEEE Transactions on Broadcasting (2022).","journal-title":"IEEE Transactions on Broadcasting"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/tcsvt.2019.2959807"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3107616"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3636510","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3636510","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:54:11Z","timestamp":1750287251000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3636510"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,22]]},"references-count":51,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,5,31]]}},"alternative-id":["10.1145\/3636510"],"URL":"https:\/\/doi.org\/10.1145\/3636510","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,22]]},"assertion":[{"value":"2023-03-03","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-11-24","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-01-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}