{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T16:24:33Z","timestamp":1778171073719,"version":"3.51.4"},"reference-count":105,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2023,12,11]],"date-time":"2023-12-11T00:00:00Z","timestamp":1702252800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,4,30]]},"abstract":"<jats:p>\n            Recently, with the growing popularity of mobile devices as well as video sharing platforms (e.g., YouTube, Facebook, TikTok, and Twitch), User-Generated Content (UGC) videos have become increasingly common and now account for a large portion of multimedia traffic on the internet. Unlike professionally generated videos produced by filmmakers and videographers, typically, UGC videos contain multiple authentic distortions, generally introduced during capture and processing by naive users. Quality prediction of UGC videos is of paramount importance to optimize and monitor their processing in hosting platforms, such as their coding, transcoding, and streaming. However, blind quality prediction of UGC is quite challenging, because the degradations of UGC videos are unknown and very diverse, in addition to the unavailability of pristine reference. Therefore, in this article, we propose an accurate and efficient Blind Video Quality Assessment (BVQA) model for UGC videos, which we name 2BiVQA for double Bi-LSTM Video Quality Assessment. 2BiVQA metric consists of three main blocks, including a pre-trained Convolutional Neural Network to extract discriminative features from image patches, which are then fed into two Recurrent Neural Networks for spatial and temporal pooling. Specifically, we use two Bi-directional Long Short-term Memory networks, the first is used to capture short-range dependencies between image patches, while the second allows capturing long-range dependencies between frames to account for the temporal memory effect. Experimental results on recent large-scale UGC VQA datasets show that 2BiVQA achieves high performance at lower computational cost than most state-of-the-art VQA models. The source code of our 2BiVQA metric is made publicly available at\n            <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"https:\/\/github.com\/atelili\/2BiVQA\">https:\/\/github.com\/atelili\/2BiVQA<\/jats:ext-link>\n            .\n          <\/jats:p>","DOI":"10.1145\/3632178","type":"journal-article","created":{"date-parts":[[2023,11,8]],"date-time":"2023-11-08T11:56:42Z","timestamp":1699444602000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["2BiVQA: Double Bi-LSTM-based Video Quality Assessment of UGC Videos"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2659-7840","authenticated-orcid":false,"given":"Ahmed","family":"Telili","sequence":"first","affiliation":[{"name":"Univ. Rennes, INSA Rennes, CNRS, IETR - UMR 6164, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6453-8588","authenticated-orcid":false,"given":"Sid Ahmed","family":"Fezza","sequence":"additional","affiliation":[{"name":"National Higher School of Telecommunications and ICT, Algeria"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0143-1756","authenticated-orcid":false,"given":"Wassim","family":"Hamidouche","sequence":"additional","affiliation":[{"name":"Univ. Rennes, INSA Rennes, CNRS, IETR - UMR 6164, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-1399-4506","authenticated-orcid":false,"given":"Hanene F. Z.","family":"Brachemi Meftah","sequence":"additional","affiliation":[{"name":"National Higher School of Telecommunications and ICT, Algeria"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,12,11]]},"reference":[{"key":"e_1_3_2_2_2","first-page":"619","volume-title":"Proceedings of the 25th IEEE International Conference on Image Processing (ICIP\u201918)","author":"Ahn Sewoong","year":"2018","unstructured":"Sewoong Ahn and Sanghoon Lee. 2018. Deep blind video quality assessment based on temporal human perception. In Proceedings of the 25th IEEE International Conference on Image Processing (ICIP\u201918). IEEE, 619\u2013623."},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.5555\/2322561.2323684"},{"issue":"6","key":"e_1_3_2_4_2","doi-asserted-by":"crossref","DOI":"10.2352\/J.ImagingSci.Technol.2016.60.6.060410","article-title":"Image quality assessment by comparing CNN features between images","volume":"60","author":"Amirshahi S. A.","year":"2016","unstructured":"S. A. Amirshahi, M. Pedersen, and S. X. Yu. 2016. Image quality assessment by comparing CNN features between images. J. Imag. Sci. Technol. 60, 6 (2016), 60410\u20131.","journal-title":"J. Imag. Sci. Technol."},{"key":"e_1_3_2_5_2","unstructured":"Alexey Bochkovskiy Chien-Yao Wang and Hong-Yuan Mark Liao. 2020. YOLOv4: Optimal speed and accuracy of object detection. Retrieved from https:\/\/arxiv.org\/abs\/2004.10934"},{"key":"e_1_3_2_6_2","volume":"6","author":"BT ITU Recommendation","year":"2012","unstructured":"ITU Recommendation BT. 2012. Methodology for the Subjective Assessment of the Quality of Television Pictures. Int. Telecommun. Union 6 (2012).","journal-title":"Methodology for the Subjective Assessment of the Quality of Television Pictures. Int. Telecommun. Union"},{"key":"e_1_3_2_7_2","doi-asserted-by":"crossref","unstructured":"Y. Cao X. Min W. Sun and G. Zhai. 2023. Subjective and objective audio-visual quality assessment for user generated content. In IEEE Transactions on Image Processing Vol. 32 3847\u20133861.","DOI":"10.1109\/TIP.2023.3290528"},{"key":"e_1_3_2_8_2","article-title":"Cisco annual internet report (2018\u20132023) white paper","author":"Cisco U","year":"2020","unstructured":"U Cisco. 2020. Cisco annual internet report (2018\u20132023) white paper. Cisco, San Jose, CA.","journal-title":"Cisco, San Jose, CA"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_10_2","doi-asserted-by":"crossref","first-page":"60760A","DOI":"10.1117\/12.645089","volume-title":"Digital Publishing","author":"Feng Xiaojun","year":"2006","unstructured":"Xiaojun Feng and Jan P. Allebach. 2006. Measurement of ringing artifacts in JPEG images. In Digital Publishing, Vol. 6076. International Society for Optics and Photonics, 60760A."},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2017.01.054"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2500021"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1167\/17.1.32"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3077642"},{"key":"e_1_3_2_16_2","first-page":"273","volume-title":"Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding","author":"Graves Alex","year":"2013","unstructured":"Alex Graves, Navdeep Jaitly, and Abdel-rahman Mohamed. 2013. Hybrid speech recognition with deep bidirectional LSTM. In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding. IEEE, 273\u2013278."},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2005.06.042"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/TBC.2014.2344471"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1142\/S0218488598000094"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_22_2","first-page":"1","volume-title":"Proceedings of the 9th International Conference on Quality of Multimedia Experience (QoMEX\u201917)","author":"Hosu Vlad","year":"2017","unstructured":"Vlad Hosu, Franz Hahn, Mohsen Jenadeleh, Hanhe Lin, Hui Men, Tam\u00e1s Szir\u00e1nyi, Shujun Li, and Dietmar Saupe. 2017. The Konstanz natural video database (KoNViD-1k). In Proceedings of the 9th International Conference on Quality of Multimedia Experience (QoMEX\u201917). IEEE, 1\u20136."},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2020.2967829"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.243"},{"key":"e_1_3_2_25_2","article-title":"XGC-VQA: A unified video quality assessment model for user, professionally, and occupationally-generated content","author":"Huang Xinhui","year":"2023","unstructured":"Xinhui Huang, Chunyi Li, Abdelhak Bentaleb, Roger Zimmermann, and Guangtao Zhai. 2023. XGC-VQA: A unified video quality assessment model for user, professionally, and occupationally-generated content. Retrieved from https:\/\/arXiv:2303.13859","journal-title":"Retrieved from https:\/\/arXiv:2303.13859"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46475-6_43"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3130541"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.224"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2017.2736018"},{"key":"e_1_3_2_30_2","first-page":"219","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV\u201918)","author":"Kim Woojae","year":"2018","unstructured":"Woojae Kim, Jongyoo Kim, Sewoong Ahn, Jinwoo Kim, and Sanghoon Lee. 2018. Deep video quality assessor: From spatio-temporal visual sensitivity to a convolutional neural aggregation network. In Proceedings of the European Conference on Computer Vision (ECCV\u201918). 219\u2013234."},{"key":"e_1_3_2_31_2","volume-title":"Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915)","author":"Kingma Diederik P.","year":"2015","unstructured":"Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915), Yoshua Bengio and Yann LeCun (Eds.)."},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2923051"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2685941"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1145\/3343031.3351028"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-020-01408-w"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2016.2568752"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240643"},{"key":"e_1_3_2_38_2","unstructured":"Wei Liu Andrew Rabinovich and Alexander C. Berg. 2015. ParseNet: Looking wider to see better. Retrieved from https:\/\/arxiv.org\/abs\/1506.04579"},{"issue":"3","key":"e_1_3_2_39_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3414837","article-title":"Blind image quality assessment by natural scene statistics and perceptual characteristics","volume":"16","author":"Liu Yutao","year":"2020","unstructured":"Yutao Liu, Ke Gu, Xiu Li, and Yongbing Zhang. 2020. Blind image quality assessment by natural scene statistics and perceptual characteristics. ACM Trans. Multimedia Comput. Commun. Appl. 16, 3 (2020), 1\u201391.","journal-title":"ACM Trans. Multimedia Comput. Commun. Appl."},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2015.2436908"},{"key":"e_1_3_2_41_2","first-page":"2400","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201916)","author":"Manasa K.","year":"2016","unstructured":"K. Manasa and Sumohana S. Channappayya. 2016. An optical flow-based no-reference video quality assessment algorithm. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201916). IEEE, 2400\u20132404."},{"key":"e_1_3_2_42_2","first-page":"III","volume-title":"Proceedings of the International Conference on Image Processing","volume":"3","author":"Marziliano Pina","year":"2002","unstructured":"Pina Marziliano, Frederic Dufaux, Stefan Winkler, and Touradj Ebrahimi. 2002. A no-reference perceptual blur metric. In Proceedings of the International Conference on Image Processing, Vol. 3. IEEE, III."},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2017.2788206"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/3470970"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2735192"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TBC.2018.2816783"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2020.2988148"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/3503161.3548064"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2012.2214050"},{"issue":"1","key":"e_1_3_2_50_2","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1109\/TIP.2015.2502725","article-title":"A completely blind video integrity oracle","volume":"25","author":"Mittal Anish","year":"2015","unstructured":"Anish Mittal, Michele A. Saad, and Alan C. Bovik. 2015. A completely blind video integrity oracle. IEEE Trans. Image Process. 25, 1 (2015), 289\u2013300.","journal-title":"IEEE Trans. Image Process."},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2012.2227726"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2010.2043888"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2011.2147325"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSTSP.2009.2014806"},{"key":"e_1_3_2_55_2","first-page":"3","volume-title":"Proceedings of the Data Compression Conference","author":"Norkin Andrey","year":"2018","unstructured":"Andrey Norkin and Neil Birkbeck. 2018. Film grain synthesis for AV1 video codec. In Proceedings of the Data Compression Conference. IEEE, 3\u201312."},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2012.2219551"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2440172"},{"key":"e_1_3_2_58_2","first-page":"583","volume-title":"Proceedings of the Conference on Visual Communications and Image Processing","volume":"5150","author":"Pinson Margaret H.","year":"2003","unstructured":"Margaret H. Pinson and Stephen Wolf. 2003. An objective method for combining multiple subjective data sets. In Proceedings of the Conference on Visual Communications and Image Processing, Vol. 5150. International Society for Optics and Photonics, 583\u2013592."},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.73.814"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2010.2045550"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2012.2191563"},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2014.2299154"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1109\/78.650093"},{"key":"e_1_3_2_65_2","first-page":"1153","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201911)","author":"Seshadrinathan Kalpana","year":"2011","unstructured":"Kalpana Seshadrinathan and Alan C. Bovik. 2011. Temporal hysteresis model of time varying subjective video quality. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201911). IEEE, 1153\u20131156."},{"key":"e_1_3_2_66_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2010.2042111"},{"key":"e_1_3_2_67_2","doi-asserted-by":"crossref","unstructured":"Wenhao Shen Mingliang Zhou Xingran Liao Weijia Jia Tao Xiang Bin Fang and Zhaowei Shang. 2022. An end-to-end no-reference video quality assessment method with hierarchical spatiotemporal feature representation. IEEE Transactions on Broadcasting 68 3 (2022) 651\u2013660.","DOI":"10.1109\/TBC.2022.3164332"},{"key":"e_1_3_2_68_2","first-page":"3285","volume-title":"Proceedings of the IEEE International Conference on Big Data (Big Data\u201919)","author":"Siami-Namini Sima","year":"2019","unstructured":"Sima Siami-Namini, Neda Tavakoli, and Akbar Siami Namin. 2019. The performance of LSTM and BiLSTM in forecasting time series. In Proceedings of the IEEE International Conference on Big Data (Big Data\u201919). IEEE, 3285\u20133292."},{"key":"e_1_3_2_69_2","volume-title":"Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915)","author":"Simonyan Karen","year":"2015","unstructured":"Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915), Yoshua Bengio and Yann LeCun (Eds.)."},{"issue":"2","key":"e_1_3_2_70_2","doi-asserted-by":"crossref","first-page":"612","DOI":"10.1109\/TIP.2018.2869673","article-title":"Large-scale study of perceptual video quality","volume":"28","author":"Sinno Zeina","year":"2018","unstructured":"Zeina Sinno and Alan Conrad Bovik. 2018. Large-scale study of perceptual video quality. IEEE Trans. Image Process. 28, 2 (2018), 612\u2013627.","journal-title":"IEEE Trans. Image Process."},{"key":"e_1_3_2_71_2","first-page":"1750","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201919)","author":"Sinno Zeina","year":"2019","unstructured":"Zeina Sinno and Alan C. Bovik. 2019. Spatio-temporal measures of naturalness. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201919). IEEE, 1750\u20131754."},{"key":"e_1_3_2_72_2","doi-asserted-by":"publisher","DOI":"10.1145\/3503161.3548329"},{"key":"e_1_3_2_73_2","doi-asserted-by":"crossref","DOI":"10.1109\/JSTSP.2023.3270621","article-title":"Blind quality assessment for in-the-wild images via hierarchical feature fusion and iterative mixed database training","author":"Sun Wei","year":"2023","unstructured":"Wei Sun, Xiongkuo Min, Danyang Tu, Siwei Ma, and Guangtao Zhai. 2023. Blind quality assessment for in-the-wild images via hierarchical feature fusion and iterative mixed database training. IEEE J. Select. Top. Signal Process. (2023).","journal-title":"IEEE J. Select. Top. Signal Process."},{"key":"e_1_3_2_74_2","first-page":"1","volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo Workshops (ICMEW\u201921)","author":"Sun Wei","year":"2021","unstructured":"Wei Sun, Tao Wang, Xiongkuo Min, Fuwang Yi, and Guangtao Zhai. 2021. Deep learning based full-reference and no-reference quality assessment models for compressed UGC videos. In Proceedings of the IEEE International Conference on Multimedia and Expo Workshops (ICMEW\u201921). IEEE, 1\u20136."},{"key":"e_1_3_2_75_2","first-page":"6105","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Tan Mingxing","year":"2019","unstructured":"Mingxing Tan and Quoc Le. 2019. Efficientnet: Rethinking model scaling for convolutional neural networks. In Proceedings of the International Conference on Machine Learning. PMLR, 6105\u20136114."},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1145\/3464393"},{"key":"e_1_3_2_77_2","first-page":"141","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201920)","author":"Tu Zhengzhong","year":"2020","unstructured":"Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Neil Birkbeck, Balu Adsumilli, and Alan C. Bovik. 2020. A comparative evaluation of temporal pooling methods for blind video quality assessment. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201920). IEEE, 141\u2013145."},{"key":"e_1_3_2_78_2","first-page":"2712","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201920)","author":"Tu Zhengzhong","year":"2020","unstructured":"Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, and Alan C. Bovik. 2020. Bband index: A no-reference banding artifact predictor. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201920). IEEE, 2712\u20132716."},{"key":"e_1_3_2_79_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3072221"},{"key":"e_1_3_2_80_2","doi-asserted-by":"publisher","DOI":"10.1109\/OJSP.2021.3090333"},{"key":"e_1_3_2_81_2","first-page":"467","volume-title":"Proceedings of the Congress on Image and Signal Processing","volume":"1","author":"Wang Xin","year":"2008","unstructured":"Xin Wang, Baofeng Tian, Chao Liang, and Dongcheng Shi. 2008. Blind image quality assessment for measuring image blur. In Proceedings of the Congress on Image and Signal Processing, Vol. 1. IEEE, 467\u2013470."},{"key":"e_1_3_2_82_2","first-page":"1","volume-title":"Proceedings of the IEEE 21st International Workshop on Multimedia Signal Processing (MMSP\u201919)","author":"Wang Yilin","year":"2019","unstructured":"Yilin Wang, Sasi Inguva, and Balu Adsumilli. 2019. YouTube UGC dataset for video compression research. In Proceedings of the IEEE 21st International Workshop on Multimedia Signal Processing (MMSP\u201919). IEEE, 1\u20135."},{"key":"e_1_3_2_83_2","first-page":"2067","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201916)","author":"Wang Yilin","year":"2016","unstructured":"Yilin Wang, Sang-Uok Kum, Chao Chen, and Anil Kokaram. 2016. A perceptual visibility metric for banding artifacts. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201916). IEEE, 2067\u20132071."},{"key":"e_1_3_2_84_2","first-page":"981","volume-title":"Proceedings International Conference on Image Processing","volume":"3","author":"Wang Zhou","year":"2000","unstructured":"Zhou Wang, Alan C. Bovik, and Brian L. Evan. 2000. Blind measurement of blocking artifacts in images. In Proceedings International Conference on Image Processing, Vol. 3. IEEE, 981\u2013984."},{"key":"e_1_3_2_85_2","volume-title":"Proceedings of the Electronic Imaging Science and Technology Conference","author":"Watson Andrew B.","year":"1997","unstructured":"Andrew B. Watson and Cynthia H. Null. 1997. Digital images and human vision. In Proceedings of the Electronic Imaging Science and Technology Conference."},{"key":"e_1_3_2_86_2","first-page":"538","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Wu Haoning","year":"2022","unstructured":"Haoning Wu, Chaofeng Chen, Jingwen Hou, Liang Liao, Annan Wang, Wenxiu Sun, Qiong Yan, and Weisi Lin. 2022. Fast-vqa: Efficient end-to-end video quality assessment with fragment sampling. In Proceedings of the European Conference on Computer Vision. Springer, 538\u2013554."},{"key":"e_1_3_2_87_2","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV\u201923)","author":"Wu Haoning","year":"2023","unstructured":"Haoning Wu, Erli Zhang, Liang Liao, Chaofeng Chen, Jingwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, and Weisi Lin. 2023. Exploring video quality assessment on user generated contents from aesthetic and technical perspectives. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV\u201923)."},{"key":"e_1_3_2_88_2","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475486"},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2016.2585880"},{"key":"e_1_3_2_90_2","first-page":"491","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201914)","author":"Xu Jingtao","year":"2014","unstructured":"Jingtao Xu, Peng Ye, Yong Liu, and David Doermann. 2014. No-reference video quality assessment via feature learning. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201914). IEEE, 491\u2013495."},{"key":"e_1_3_2_91_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2014.2355716"},{"key":"e_1_3_2_92_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2938900"},{"key":"e_1_3_2_93_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.03.072"},{"key":"e_1_3_2_94_2","first-page":"1098","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Ye Peng","year":"2012","unstructured":"Peng Ye, Jayant Kumar, Le Kang, and David Doermann. 2012. Unsupervised feature learning framework for no-reference image quality assessment. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 1098\u20131105."},{"key":"e_1_3_2_95_2","first-page":"1414","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201921)","author":"Yi Fuwang","year":"2021","unstructured":"Fuwang Yi, Mianyi Chen, Wei Sun, Xiongkuo Min, Yuan Tian, and Guangtao Zhai. 2021. Attention based network for no-reference UGC video quality assessment. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201921). IEEE, 1414\u20131418."},{"key":"e_1_3_2_96_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00363"},{"key":"e_1_3_2_97_2","first-page":"2349","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201919)","author":"You Junyong","year":"2019","unstructured":"Junyong You and Jari Korhonen. 2019. Deep neural networks for no-reference video quality assessment. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201919). IEEE, 2349\u20132353."},{"key":"e_1_3_2_98_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11432-019-2757-1"},{"key":"e_1_3_2_99_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2426416"},{"key":"e_1_3_2_100_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00068"},{"key":"e_1_3_2_101_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2502579"},{"key":"e_1_3_2_102_2","doi-asserted-by":"publisher","DOI":"10.1117\/1.JEI.22.4.043025"},{"key":"e_1_3_2_103_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2014.05.004"},{"key":"e_1_3_2_104_2","doi-asserted-by":"publisher","DOI":"10.1145\/2659520"},{"key":"e_1_3_2_105_2","first-page":"1746","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhang Zicheng","year":"2023","unstructured":"Zicheng Zhang, Wei Wu, Wei Sun, Danyang Tu, Wei Lu, Xiongkuo Min, Ying Chen, and Guangtao Zhai. 2023. MD-VQA: Multi-dimensional quality assessment for UGC live videos. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1746\u20131755."},{"key":"e_1_3_2_106_2","doi-asserted-by":"publisher","DOI":"10.1145\/3183512"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3632178","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3632178","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:49:56Z","timestamp":1750286996000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3632178"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,11]]},"references-count":105,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,4,30]]}},"alternative-id":["10.1145\/3632178"],"URL":"https:\/\/doi.org\/10.1145\/3632178","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,11]]},"assertion":[{"value":"2023-04-13","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-10-22","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-12-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}