{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,29]],"date-time":"2026-01-29T23:19:05Z","timestamp":1769728745665,"version":"3.49.0"},"reference-count":79,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2017,7,12]],"date-time":"2017-07-12T00:00:00Z","timestamp":1499817600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Major Scientific and Technological Innovation Project of Hubei Province","award":["2015AAA013"],"award-info":[{"award-number":["2015AAA013"]}]},{"DOI":"10.13039\/501100001809","name":"Natural Science Foundation of China","doi-asserted-by":"crossref","award":["U1536203, 61572493, 61572214 and 61672254"],"award-info":[{"award-number":["U1536203, 61572493, 61572214 and 61672254"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Nature Science Foundation of the Open University of China","award":["G16F3702Z and G16F2505Q"],"award-info":[{"award-number":["G16F3702Z and G16F2505Q"]}]},{"name":"Major Scientific Research Project of Yunnan Provincial Education Department","award":["2015Z169"],"award-info":[{"award-number":["2015Z169"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2017,9,30]]},"abstract":"<jats:p>Adult image and video recognition is an important and challenging problem in the real world. Low-level feature cues do not produce good enough information, especially when the dataset is very large and has various data distributions. This issue raises a serious problem for conventional approaches. In this article, we tackle this problem by proposing a deep multicontext network with fine-to-coarse strategy for adult image and video recognition. We employ a deep convolution networks to model fusion features of sensitive objects in images. Global contexts and local contexts are both taken into consideration and are jointly modeled in a unified multicontext deep learning framework. To make the model more discriminative for diverse target objects, we investigate a novel hierarchical method, and a task-specific fine-to-coarse strategy is designed to make the multicontext modeling more suitable for adult object recognition. Furthermore, some recently proposed deep models are investigated. Our approach is extensively evaluated on four different datasets. One dataset is used for ablation experiments, whereas others are used for generalization experiments. Results show significant and consistent improvements over the state-of-the-art methods.<\/jats:p>","DOI":"10.1145\/3057733","type":"journal-article","created":{"date-parts":[[2017,7,13]],"date-time":"2017-07-13T14:29:57Z","timestamp":1499956197000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":33,"title":["Adult Image and Video Recognition by a Deep Multicontext Network and Fine-to-Coarse Strategy"],"prefix":"10.1145","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2905-426X","authenticated-orcid":false,"given":"Xinyu","family":"Ou","sequence":"first","affiliation":[{"name":"Huazhong University of Science and Technology, Chinese Academy of Sciences, Yunnan Open University, Kunming, China"}]},{"given":"Hefei","family":"Ling","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, Wuhan, China"}]},{"given":"Han","family":"Yu","sequence":"additional","affiliation":[{"name":"Chinese Academy of Sciences, Beijing, China"}]},{"given":"Ping","family":"Li","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, Wuhan, China"}]},{"given":"Fuhao","family":"Zou","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, Wuhan, China"}]},{"given":"Si","family":"Liu","sequence":"additional","affiliation":[{"name":"Chinese Academy of Sciences, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2017,7,12]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCI.2015.2405317"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2003.10.007"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888089.1888123"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the 4th European Conference on Computer Vision (ECCV\u201996)","author":"Bregler Christoph","unstructured":"Christoph Bregler , Margaret M. Fleck , and David A. Forsyth . 1996. Finding naked people . In Proceedings of the 4th European Conference on Computer Vision (ECCV\u201996) . 593--602. Christoph Bregler, Margaret M. Fleck, and David A. Forsyth. 1996. Finding naked people. In Proceedings of the 4th European Conference on Computer Vision (ECCV\u201996). 593--602."},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the 22nd European Signal Processing Conference. 1681--1685","author":"Caetano Carlos","year":"2014","unstructured":"Carlos Caetano , Sandra Eliza Fontes de Avila , Silvio Jamil Ferzoli Guimar\u00e3es , and Arnaldo de Albuquerque Ara\u00fajo . 2014 . Pornography detection using BossaNova video descriptor . In Proceedings of the 22nd European Signal Processing Conference. 1681--1685 . Carlos Caetano, Sandra Eliza Fontes de Avila, Silvio Jamil Ferzoli Guimar\u00e3es, and Arnaldo de Albuquerque Ara\u00fajo. 2014. Pornography detection using BossaNova video descriptor. In Proceedings of the 22nd European Signal Processing Conference. 1681--1685."},{"key":"e_1_2_1_6_1","volume-title":"Criminal Law of the People\u2019s Republic of China. Retrieved","author":"PRC.","year":"2017","unstructured":"China PRC. 2015. Criminal Law of the People\u2019s Republic of China. Retrieved April 1, 2017 , from http:\/\/www.lawtime.cn\/faguizt\/23.html. ChinaPRC. 2015. Criminal Law of the People\u2019s Republic of China. Retrieved April 1, 2017, from http:\/\/www.lawtime.cn\/faguizt\/23.html."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/2354409.2354694"},{"key":"e_1_2_1_8_1","unstructured":"Jifeng Dai Yi Li Kaiming He and Jian Sun. 2016. R-FCN: Object detection via region-based fully convolutional networks. arXiv:1605.06409.  Jifeng Dai Yi Li Kaiming He and Jian Sun. 2016. R-FCN: Object detection via region-based fully convolutional networks. arXiv:1605.06409."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2012.09.007"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888150.1888157"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2008.4761366"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICNC.2014.6975895"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0275-4"},{"key":"e_1_2_1_15_1","doi-asserted-by":"crossref","unstructured":"C. Fellbaum (Ed.). 1998. WordNet: An Electronic Lexical Database. MIT Press Cambridge MA.  C. Fellbaum (Ed.). 1998. WordNet: An Electronic Lexical Database. MIT Press Cambridge MA.","DOI":"10.7551\/mitpress\/7287.001.0001"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5555\/1886063.1886121"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2629673"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1008145029462"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/794189.794448"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.5555\/646469.691882"},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of the 7th IEEE International Conference on Computer Vision (ICCV\u201999)","author":"David","unstructured":"David A. Forsyth and Sergey Ioffe. 1999. Finding people by sampling . In Proceedings of the 7th IEEE International Conference on Computer Vision (ICCV\u201999) . 1092--1097. David A. Forsyth and Sergey Ioffe. 1999. Finding people by sampling. In Proceedings of the 7th IEEE International Conference on Computer Vision (ICCV\u201999). 1092--1097."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1011179004708"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOM.2016.7524606"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2008.4587410"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2389824"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_1_29_1","unstructured":"Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016b. Identity mappings in deep residual networks. arXiv:1603.05027.  Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016b. Identity mappings in deep residual networks. arXiv:1603.05027."},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II. 782--788","author":"Ioffe Sergey","unstructured":"Sergey Ioffe and David A. Forsyth . 1998. Learning to find pictures of people . In Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II. 782--788 . Sergey Ioffe and David A. Forsyth. 1998. Learning to find pictures of people. In Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II. 782--788."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654889"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the 2001 International Conferences on Info-Tech and Info-Net","volume":"3","author":"Jiao Feng","year":"2001","unstructured":"Feng Jiao , Wen Gao , Lijuan Duan , and Guoqin Cui . 2001 . Detecting adult image using multiple features . In Proceedings of the 2001 International Conferences on Info-Tech and Info-Net , Vol. 3 . 378--383. Feng Jiao, Wen Gao, Lijuan Duan, and Guoqin Cui. 2001. Detecting adult image using multiple features. In Proceedings of the 2001 International Conferences on Info-Tech and Info-Net, Vol. 3. 378--383."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.223"},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the 2012 Conference on Advances in Neural Information Processing Systems. 1106--1114","author":"Krizhevsky Alex","unstructured":"Alex Krizhevsky , Ilya Sutskever , and Geoffrey E. Hinton . 2012. ImageNet classification with deep convolutional neural networks . In Proceedings of the 2012 Conference on Advances in Neural Information Processing Systems. 1106--1114 . Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In Proceedings of the 2012 Conference on Advances in Neural Information Processing Systems. 1106--1114."},{"key":"e_1_2_1_35_1","unstructured":"LegalDictionary. 2015. Pornography. Retrieved April 1 2017 from http:\/\/legal-dictionary.thefreedictionary.com\/pornography.  LegalDictionary. 2015. Pornography. Retrieved April 1 2017 from http:\/\/legal-dictionary.thefreedictionary.com\/pornography."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2014.2381872"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2819000"},{"key":"e_1_2_1_38_1","volume-title":"Proceedings of the 2004 Asian Conference on Computer Vision.","author":"Liang K. M.","unstructured":"K. M. Liang , S. D. Scott , and M. Waqas . 2004. Detecting pornographic images . In Proceedings of the 2004 Asian Conference on Computer Vision. K. M. Liang, S. D. Scott, and M. Waqas. 2004. Detecting pornographic images. In Proceedings of the 2004 Asian Conference on Computer Vision."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2015.7301269"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2015.7301269"},{"key":"e_1_2_1_41_1","unstructured":"Min Lin Qiang Chen and Shuicheng Yan. 2013. Network in network. arXiv:1312.4400.  Min Lin Qiang Chen and Shuicheng Yan. 2013. Network in network. arXiv:1312.4400."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.170"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2013.2285526"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298748"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/2393347.2396470"},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI\u201916)","author":"Liu Si","year":"2016","unstructured":"Si Liu , Xinyu Ou , Ruihe Qian , Wei Wang , and Xiaochun Cao . 2016 . Makeup like a superstar: Deep localized makeup transfer network . In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI\u201916) . 2568--2575. Si Liu, Xinyu Ou, Ruihe Qian, Wei Wang, and Xiaochun Cao. 2016. Makeup like a superstar: Deep localized makeup transfer network. In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI\u201916). 2568--2575."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.5555\/2354409.2354954"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"e_1_2_1_49_1","unstructured":"Mohamed Moustafa. 2015. Applying deep learning to classify pornographic images and videos. arXiv:1551.08899.  Mohamed Moustafa. 2015. Applying deep learning to classify pornographic images and videos. arXiv:1551.08899."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBC.2016.2580920"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2014.11.006"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1013200319198"},{"key":"e_1_2_1_53_1","volume-title":"Proceedings of the 2015 Conference on Advances in Neural Information Processing Systems. 91--99","author":"Ren Shaoqing","year":"2015","unstructured":"Shaoqing Ren , Kaiming He , Ross B. Girshick , and Jian Sun . 2015 . Faster R-CNN: Towards real-time object detection with region proposal networks . In Proceedings of the 2015 Conference on Advances in Neural Information Processing Systems. 91--99 . Shaoqing Ren, Kaiming He, Ross B. Girshick, and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Proceedings of the 2015 Conference on Advances in Neural Information Processing Systems. 91--99."},{"key":"e_1_2_1_54_1","volume-title":"Proceedings of the 1st International Conference on Computer Vision Theory and Applications (VISAPP\u201906)","author":"Rowley Henry A.","year":"2006","unstructured":"Henry A. Rowley , Yushi Jing , and Shumeet Baluja . 2006 . Large scale image-based adult-content filtering . In Proceedings of the 1st International Conference on Computer Vision Theory and Applications (VISAPP\u201906) . 290--296. Henry A. Rowley, Yushi Jing, and Shumeet Baluja. 2006. Large scale image-based adult-content filtering. In Proceedings of the 1st International Conference on Computer Vision Theory and Applications (VISAPP\u201906). 290--296."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0816-y"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2014.7026079"},{"key":"e_1_2_1_57_1","volume-title":"Proceedings of the 2nd International Conference on Distributed Computing Systems. 509--512","author":"Saltzer Jerome H.","unstructured":"Jerome H. Saltzer , David P. Reed , and David D. Clark . 1981. End-to-end arguments in system design . In Proceedings of the 2nd International Conference on Distributed Computing Systems. 509--512 . Jerome H. Saltzer, David P. Reed, and David D. Clark. 1981. End-to-end arguments in system design. In Proceedings of the 2nd International Conference on Distributed Computing Systems. 509--512."},{"key":"e_1_2_1_58_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556.  Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556."},{"key":"e_1_2_1_59_1","unstructured":"Rupesh Kumar Srivastava Klaus Greff and Jrgen Schmidhuber. 2015. Highway networks. arXiv:1505.00387.  Rupesh Kumar Srivastava Klaus Greff and Jrgen Schmidhuber. 2015. Highway networks. arXiv:1505.00387."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/2735952"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.128"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigMM.2015.36"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0140-3664(98)00203-5"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2016.2590944"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2016.01.022"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1109\/DMDCM.2011.36"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2014.2307862"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2016.2591583"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2016.2636090"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICACT.2014.6779041"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1109\/TGRS.2014.2357078"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/TGRS.2015.2488681"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1109\/TGRS.2016.2569141"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298731"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICNIDC.2010.5657916"},{"key":"e_1_2_1_77_1","volume-title":"Proceedings of the 2004 IEEE International Conference on Multimedia and Expo (ICME\u201904)","author":"Zheng Huicheng","year":"2004","unstructured":"Huicheng Zheng , Hongmei Liu , and Mohamed Daoudi . 2004 . Blocking objectionable images: Adult images and harmful symbols . In Proceedings of the 2004 IEEE International Conference on Multimedia and Expo (ICME\u201904) . 1223--1226. Huicheng Zheng, Hongmei Liu, and Mohamed Daoudi. 2004. Blocking objectionable images: Adult images and harmful symbols. In Proceedings of the 2004 IEEE International Conference on Multimedia and Expo (ICME\u201904). 1223--1226."},{"key":"e_1_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.3233\/IFS-141378"},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2016.2601065"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3057733","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3057733","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:24:02Z","timestamp":1750220642000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3057733"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,7,12]]},"references-count":79,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2017,9,30]]}},"alternative-id":["10.1145\/3057733"],"URL":"https:\/\/doi.org\/10.1145\/3057733","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,7,12]]},"assertion":[{"value":"2016-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-02-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-07-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}