{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:26:00Z","timestamp":1750307160417,"version":"3.41.0"},"reference-count":51,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2011,10,1]],"date-time":"2011-10-01T00:00:00Z","timestamp":1317427200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["60825204, 60935002"],"award-info":[{"award-number":["60825204, 60935002"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2011,10]]},"abstract":"<jats:p>In this article, we develop an integrated adult-content recognition system which can detect adult images, adult videos, and adult Web page bags, where a Web page bag consists of a Web page and a predefined number of Web pages linked to it through hyperlinks. In our adult image-recognition algorithm, we model skin patches rather than skin pixels, resulting in better results than state-of-the-art algorithms which model skin pixels. In our adult video-recognition algorithm, information from the accompanying audio section around an image in an adult video is used to obtain a prior classification of the image. The algorithm achieves a better performance than the ones which use image information alone or audio information alone. The adult Web page bag recognition is carried out using multi-instance learning based on the combination of classifying texts, images and videos in Web pages. Both the speed and the accuracy for recognizing the Web adult content are increased, in contrast to recognizing Web pages one-by-one.<\/jats:p>","DOI":"10.1145\/2037676.2037685","type":"journal-article","created":{"date-parts":[[2011,11,8]],"date-time":"2011-11-08T13:32:01Z","timestamp":1320759121000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Recognition of adult images, videos, and web page bags"],"prefix":"10.1145","volume":"7S","author":[{"given":"Weiming","family":"Hu","sequence":"first","affiliation":[{"name":"Institute of Automation, Chinese Academy of Sciences"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haiqiang","family":"Zuo","sequence":"additional","affiliation":[{"name":"Institute of Automation, Chinese Academy of Sciences"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ou","family":"Wu","sequence":"additional","affiliation":[{"name":"Institute of Automation, Chinese Academy of Sciences"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yunfei","family":"Chen","sequence":"additional","affiliation":[{"name":"Institute of Automation, Chinese Academy of Sciences"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhongfei","family":"Zhang","sequence":"additional","affiliation":[{"name":"State University of New York"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Suter","sequence":"additional","affiliation":[{"name":"University of Adelaide"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2011,11,4]]},"reference":[{"volume-title":"Proceedings of the Neural Information Processing Systems Conference. MIT Press, 561--568","author":"Andrews S.","key":"e_1_2_1_1_1","unstructured":"Andrews , S. , Tsochantaridis , I. , and Hofmann , T . 2003. Support Vector machines for multiple-instance learning . In Proceedings of the Neural Information Processing Systems Conference. MIT Press, 561--568 . Andrews, S., Tsochantaridis, I., and Hofmann, T. 2003. Support Vector machines for multiple-instance learning. In Proceedings of the Neural Information Processing Systems Conference. MIT Press, 561--568."},{"key":"e_1_2_1_2_1","volume-title":"Electronic Imaging: Real-Time Image Processing.","author":"Aragon C. R.","year":"2007","unstructured":"Aragon , C. R. and Aragon , D. B . 2007 . A fast contour descriptor algorithm for supernova image classification. In Proceedings of SPIE Annual Symposium on Electronic Imaging: Real-Time Image Processing. Vol. 6496 , 649607.1-649607.12. Aragon, C. R. and Aragon, D. B. 2007. A fast contour descriptor algorithm for supernova image classification. In Proceedings of SPIE Annual Symposium on Electronic Imaging: Real-Time Image Processing. Vol. 6496, 649607.1-649607.12."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2003.10.007"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2010.189"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"volume-title":"Proceedings of the British Machine Vision Conference. 491--500","author":"Brown D.","key":"e_1_2_1_6_1","unstructured":"Brown , D. , Craw , I. , and Lewthwaite , J . 2001. A SOM based approach to skin detection with application in real time systems . In Proceedings of the British Machine Vision Conference. 491--500 . Brown, D., Craw, I., and Lewthwaite, J. 2001. A SOM based approach to skin detection with application in real time systems. In Proceedings of the British Machine Vision Conference. 491--500."},{"volume-title":"Semantic-based audio recognition and retrieval. Master Thesis","author":"Buchanan C. R.","key":"e_1_2_1_7_1","unstructured":"Buchanan , C. R. 2005. Semantic-based audio recognition and retrieval. Master Thesis , University of Edinburgh. Buchanan, C. R. 2005. Semantic-based audio recognition and retrieval. Master Thesis, University of Edinburgh."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.177"},{"volume-title":"Proceedings of International Conference on Pattern Recognition. 1--4.","author":"Deselaers T.","key":"e_1_2_1_9_1","unstructured":"Deselaers , T. , Pimenidis , L. , and Ney , H . 2008. Bag-of-visual-words models for adult image classification and filtering . In Proceedings of International Conference on Pattern Recognition. 1--4. Deselaers, T., Pimenidis, L., and Ney, H. 2008. Bag-of-visual-words models for adult image classification and filtering. In Proceedings of International Conference on Pattern Recognition. 1--4."},{"volume-title":"Proceedings of the IEEE International Conference on Networks. 325--330","author":"Du R.","key":"e_1_2_1_10_1","unstructured":"Du , R. , Safavi-Naini , R. , and Susilo , W . 2003. Web filtering using text classification . In Proceedings of the IEEE International Conference on Networks. 325--330 . Du, R., Safavi-Naini, R., and Susilo, W. 2003. Web filtering using text classification. In Proceedings of the IEEE International Conference on Networks. 325--330."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/AIPR.2008.4906438"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000022288.19776.77"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1008145029462"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2006.34"},{"key":"e_1_2_1_15_1","first-page":"1","article-title":"Multi-Layer objectionable video classification system using local-global information","volume":"49","author":"Han S.","year":"2005","unstructured":"Han , S. , Jeong , C. , and Nam , T. 2005 . Multi-Layer objectionable video classification system using local-global information . In Proceedings of the WSEAS International Conference on Computers. 49 , 1 -- 5 . Han, S., Jeong, C., and Nam, T. 2005. Multi-Layer objectionable video classification system using local-global information. In Proceedings of the WSEAS International Conference on Computers. 49, 1--5.","journal-title":"Proceedings of the WSEAS International Conference on Computers."},{"key":"e_1_2_1_16_1","first-page":"4792","article-title":"Statistical and structural approaches to filtering internet pornography","volume":"5","author":"Ho W. H.","year":"2004","unstructured":"Ho , W. H. and Watters , P. A. 2004 . Statistical and structural approaches to filtering internet pornography . In Proceedings of the IEEE International Conference on System, Man and Cybernetics. 5 , 4792 -- 4798 . Ho, W. H. and Watters, P. A. 2004. Statistical and structural approaches to filtering internet pornography. In Proceedings of the IEEE International Conference on System, Man and Cybernetics. 5, 4792--4798.","journal-title":"Proceedings of the IEEE International Conference on System, Man and Cybernetics."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.1000242"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.1133"},{"volume-title":"Proceedings of the IEEE International Conference on Computer Vision. 1092--1097","author":"Ioffe S.","key":"e_1_2_1_19_1","unstructured":"Ioffe , S. and Forsyth , D . 1999. Finding people by sampling . In Proceedings of the IEEE International Conference on Computer Vision. 1092--1097 . Ioffe, S. and Forsyth, D. 1999. Finding people by sampling. In Proceedings of the IEEE International Conference on Computer Vision. 1092--1097."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1011179004708"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1631272.1631366"},{"volume-title":"Proceedings of the Indian Conference on Computer Vision, Graphics and Image Processing. 276--281","author":"Jedynak B.","key":"e_1_2_1_22_1","unstructured":"Jedynak , B. , Zheng , H. , Daoudi , M. , and Barret , D . 2002. Maximum entropy models for skin detection . In Proceedings of the Indian Conference on Computer Vision, Graphics and Image Processing. 276--281 . Jedynak, B., Zheng, H., Daoudi, M., and Barret, D. 2002. Maximum entropy models for skin detection. In Proceedings of the Indian Conference on Computer Vision, Graphics and Image Processing. 276--281."},{"key":"e_1_2_1_23_1","first-page":"378","article-title":"Detecting adult image using multiple features","volume":"3","author":"Jiao F.","year":"2001","unstructured":"Jiao , F. , Gao , W. , Duan , L. , and Cui , G. 2001 . Detecting adult image using multiple features . In Proceedings of the IEEE International Conference on Info-Tech and Info-Net. Vol. 3 , 378 -- 383 . Jiao, F., Gao, W., Duan, L., and Cui, G. 2001. Detecting adult image using multiple features. In Proceedings of the IEEE International Conference on Info-Tech and Info-Net. Vol. 3, 378--383.","journal-title":"Proceedings of the IEEE International Conference on Info-Tech and Info-Net."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1013200319198"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the International Conference on Computer as a Tool.","volume":"2","author":"Kovac J.","unstructured":"Kovac , J. , Peer , P. , and Solina , F . 2003. Human skin colour clustering for face detection . In Proceedings of the International Conference on Computer as a Tool. Vol. 2 , 144--148. Kovac, J., Peer, P., and Solina, F. 2003. Human skin colour clustering for face detection. In Proceedings of the International Conference on Computer as a Tool. Vol. 2, 144--148."},{"volume-title":"Proceedings of the International Conference on Imaging Science, Systems and Technology.","author":"Lee J. Y.","key":"e_1_2_1_26_1","unstructured":"Lee , J. Y. and Yoo , S. I . 2002. An elliptical boundary model for skin color detection . In Proceedings of the International Conference on Imaging Science, Systems and Technology. Lee, J. Y. and Yoo, S. I. 2002. An elliptical boundary model for skin color detection. In Proceedings of the International Conference on Imaging Science, Systems and Technology."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2005.858414"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCE.2009.5174439"},{"volume-title":"Proceedings of the Asian Conference on Computer Vision. 497--502","author":"Liang K. M.","key":"e_1_2_1_29_1","unstructured":"Liang , K. M. , Scott , S. D. , and Waqas , M . 2004. Detecting pornographic images . In Proceedings of the Asian Conference on Computer Vision. 497--502 . Liang, K. M., Scott, S. D., and Waqas, M. 2004. Detecting pornographic images. In Proceedings of the Asian Conference on Computer Vision. 497--502."},{"volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo. 1472--1475","author":"Lienhart R.","key":"e_1_2_1_30_1","unstructured":"Lienhart , R. and Hauke , R . 2009. Filtering adult image content with topic models . In Proceedings of the IEEE International Conference on Multimedia and Expo. 1472--1475 . Lienhart, R. and Hauke, R. 2009. Filtering adult image content with topic models. In Proceedings of the IEEE International Conference on Multimedia and Expo. 1472--1475."},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the IEEE International Conference on Image Processing.","volume":"1","author":"Lienhart R.","unstructured":"Lienhart , R. and Maydt , J . 2002. An extended set of haar-like features for rapid object detection . In Proceedings of the IEEE International Conference on Image Processing. Vol. 1 , I-900-I-903. Lienhart, R. and Maydt, J. 2002. An extended set of haar-like features for rapid object detection. In Proceedings of the IEEE International Conference on Image Processing. Vol. 1, I-900-I-903."},{"volume-title":"Proceedings of the European Signal Processing Conference. 1552--1556","author":"Lopes A. P. B.","key":"e_1_2_1_32_1","unstructured":"Lopes , A. P. B. , Avila 1, S. E. F. de., Peixoto , A. N. A. , Oliveira 1, R. S., and Araujo , A . de A. 2009. A bag-of-features approach based on hue-sift descriptor for nude detection . In Proceedings of the European Signal Processing Conference. 1552--1556 . Lopes, A. P. B., Avila1, S. E. F. de., Peixoto, A. N. A., Oliveira1, R. S., and Araujo, A. de A. 2009. A bag-of-features approach based on hue-sift descriptor for nude detection. In Proceedings of the European Signal Processing Conference. 1552--1556."},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the International Conference on Computer Vision Theory and Applications.","volume":"1","author":"Rowley H. A.","unstructured":"Rowley , H. A. , Jing , Y. S. , and Baluja , S . 2006. Large scale image-based adult-content filtering . In Proceedings of the International Conference on Computer Vision Theory and Applications. Vol. 1 , 290--296. Rowley, H. A., Jing, Y. S., and Baluja, S. 2006. Large scale image-based adult-content filtering. In Proceedings of the International Conference on Computer Vision Theory and Applications. Vol. 1, 290--296."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the Workshop on Image Analysis for Multimedia Interactive Services.","author":"Sadka A. H.","year":"2004","unstructured":"Sadka , A. H. 2004 . Visnet: NoE on networked audiovisual media technologies . In Proceedings of the Workshop on Image Analysis for Multimedia Interactive Services. Sadka, A. H. 2004. Visnet: NoE on networked audiovisual media technologies. In Proceedings of the Workshop on Image Analysis for Multimedia Interactive Services."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2007.08.002"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2006.134"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.70"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:MACH.0000008084.60811.49"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.75"},{"key":"e_1_2_1_41_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.","volume":"1","author":"Viola P.","unstructured":"Viola , P. and Jones , M. J . 2001. Rapid object detection using a boosted cascade of simple features . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Vol. 1 , 511--518. Viola, P. and Jones, M. J. 2001. Rapid object detection using a boosted cascade of simple features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Vol. 1, 511--518."},{"volume-title":"Proceedings of the International Conference on Machine Learning. 1119--1125","author":"Wang J.","key":"e_1_2_1_42_1","unstructured":"Wang , J. and Zuchker , J . -D. 2000. Solving the multiple-instance problem: A lazy learning approach . In Proceedings of the International Conference on Machine Learning. 1119--1125 . Wang, J. and Zuchker, J.-D. 2000. Solving the multiple-instance problem: A lazy learning approach. In Proceedings of the International Conference on Machine Learning. 1119--1125."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0140-3664(98)00203-5"},{"key":"e_1_2_1_44_1","volume-title":"Plagemann and V. Goebel Eds.","volume":"1483","author":"Wang J. Z.","unstructured":"Wang , J. Z. , Li , J. , Wiederhold , G. , and Firschein , O . 1998. Classifying objectionable websites based on image content. In Lecture Notes in Computer Science, Special Issue on Interactive Distributed Multimedia Systems and Telecommunication Services, T . Plagemann and V. Goebel Eds. , vol. 1483 , 113--124. Wang, J. Z., Li, J., Wiederhold, G., and Firschein, O. 1998. Classifying objectionable websites based on image content. In Lecture Notes in Computer Science, Special Issue on Interactive Distributed Multimedia Systems and Telecommunication Services, T. Plagemann and V. Goebel Eds., vol. 1483, 113--124."},{"key":"e_1_2_1_45_1","first-page":"49","article-title":"Applications of computational verbs to effective and realtime image understanding","volume":"4","author":"Yang T.","year":"2006","unstructured":"Yang , T. 2006 . Applications of computational verbs to effective and realtime image understanding . Int. J. Comput. Cognition , 4 , 1, 49 -- 67 . Yang, T. 2006. Applications of computational verbs to effective and realtime image understanding. Int. J. Comput. Cognition, 4, 1, 49--67.","journal-title":"Int. J. Comput. Cognition"},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the International Conference on Machine Learning. 410--420","author":"Yang Y.","year":"1997","unstructured":"Yang , Y. 1997 . A comparative study on feature selection in text categorization . In Proceedings of the International Conference on Machine Learning. 410--420 . Yang, Y. 1997. A comparative study on feature selection in text categorization. In Proceedings of the International Conference on Machine Learning. 410--420."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2008.27"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.5565\/rev\/elcvia.78"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10489-005-5602-z"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772690.1772887"},{"volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo. 37--40","author":"Zuo H.","key":"e_1_2_1_51_1","unstructured":"Zuo , H. , Wu , O. , Hu , W. , and Xu , B . 2008. Recognition of blue movies by fusion of audio and video . In Proceedings of the IEEE International Conference on Multimedia and Expo. 37--40 . Zuo, H., Wu, O., Hu, W., and Xu, B. 2008. Recognition of blue movies by fusion of audio and video. In Proceedings of the IEEE International Conference on Multimedia and Expo. 37--40."}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2037676.2037685","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2037676.2037685","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T09:54:28Z","timestamp":1750240468000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2037676.2037685"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,10]]},"references-count":51,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,10]]}},"alternative-id":["10.1145\/2037676.2037685"],"URL":"https:\/\/doi.org\/10.1145\/2037676.2037685","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2011,10]]},"assertion":[{"value":"2010-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-11-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}