{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:23:27Z","timestamp":1750220607115,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,11,30]],"date-time":"2020-11-30T00:00:00Z","timestamp":1606694400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,11,30]]},"DOI":"10.1145\/3428658.3431079","type":"proceedings-article","created":{"date-parts":[[2020,11,25]],"date-time":"2020-11-25T17:17:29Z","timestamp":1606324649000},"page":"113-120","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Evaluating Early Fusion Operators at Mid-Level Feature Space"],"prefix":"10.1145","author":[{"given":"Antonio A. R.","family":"Beserra","sequence":"first","affiliation":[{"name":"University of S\u00e3o Paulo, Instituto de Ci\u00eancias Matem\u00e1ticas e de Computa\u00e7\u00e3o, S\u00e3o Carlos, SP, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rodrigo M.","family":"Kishi","sequence":"additional","affiliation":[{"name":"Federal University of Mato Grosso do Sul, Tr\u00eas Lagoas Campus, Tr\u00eas Lagoas, MS, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rudinei","family":"Goularte","sequence":"additional","affiliation":[{"name":"University of S\u00e3o Paulo, Instituto de Ci\u00eancias Matem\u00e1ticas e de Computa\u00e7\u00e3o, S\u00e3o Carlos, SP, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,11,30]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"CSIFT: A SIFT Descriptor with Color Invariant Characteristics. In 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06)","volume":"2","author":"Abdel-Hakim A. E.","year":"1978","unstructured":"A. E. Abdel-Hakim and A. A. Farag . 2006 . CSIFT: A SIFT Descriptor with Color Invariant Characteristics. In 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06) , Vol. 2 . 1978 -1983. A. E. Abdel-Hakim and A. A. Farag. 2006. CSIFT: A SIFT Descriptor with Color Invariant Characteristics. In 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), Vol. 2. 1978-1983."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2733373.2806316"},{"key":"e_1_3_2_1_4_1","volume-title":"Pattern Recognition and Image Analysis","author":"Baraldi Lorenzo","year":"1939","unstructured":"Lorenzo Baraldi , Costantino Grana , and Rita Cucchiara . 2015. Measuring Scene Detection Performance . In Pattern Recognition and Image Analysis . Springer International Publishing , 395--403. https:\/\/doi.org\/10.1007\/978-3-319- 1939 0-8_45 10.1007\/978-3-319-19390-8_45 Lorenzo Baraldi, Costantino Grana, and Rita Cucchiara. 2015. Measuring Scene Detection Performance. In Pattern Recognition and Image Analysis. Springer International Publishing, 395--403. https:\/\/doi.org\/10.1007\/978-3-319-19390-8_45"},{"volume-title":"Database Theory --- ICDT'99","author":"Beyer Kevin","key":"e_1_3_2_1_5_1","unstructured":"Kevin Beyer , Jonathan Goldstein , Raghu Ramakrishnan , and Uri Shaft . 1999. When Is \" Nearest Neighbor\" Meaningful?. In Database Theory --- ICDT'99 . Springer Berlin Heidelberg , Berlin, Heidelberg , 217--235. Kevin Beyer, Jonathan Goldstein, Raghu Ramakrishnan, and Uri Shaft. 1999. When Is \"Nearest Neighbor\" Meaningful?. In Database Theory --- ICDT'99. Springer Berlin Heidelberg, Berlin, Heidelberg, 217--235."},{"key":"e_1_3_2_1_6_1","volume-title":"Davide Del Testa","author":"Bojarski Mariusz","year":"2016","unstructured":"Mariusz Bojarski , Davide Del Testa , Daniel Dworakowski, Bernhard Firner , Beat Flepp, Prasoon Goyal, Lawrence D. Jackel, Mathew Monfort, Urs Muller, Jiakai Zhang, Xin Zhang, Jake Zhao, and Karol Zieba. 2016 . End to End Learning for Self-Driving Cars . arXiv:cs.CV\/1604.07316 Mariusz Bojarski, Davide Del Testa, Daniel Dworakowski, Bernhard Firner, Beat Flepp, Prasoon Goyal, Lawrence D. Jackel, Mathew Monfort, Urs Muller, Jiakai Zhang, Xin Zhang, Jake Zhao, and Karol Zieba. 2016. End to End Learning for Self-Driving Cars. arXiv:cs.CV\/1604.07316"},{"key":"e_1_3_2_1_7_1","volume-title":"Cisco Visual Networking Index: Forecast and Trends","author":"CISCO.","year":"2017","unstructured":"CISCO. 2018. Cisco Visual Networking Index: Forecast and Trends , 2017 --2022. https:\/\/www.cisco.com\/c\/en\/us\/solutions\/collateral\/executive-perspectives\/annual-internet-report\/white-paper-c11-741490.html. [Online; accessed 25-May-2020]. CISCO. 2018. Cisco Visual Networking Index: Forecast and Trends, 2017--2022. https:\/\/www.cisco.com\/c\/en\/us\/solutions\/collateral\/executive-perspectives\/annual-internet-report\/white-paper-c11-741490.html. [Online; accessed 25-May-2020]."},{"key":"e_1_3_2_1_8_1","volume-title":"Workshop on statistical learning in computer vision, ECCV","volume":"1","author":"Csurka Gabriella","year":"2004","unstructured":"Gabriella Csurka , Christopher Dance , Lixin Fan , Jutta Willamowski , and C\u00e9dric Bray . 2004 . Visual categorization with bags of keypoints . In Workshop on statistical learning in computer vision, ECCV , Vol. 1 . Prague, 1--22. Gabriella Csurka, Christopher Dance, Lixin Fan, Jutta Willamowski, and C\u00e9dric Bray. 2004. Visual categorization with bags of keypoints. In Workshop on statistical learning in computer vision, ECCV, Vol. 1. Prague, 1--22."},{"key":"e_1_3_2_1_9_1","volume-title":"Vols. I and II. The ANNALS of the American Academy of Political and Social Science 360, 1","author":"Gross Bertram M.","year":"1965","unstructured":"Bertram M. Gross . 1965. The Managing of Organizations: The Administrative Struggle , Vols. I and II. The ANNALS of the American Academy of Political and Social Science 360, 1 ( 1965 ), 197--198. https:\/\/doi.org\/10.1177\/000271626536000140 10.1177\/000271626536000140 Bertram M. Gross. 1965. The Managing of Organizations: The Administrative Struggle, Vols. I and II. The ANNALS of the American Academy of Political and Social Science 360, 1 (1965), 197--198. https:\/\/doi.org\/10.1177\/000271626536000140"},{"key":"e_1_3_2_1_10_1","first-page":"1","article-title":"Multi-modal video event recognition based on association rules and decision fusion","volume":"24","author":"G\u00fcder Mennan","year":"2017","unstructured":"Mennan G\u00fcder and Nihan Kesim \u00c7i\u00e7ekli . 2017 . Multi-modal video event recognition based on association rules and decision fusion . Multimedia Systems 24 , 1 (Feb. 2017), 55--72. https:\/\/doi.org\/10.1007\/s00530-017-0535-z 10.1007\/s00530-017-0535-z Mennan G\u00fcder and Nihan Kesim \u00c7i\u00e7ekli. 2017. Multi-modal video event recognition based on association rules and decision fusion. Multimedia Systems 24, 1 (Feb. 2017), 55--72. https:\/\/doi.org\/10.1007\/s00530-017-0535-z","journal-title":"Multimedia Systems"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2011.6012001"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00138-013-0567-0"},{"key":"e_1_3_2_1_13_1","volume-title":"C (Jul","author":"Ji Zhong","year":"2018","unstructured":"Zhong Ji , Yuanyuan Zhang , Yanwei Pang , and Xuelong Li. 2018. Hypergraph Dominant Set Based Multi-video Summarization. Signal Processing 148 , C (Jul 2018 ), 114--123. https:\/\/doi.org\/10.1016\/j.sigpro.2018.01.028 10.1016\/j.sigpro.2018.01.028 Zhong Ji, Yuanyuan Zhang, Yanwei Pang, and Xuelong Li. 2018. Hypergraph Dominant Set Based Multi-video Summarization. Signal Processing 148, C (Jul 2018), 114--123. https:\/\/doi.org\/10.1016\/j.sigpro.2018.01.028"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2018.2823900"},{"volume-title":"Proceedings of ACM International Conference on Multimedia Retrieval (ICMR), oral session.","author":"Jiang Yu-Gang","key":"e_1_3_2_1_15_1","unstructured":"Yu-Gang Jiang , Guangnan Ye , Shih-Fu Chang , Daniel Ellis , and Alexander C. Loui . 2011. Consumer Video Understanding: A Benchmark Database and An Evaluation of Human and Machine Performance . In Proceedings of ACM International Conference on Multimedia Retrieval (ICMR), oral session. Yu-Gang Jiang, Guangnan Ye, Shih-Fu Chang, Daniel Ellis, and Alexander C. Loui. 2011. Consumer Video Understanding: A Benchmark Database and An Evaluation of Human and Machine Performance. In Proceedings of ACM International Conference on Multimedia Retrieval (ICMR), oral session."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-018-6959-4"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Irena Koprinska and Sergio Carrato. 2001. Temporal video segmentation: A survey. In Signal Processing: Image Communication. 477--500.  Irena Koprinska and Sergio Carrato. 2001. Temporal video segmentation: A survey. In Signal Processing: Image Communication. 477--500.","DOI":"10.1016\/S0923-5965(00)00011-4"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-30577-2_17"},{"key":"e_1_3_2_1_20_1","volume-title":"Video Scene Detection by Multimodal Bag of Features. Journal of Information and Data Management 5 (06","author":"Lopes Bruno","year":"2014","unstructured":"Bruno Lopes , Tiago Trojahn , and Rudinei Goularte . 2014. Video Scene Detection by Multimodal Bag of Features. Journal of Information and Data Management 5 (06 2014 ), 1. Bruno Lopes, Tiago Trojahn, and Rudinei Goularte. 2014. Video Scene Detection by Multimodal Bag of Features. Journal of Information and Data Management 5 (06 2014), 1."},{"key":"e_1_3_2_1_21_1","volume-title":"Content Creators, & The YouTube Community. https:\/\/mediakix.com\/blog\/youtuber-statistics-content-creators-demographics\/. [Online","author":"Media","year":"2020","unstructured":"Media kix. 2018. The 11 Biggest Statistics To Know About YouTubers , Content Creators, & The YouTube Community. https:\/\/mediakix.com\/blog\/youtuber-statistics-content-creators-demographics\/. [Online ; accessed 25- May - 2020 ]. Media kix. 2018. The 11 Biggest Statistics To Know About YouTubers, Content Creators, & The YouTube Community. https:\/\/mediakix.com\/blog\/youtuber-statistics-content-creators-demographics\/. [Online; accessed 25-May-2020]."},{"volume-title":"MultiMedia Modeling","author":"M\u00fcnzer Bernd","key":"e_1_3_2_1_22_1","unstructured":"Bernd M\u00fcnzer and Klaus Schoeffmann . 2018. Video Browsing on a Circular Timeline . In MultiMedia Modeling . Springer International Publishing , Cham , 395--399. Bernd M\u00fcnzer and Klaus Schoeffmann. 2018. Video Browsing on a Circular Timeline. In MultiMedia Modeling. Springer International Publishing, Cham, 395--399."},{"key":"e_1_3_2_1_23_1","unstructured":"Eunsoo Park Xuenan Cui Weonjin Kim and Hakil Kim. 2018. End-to-End Fingerprints Liveness Detection using Convolutional Networks with Gram module. arXiv:cs.CV\/1803.07830  Eunsoo Park Xuenan Cui Weonjin Kim and Hakil Kim. 2018. End-to-End Fingerprints Liveness Detection using Convolutional Networks with Gram module. arXiv:cs.CV\/1803.07830"},{"volume-title":"XIV Workshop de Vis\u00e3o Computacional.","author":"Jr Osmando Pereira","key":"e_1_3_2_1_24_1","unstructured":"Osmando Pereira Jr , C. T. Ferraz , and A. Gonzaga . 2018. Image correspondence using a fusion of local region descriptors . In XIV Workshop de Vis\u00e3o Computacional. Osmando Pereira Jr, C. T. Ferraz, and A. Gonzaga. 2018. Image correspondence using a fusion of local region descriptors. In XIV Workshop de Vis\u00e3o Computacional."},{"key":"e_1_3_2_1_25_1","volume-title":"Information Retrieval","author":"Van Rijsbergen C. J.","unstructured":"C. J. Van Rijsbergen . 1979. Information Retrieval ( 2 nd ed.). Butterworth-Heinemann , USA. C. J. Van Rijsbergen. 1979. Information Retrieval (2nd ed.). Butterworth-Heinemann, USA.","edition":"2"},{"volume-title":"Advances in Design for Inclusion","author":"Rothfuss Damaris","key":"e_1_3_2_1_26_1","unstructured":"Damaris Rothfuss , Patrick M\u00fcnster , and Gottfried Zimmermann . 2019. Design Guidelines for Adaptable Videos and Video Players on the Web . In Advances in Design for Inclusion . Springer International Publishing , Cham , 229--240. Damaris Rothfuss, Patrick M\u00fcnster, and Gottfried Zimmermann. 2019. Design Guidelines for Adaptable Videos and Video Players on the Web. In Advances in Design for Inclusion. Springer International Publishing, Cham, 229--240."},{"volume-title":"2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP). 1--6.","author":"Rotman D.","key":"e_1_3_2_1_27_1","unstructured":"D. Rotman , D. Porat , and G. Ashour . 2017. Robust video scene detection using multimodal fusion of optimally grouped features . In 2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP). 1--6. D. Rotman, D. Porat, and G. Ashour. 2017. Robust video scene detection using multimodal fusion of optimally grouped features. In 2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP). 1--6."},{"key":"e_1_3_2_1_28_1","volume-title":"McGill","author":"Salton Gerard","year":"1986","unstructured":"Gerard Salton and Michael J . McGill . 1986 . Introduction to Modern Information Retrieval. McGraw-Hill , Inc., USA. Gerard Salton and Michael J. McGill. 1986. Introduction to Modern Information Retrieval. McGraw-Hill, Inc., USA."},{"key":"e_1_3_2_1_29_1","volume-title":"1997 IEEE International Conference on Acoustics, Speech, and Signal Processing","volume":"4","author":"Saraceno C.","unstructured":"C. Saraceno and R. Leonardi . 1997. Audio as a support to scene change detection and characterization of video sequences . In 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing , Vol. 4 . 2597-2600 vol.4. C. Saraceno and R. Leonardi. 1997. Audio as a support to scene change detection and characterization of video sequences. In 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 4. 2597-2600 vol.4."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CBMI.2019.8877397"},{"volume-title":"Audio Processing and Speech Recognition","author":"Sen Soumya","key":"e_1_3_2_1_31_1","unstructured":"Soumya Sen , Anjan Dutta , and Nilanjan Dey . 2019. Audio Processing and Speech Recognition . Springer Singapore . https:\/\/doi.org\/10.1007\/978-981-13-6098-5 10.1007\/978-981-13-6098-5 Soumya Sen, Anjan Dutta, and Nilanjan Dey. 2019. Audio Processing and Speech Recognition. Springer Singapore. https:\/\/doi.org\/10.1007\/978-981-13-6098-5"},{"volume-title":"2016 IEEE Winter Conference on Applications of Computer Vision (WACV). 1--9.","author":"Singh K. K.","key":"e_1_3_2_1_32_1","unstructured":"K. K. Singh , K. Fatahalian , and A. A. Efros . 2016. KrishnaCam: Using a longitudinal, single-person, egocentric dataset for scene understanding tasks . In 2016 IEEE Winter Conference on Applications of Computer Vision (WACV). 1--9. K. K. Singh, K. Fatahalian, and A. A. Efros. 2016. KrishnaCam: Using a longitudinal, single-person, egocentric dataset for scene understanding tasks. In 2016 IEEE Winter Conference on Applications of Computer Vision (WACV). 1--9."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.895972"},{"key":"e_1_3_2_1_34_1","volume-title":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"2","author":"Snoek C. G. M.","unstructured":"C. G. M. Snoek and M. Worring . 2002. A review on multimodal video indexing . In Proceedings. IEEE International Conference on Multimedia and Expo , Vol. 2 . 21--24 vol.2. C. G. M. Snoek and M. Worring. 2002. A review on multimodal video indexing. In Proceedings. IEEE International Conference on Multimedia and Expo, Vol. 2. 21--24 vol.2."},{"volume-title":"Proceedings of the 13th Annual ACM International Conference on Multimedia (MULTIMEDIA '05)","author":"Snoek Cees G. M.","key":"e_1_3_2_1_35_1","unstructured":"Cees G. M. Snoek , Marcel Worring , and Arnold W. M. Smeulders . 2005. Early versus Late Fusion in Semantic Video Analysis . In Proceedings of the 13th Annual ACM International Conference on Multimedia (MULTIMEDIA '05) . Association for Computing Machinery, New York, NY, USA, 399--402. https:\/\/doi.org\/10.1145\/1101149.1101236 10.1145\/1101149.1101236 Cees G. M. Snoek, Marcel Worring, and Arnold W. M. Smeulders. 2005. Early versus Late Fusion in Semantic Video Analysis. In Proceedings of the 13th Annual ACM International Conference on Multimedia (MULTIMEDIA '05). Association for Computing Machinery, New York, NY, USA, 399--402. https:\/\/doi.org\/10.1145\/1101149.1101236"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2020.103557"},{"volume-title":"Intelligent Computing, Networking, and Informatics","author":"Thounaojam Dalton Meitei","key":"e_1_3_2_1_37_1","unstructured":"Dalton Meitei Thounaojam , Amit Trivedi , Kh. Manglem Singh , and Sudipta Roy . 2014. A Survey on Video Segmentation . In Intelligent Computing, Networking, and Informatics . Springer India , New Delhi , 903--912. Dalton Meitei Thounaojam, Amit Trivedi, Kh. Manglem Singh, and Sudipta Roy. 2014. A Survey on Video Segmentation. In Intelligent Computing, Networking, and Informatics. Springer India, New Delhi, 903--912."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2010.2091400"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6638342"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2002.802021"},{"key":"e_1_3_2_1_41_1","volume-title":"Big Data Analytics for Large-Scale Multimedia Search","author":"Vrochidis Stefanos","year":"1937","unstructured":"Stefanos Vrochidis , Benoit Huet , Edward Chang , and Ioannis Kompatsiaris . 2019. Big Data Analytics for Large-Scale Multimedia Search . Wiley . https:\/\/doi.org\/10.1002\/97811 1937 6996 10.1002\/9781119376996 Stefanos Vrochidis, Benoit Huet, Edward Chang, and Ioannis Kompatsiaris. 2019. Big Data Analytics for Large-Scale Multimedia Search. Wiley. https:\/\/doi.org\/10.1002\/9781119376996"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2017.01.034"},{"key":"e_1_3_2_1_43_1","first-page":"931","article-title":"Multimodal Biometric System Using Face-Iris Fusion Feature","volume":"6","author":"Wang Zhifang","year":"2011","unstructured":"Zhifang Wang , Erfu Wang , Shuangshuang Wang , and Qun Ding . 2011 . Multimodal Biometric System Using Face-Iris Fusion Feature . JCP 6 (2011), 931 -- 938 . Zhifang Wang, Erfu Wang, Shuangshuang Wang, and Qun Ding. 2011. Multimodal Biometric System Using Face-Iris Fusion Feature. JCP 6 (2011), 931--938.","journal-title":"JCP"},{"key":"#cr-split#-e_1_3_2_1_44_1.1","doi-asserted-by":"crossref","unstructured":"H. Yang J. Liu M. Zhang and J. Zeng. 2018. Face recognition algorithm based on orthogonal gradient difference local directional pattern. Laser and Optoelectronics Progress 55 4 (2018). https:\/\/doi.org\/10.3788\/LOP55.041008 10.3788\/LOP55.041008","DOI":"10.3788\/LOP55.041008"},{"key":"#cr-split#-e_1_3_2_1_44_1.2","doi-asserted-by":"crossref","unstructured":"H. Yang J. Liu M. Zhang and J. Zeng. 2018. Face recognition algorithm based on orthogonal gradient difference local directional pattern. Laser and Optoelectronics Progress 55 4 (2018). https:\/\/doi.org\/10.3788\/LOP55.041008","DOI":"10.3788\/LOP55.041008"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1006\/cviu.1997.0628"},{"key":"e_1_3_2_1_46_1","first-page":"3","article-title":"Deep Fusion of Multiple Semantic Cues for Complex Event Recognition","volume":"25","author":"Zhang Xishan","year":"2016","unstructured":"Xishan Zhang , Hanwang Zhang , Yongdong Zhang , Yang Yang , Meng Wang , Huanbo Luan , Jintao Li , and Tat-Seng Chua . 2016 . Deep Fusion of Multiple Semantic Cues for Complex Event Recognition . IEEE Transactions on Image Processing 25 , 3 (March 2016), 1033--1046. https:\/\/doi.org\/10.1109\/tip.2015.2511585 10.1109\/tip.2015.2511585 Xishan Zhang, Hanwang Zhang, Yongdong Zhang, Yang Yang, Meng Wang, Huanbo Luan, Jintao Li, and Tat-Seng Chua. 2016. Deep Fusion of Multiple Semantic Cues for Complex Event Recognition. IEEE Transactions on Image Processing 25, 3 (March 2016), 1033--1046. https:\/\/doi.org\/10.1109\/tip.2015.2511585","journal-title":"IEEE Transactions on Image Processing"}],"event":{"name":"WebMedia '20: Brazillian Symposium on Multimedia and the Web","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGMM ACM Special Interest Group on Multimedia","SBC Brazilian Computer Society","CNPq Conselho Nacional de Desenvolvimento Cientifico e Tecn","CGIBR Comite Gestor da Internet no Brazil","CAPES Brazilian Higher Education Funding Council"],"location":"S\u00e3o Lu\u00eds Brazil","acronym":"WebMedia '20"},"container-title":["Proceedings of the Brazilian Symposium on Multimedia and the Web"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3428658.3431079","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3428658.3431079","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:32:08Z","timestamp":1750195928000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3428658.3431079"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,30]]},"references-count":45,"alternative-id":["10.1145\/3428658.3431079","10.1145\/3428658"],"URL":"https:\/\/doi.org\/10.1145\/3428658.3431079","relation":{},"subject":[],"published":{"date-parts":[[2020,11,30]]},"assertion":[{"value":"2020-11-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}