{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T16:12:57Z","timestamp":1778170377106,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":25,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,29]],"date-time":"2022-06-29T00:00:00Z","timestamp":1656460800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,29]]},"DOI":"10.1145\/3530190.3534837","type":"proceedings-article","created":{"date-parts":[[2022,6,24]],"date-time":"2022-06-24T16:21:57Z","timestamp":1656087717000},"page":"632-637","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["Note: Towards Devising an Efficient VQA in the Bengali Language"],"prefix":"10.1145","author":[{"given":"S M Shahriar","family":"Islam","sequence":"first","affiliation":[{"name":"Computer Science And Engineering, BRAC University, Bangladesh"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Riyad Ahsan","family":"Auntor","sequence":"additional","affiliation":[{"name":"Computer Science And Engineering, BRAC University, Bangladesh"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Minhajul","family":"Islam","sequence":"additional","affiliation":[{"name":"Computer Science And Engineering, BRAC University, Bangladesh"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mohammad Yousuf Hossain","family":"Anik","sequence":"additional","affiliation":[{"name":"Computer Science And Engineering, BRAC University, Bangladesh"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"A. B. M. Alim Al","family":"Islam","sequence":"additional","affiliation":[{"name":"Computer Science And Engineering, Bangladesh University of Engineering and Technology, Bangladesh"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jannatun","family":"Noor","sequence":"additional","affiliation":[{"name":"Computer Science And Engineering, BRAC University, Bangladesh"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,6,29]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Development and evaluation of bidirectional LSTM freeway traffic forecasting models using simulation data. Scientific Reports 11 (12","author":"Abduljabbar Rusul","year":"2021","unstructured":"Rusul Abduljabbar , Hussein Dia , and Pei-Wei Tsai . 2021. Development and evaluation of bidirectional LSTM freeway traffic forecasting models using simulation data. Scientific Reports 11 (12 2021 ). https:\/\/doi.org\/10.1038\/s41598-021-03282-z 10.1038\/s41598-021-03282-z Rusul Abduljabbar, Hussein Dia, and Pei-Wei Tsai. 2021. Development and evaluation of bidirectional LSTM freeway traffic forecasting models using simulation data. Scientific Reports 11 (12 2021). https:\/\/doi.org\/10.1038\/s41598-021-03282-z"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1203"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-016-0981-7"},{"key":"e_1_3_2_2_4_1","volume-title":"VQA: Visual Question Answering. In 2015 IEEE International Conference on Computer Vision (ICCV). 2425\u20132433","author":"Antol Stanislaw","year":"2015","unstructured":"Stanislaw Antol , Aishwarya Agrawal , Jiasen Lu , Margaret Mitchell , Dhruv Batra , C.\u00a0 Lawrence Zitnick , and Devi Parikh . 2015 . VQA: Visual Question Answering. In 2015 IEEE International Conference on Computer Vision (ICCV). 2425\u20132433 . https:\/\/doi.org\/10.1109\/ICCV.2015.279 10.1109\/ICCV.2015.279 Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C.\u00a0Lawrence Zitnick, and Devi Parikh. 2015. VQA: Visual Question Answering. In 2015 IEEE International Conference on Computer Vision (ICCV). 2425\u20132433. https:\/\/doi.org\/10.1109\/ICCV.2015.279"},{"key":"e_1_3_2_2_5_1","volume-title":"Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6325\u20136334","author":"Goyal Yash","year":"2017","unstructured":"Yash Goyal , Tejas Khot , Douglas Summers-Stay , Dhruv Batra , and Devi Parikh . 2017 . Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6325\u20136334 . https:\/\/doi.org\/10.1109\/CVPR.2017.670 10.1109\/CVPR.2017.670 Yash Goyal, Tejas Khot, Douglas Summers-Stay, Dhruv Batra, and Devi Parikh. 2017. Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6325\u20136334. https:\/\/doi.org\/10.1109\/CVPR.2017.670"},{"key":"#cr-split#-e_1_3_2_2_6_1.1","doi-asserted-by":"crossref","unstructured":"Zhongzhe Hu Junmin Xiao Zhongbo Tian Xiaoyang Zhang Hongrui Zhu Chengji Yao Ninghui Sun and Guangming Tan. 2019. A Variable Batch Size Strategy for Large Scale Distributed DNN Training. In 2019 IEEE Intl Conf on Parallel Distributed Processing with Applications Big Data Cloud Computing Sustainable Computing Communications Social Computing Networking (ISPA\/BDCloud\/SocialCom\/SustainCom). 476-485. https:\/\/doi.org\/10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074 10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074","DOI":"10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074"},{"key":"#cr-split#-e_1_3_2_2_6_1.2","doi-asserted-by":"crossref","unstructured":"Zhongzhe Hu Junmin Xiao Zhongbo Tian Xiaoyang Zhang Hongrui Zhu Chengji Yao Ninghui Sun and Guangming Tan. 2019. A Variable Batch Size Strategy for Large Scale Distributed DNN Training. In 2019 IEEE Intl Conf on Parallel Distributed Processing with Applications Big Data Cloud Computing Sustainable Computing Communications Social Computing Networking (ISPA\/BDCloud\/SocialCom\/SustainCom). 476-485. https:\/\/doi.org\/10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074","DOI":"10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074"},{"key":"#cr-split#-e_1_3_2_2_7_1.1","doi-asserted-by":"crossref","unstructured":"Zhongzhe Hu Junmin Xiao Zhongbo Tian Xiaoyang Zhang Hongrui Zhu Chengji Yao Ninghui Sun and Guangming Tan. 2019. A Variable Batch Size Strategy for Large Scale Distributed DNN Training. In 2019 IEEE Intl Conf on Parallel Distributed Processing with Applications Big Data Cloud Computing Sustainable Computing Communications Social Computing Networking (ISPA\/BDCloud\/SocialCom\/SustainCom). 476-485. https:\/\/doi.org\/10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074 10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074","DOI":"10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074"},{"key":"#cr-split#-e_1_3_2_2_7_1.2","doi-asserted-by":"crossref","unstructured":"Zhongzhe Hu Junmin Xiao Zhongbo Tian Xiaoyang Zhang Hongrui Zhu Chengji Yao Ninghui Sun and Guangming Tan. 2019. A Variable Batch Size Strategy for Large Scale Distributed DNN Training. In 2019 IEEE Intl Conf on Parallel Distributed Processing with Applications Big Data Cloud Computing Sustainable Computing Communications Social Computing Networking (ISPA\/BDCloud\/SocialCom\/SustainCom). 476-485. https:\/\/doi.org\/10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074","DOI":"10.1109\/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00074"},{"key":"e_1_3_2_2_8_1","volume-title":"CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1988\u20131997","author":"Johnson Justin","year":"2017","unstructured":"Justin Johnson , Bharath Hariharan , Laurens van\u00a0der Maaten , Li Fei-Fei , C.\u00a0 Lawrence Zitnick , and Ross Girshick . 2017 . CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1988\u20131997 . https:\/\/doi.org\/10.1109\/CVPR.2017.215 10.1109\/CVPR.2017.215 Justin Johnson, Bharath Hariharan, Laurens van\u00a0der Maaten, Li Fei-Fei, C.\u00a0Lawrence Zitnick, and Ross Girshick. 2017. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1988\u20131997. https:\/\/doi.org\/10.1109\/CVPR.2017.215"},{"key":"#cr-split#-e_1_3_2_2_9_1.1","doi-asserted-by":"crossref","unstructured":"Kushal Kafle and Christopher Kanan. 2017. An Analysis of Visual Question Answering Algorithms. 1983-1991. https:\/\/doi.org\/10.1109\/ICCV.2017.217 10.1109\/ICCV.2017.217","DOI":"10.1109\/ICCV.2017.217"},{"key":"#cr-split#-e_1_3_2_2_9_1.2","doi-asserted-by":"crossref","unstructured":"Kushal Kafle and Christopher Kanan. 2017. An Analysis of Visual Question Answering Algorithms. 1983-1991. https:\/\/doi.org\/10.1109\/ICCV.2017.217","DOI":"10.1109\/ICCV.2017.217"},{"key":"e_1_3_2_2_10_1","volume-title":"2017 IEEE International Conference on Big Data and Smart Computing (BigComp). 358\u2013362","author":"Ko ByungSoo","year":"2017","unstructured":"ByungSoo Ko , Han-Gyu Kim , Kyo-Joong Oh , and Ho-Jin Choi . 2017 . Controlled dropout: A different approach to using dropout on deep neural network . In 2017 IEEE International Conference on Big Data and Smart Computing (BigComp). 358\u2013362 . https:\/\/doi.org\/10.1109\/BIGCOMP.2017.7881693 10.1109\/BIGCOMP.2017.7881693 ByungSoo Ko, Han-Gyu Kim, Kyo-Joong Oh, and Ho-Jin Choi. 2017. Controlled dropout: A different approach to using dropout on deep neural network. In 2017 IEEE International Conference on Big Data and Smart Computing (BigComp). 358\u2013362. https:\/\/doi.org\/10.1109\/BIGCOMP.2017.7881693"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_2_2_12_1","volume-title":"KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA. In 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 14106\u201314116","author":"Marino Kenneth","year":"2021","unstructured":"Kenneth Marino , Xinlei Chen , Devi Parikh , Abhinav Gupta , and Marcus Rohrbach . 2021 . KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA. In 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 14106\u201314116 . https:\/\/doi.org\/10.1109\/CVPR46437.2021.01389 10.1109\/CVPR46437.2021.01389 Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, and Marcus Rohrbach. 2021. KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA. In 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 14106\u201314116. https:\/\/doi.org\/10.1109\/CVPR46437.2021.01389"},{"key":"e_1_3_2_2_13_1","volume-title":"OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge. In 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 3190\u20133199","author":"Marino Kenneth","year":"2019","unstructured":"Kenneth Marino , Mohammad Rastegari , Ali Farhadi , and Roozbeh Mottaghi . 2019 . OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge. In 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 3190\u20133199 . https:\/\/doi.org\/10.1109\/CVPR.2019.00331 10.1109\/CVPR.2019.00331 Kenneth Marino, Mohammad Rastegari, Ali Farhadi, and Roozbeh Mottaghi. 2019. OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge. In 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 3190\u20133199. https:\/\/doi.org\/10.1109\/CVPR.2019.00331"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1080\/24751839.2020.1833136"},{"key":"e_1_3_2_2_15_1","volume-title":"Deep learning-based convolutional neural network for intra-modality brain MRI synthesis. Journal of Applied Clinical Medical Physics (01","author":"Osman Alexander","year":"2022","unstructured":"Alexander Osman and Nissren Tamam . 2022. Deep learning-based convolutional neural network for intra-modality brain MRI synthesis. Journal of Applied Clinical Medical Physics (01 2022 ), 1\u201311. https:\/\/doi.org\/10.1002\/acm2.13530 10.1002\/acm2.13530 Alexander Osman and Nissren Tamam. 2022. Deep learning-based convolutional neural network for intra-modality brain MRI synthesis. Journal of Applied Clinical Medical Physics (01 2022), 1\u201311. https:\/\/doi.org\/10.1002\/acm2.13530"},{"key":"e_1_3_2_2_16_1","unstructured":"Md\u00a0Aminul\u00a0Haque Palash M.\u00a0D. Abdullah\u00a0Al Nasim Sourav Saha Faria Afrin Raisa Mallik and Sathishkumar Samiappan. 2021. Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder Network. CoRR abs\/2110.12442(2021). Md\u00a0Aminul\u00a0Haque Palash M.\u00a0D. Abdullah\u00a0Al Nasim Sourav Saha Faria Afrin Raisa Mallik and Sathishkumar Samiappan. 2021. Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder Network. CoRR abs\/2110.12442(2021)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"crossref","unstructured":"S. Preethi A. Arun\u00a0Prakash and R. Thangarajan. 2021. Plant Disease Recognition from Leaf Images Using Convolutional Neural Network. In Intelligent Systems Siba\u00a0K. Udgata Srinivas Sethi and Satish\u00a0N. Srirama (Eds.). Springer Singapore Singapore 169\u2013176. S. Preethi A. Arun\u00a0Prakash and R. Thangarajan. 2021. Plant Disease Recognition from Leaf Images Using Convolutional Neural Network. In Intelligent Systems Siba\u00a0K. Udgata Srinivas Sethi and Satish\u00a0N. Srirama (Eds.). Springer Singapore Singapore 169\u2013176.","DOI":"10.1007\/978-981-33-6081-5_15"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2019.06.100"},{"key":"#cr-split#-e_1_3_2_2_19_1.1","doi-asserted-by":"crossref","unstructured":"Mark Sandler Andrew Howard Menglong Zhu Andrey Zhmoginov and Liang-Chieh Chen. 2018. MobileNetV2: Inverted Residuals and Linear Bottlenecks. 4510-4520. https:\/\/doi.org\/10.1109\/CVPR.2018.00474 10.1109\/CVPR.2018.00474","DOI":"10.1109\/CVPR.2018.00474"},{"key":"#cr-split#-e_1_3_2_2_19_1.2","doi-asserted-by":"crossref","unstructured":"Mark Sandler Andrew Howard Menglong Zhu Andrey Zhmoginov and Liang-Chieh Chen. 2018. MobileNetV2: Inverted Residuals and Linear Bottlenecks. 4510-4520. https:\/\/doi.org\/10.1109\/CVPR.2018.00474","DOI":"10.1109\/CVPR.2018.00474"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33018876"},{"key":"e_1_3_2_2_21_1","volume-title":"Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding. (10","author":"Yi Kexin","year":"2018","unstructured":"Kexin Yi , Jiajun Wu , Chuang Gan , Antonio Torralba , Pushmeet Kohli , and Joshua\u00a0 B. Tenenbaum . 2018. Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding. (10 2018 ). http:\/\/nsvqa.csail.mit.edu Kexin Yi, Jiajun Wu, Chuang Gan, Antonio Torralba, Pushmeet Kohli, and Joshua\u00a0B. Tenenbaum. 2018. Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding. (10 2018). http:\/\/nsvqa.csail.mit.edu"}],"event":{"name":"COMPASS '22: ACM SIGCAS\/SIGCHI Conference on Computing and Sustainable Societies","location":"Seattle WA USA","acronym":"COMPASS '22","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction","SIGCAS ACM Special Interest Group on Computers and Society"]},"container-title":["ACM SIGCAS\/SIGCHI Conference on Computing and Sustainable Societies (COMPASS)"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3530190.3534837","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3530190.3534837","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:24Z","timestamp":1750183764000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3530190.3534837"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,29]]},"references-count":25,"alternative-id":["10.1145\/3530190.3534837","10.1145\/3530190"],"URL":"https:\/\/doi.org\/10.1145\/3530190.3534837","relation":{},"subject":[],"published":{"date-parts":[[2022,6,29]]},"assertion":[{"value":"2022-06-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}