{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,24]],"date-time":"2026-02-24T07:59:48Z","timestamp":1771919988146,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,4,19]],"date-time":"2021-04-19T00:00:00Z","timestamp":1618790400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,19]]},"DOI":"10.1145\/3442381.3449950","type":"proceedings-article","created":{"date-parts":[[2021,6,3]],"date-time":"2021-06-03T19:34:17Z","timestamp":1622748857000},"page":"633-645","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":35,"title":["Mitigating Gender Bias in Captioning Systems"],"prefix":"10.1145","author":[{"given":"Ruixiang","family":"Tang","sequence":"first","affiliation":[{"name":"Texas A&amp;M University, USA"}]},{"given":"Mengnan","family":"Du","sequence":"additional","affiliation":[{"name":"Texas A&amp;M University, USA"}]},{"given":"Yuening","family":"Li","sequence":"additional","affiliation":[{"name":"Texas A&amp;M University, USA"}]},{"given":"Zirui","family":"Liu","sequence":"additional","affiliation":[{"name":"Texas A&amp;M University, USA"}]},{"given":"Na","family":"Zou","sequence":"additional","affiliation":[{"name":"Texas A&amp;M University, USA"}]},{"given":"Xia","family":"Hu","sequence":"additional","affiliation":[{"name":"Texas A&amp;M University, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,6,3]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00522"},{"key":"e_1_3_2_1_2_1","volume-title":"Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering. In 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 6077\u20136086","author":"Anderson Peter","year":"2018","unstructured":"Peter Anderson , Xiaodong He , Chris Buehler , Damien Teney , Mark Johnson , Stephen Gould , and Lei Zhang . 2018 . Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering. In 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 6077\u20136086 . Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, and Lei Zhang. 2018. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering. In 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 6077\u20136086."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00041"},{"key":"e_1_3_2_1_5_1","unstructured":"Tolga Bolukbasi Kai-Wei Chang James\u00a0Y Zou Venkatesh Saligrama and Adam\u00a0T Kalai. 2016. Man is to computer programmer as woman is to homemaker? debiasing word embeddings. In Advances in neural information processing systems. 4349\u20134357.  Tolga Bolukbasi Kai-Wei Chang James\u00a0Y Zou Venkatesh Saligrama and Adam\u00a0T Kalai. 2016. Man is to computer programmer as woman is to homemaker? debiasing word embeddings. In Advances in neural information processing systems. 4349\u20134357."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-3002"},{"key":"e_1_3_2_1_7_1","volume-title":"Understanding the Origins of Bias in Word Embeddings. In International Conference on Machine Learning. 803\u2013811","author":"Brunet Marc-Etienne","year":"2019","unstructured":"Marc-Etienne Brunet , Colleen Alkalay-Houlihan , Ashton Anderson , and Richard Zemel . 2019 . Understanding the Origins of Bias in Word Embeddings. In International Conference on Machine Learning. 803\u2013811 . Marc-Etienne Brunet, Colleen Alkalay-Houlihan, Ashton Anderson, and Richard Zemel. 2019. Understanding the Origins of Bias in Word Embeddings. In International Conference on Machine Learning. 803\u2013811."},{"key":"e_1_3_2_1_8_1","volume-title":"Conference on fairness, accountability and transparency. 77\u201391","author":"Buolamwini Joy","year":"2018","unstructured":"Joy Buolamwini and Timnit Gebru . 2018 . Gender shades: Intersectional accuracy disparities in commercial gender classification . In Conference on fairness, accountability and transparency. 77\u201391 . Joy Buolamwini and Timnit Gebru. 2018. Gender shades: Intersectional accuracy disparities in commercial gender classification. In Conference on fairness, accountability and transparency. 77\u201391."},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings of the sixth workshop on statistical machine translation. 85\u201391","author":"Denkowski Michael","year":"2011","unstructured":"Michael Denkowski and Alon Lavie . 2011 . Meteor 1.3: Automatic metric for reliable optimization and evaluation of machine translation systems . In Proceedings of the sixth workshop on statistical machine translation. 85\u201391 . Michael Denkowski and Alon Lavie. 2011. Meteor 1.3: Automatic metric for reliable optimization and evaluation of machine translation systems. In Proceedings of the sixth workshop on statistical machine translation. 85\u201391."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298878"},{"key":"e_1_3_2_1_11_1","volume-title":"Fairness in deep learning: A computational perspective","author":"Du Mengnan","year":"2020","unstructured":"Mengnan Du , Fan Yang , Na Zou , and Xia Hu. 2020. Fairness in deep learning: A computational perspective . IEEE Intelligent Systems( 2020 ). Mengnan Du, Fan Yang, Na Zou, and Xia Hu. 2020. Fairness in deep learning: A computational perspective. IEEE Intelligent Systems(2020)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298754"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-3821"},{"key":"e_1_3_2_1_14_1","unstructured":"Robert Geirhos J\u00f6rn-Henrik Jacobsen Claudio Michaelis Richard Zemel Wieland Brendel Matthias Bethge and Felix\u00a0A Wichmann. 2020. Shortcut Learning in Deep Neural Networks. arXiv preprint arXiv:2004.07780(2020).  Robert Geirhos J\u00f6rn-Henrik Jacobsen Claudio Michaelis Richard Zemel Wieland Brendel Matthias Bethge and Felix\u00a0A Wichmann. 2020. Shortcut Learning in Deep Neural Networks. arXiv preprint arXiv:2004.07780(2020)."},{"key":"e_1_3_2_1_15_1","unstructured":"Moritz Hardt Eric Price and Nati Srebro. 2016. Equality of opportunity in supervised learning. In Advances in neural information processing systems. 3315\u20133323.  Moritz Hardt Eric Price and Nati Srebro. 2016. Equality of opportunity in supervised learning. In Advances in neural information processing systems. 3315\u20133323."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01219-9_47"},{"key":"e_1_3_2_1_17_1","volume-title":"Long short-term memory. Neural computation 9, 8","author":"Hochreiter Sepp","year":"1997","unstructured":"Sepp Hochreiter and J\u00fcrgen Schmidhuber . 1997. Long short-term memory. Neural computation 9, 8 ( 1997 ), 1735\u20131780. Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735\u20131780."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3295748"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298932"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1287\/mnsc.2018.3093"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00960"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/3298023.3298174"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.345"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00754"},{"key":"e_1_3_2_1_26_1","unstructured":"Junhua Mao Wei Xu Yi Yang Jiang Wang Zhiheng Huang and Alan Yuille. 2014. Deep captioning with multimodal recurrent neural networks (m-rnn). arXiv preprint arXiv:1412.6632(2014).  Junhua Mao Wei Xu Yi Yang Jiang Wang Zhiheng Huang and Alan Yuille. 2014. Deep captioning with multimodal recurrent neural networks (m-rnn). arXiv preprint arXiv:1412.6632(2014)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240712"},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311\u2013318","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni , Salim Roukos , Todd Ward , and Wei-Jing Zhu . 2002 . BLEU: a method for automatic evaluation of machine translation . In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311\u2013318 . Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311\u2013318."},{"key":"e_1_3_2_1_29_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91\u201399.  Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91\u201399."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.131"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1437"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1177\/0163443711418272"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-2002"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.74"},{"key":"e_1_3_2_1_35_1","unstructured":"K Simonyan A Vedaldi and A Zisserman. 2014. Deep inside convolutional networks: visualising image classification models and saliency maps. (2014).  K Simonyan A Vedaldi and A Zisserman. 2014. Deep inside convolutional networks: visualising image classification models and saliency maps. (2014)."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01231-1_31"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"crossref","unstructured":"Tony Sun Andrew Gaut Shirlyn Tang Yuxin Huang Mai ElSherief Jieyu Zhao Diba Mirza Elizabeth Belding Kai-Wei Chang and William\u00a0Yang Wang. 2019. Mitigating gender bias in natural language processing: Literature review. arXiv preprint arXiv:1906.08976(2019).  Tony Sun Andrew Gaut Shirlyn Tang Yuxin Huang Mai ElSherief Jieyu Zhao Diba Mirza Elizabeth Belding Kai-Wei Chang and William\u00a0Yang Wang. 2019. Mitigating gender bias in natural language processing: Literature review. arXiv preprint arXiv:1906.08976(2019).","DOI":"10.18653\/v1\/P19-1159"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1334"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299087"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"e_1_3_2_1_41_1","volume-title":"Online Encyclopedia. In International AAAI Conference on Weblogs and Social Media. USA, 454\u2013463","author":"Wagner Claudia","year":"2015","unstructured":"Claudia Wagner , David Garcia , Mohsen Jadidi , and Markus Strohmaier . 2015 . It\u2019s a Man\u2019s Wikipedia? Assessing Gender Inequality in an Online Encyclopedia. In International AAAI Conference on Weblogs and Social Media. USA, 454\u2013463 . Claudia Wagner, David Garcia, Mohsen Jadidi, and Markus Strohmaier. 2015. It\u2019s a Man\u2019s Wikipedia? Assessing Gender Inequality in an Online Encyclopedia. In International AAAI Conference on Weblogs and Social Media. USA, 454\u2013463."},{"key":"e_1_3_2_1_42_1","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","volume":"1","author":"Wiegand Michael","year":"2019","unstructured":"Michael Wiegand , Josef Ruppenhofer , and Thomas Kleinbauer . 2019 . Detection of abusive language: the problem of biased datasets . In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , Volume 1 (Long and Short Papers). 602\u2013608. Michael Wiegand, Josef Ruppenhofer, and Thomas Kleinbauer. 2019. Detection of abusive language: the problem of biased datasets. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 602\u2013608."},{"key":"e_1_3_2_1_43_1","volume-title":"International conference on machine learning. 2048\u20132057","author":"Xu Kelvin","year":"2015","unstructured":"Kelvin Xu , Jimmy Ba , Ryan Kiros , Kyunghyun Cho , Aaron Courville , Ruslan Salakhudinov , Rich Zemel , and Yoshua Bengio . 2015 . Show, attend and tell: Neural image caption generation with visual attention . In International conference on machine learning. 2048\u20132057 . Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In International conference on machine learning. 2048\u20132057."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-017-1059-x"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1064"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1323"}],"event":{"name":"WWW '21: The Web Conference 2021","location":"Ljubljana Slovenia","acronym":"WWW '21","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Proceedings of the Web Conference 2021"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442381.3449950","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3442381.3449950","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:32Z","timestamp":1750195472000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442381.3449950"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,19]]},"references-count":45,"alternative-id":["10.1145\/3442381.3449950","10.1145\/3442381"],"URL":"https:\/\/doi.org\/10.1145\/3442381.3449950","relation":{},"subject":[],"published":{"date-parts":[[2021,4,19]]},"assertion":[{"value":"2021-06-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}