{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T17:52:54Z","timestamp":1776275574240,"version":"3.50.1"},"reference-count":41,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2020,6,1]],"date-time":"2020-06-01T00:00:00Z","timestamp":1590969600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61772246, 61876074, and 61673290"],"award-info":[{"award-number":["61772246, 61876074, and 61673290"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Jiangxi Science and Technology Plan","award":["20192ACBL21030"],"award-info":[{"award-number":["20192ACBL21030"]}]},{"name":"Humanities and Social Sciences Projects","award":["YY17211"],"award-info":[{"award-number":["YY17211"]}]},{"name":"Social Science Planning","award":["17YY05"],"award-info":[{"award-number":["17YY05"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2020,9,30]]},"abstract":"<jats:p>Language discrimination among similar languages, varieties, and dialects is a challenging natural language processing task. The traditional text-driven focus leads to poor results. In this article, we explore the effectiveness of speech-driven features toward language discrimination among Chinese dialects. First, we systematically explore the appropriateness of speech-driven MFCC features toward CNN-based language discrimination. Then, we design an end-to-end speech recognition model based on HMM-DNN to predict Chinese dialect words. We adopt attention mechanism to extract the discriminative words related to different Chinese dialects. Finally, through a CNN, we combine the word-level embedding and the MFCC-based features. Evaluation of two benchmark Chinese dialect corpora shows the appropriateness and effectiveness of the proposed speech-driven approach to fine-grained Chinese dialect discrimination compared to the state-of-the-art methods.<\/jats:p>","DOI":"10.1145\/3389021","type":"journal-article","created":{"date-parts":[[2020,6,1]],"date-time":"2020-06-01T10:14:13Z","timestamp":1591006453000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Speech-Driven End-to-End Language Discrimination toward Chinese Dialects"],"prefix":"10.1145","volume":"19","author":[{"given":"Fan","family":"Xu","sequence":"first","affiliation":[{"name":"Jiangxi Normal University, Nanchang, China"}]},{"given":"Jian","family":"Luo","sequence":"additional","affiliation":[{"name":"Jiangxi Normal University, Nanchang, China"}]},{"given":"Mingwen","family":"Wang","sequence":"additional","affiliation":[{"name":"Jiangxi Normal University, Nanchang, China"}]},{"given":"Guodong","family":"Zhou","sequence":"additional","affiliation":[{"name":"Soochow University, China"}]}],"member":"320","published-online":{"date-parts":[[2020,6]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (COLING-ACL'06)","author":"Ayan Necip Fazil"},{"key":"e_1_2_1_2_1","unstructured":"Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).  Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1155\/2015\/797083"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2016.07.005"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the 3rd Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial\u201916)","author":"\u00c7\u00f6ltekin \u00c7a\u011fr\u0131","year":"2016"},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Steven B. Davis and Paul Mermelstein. 1990. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. In Readings in Speech Recognition. Elsevier 65--74.  Steven B. Davis and Paul Mermelstein. 1990. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. In Readings in Speech Recognition. Elsevier 65--74.","DOI":"10.1016\/B978-0-08-051584-7.50010-3"},{"key":"e_1_2_1_7_1","volume-title":"12th Annual Conference of the International Speech Communication Association (Interspeech'11)","author":"Dehak Najim","year":"2011"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639345"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL'13)","author":"Elfardy Heba","year":"2013"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-1217"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-5316"},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of International Conference on Statistical Analysis of Textual Data","volume":"95","author":"Grefenstette Gregory","year":"1995"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2500887"},{"key":"e_1_2_1_14_1","unstructured":"IFLYTEK. 2018. IFLYTEK world-wide contest for dialect discrimination: A baseline system. Retrieved from http:\/\/challenge.xfyun.cn\/aicompetition\/mobile\/techDetail --&gt;http:\/\/challenge.xfyun.cn\/aicompetition\/mobile\/techDetail.  IFLYTEK. 2018. IFLYTEK world-wide contest for dialect discrimination: A baseline system. Retrieved from http:\/\/challenge.xfyun.cn\/aicompetition\/mobile\/techDetail --&gt;http:\/\/challenge.xfyun.cn\/aicompetition\/mobile\/techDetail."},{"key":"e_1_2_1_15_1","unstructured":"Armand Joulin Edouard Grave Piotr Bojanowski and Tomas Mikolov. 2016. Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759 (2016).  Armand Joulin Edouard Grave Piotr Bojanowski and Tomas Mikolov. 2016. Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759 (2016)."},{"key":"e_1_2_1_16_1","unstructured":"Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).  Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-6393(01)00041-3"},{"key":"e_1_2_1_18_1","first-page":"1","article-title":"Discriminating between closely related languages on Twitter","volume":"39","author":"Ljube\u0161i\u0107 Nikola","year":"2015","journal-title":"Informatica"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the Australasian Language Technology Association Workshop 2013 (ALTA\u201913)","author":"Lui Marco","year":"2013"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-4204"},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of International Conference of the Pacific Association for Computational Linguistics. 59--64","author":"Malmasi Shervin","year":"2015"},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 3rd Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial\u201916)","author":"Malmasi Shervin","year":"2016"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118905.1118906"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1080\/09296170500500694"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.37936\/ecti-cit.200622.53288"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-2125"},{"key":"e_1_2_1_27_1","volume-title":"3rd Symposium on Languages, Applications and Technologies. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.","author":"Sim\u00f5es Alberto"},{"key":"e_1_2_1_28_1","doi-asserted-by":"crossref","unstructured":"David Snyder Daniel Garcia-Romero Alan McCree Gregory Sell Daniel Povey and Sanjeev Khudanpur. 2018. Spoken language recognition using X-vectors. In Odyssey. 105--111.  David Snyder Daniel Garcia-Romero Alan McCree Gregory Sell Daniel Povey and Sanjeev Khudanpur. 2018. Spoken language recognition using X-vectors. In Odyssey. 105--111.","DOI":"10.21437\/Odyssey.2018-15"},{"key":"e_1_2_1_29_1","volume-title":"7th International Conference on Spoken Language Processing.","author":"Stolcke Andreas","year":"2002"},{"key":"e_1_2_1_30_1","volume-title":"International Conference on Language Resources and Evaluation (LREC'02)","author":"Takezawa Toshiyuki","year":"2002"},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of International Conference on Computational Linguistics (COLING'12)","author":"Tiedemann J\u00f6rg","year":"2012"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-5313"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/APSIPA.2016.7820796"},{"key":"e_1_2_1_34_1","unstructured":"Dong Wang and Xuewei Zhang. 2015. Thchs-30: A free Chinese speech corpus. arXiv preprint arXiv:1512.01882 (2015).  Dong Wang and Xuewei Zhang. 2015. Thchs-30: A free Chinese speech corpus. arXiv preprint arXiv:1512.01882 (2015)."},{"key":"e_1_2_1_35_1","volume-title":"International Conference on Language Resources and Evaluation (LREC'18)","author":"Xu Fan","year":"2018"},{"key":"e_1_2_1_36_1","volume-title":"Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects (VarDial'15)","author":"Xu Fan","year":"2015"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00169"},{"key":"e_1_2_1_38_1","volume-title":"The 11th Conference on Natural Language Processing (KONVENS\u201912)","author":"Zampieri Marcos","year":"2012"},{"key":"e_1_2_1_39_1","doi-asserted-by":"crossref","unstructured":"Marcos Zampieri Shervin Malmasi Nikola Ljube\u0161i\u0107 Preslav Nakov Ahmed Ali J\u00f6rg Tiedemann Yves Scherrer and No\u00ebmi Aepli. 2017. Findings of the VarDial evaluation campaign 2017 (2017) 1--15.  Marcos Zampieri Shervin Malmasi Nikola Ljube\u0161i\u0107 Preslav Nakov Ahmed Ali J\u00f6rg Tiedemann Yves Scherrer and No\u00ebmi Aepli. 2017. Findings of the VarDial evaluation campaign 2017 (2017) 1--15.","DOI":"10.18653\/v1\/W17-1201"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-5307"},{"key":"e_1_2_1_41_1","volume-title":"Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects (VarDial'15)","author":"Zampieri Marcos","year":"2015"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3389021","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3389021","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:33:02Z","timestamp":1750199582000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3389021"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6]]},"references-count":41,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2020,9,30]]}},"alternative-id":["10.1145\/3389021"],"URL":"https:\/\/doi.org\/10.1145\/3389021","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6]]},"assertion":[{"value":"2019-01-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-06-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}