{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:17:45Z","timestamp":1750220265088,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":22,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,2,24]],"date-time":"2022-02-24T00:00:00Z","timestamp":1645660800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,2,24]]},"DOI":"10.1145\/3524304.3524314","type":"proceedings-article","created":{"date-parts":[[2022,6,6]],"date-time":"2022-06-06T16:13:59Z","timestamp":1654532039000},"page":"69-73","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["G2P based on multi-sensing field and modified focus loss"],"prefix":"10.1145","author":[{"given":"Jinbo","family":"Zhang","sequence":"first","affiliation":[{"name":"Artificial Intelligence College, GuangXi University for Nationalities, China"}]},{"given":"Donghong","family":"Qin","sequence":"additional","affiliation":[{"name":"Artificial Intelligence College, GuangXi University for Nationalities, China"}]},{"given":"Yang","family":"Li","sequence":"additional","affiliation":[{"name":"Artificial Intelligence College, GuangXi University for Nationalities, China"}]},{"given":"Xiao","family":"Liang","sequence":"additional","affiliation":[{"name":"Artificial Intelligence College, GuangXi University for Nationalities, China"}]}],"member":"320","published-online":{"date-parts":[[2022,6,6]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/S1007-0214(09)70124-5"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CHINSL.2004.1409612"},{"key":"e_1_3_2_2_3_1","first-page":"1","volume-title":"The First International Workshop on MultiMedia Annotation (MMA2001)","volume":"1","author":"Zhang H.","year":"2001","unstructured":"H. Zhang , J. Yu , W. Zhan , and S. Yu , \u201c Disambiguation of Chinese Polyphonic Characters ,\u201d in The First International Workshop on MultiMedia Annotation (MMA2001) , vol. 1 , pp. 30\u2013 1 , 2001 . H. Zhang, J. Yu, W. Zhan, and S. Yu, \u201cDisambiguation of Chinese Polyphonic Characters,\u201d in The First International Workshop on MultiMedia Annotation (MMA2001), vol. 1, pp. 30\u20131, 2001."},{"key":"e_1_3_2_2_4_1","volume-title":"Polyphonic Word Disambiguation with Machine Learning Approaches","author":"Liu J.","year":"2010","unstructured":"J. Liu , W. Qu , X. Tang , Y. Zhang , and Y. Sun , \u201c Polyphonic Word Disambiguation with Machine Learning Approaches ,\u201d in 2010 . J. Liu, W. Qu, X. Tang, Y. Zhang, and Y. Sun, \u201cPolyphonic Word Disambiguation with Machine Learning Approaches,\u201d in 2010."},{"key":"e_1_3_2_2_5_1","volume-title":"The First Interna-tional Workshop on MultiMedia Annotation (MMA2001)","author":"Hong Z.","year":"2001","unstructured":"Z. Hong , Y. Jiangsheng , Z. Weidong , and Y. Shiwen , \u201c Disam-biguation of chinese polyphonic characters ,\u201d in The First Interna-tional Workshop on MultiMedia Annotation (MMA2001) , 2001 . Z. Hong, Y. Jiangsheng, Z. Weidong, and Y. Shiwen, \u201cDisam-biguation of chinese polyphonic characters,\u201d in The First Interna-tional Workshop on MultiMedia Annotation (MMA2001), 2001."},{"key":"e_1_3_2_2_6_1","volume-title":"2002 Interna-tional Symposium on Chinese Spoken Language Processing (ISC-SLP)","author":"Zirong Z.","year":"2002","unstructured":"Z. Zirong , C. Min , and C. Eric , \u201c An efficient way to learn rules for grapheme-to-phoneme conversion in chinese ,\u201d in 2002 Interna-tional Symposium on Chinese Spoken Language Processing (ISC-SLP) , 2002 . Z. Zirong, C. Min, and C. Eric, \u201cAn efficient way to learn rules for grapheme-to-phoneme conversion in chinese,\u201d in 2002 Interna-tional Symposium on Chinese Spoken Language Processing (ISC-SLP), 2002."},{"key":"e_1_3_2_2_7_1","volume-title":"INTERSPEECH","author":"Cai Z.","year":"2019","unstructured":"Z. Cai , Y. Yang , C. Zhang , X. Qin , and M. Li , \u201c Polyphone disam-biguation for mandarin chinese using conditional neural network with multi-level embedding features ,\u201d in INTERSPEECH , 2019 . Z. Cai, Y. Yang, C. Zhang, X. Qin, and M. Li, \u201cPolyphone disam-biguation for mandarin chinese using conditional neural network with multi-level embedding features,\u201d in INTERSPEECH, 2019."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"crossref","unstructured":"S. Hochreiter and J. Schmidhuber \u201cLong short-term memory \u201dNeural computation 1997.  S. Hochreiter and J. Schmidhuber \u201cLong short-term memory \u201dNeural computation 1997.","DOI":"10.1162\/neco.1997.9.8.1735"},{"volume-title":"Bae J. Hifi-gan: Generative adversarial networks for efficient and high fidelity speech synthesis[J]. arXiv preprint arXiv:2010.05646","year":"2020","key":"e_1_3_2_2_9_1","unstructured":"Kong J, Kim J , Bae J. Hifi-gan: Generative adversarial networks for efficient and high fidelity speech synthesis[J]. arXiv preprint arXiv:2010.05646 , 2020 . Kong J, Kim J, Bae J. Hifi-gan: Generative adversarial networks for efficient and high fidelity speech synthesis[J]. arXiv preprint arXiv:2010.05646, 2020."},{"key":"e_1_3_2_2_10_1","first-page":"2980","article-title":"Focal loss for dense object detection","author":"Lin T.","year":"2017","unstructured":"T. Lin , P. Goyal , R. Girshich , K. He , and P Doll\u00e1r , \" Focal loss for dense object detection ,\" in Proceedings of the IEEE interna-tional conference on computer vision , pp. 2980 - 2988 , 2017 . T. Lin, P. Goyal, R. Girshich, K. He, and P Doll\u00e1r, \"Focal loss for dense object detection,\" in Proceedings of the IEEE interna-tional conference on computer vision, pp. 2980-2988, 2017.","journal-title":"Proceedings of the IEEE interna-tional conference on computer vision"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"crossref","unstructured":"Zhang Haiteng Huashan Pan and Xiulin Li. \"A Mask-Based Model for Mandarin Chinese Polyphone Disambiguation.\"\u00a0INTERSPEECH. 2020.  Zhang Haiteng Huashan Pan and Xiulin Li. \"A Mask-Based Model for Mandarin Chinese Polyphone Disambiguation.\"\u00a0INTERSPEECH. 2020.","DOI":"10.21437\/Interspeech.2020-1142"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICGEC.2010.67"},{"key":"e_1_3_2_2_13_1","volume-title":"Key Engineering Materials","author":"Liu F. Z.","year":"2011","unstructured":"F. Z. Liu and Y. Zhou , \u201c Polyphone disambiguation based on max-imum entropy model in mandarin grapheme-to-phoneme conver-sion ,\u201d Key Engineering Materials , 2011 . F. Z. Liu and Y. Zhou, \u201cPolyphone disambiguation based on max-imum entropy model in mandarin grapheme-to-phoneme conver-sion,\u201d Key Engineering Materials, 2011."},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2007.367010"},{"key":"e_1_3_2_2_15_1","volume-title":"Lee K","author":"M","year":"1810","unstructured":"Devlin J, Chang M W , Lee K , Bert : Pre-training of de ep bidirectional transformers for language understanding[J]. arXiv preprint arXiv: 1810 .04805, 2018. Devlin J, Chang M W, Lee K, Bert: Pre-training of deep bidirectional transformers for language understanding[J]. arXiv preprint arXiv:1810.04805, 2018."},{"volume-title":"Language models are unsupervised multitask learners[J]. OpenAI blog","year":"2019","key":"e_1_3_2_2_16_1","unstructured":"Radford A, Wu J, Child R , Language models are unsupervised multitask learners[J]. OpenAI blog , 2019 , 1(8): 9. Radford A, Wu J, Child R, Language models are unsupervised multitask learners[J]. OpenAI blog, 2019, 1(8): 9."},{"key":"e_1_3_2_2_17_1","first-page":"2090","article-title":"Disam-biguation of Chinese Polyphones in an End-to-End Frame-work with Semantic Features Extracted by Pre-Trained BERT","author":"Dai D.","year":"2019","unstructured":"D. Dai , Z. Wu , S. Kang , X. Wu , J. Jia , D. Su , and H. Meng , \" Disam-biguation of Chinese Polyphones in an End-to-End Frame-work with Semantic Features Extracted by Pre-Trained BERT , \" Proceedings of the Interspeech 2019 , pp. 2090 - 2094 ,2019. D. Dai, Z. Wu, S. Kang, X. Wu, J. Jia, D. Su, and H. Meng, \"Disam-biguation of Chinese Polyphones in an End-to-End Frame-work with Semantic Features Extracted by Pre-Trained BERT, \" Proceedings of the Interspeech 2019, pp. 2090-2094,2019.","journal-title":"Proceedings of the Interspeech"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"crossref","unstructured":"M. Yu H. D. Nguyen A. Sokolov J. Lepird K. M. Sathyen-dra S. Choudhary A. Mouchtaris and S. Kunzmann \u201cMultilin-gual grapheme-to-phoneme conversion with byte representation \u201d2020.  M. Yu H. D. Nguyen A. Sokolov J. Lepird K. M. Sathyen-dra S. Choudhary A. Mouchtaris and S. Kunzmann \u201cMultilin-gual grapheme-to-phoneme conversion with byte representation \u201d2020.","DOI":"10.1109\/ICASSP40776.2020.9054696"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2019-3176"},{"volume-title":"Bae J. Hifi-gan: Generative adversarial networks for efficient and high fidelity speech synthesis[J]. arXiv preprint arXiv:2010.05646","year":"2020","key":"e_1_3_2_2_20_1","unstructured":"Kong J, Kim J , Bae J. Hifi-gan: Generative adversarial networks for efficient and high fidelity speech synthesis[J]. arXiv preprint arXiv:2010.05646 , 2020 . Kong J, Kim J, Bae J. Hifi-gan: Generative adversarial networks for efficient and high fidelity speech synthesis[J]. arXiv preprint arXiv:2010.05646, 2020."},{"key":"e_1_3_2_2_21_1","unstructured":"A. F. Agarap \u201cDeep learning using rectified linear units (relu) \u201darXiv preprint arXiv:1803.08375 2018.  A. F. Agarap \u201cDeep learning using rectified linear units (relu) \u201darXiv preprint arXiv:1803.08375 2018."},{"key":"e_1_3_2_2_22_1","volume-title":"Adam: A method for stochastic opti-mization","author":"Kingma D. P.","year":"2014","unstructured":"D. P. Kingma and J. Ba , \u201c Adam: A method for stochastic opti-mization ,\u201d arXiv preprint arXiv:1412.6980, 2014 . D. P. Kingma and J. Ba, \u201cAdam: A method for stochastic opti-mization,\u201d arXiv preprint arXiv:1412.6980, 2014."}],"event":{"name":"ICSCA 2022: 2022 11th International Conference on Software and Computer Applications","acronym":"ICSCA 2022","location":"Melaka Malaysia"},"container-title":["2022 11th International Conference on Software and Computer Applications"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3524304.3524314","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3524304.3524314","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:57Z","timestamp":1750188657000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3524304.3524314"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,24]]},"references-count":22,"alternative-id":["10.1145\/3524304.3524314","10.1145\/3524304"],"URL":"https:\/\/doi.org\/10.1145\/3524304.3524314","relation":{},"subject":[],"published":{"date-parts":[[2022,2,24]]},"assertion":[{"value":"2022-06-06","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}