{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T23:34:42Z","timestamp":1772580882113,"version":"3.50.1"},"reference-count":74,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,5,18]],"date-time":"2023-05-18T00:00:00Z","timestamp":1684368000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Center for Open Intelligent Connectivity from The Featured Areas Research Center Program"},{"name":"Higher Education Sprout Project by the Ministry of Education in Taiwan"},{"DOI":"10.13039\/501100004663","name":"Ministry of Science and Technology in Taiwan","doi-asserted-by":"crossref","award":["108-2221-E-009-047, 107-2221- E-197-006-MY3, 108-2321-B-197-003, 109-2221-E-027-108"],"award-info":[{"award-number":["108-2221-E-009-047, 107-2221- E-197-006-MY3, 108-2321-B-197-003, 109-2221-E-027-108"]}],"id":[{"id":"10.13039\/501100004663","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Ministry of Economic Affairs in Taiwan","award":["107-EC-17-A-02-S5-008, 108-EC-17-A-02-S5-008, 109-EC-17-A-02-S5-008"],"award-info":[{"award-number":["107-EC-17-A-02-S5-008, 108-EC-17-A-02-S5-008, 109-EC-17-A-02-S5-008"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Internet Technol."],"published-print":{"date-parts":[[2023,5,31]]},"abstract":"<jats:p>The voice-based Internet of Multimedia Things (IoMT) is the combination of IoT interfaces and protocols with associated voice-related information, which enables advanced applications based on human-to-device interactions. An example is Automatic Speech Recognition (ASR) for live captioning and voice translation. Three major issues of ASR for IoMT are IoT development cost, speech recognition accuracy, and execution time complexity. For the first issue, most non-voice IoT applications are upgraded with the ASR feature through hard coding, which are error prone. For the second issue, recognition accuracy must be improved for ASR. For the third issue, many multimedia IoT services are real-time applications and, therefore, the ASR delay must be short.<\/jats:p>\n          <jats:p>\n            This article elaborates on the above issues based on an IoT platform called VoiceTalk. We built the largest Taiwanese spoken corpus to train\n            <jats:bold>VoiceTalk ASR (VT-ASR)<\/jats:bold>\n            and show how the VT-ASR mechanism can be transparently integrated with existing IoT applications. We consider two performance measures for VoiceTalk: speech recognition accuracy and VT-ASR delay. For the acoustic tests of PAL-Labs, VT-ASR's accuracy is 96.47%, while Google's accuracy is 94.28%. We are the first to develop an analytic model to investigate the probability that the VT-ASR delay for the first speaker is complete before the second speaker starts talking. From the measurements and analytic modeling, we show that the VT-ASR delay is short enough to result in a very good user experience. Our solution has won several important government and commercial TV contracts in Taiwan. VT-ASR has demonstrated better Taiwanese Mandarin speech recognition accuracy than famous commercial products (including Google and Iflytek) in Formosa Speech Recognition Challenge 2018 (FSR-2018) and was the best among all participating ASR systems for Taiwanese recognition accuracy in FSR-2020.\n          <\/jats:p>","DOI":"10.1145\/3543854","type":"journal-article","created":{"date-parts":[[2022,6,14]],"date-time":"2022-06-14T09:58:09Z","timestamp":1655200689000},"page":"1-30","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["VoiceTalk: Multimedia-IoT Applications for Mixing Mandarin, Taiwanese, and English"],"prefix":"10.1145","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6841-4718","authenticated-orcid":false,"given":"Yi-Bing","family":"Lin","sequence":"first","affiliation":[{"name":"College of Artificial Intelligence and Green Energy, National Yang Ming Chiao Tung University, China Medicine University, Department of Computer Science and Information Engineering, Asia University, College of Humanities and Sciences, Miin Wu School of Computing, National Cheng Kung University"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0191-2178","authenticated-orcid":false,"given":"Yuan-Fu","family":"Liao","sequence":"additional","affiliation":[{"name":"Department of Electronic\u00a0Engineering, National Taipei University of\u00a0Technology"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9820-2318","authenticated-orcid":false,"given":"Sin-Horng","family":"Chen","sequence":"additional","affiliation":[{"name":"Dept. of Electrical and Computer Engineering,\u00a0National Yang Ming Chiao Tung University"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1358-0502","authenticated-orcid":false,"given":"Shaw-Hwa","family":"Hwang","sequence":"additional","affiliation":[{"name":"Dept. of Electrical and Computer Engineering,\u00a0National Yang Ming Chiao Tung University"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4483-1418","authenticated-orcid":false,"given":"Yih-Ru","family":"Wang","sequence":"additional","affiliation":[{"name":"Dept. of Electrical and Computer Engineering,\u00a0National Yang Ming Chiao Tung University"}]}],"member":"320","published-online":{"date-parts":[[2023,5,18]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.3390\/s20082334"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2020.3039518"},{"key":"e_1_3_2_4_2","first-page":"471","volume-title":"Proceedings of the International Conference on Advances in Computing and Data Sciences","author":"Yadav D. K.","year":"2016","unstructured":"D. K. Yadav, K. Singh, and S. Kumari. 2016. Challenging issues of video surveillance system using internet of things in cloud environment. In Proceedings of the International Conference on Advances in Computing and Data Sciences, 471\u2013481. 10.1007\/978-981-10-5427-3_49"},{"key":"e_1_3_2_5_2","doi-asserted-by":"crossref","first-page":"426","DOI":"10.1145\/2789168.2790123","volume-title":"Proceedings of the 21st Annual International Conference on Mobile Computing and Networking (MobiCom\u201915)","author":"Zhang Tan","year":"2015","unstructured":"Tan Zhang, Aakanksha Chowdhery, Paramvir (Victor) Bahl, Kyle Jamieson, and Suman Banerjee. 2015. The design and implementation of a wireless video surveillance system. In Proceedings of the 21st Annual International Conference on Mobile Computing and Networking (MobiCom\u201915). Association for Computing Machinery, New York, NY, 426\u2013438. 10.1145\/2789168.2790123"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2020.2979670"},{"key":"e_1_3_2_7_2","unstructured":"M. Sheng A. K. Sangaiah and A. Chaudhary. CALL FOR PAPERS: Special Issue on Applications of Computational Linguistics in Multimedia IoT Services ACM Transactions on Internet Technology (TOIT) Retrieved from https:\/\/dl.acm.org\/pb-assets\/static_journal_pages\/toit\/pdf\/ACM-TOIT-CFP-IoMT-Jan21-1610575659947.pdf."},{"key":"e_1_3_2_8_2","unstructured":"The Association for Computational Linguistics and Chinese Language Processing: Database. Retrieved from http:\/\/www.aclclp.org.tw\/corp.php."},{"key":"e_1_3_2_9_2","unstructured":"The Association for Computational Linguistics and Chinese Language Processing. Retrieved from http:\/\/www.aclclp.org.tw\/."},{"key":"e_1_3_2_10_2","unstructured":"Linguistic Data Consortium University of Pennsylvania. Retrieved from https:\/\/www.ldc.upenn.edu\/."},{"key":"e_1_3_2_11_2","unstructured":"The European Language Resources Association. Retrieved from http:\/\/www.elra.info\/en\/."},{"key":"e_1_3_2_12_2","unstructured":"SpeechOcean: Speech Data Services Text Data and Image Data Services Speech Datasets Database. Retrieved from http:\/\/en.speechocean.com\/."},{"key":"e_1_3_2_13_2","unstructured":"Google: Cloud Speech-to-text. Retrieved from https:\/\/cloud.google.com\/speech-to-text\/."},{"key":"e_1_3_2_14_2","unstructured":"IFlyTek: iFLYTEK Open Platform\u2014China's First Artificial Intelligence Open Platform for Mobile Internet and Intelligent Hardware Developers. Retrieved from http:\/\/global.xfyun.cn\/."},{"key":"e_1_3_2_15_2","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1145\/1143844.1143891","volume-title":"Proceedings of the 23rd International Conference on Machine Learning (ICM\u201906)","author":"Graves Alex","year":"2006","unstructured":"Alex Graves, Santiago Fern\u00e1ndez, Faustino Gomez, and J\u00fcrgen Schmidhuber. 2006. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In Proceedings of the 23rd International Conference on Machine Learning (ICM\u201906). Association for Computing Machinery, New York, NY, 369\u2013376. 10.1145\/1143844.1143891"},{"key":"e_1_3_2_16_2","unstructured":"D. Amodei et al. 2016. Deep Speech 2: End-to-End speech recognition in english and mandarin. arXiv:1512.02595. Retrieved from https:\/\/arxiv.org\/abs\/1512.02595."},{"key":"e_1_3_2_17_2","volume-title":"Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201919)","author":"Karita S.","year":"2019","unstructured":"S. Karita, N. E. Y. Soplin, S. Watanabe, M. Delcroix, A. Ogawa, and T. Nakatani. 2019. Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration. In Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201919). 10.21437\/Interspeech.2019-1938"},{"key":"e_1_3_2_18_2","doi-asserted-by":"crossref","unstructured":"N. Q. Pham T. S. Nguyen J. Niehues M. M\u00fcller S. St\u00fcker and A. Waibel. 2019. Very deep self-attention networks for end-to-end speech recognition. arXiv:1904.13377. Retrieved from https:\/\/arxiv.org\/abs\/1904.13377.","DOI":"10.21437\/Interspeech.2019-2702"},{"key":"e_1_3_2_19_2","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201916)","author":"Chan W.","year":"2016","unstructured":"W. Chan, N. Jaitly, Q. Le, and O. Vinyals. 2016. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201916). 10.1109\/ICASSP.2016.7472621"},{"key":"e_1_3_2_20_2","volume-title":"Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201920)","author":"Gulati A.","year":"2020","unstructured":"A. Gulati et al. 2020. Conformer: Convolution-augmented transformer for speech recognition. In Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201920). 10.21437\/Interspeech.2020-3015"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2012.2205597"},{"key":"e_1_3_2_22_2","unstructured":"W. Xiong et al. 2016. Achieving human parity in conversational speech recognition. arXiv:1610.05256. Retrieved from https:\/\/arxiv.org\/abs\/1610.05256"},{"key":"e_1_3_2_23_2","volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP\u20192020)","author":"Zhou W.","year":"2020","unstructured":"W. Zhou, W. Michel, K. Irie, M. Kitza, R. Schluter, and H. Ney. 2020. The RWTH ASR system for TED-LIUM Release 2: Improving hybrid HMM with SpecAugment. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP\u20192020). 10.48550\/arXiv.2004.00960"},{"key":"e_1_3_2_24_2","volume-title":"Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201919)","author":"L\u00fcscher C.","year":"2019","unstructured":"C. L\u00fcscher et al. 2019. RWTH ASR systems for Librispeech: Hybrid vs Attention - w\/o Data Augmentation. In Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201919)."},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2339736"},{"key":"e_1_3_2_26_2","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201913)","author":"Graves A.","year":"2013","unstructured":"A. Graves, A. Mohamed, and G. Hinton, 2013. Speech recognition with deep recurrent neural networks. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201913). 10.1109\/ICASSP.2013.6638947"},{"key":"e_1_3_2_27_2","volume-title":"Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201915)","author":"Peddinti V.","year":"2015","unstructured":"V. Peddinti, D. Povey, and S. Khudanpur. 2015. A time delay neural network architecture for efficient modeling of long temporal contexts. In Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201915). 10.21437\/Interspeech.2015-647"},{"key":"e_1_3_2_28_2","volume-title":"Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201918)","author":"Povey D.","year":"2018","unstructured":"D. Povey et al. 2018. Semi-Orthogonal low-rank matrix factorization for deep neural networks. In Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201918). 10.21437\/Interspeech.2018-1417"},{"key":"e_1_3_2_29_2","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201919)","author":"Manohar V.","year":"2019","unstructured":"V. Manohar, S. Chen, Z. Wang, Y. Fujita, S. Watanabe, and S. Khudanpur. 2019. Acoustic modeling for overlapping speech recognition: Jhu Chime-5 challenge system. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201919). 10.1109\/ICASSP.2019.8682556"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11265-019-01483-4"},{"key":"e_1_3_2_31_2","unstructured":"TCC300 Corpus. Retrieved April 6 2021 from http:\/\/www.aclclp.org.tw\/use_mat.php#tcc300edu\u2019."},{"key":"e_1_3_2_32_2","volume-title":"Proceedings of the ISCA & IEEE Workshop on Spontaneous Speech Process. Recognition (SSPR\u201903)","author":"Wang H.-M.","year":"2003","unstructured":"H.-M. Wang. 2003. MATBN 2002: A mandarin Chinese broadcast news corpus. In Proceedings of the ISCA & IEEE Workshop on Spontaneous Speech Process. Recognition (SSPR\u201903)."},{"key":"e_1_3_2_33_2","volume-title":"Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I\/O Systems and Assessment (O-COCOSDA\u201917)","author":"Bu H.","year":"2017","unstructured":"H. Bu, J. Du, X. Na, B. Wu, and H. Zheng. 2017. AISHELL-1: An open-source Mandarin speech corpus and a speech recognition baseline. In Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I\/O Systems and Assessment (O-COCOSDA\u201917). 10.1109\/ICSDA.2017.8384449"},{"key":"e_1_3_2_34_2","unstructured":"D. Wang and X. Zhang. 2015. THCHS-30: A free chinese speech corpus. arXiv:1512.01882. Retrieved from https:\/\/arxiv.org\/abs\/1512.01882."},{"key":"e_1_3_2_35_2","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201915)","author":"Panayotov V.","year":"2015","unstructured":"V. Panayotov, G. Chen, D. Povey, and S. Khudanpur. 2015. Librispeech: An ASR corpus based on public domain audio books. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201915). 10.1109\/ICASSP.2015.7178964"},{"key":"e_1_3_2_36_2","volume-title":"Proceedings of the Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA\u201916)","author":"Wang D.","year":"2016","unstructured":"D. Wang, Z. Tang, D. Tang, and Q. Chen. 2016. OC16-CE80: A Chinese-English mixlingual database and a speech recognition baseline. In Proceedings of the Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA\u201916). 10.1109\/icsda.2016.7918989"},{"key":"e_1_3_2_37_2","volume-title":"Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201910)","author":"Lyu D.-C.","year":"2010","unstructured":"D.-C. Lyu, T.-P. Tan, E.-S. Chng, and H. Li. 2010. SEAME: A mandarin-english code-switching speech corpus in South-East Asia. In Proceedings of the Conference of the International Speech Communication Association (Interspeech\u201910)."},{"key":"e_1_3_2_38_2","unstructured":"C. Huang. 2009. Tagged Chinese gigaword corpus 2.0. linguistic data consortium philadelphia. 10.35111\/9bhh-2s82"},{"key":"e_1_3_2_39_2","unstructured":"J. Du X. Na X. Liu and H. Bu. 2018. AISHELL-2: Transforming mandarin ASR research into industrial scale. arXiv:1808.10583. Retrieved from http:\/\/arxiv.org\/abs\/1808.10583."},{"key":"e_1_3_2_40_2","volume-title":"Proceedings of the IEEE Pune Section International Conference (PuneCon\u201919)","author":"Kavre M.S.","year":"2019","unstructured":"M.S. Kavre, A. Gadekar, and Y. Gadhade. 2019. Internet of things (IoT): A survey. In Proceedings of the IEEE Pune Section International Conference (PuneCon\u201919)."},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2016.10.003"},{"key":"e_1_3_2_42_2","doi-asserted-by":"crossref","unstructured":"G. Marques N. Garcia and N. Pombo. 2017. A survey on IoT: Architectures elements applications QoS platforms and security concepts. In Advances in Mobile Cloud Computing and Big Data in the 5G Era Studies in Big Data Vol. 22 C. Mavromoustakis G. Mastorakis and C. Dobre (Eds.). Springer Cham. 10.1007\/978-3-319-45145-9_5","DOI":"10.1007\/978-3-319-45145-9_5"},{"key":"e_1_3_2_43_2","volume-title":"Proceedings of the 5th Conference on Mobile and Secure Services (MobiSecServ\u201919)","author":"Godwin S.","year":"2019","unstructured":"S. Godwin, B. Glendenning, and K. Gagneja. 2019. Future security of smart speaker and IoT smart home devices. In Proceedings of the 5th Conference on Mobile and Secure Services (MobiSecServ\u201919). 10.1109\/MOBISECSERV.2019.8686545."},{"key":"e_1_3_2_44_2","volume-title":"Proceedings of the IEEE Security and Privacy Workshops (SPW\u201919)","author":"Cheng P.","year":"2019","unstructured":"P. Cheng, I. E. Bagci, J. Yan, and U. Roedig. 2019. Smart speaker privacy control\u2014Acoustic tagging for personal voice assistants. In Proceedings of the IEEE Security and Privacy Workshops (SPW\u201919). 10.1109\/SPW.2019.00035"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/MCE.2012.2207158"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2017.2715859"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/65.912720"},{"key":"e_1_3_2_48_2","volume-title":"Proceedings of the 15th International Carpathian Control Conference (ICCC\u201914)","author":"N\u00e1dvorn\u00edk J.","year":"2014","unstructured":"J. N\u00e1dvorn\u00edk and P. Smutn\u00fd. 2014. Remote control robot using Android mobile device. In Proceedings of the 15th International Carpathian Control Conference (ICCC\u201914). 10.1109\/CarpathianCC.2014.6843630"},{"key":"e_1_3_2_49_2","volume-title":"Proceedings of the IEEE International Conference on Consumer Electronics\u2013Taiwan (ICCE-TW\u20192017)","author":"Sullivan D.","year":"2017","unstructured":"D. Sullivan, W. Chen, and A. Pandya. 2017. Design of remote control of home appliances via Bluetooth and Android smart phones. In Proceedings of the IEEE International Conference on Consumer Electronics\u2013Taiwan (ICCE-TW\u20192017). 10.1109\/ICCE-China.2017.7991150"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3058364"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2017.2682100"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2832222"},{"key":"e_1_3_2_53_2","unstructured":"Swing Light Pole Interactive Art Demo. Retrieved March 2021 from https:\/\/youtu.be\/wZ99kc-4aAo."},{"key":"e_1_3_2_54_2","unstructured":"Hollow Light Globe Demo. Retrieved March 2021 from https:\/\/youtu.be\/ZICUCOjQ4iA."},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.3390\/s19081763"},{"key":"e_1_3_2_56_2","unstructured":"Smart Plantbox Demo. Retrieved March 2021 from https:\/\/youtu.be\/pyVbYxOEZWo."},{"key":"e_1_3_2_57_2","unstructured":"Smart Toilet Demo. Retrieved March 2021 from https:\/\/youtu.be\/Pr15OyC7fNc."},{"key":"e_1_3_2_58_2","unstructured":"Smart Robot Demo. Retrieved March 2021 from https:\/\/youtu.be\/kPMIJ2TxfIg."},{"key":"e_1_3_2_59_2","volume-title":"Proceedings of the 11th International Symposium on Chinese Spoken Language Processing (ISCSLP\u201918)","author":"Liao Y.-F.","year":"2018","unstructured":"Y.-F. Liao et al. 2018. Formosa speech recognition challenge 2018: Data, plan and baselines. In Proceedings of the 11th International Symposium on Chinese Spoken Language Processing (ISCSLP\u201918). 10.1109\/ISCSLP.2018.8706700"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11265-019-01483-4"},{"key":"e_1_3_2_61_2","unstructured":"Taiwan's National Education Radio (NER) Corpus. Retrieved from http:\/\/www.aclclp.org.tw\/use_mat_c.php#ner."},{"key":"e_1_3_2_62_2","volume-title":"Proceedings of 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA\u201920)","author":"Liao Y.-F.","year":"2020","unstructured":"Y.-F. Liao et al. 2020. Formosa speech recognition challenge 2020 and taiwanese across Taiwan Corpus. In Proceedings of 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA\u201920). 10.1109\/O-COCOSDA50338.2020.9295019"},{"key":"e_1_3_2_63_2","unstructured":"Y.-F. Liao. 2018. Formosa speech recognition challenge 2018. Retrieved from https:\/\/sites.google.com\/speech.ntut.edu.tw\/fsw\/home\/challenge."},{"key":"e_1_3_2_64_2","unstructured":"Y.-F. Liao. Formosa Speech Recognition Challenge 2020. Retrieved from https:\/\/sites.google.com\/speech.ntut.edu.tw\/fsw\/home\/challenge-2020."},{"key":"e_1_3_2_65_2","unstructured":"2020 Presidential Election-Television Debate (Live Subtitling). 2020. Retrieved from https:\/\/youtu.be\/zcrIoO_8ZbU."},{"key":"e_1_3_2_66_2","unstructured":"List of 10th Legislative Councilors of Legislative Yuan in Taiwan 2021. Retrieved from https:\/\/www.ly.gov.tw\/Pages\/List.aspx?nodeid=109."},{"key":"e_1_3_2_67_2","unstructured":"PAL Acoustics Technology Ltd 2021. Retrieved from http:\/\/www.pal-acoustics.com\/."},{"key":"e_1_3_2_68_2","unstructured":"Public Television Service PBS Talk. Retrieved 2021 from https:\/\/www.youtube.com\/user\/PTSTalk\/videos."},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSYST.2017.2773077"},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.1109\/TVT.2003.819616"},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1109\/35.536558"},{"key":"e_1_3_2_72_2","doi-asserted-by":"publisher","DOI":"10.1109\/4234.752903"},{"key":"e_1_3_2_73_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.wocn.2010.08.002"},{"key":"e_1_3_2_74_2","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2015.00731"},{"key":"e_1_3_2_75_2","first-page":"258","volume-title":"Proceedings of the IEEE Wireless Communications and Networking Conference","volume":"1","author":"Yin L.","unstructured":"L. Yin, B. Li, Z. Zhang, and Y.-B. Lin. Performance analysis of a dual-threshold reservation (DTR) scheme for voice\/data integrated mobile wireless networks. In Proceedings of the IEEE Wireless Communications and Networking Conference, Vol. 1. 258\u2013262. DOI:10.10.1109\/WCNC.2000.904638"}],"container-title":["ACM Transactions on Internet Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3543854","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3543854","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T17:49:39Z","timestamp":1750268979000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3543854"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,18]]},"references-count":74,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,5,31]]}},"alternative-id":["10.1145\/3543854"],"URL":"https:\/\/doi.org\/10.1145\/3543854","relation":{},"ISSN":["1533-5399","1557-6051"],"issn-type":[{"value":"1533-5399","type":"print"},{"value":"1557-6051","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,18]]},"assertion":[{"value":"2021-04-15","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-06-03","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-05-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}