{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T04:38:57Z","timestamp":1770698337519,"version":"3.49.0"},"reference-count":38,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2017,10,18]],"date-time":"2017-10-18T00:00:00Z","timestamp":1508284800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2018,3,31]]},"abstract":"<jats:p>Language analysis is very important for the native speaker to connect with the digital world. Assamese is a relatively unexplored language. In this report, we analyze different aspects of speech-to-text processing, starting from building a speech corpus, defining syllable rules, and finally developing a speech search engine of Assamese. We have collected about 20 hours of speech in three (viz., read, extempore, and conversation) modes and transcribed it. We also discuss some issues and challenges faced during development of the corpus. We have developed an automatic syllabification model with 11 rules for the Assamese language and found an accuracy of more than 95% in our result. We found 12 different syllable patterns where 5 are found most frequent. The maximum length of a syllable found is four letters. With the help of Hidden Markov Model Toolkit (HTK) 3.5, we used deep learning based neural network for our speech recognition model, where we obtained 78.05% accuracy for automatic transcription of Assamese speech.<\/jats:p>","DOI":"10.1145\/3137055","type":"journal-article","created":{"date-parts":[[2017,10,19]],"date-time":"2017-10-19T12:27:47Z","timestamp":1508416067000},"page":"1-14","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Development and Analysis of Speech Recognition Systems for Assamese Language Using HTK"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5630-1054","authenticated-orcid":false,"given":"Himangshu","family":"Sarma","sequence":"first","affiliation":[{"name":"Indian Institute of Information Technology Manipur, Imphal, Manipur, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Navanath","family":"Saharia","sequence":"additional","affiliation":[{"name":"Indian Institute of Information Technology Manipur, Imphal, Manipur, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Utpal","family":"Sharma","sequence":"additional","affiliation":[{"name":"Tezpur University, Tezpur, Assam, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,10,18]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-03784-9_17"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of ACL-08: HLT. Association for Computational Linguistics, 568--576","author":"Bartlett Susan","year":"2008"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-015-9311-7"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324903003073"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the INTERSPEECH","author":"Chang Shuangyu","year":"2000"},{"key":"e_1_2_1_6_1","volume-title":"Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on. IEEE","author":"Chen Xie"},{"key":"e_1_2_1_7_1","unstructured":"P. Coxhead. 2007. Phones and Phonemes. (2007).  P. Coxhead. 2007. Phones and Phonemes. (2007)."},{"key":"e_1_2_1_8_1","volume-title":"Some issues in metrical phonology of Bangla: The indigenous research tradition. Unpublished Ph. D. Dissertation","author":"Dan Mina","year":"1992"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1080\/09296174.2013.773136"},{"key":"e_1_2_1_10_1","volume-title":"Rath","author":"Gales Mark J. F.","year":"2014"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2012.6256337"},{"key":"e_1_2_1_12_1","volume-title":"Syllable analysis to build a dictation system in Telugu language. arXiv preprint arXiv:1001.2263","author":"Kalyani N.","year":"2010"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the International Conference on Natutal Language Processing 2002 (ICON\u201902)","author":"Kishore S. P."},{"key":"e_1_2_1_15_1","doi-asserted-by":"crossref","volume-title":"Elements of Acoustic Phonetics","author":"Ladefoged Peter","DOI":"10.7208\/chicago\/9780226191010.001.0001"},{"key":"e_1_2_1_16_1","unstructured":"Peter Ladefoged and Keith Johnstone. 2011. A Course in Phonetics. CengageBrain. com.  Peter Ladefoged and Keith Johnstone. 2011. A Course in Phonetics. CengageBrain. com."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2009.4960571"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1984.1172426"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1989.266458"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2007.367175"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324905004043"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSCN.2008.4447161"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISACC.2015.7377331"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 10th WSEAS International Conference on Applied Computer and Applied Computational Science (ACACOS\u201911)","author":"Musa Hafiz"},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of EUSIPCO","author":"Nagarajan T."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-54903-8_45"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/IALP.2012.59"},{"key":"e_1_2_1_29_1","volume-title":"Gales","author":"Ragni Anton","year":"2014"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.3115\/1667583.1667595"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2629670"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the 4th Corpus Linguistics Conference.","author":"Saimaiti Maimaitimin","year":"2007"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12046-009-0006-0"},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of National Seminar cum Conference on Recent threads and Techniques in Computer Sciences.","author":"Sarma Himangshu","year":"2013"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1386869.1386871"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ComputationWorld.2009.59"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/SPED.2009.5156177"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/B0-08-044854-2\/00014-6"},{"key":"e_1_2_1_39_1","volume-title":"The HTK book (HTK version 3.5) (version 3.5 ed.)","author":"Young Steve"},{"key":"e_1_2_1_40_1","volume-title":"Proc. Interspeech\u201915","author":"Zhang C."}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3137055","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3137055","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:11:18Z","timestamp":1750212678000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3137055"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,10,18]]},"references-count":38,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2018,3,31]]}},"alternative-id":["10.1145\/3137055"],"URL":"https:\/\/doi.org\/10.1145\/3137055","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,10,18]]},"assertion":[{"value":"2016-06-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-10-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}