{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:41:58Z","timestamp":1750308118201,"version":"3.41.0"},"reference-count":13,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2005,3,1]],"date-time":"2005-03-01T00:00:00Z","timestamp":1109635200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Transactions on Asian Language Information Processing"],"published-print":{"date-parts":[[2005,3]]},"abstract":"<jats:p>The feasibility of converting text into speech using an inexpensive computer with minimal memory is of great interest. Speech synthesizers have been developed for many popular languages (e.g., English, Chinese, Spanish, French, etc.), but designing a speech synthesizer for a language is largely dependant on the language structure. In this article, we develop a Persian synthesizer that includes an innovative text analyzer module. In the synthesizer, the text is segmented into words and after preprocessing, a neural network is passed over each word. In addition to preprocessing, a new model (SEHMM) is used as a postprocessor to compensate for errors generated by the neural network. The performance of the proposed model is verified and the intelligibility of the synthetic speech is assessed via listening tests.<\/jats:p>","DOI":"10.1145\/1066078.1066081","type":"journal-article","created":{"date-parts":[[2005,8,3]],"date-time":"2005-08-03T08:30:55Z","timestamp":1123057855000},"page":"38-52","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["A speech synthesizer for Persian text using a neural network with a smooth ergodic HMM"],"prefix":"10.1145","volume":"4","author":[{"given":"F.","family":"Hendessi","sequence":"first","affiliation":[{"name":"Isfahan University of Technology, Isfahan, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"A.","family":"Ghayoori","sequence":"additional","affiliation":[{"name":"Isfahan University of Technology, Isfahan, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"T. A.","family":"Gulliver","sequence":"additional","affiliation":[{"name":"University of Victoria, Victoria, B.C., Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2005,3]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"crossref","first-page":"288","DOI":"10.1109\/TAU.1973.1162452","article-title":"A system for converting English text into speech","volume":"21","author":"Ainsworth W. A.","year":"1973","journal-title":"IEEE Trans. Audio and Electroacoustics"},{"key":"e_1_2_1_2_1","first-page":"119","article-title":"Phonemic transcription by analogy in text-to-speech synthesis: Novel word pronunciation and lexicon compression","volume":"12","author":"Bagshaw P. C.","year":"1998","journal-title":"Computational Linguistics"},{"key":"e_1_2_1_3_1","doi-asserted-by":"crossref","first-page":"1829","DOI":"10.1109\/29.45531","article-title":"An unrestricted vocabulary Arabic speech synthesis system","volume":"37","author":"El-Imam Y. A.","year":"1989","journal-title":"IEEE Trans. Acoustic, Speech and Signal Processing"},{"volume-title":"Proceedings of the IEEE Systems, Man and Cybernetics Conference. IEEE Society","year":"2000","author":"Embrechts M. J.","key":"e_1_2_1_4_1"},{"key":"e_1_2_1_5_1","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1109\/89.232612","article-title":"Improved tone concatenation rules in a formant-based Chinese text-to-speech system","volume":"1","author":"Lee L.-S.","year":"1993","journal-title":"IEEE Trans. Speech and Audio Processing"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/0167-6393(90)90021-Z"},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1109\/TASSP.1977.1162905","article-title":"On the use of autocorrelation analysis for pitch detection","volume":"25","author":"Rabiner L. R.","year":"1977","journal-title":"IEEE Trans. Acoustic, Speech and Signal Processing"},{"key":"e_1_2_1_8_1","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1109\/5.18626","article-title":"A tutorial on hidden Markov models and selected applications in speech recognition","volume":"77","author":"Rabiner L. R.","year":"1989","journal-title":"Proceedings of the IEEE"},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1109\/TASSP.1976.1162846","article-title":"A comparative performance study of several pitch detection algorithms","volume":"24","author":"Rabiner L. R.","year":"1976","journal-title":"IEEE Trans, Acoustic, Speech, and Signal Processing"},{"key":"e_1_2_1_10_1","first-page":"145","article-title":"NETtalk: Parallel networks that learn to pronounce English text","volume":"1","author":"Sejnowski T. J.","year":"1987","journal-title":"Complex Systems"},{"key":"e_1_2_1_11_1","unstructured":"Selim H. and Anbar T. 1986. A phonetic transcription system of Arabic text. IBM Cairo Scientific Center Tech. Rep. 25.  Selim H. and Anbar T. 1986. A phonetic transcription system of Arabic text. IBM Cairo Scientific Center Tech. Rep. 25."},{"volume-title":"Proceedings of the IEEE Workshop on Multimedia Signal Processing","year":"1998","author":"Sproat R.","key":"e_1_2_1_12_1"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/30.628698"}],"container-title":["ACM Transactions on Asian Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1066078.1066081","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1066078.1066081","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T16:08:17Z","timestamp":1750262897000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1066078.1066081"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,3]]},"references-count":13,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2005,3]]}},"alternative-id":["10.1145\/1066078.1066081"],"URL":"https:\/\/doi.org\/10.1145\/1066078.1066081","relation":{},"ISSN":["1530-0226","1558-3430"],"issn-type":[{"type":"print","value":"1530-0226"},{"type":"electronic","value":"1558-3430"}],"subject":[],"published":{"date-parts":[[2005,3]]},"assertion":[{"value":"2005-03-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}