{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,1,2]],"date-time":"2023-01-02T12:28:42Z","timestamp":1672662522544},"reference-count":34,"publisher":"Association for Computing Machinery (ACM)","issue":"3","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Transactions on Asian Language Information Processing"],"published-print":{"date-parts":[[2006,9]]},"abstract":"<jats:p>Machine transliteration is an automatic method for converting words in one language into phonetically equivalent ones in another language. There has been growing interest in the use of machine transliteration to assist machine translation and information retrieval. Three types of machine transliteration models---grapheme-based, phoneme-based, and hybrid---have been proposed. Surprisingly, there have been few reports of efforts to utilize the correspondence between source graphemes and source phonemes, although this correspondence plays an important role in machine transliteration. Furthermore, little work has been reported on ways to dynamically handle source graphemes and phonemes. In this paper, we propose a transliteration model that dynamically uses both graphemes and phonemes, particularly the correspondence between them. With this model, we have achieved better performance---improvements of about 15 to 41% in English-to-Korean transliteration and about 16 to 44% in English-to-Japanese transliteration---than has been reported for other models.<\/jats:p>","DOI":"10.1145\/1194936.1194938","type":"journal-article","created":{"date-parts":[[2007,1,16]],"date-time":"2007-01-16T19:38:29Z","timestamp":1168976309000},"page":"185-208","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["A machine transliteration model based on correspondence between graphemes and phonemes"],"prefix":"10.1145","volume":"5","author":[{"given":"Jong-Hoon","family":"Oh","sequence":"first","affiliation":[{"name":"National Institute of Information and Communications Technology, Soraku-gun, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Key-Sun","family":"Choi","sequence":"additional","affiliation":[{"name":"Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hitoshi","family":"Isahara","sequence":"additional","affiliation":[{"name":"National Institute of Information and Communications Technology, Soraku-gun, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2006,9]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1006538427943"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022689900470"},{"key":"e_1_2_1_3_1","volume-title":"Proceedings of ACL","author":"Al-Onaizan Y.","year":"2002","unstructured":"Al-Onaizan , Y. and Knight , K . 2002. Translating named entities using monolingual and bilingual resources . In Proceedings of ACL 2002 . 400--408. 10.3115\/1073083.1073150 Al-Onaizan, Y. and Knight, K. 2002. Translating named entities using monolingual and bilingual resources. In Proceedings of ACL 2002. 400--408. 10.3115\/1073083.1073150"},{"key":"e_1_2_1_4_1","first-page":"39","article-title":"A maximum entropy approach to natural language processing","volume":"22","author":"Berger A. L.","year":"1996","unstructured":"Berger , A. L. , Pietra , S. D. , and Pietra , V. J. D. 1996 . A maximum entropy approach to natural language processing . Computational Linguistics 22 , 1, 39 -- 71 . Berger, A. L., Pietra, S. D., and Pietra, V. J. D. 1996. A maximum entropy approach to natural language processing. Computational Linguistics 22, 1, 39--71.","journal-title":"Computational Linguistics"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of IJCNLP","author":"Bilac S.","year":"2004","unstructured":"Bilac , S. and Tanaka , H . 2004. Improving back-transliteration by combining information sources . In Proceedings of IJCNLP 2004 . 542--547. Bilac, S. and Tanaka, H. 2004. Improving back-transliteration by combining information sources. In Proceedings of IJCNLP 2004. 542--547."},{"key":"e_1_2_1_6_1","volume-title":"the Electronic Dictionary Research and Development Group","author":"Breen J.","unstructured":"Breen , J. 2003. EDICT Japanese \/English dictionary. le. the Electronic Dictionary Research and Development Group , Monash University . Breen, J. 2003. EDICT Japanese\/English dictionary. le. the Electronic Dictionary Research and Development Group, Monash University."},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of the Natural Language Processing Pacific Rim Symposium","author":"Brill E.","year":"2001","unstructured":"Brill , E. , Kacmarcik , G. , and Brockett , C . 2001. Automatically harvesting Katakana-English term pairs from search engine query logs . In Proceedings of the Natural Language Processing Pacific Rim Symposium 2001 . 393--399. Brill, E., Kacmarcik, G., and Brockett, C. 2001. Automatically harvesting Katakana-English term pairs from search engine query logs. In Proceedings of the Natural Language Processing Pacific Rim Symposium 2001. 393--399."},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the Natural Language Processing Pacific Rim Symposium","author":"Collier N.","year":"1997","unstructured":"Collier , N. , Kumano , A. , and Hirakawa , H . 1997. Acquisition of English-Japanese proper nouns from noisy-parallel newswire articles using Katakana matching . In Proceedings of the Natural Language Processing Pacific Rim Symposium 1997 . 309--314. Collier, N., Kumano, A., and Hirakawa, H. 1997. Acquisition of English-Japanese proper nouns from noisy-parallel newswire articles using Katakana matching. In Proceedings of the Natural Language Processing Pacific Rim Symposium 1997. 309--314."},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","DOI":"10.1109\/TIT.1967.1053964","article-title":"Nearest neighbor pattern classification","volume":"13","author":"Cover T. M.","year":"1967","unstructured":"Cover , T. M. and Hart , P. E. 1967 . Nearest neighbor pattern classification . Institute of Electrical and Electronics Engineers Transactions on Information Theory 13 , 2127, 57--67. Cover, T. M. and Hart, P. E. 1967. Nearest neighbor pattern classification. Institute of Electrical and Electronics Engineers Transactions on Information Theory 13, 2127, 57--67.","journal-title":"Institute of Electrical and Electronics Engineers Transactions on Information Theory"},{"key":"e_1_2_1_10_1","unstructured":"Daelemans W. Zavrel J. Sloot K. V. D. and Bosch A. V. D. 2003. TiMBL: Tilburg Memory-Based Learner-version 4.3 reference guide.  Daelemans W. Zavrel J. Sloot K. V. D. and Bosch A. V. D. 2003. TiMBL: Tilburg Memory-Based Learner-version 4.3 reference guide."},{"key":"e_1_2_1_11_1","volume-title":"Pattern recognition: A statistical approach","author":"Devijver P. A.","unstructured":"Devijver , P. A. and Kittler ., J. 1982. Pattern recognition: A statistical approach . Prentice-Hall , Englewood Cliffs, NJ . Devijver, P. A. and Kittler., J. 1982. Pattern recognition: A statistical approach. Prentice-Hall, Englewood Cliffs, NJ."},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the IEEE.","volume":"61","author":"Forney G. D.","year":"1973","unstructured":"Forney , G. D. 1973 . The Viterbi algorithm . In Proceedings of the IEEE. Vol. 61 . 268--278. Forney, G. D. 1973. The Viterbi algorithm. In Proceedings of the IEEE. Vol. 61. 268--278."},{"key":"e_1_2_1_13_1","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1011856202986","article-title":"Japanese\/English cross-language information retrieval: Exploration of query translation and transliteration","volume":"35","author":"Fujii A.","year":"2001","unstructured":"Fujii , A. and Tetsuya , I. 2001 . Japanese\/English cross-language information retrieval: Exploration of query translation and transliteration . Computers and the Humanities 35 , 4, 389 -- 420 . Fujii, A. and Tetsuya, I. 2001. Japanese\/English cross-language information retrieval: Exploration of query translation and transliteration. Computers and the Humanities 35, 4, 389--420.","journal-title":"Computers and the Humanities"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of MT-Summit IX. 125--132","author":"Goto I.","unstructured":"Goto , I. , Kato , N. , Uratani , N. , and Ehara , T . 2003. Transliteration considering context information based on the maximum entropy method . In Proceedings of MT-Summit IX. 125--132 . Goto, I., Kato, N., Uratani, N., and Ehara, T. 2003. Transliteration considering context information based on the maximum entropy method. In Proceedings of MT-Summit IX. 125--132."},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition. 11--44","author":"Kando N.","unstructured":"Kando , N. , Kuriyama , K. , Nozue , T. , Eguchi , K. , Kato , H. , and Hidaka , S . 1999. Overview of IR tasks . In Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition. 11--44 . Kando, N., Kuriyama, K., Nozue, T., Eguchi, K., Kato, H., and Hidaka, S. 1999. Overview of IR tasks. In Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition. 11--44."},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the 2nd International Conference on Language Resources and Evaluation. 1135--1411","author":"Kang B. J.","unstructured":"Kang , B. J. and Choi , K. S . 2000. Automatic transliteration and back-transliteration by decision tree learning . In Proceedings of the 2nd International Conference on Language Resources and Evaluation. 1135--1411 . Kang, B. J. and Choi, K. S. 2000. Automatic transliteration and back-transliteration by decision tree learning. In Proceedings of the 2nd International Conference on Language Resources and Evaluation. 1135--1411."},{"key":"e_1_2_1_18_1","volume-title":"Proceedings of the 18th International Conference on Computational Linguistics. 418--424","author":"Kang I. H.","unstructured":"Kang , I. H. and Kim , G. C . 2000. English-to-Korean transliteration using multiple unbounded overlapping phoneme chunks . In Proceedings of the 18th International Conference on Computational Linguistics. 418--424 . 10.3115\/990820.990881 Kang, I. H. and Kim, G. C. 2000. English-to-Korean transliteration using multiple unbounded overlapping phoneme chunks. In Proceedings of the 18th International Conference on Computational Linguistics. 418--424. 10.3115\/990820.990881"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of Korea Cognitive Science Association. 247--252","author":"Kim J. J.","unstructured":"Kim , J. J. , Lee , J. S. , and Choi , K. S . 1999. Pronunciation unit based automatic English-Korean transliteration model using neural network . In Proceedings of Korea Cognitive Science Association. 247--252 . Kim, J. J., Lee, J. S., and Choi, K. S. 1999. Pronunciation unit based automatic English-Korean transliteration model using neural network. In Proceedings of Korea Cognitive Science Association. 247--252."},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the 35th Annual Meetings of the Association for Computational Linguistics. 128--135","author":"Knight K.","unstructured":"Knight , K. and Graehl , J . 1997. Machine transliteration . In Proceedings of the 35th Annual Meetings of the Association for Computational Linguistics. 128--135 . 10.3115\/976909.979634 Knight, K. and Graehl, J. 1997. Machine transliteration. In Proceedings of the 35th Annual Meetings of the Association for Computational Linguistics. 128--135. 10.3115\/976909.979634"},{"key":"e_1_2_1_21_1","unstructured":"Korea Ministry of Culture & Tourism. 1995. English to Korean standard conversion rules.  Korea Ministry of Culture & Tourism. 1995. English to Korean standard conversion rules."},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of PACLIC 18","author":"Kuo J.-S.","unstructured":"Kuo , J.-S. and Yang , Y . -K. 2004. Generating paired transliterated-cognates using multiple pronunciation characteristics from web corpora . In Proceedings of PACLIC 18 . 275--282. Kuo, J.-S. and Yang, Y.-K. 2004. Generating paired transliterated-cognates using multiple pronunciation characteristics from web corpora. In Proceedings of PACLIC 18. 275--282."},{"key":"e_1_2_1_24_1","first-page":"17","article-title":"English to Korean statistical transliteration for information retrieval","volume":"12","author":"Lee J. S.","year":"1998","unstructured":"Lee , J. S. and Choi , K. S. 1998 . English to Korean statistical transliteration for information retrieval . Computer Processing of Oriental Languages 12 , 1, 17 -- 37 . Lee, J. S. and Choi, K. S. 1998. English to Korean statistical transliteration for information retrieval. Computer Processing of Oriental Languages 12, 1, 17--37.","journal-title":"Computer Processing of Oriental Languages"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings. of ACL","author":"Li H.","year":"2004","unstructured":"Li , H. , Zhang , M. , and Su , J . 2004. A joint source-channel model for machine transliteration . In Proceedings. of ACL 2004 . 160--167. Li, H., Zhang, M., and Su, J. 2004. A joint source-channel model for machine transliteration. In Proceedings. of ACL 2004. 160--167."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the Sixth Conference on Natural Language Learning (CoNLL). 139--145","author":"Lin W. H.","year":"1885","unstructured":"Lin , W. H. and Chen , H. H . 2002. Backward machine transliteration by learning phonetic similarity . In Proceedings of the Sixth Conference on Natural Language Learning (CoNLL). 139--145 . 10.3115\/11 1885 3.1118870 Lin, W. H. and Chen, H. H. 2002. Backward machine transliteration by learning phonetic similarity. In Proceedings of the Sixth Conference on Natural Language Learning (CoNLL). 139--145. 10.3115\/1118853.1118870"},{"key":"e_1_2_1_27_1","unstructured":"Manning C. and Schutze H. 1999. Foundations of Statistical Natural Language Processing. MIT Press Cambridge MA.   Manning C. and Schutze H. 1999. Foundations of Statistical Natural Language Processing. MIT Press Cambridge MA."},{"key":"e_1_2_1_28_1","volume-title":"Machine Learning","author":"Mitchell T. M.","unstructured":"Mitchell , T. M. 1997. Machine Learning . McGraw-Hill , New- York . Mitchell, T. M. 1997. Machine Learning. McGraw-Hill, New-York."},{"key":"e_1_2_1_29_1","volume-title":"Proceedings of Human Language Technology Conference. 292--297","author":"Miyao Y.","unstructured":"Miyao , Y. and Tsuji , J . 2002. Maximum entropy estimation for feature forests . In Proceedings of Human Language Technology Conference. 292--297 . Miyao, Y. and Tsuji, J. 2002. Maximum entropy estimation for feature forests. In Proceedings of Human Language Technology Conference. 292--297."},{"key":"e_1_2_1_30_1","unstructured":"Nam Y. S. 1997. Foreign dictionary. Sung An Dang.  Nam Y. S. 1997. Foreign dictionary. Sung An Dang."},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of Special Interest Group on Artificial Intelligence of Korea Information Science Society Spring Conference. 59--65","author":"Park Y.","unstructured":"Park , Y. , Choi , K.-S. , Kim , J. , and Kim , Y . 1996. Development of the data collection Ver. 2.0 (KTSET 2.0) for Korean information retrieval studies . In Proceedings of Special Interest Group on Artificial Intelligence of Korea Information Science Society Spring Conference. 59--65 . Park, Y., Choi, K.-S., Kim, J., and Kim, Y. 1996. Development of the data collection Ver. 2.0 (KTSET 2.0) for Korean information retrieval studies. In Proceedings of Special Interest Group on Artificial Intelligence of Korea Information Science Society Spring Conference. 59--65."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022643204877"},{"key":"e_1_2_1_33_1","volume-title":"Programs for Machine Learning. Morgan Kaufmann","author":"Quinlan J. R.","unstructured":"Quinlan , J. R. 1993. C4.5 : Programs for Machine Learning. Morgan Kaufmann , San Mateo, CA . Quinlan, J. R. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of IEEE International Conference on Acoustic Speech and Signal Processing. 546--549","author":"Soong K. F.","unstructured":"Soong , K. F. and Huang , E. F . 1991. A tree-trellis based fast search for finding the n best sentence hypotheses in continuous speech recognition . In Proceedings of IEEE International Conference on Acoustic Speech and Signal Processing. 546--549 . Soong, K. F. and Huang, E. F. 1991. A tree-trellis based fast search for finding the n best sentence hypotheses in continuous speech recognition. In Proceedings of IEEE International Conference on Acoustic Speech and Signal Processing. 546--549."},{"key":"e_1_2_1_35_1","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1142\/S0219427902000649","article-title":"Automatic extraction of translational Japanese-Katakana and English word pairs from bilingual corpora","volume":"15","author":"Tsujii K.","year":"2002","unstructured":"Tsujii , K. 2002 . Automatic extraction of translational Japanese-Katakana and English word pairs from bilingual corpora . International Journal of Computer Processing of Oriental Languages 15 , 3, 261 -- 279 . Tsujii, K. 2002. Automatic extraction of translational Japanese-Katakana and English word pairs from bilingual corpora. International Journal of Computer Processing of Oriental Languages 15, 3, 261--279.","journal-title":"International Journal of Computer Processing of Oriental Languages"},{"key":"e_1_2_1_36_1","unstructured":"Zhang L. 2004. Maximum entropy modeling toolkit for python and C&plus;&plus;.  Zhang L. 2004. Maximum entropy modeling toolkit for python and C&plus;&plus;."}],"container-title":["ACM Transactions on Asian Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1194936.1194938","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T21:42:01Z","timestamp":1672263721000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1194936.1194938"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,9]]},"references-count":34,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2006,9]]}},"alternative-id":["10.1145\/1194936.1194938"],"URL":"https:\/\/doi.org\/10.1145\/1194936.1194938","relation":{},"ISSN":["1530-0226","1558-3430"],"issn-type":[{"value":"1530-0226","type":"print"},{"value":"1558-3430","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,9]]},"assertion":[{"value":"2006-09-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}