{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,8,18]],"date-time":"2023-08-18T16:46:21Z","timestamp":1692377181861},"reference-count":48,"publisher":"Cambridge University Press (CUP)","issue":"2","license":[{"start":{"date-parts":[[2010,3,24]],"date-time":"2010-03-24T00:00:00Z","timestamp":1269388800000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2010,4]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Electronic written texts used in computer-mediated interactions (emails, blogs, chats, and the like) contain significant deviations from the norm of the language. This paper presents the detail of a system aiming at normalizing the orthography of French SMS messages: after discussing the linguistic peculiarities of these messages and possible approaches to their automatic normalization, we present, compare, and evaluate various instanciations of a normalization device based on weighted finite-state transducers. These experiments show that using an intermediate phonemic representation and training, our system outperforms an alternative normalization system based on phrase-based statistical machine translation techniques.<\/jats:p>","DOI":"10.1017\/s1351324909990258","type":"journal-article","created":{"date-parts":[[2010,3,24]],"date-time":"2010-03-24T14:22:15Z","timestamp":1269440535000},"page":"133-159","source":"Crossref","is-referenced-by-count":6,"title":["Rewriting the orthography of SMS messages"],"prefix":"10.1017","volume":"16","author":[{"given":"FRAN\u00c7OIS","family":"YVON","sequence":"first","affiliation":[]}],"member":"56","published-online":{"date-parts":[[2010,3,24]]},"reference":[{"key":"S1351324909990258_ref30","doi-asserted-by":"publisher","DOI":"10.1145\/146370.146380"},{"key":"S1351324909990258_ref23","unstructured":"Jansche M. 2003. Inference of string mappings for language technology. PhD thesis, Ohio State University."},{"key":"S1351324909990258_ref25","first-page":"331","article-title":"Regular models of phonological rule systems","volume":"20","author":"Kaplan","year":"1994","journal-title":"Computational Linguistics"},{"key":"S1351324909990258_ref44","first-page":"901","volume-title":"Proceedings of the International Conference on Spoken Langage Processing (ICSLP)","author":"Stolcke","year":"2002"},{"key":"S1351324909990258_ref48","first-page":"688","volume-title":"Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics","author":"Zhu","year":"2007"},{"key":"S1351324909990258_ref19","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1989.266481"},{"key":"S1351324909990258_ref10","doi-asserted-by":"publisher","DOI":"10.3115\/981863.981904"},{"key":"S1351324909990258_ref21","first-page":"71","volume-title":"the Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics","author":"Golding","year":"1996"},{"key":"S1351324909990258_ref28","volume-title":"Proceedings of the Annual Meeting of the Association for Computational Linguistics, Demonstration Session","author":"Koehn","year":"2007"},{"key":"S1351324909990258_ref34","volume-title":"Finite State Natural Language Processing","author":"Mohri","year":"1997"},{"key":"S1351324909990258_ref42","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324901002650"},{"key":"S1351324909990258_ref6","first-page":"47","article-title":"LIA_PHON: Un syst\u00e8me complet de phon\u00e9tisation de textes","volume":"42","author":"B\u00e9chet","year":"2001","journal-title":"Traitement Automatique des Langues"},{"key":"S1351324909990258_ref22","first-page":"123","volume-title":"Actes de la Conf\u00e9rence sur le Traitement Automatique des Langues (TALN'07)","author":"Guimier de Neef","year":"2007"},{"key":"S1351324909990258_ref26","doi-asserted-by":"publisher","DOI":"10.3115\/1599081.1599137"},{"key":"S1351324909990258_ref9","first-page":"79","article-title":"A statistical approach to machine translation","volume":"16","author":"Brown","year":"1990","journal-title":"Computational Linguistics"},{"key":"S1351324909990258_ref1","doi-asserted-by":"publisher","DOI":"10.1142\/S0129054105003066"},{"key":"S1351324909990258_ref2","volume-title":"Parlez-vous texto? Guide des nouveaux langages du r\u00e9seau","author":"Anis","year":"2001"},{"key":"S1351324909990258_ref3","first-page":"33","volume-title":"Proceedings of COLING\/Association for Computational Linguistics","author":"Aw","year":"2006"},{"key":"S1351324909990258_ref4","first-page":"401","volume-title":"Proceedings of the Internatinal Conference on Spoken Language Processing (ICSLP)","author":"Bazzi","year":"2000"},{"key":"S1351324909990258_ref8","first-page":"286","volume-title":"Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics","author":"Brill","year":"2000"},{"key":"S1351324909990258_ref5","first-page":"55","volume-title":"Actes des Journ\u00e9es Internationales de l'Analyse des Donn\u00e9es Textuelles (JADT)","author":"Beaufort","year":"2008"},{"key":"S1351324909990258_ref7","first-page":"273","volume-title":"Proceedings of the 2nd Language Resources Engineering Conference (LREC)","author":"Boula de Mare\u00fcil","year":"2000"},{"key":"S1351324909990258_ref20","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007545901558"},{"key":"S1351324909990258_ref40","doi-asserted-by":"crossref","unstructured":"Papineni K. , Roukos S. , Ward T. , and Zhu W.-J. 2001. Bleu: a method for automatic evaluation of machine translation. Technical Report RC22176 (W0109-022), IBM Research Division, Thomas J. Watson Research Center.","DOI":"10.3115\/1073083.1073135"},{"key":"S1351324909990258_ref12","doi-asserted-by":"publisher","DOI":"10.1007\/BF01889984"},{"key":"S1351324909990258_ref13","volume-title":"Proceedings of Workshop on Shallow Processing of Large Corpora","author":"Clark","year":"2003"},{"key":"S1351324909990258_ref11","first-page":"63","volume-title":"Proceedings of the IJCAI Workshop on \u2018Analytics for Noisy Unstructured Text Data\u2019","author":"Choudhury","year":"2007"},{"key":"S1351324909990258_ref14","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9781139164771"},{"key":"S1351324909990258_ref17","volume-title":"Le langage SMS","author":"Fairon","year":"2006"},{"key":"S1351324909990258_ref27","first-page":"128","volume-title":"Actes de la Conf\u00e9rence sur le Traitement Automatique des Langues (TALN'08)","author":"Kobus","year":"2008"},{"key":"S1351324909990258_ref15","first-page":"495","article-title":"Algorithm for grapheme-to-phoneme translation for French and English: applications","volume":"23","author":"Divay","year":"1997","journal-title":"Computational Linguistics"},{"key":"S1351324909990258_ref16","first-page":"1","volume-title":"Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics","author":"Eisner","year":"2002"},{"key":"S1351324909990258_ref18","volume-title":"Proceedings of LREC 2006","author":"Fairon","year":"2006"},{"key":"S1351324909990258_ref24","doi-asserted-by":"publisher","DOI":"10.1016\/B978-0-08-051584-7.50045-0"},{"key":"S1351324909990258_ref32","doi-asserted-by":"publisher","DOI":"10.1145\/345508.345564"},{"key":"S1351324909990258_ref29","first-page":"127","volume-title":"Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistic","author":"Koehn","year":"2003"},{"key":"S1351324909990258_ref31","first-page":"152","volume-title":"Proceedings of the 41st Annual Meeting on Association for Computational Linguistics","author":"Lita","year":"2003"},{"key":"S1351324909990258_ref35","first-page":"269","article-title":"Transducers in language and speech","volume":"23","author":"Mohri","year":"1997","journal-title":"Computational Linguistics"},{"key":"S1351324909990258_ref36","doi-asserted-by":"crossref","unstructured":"Mohri M. , Pereira F. , and Riley M. 2000. The design principles of a weighted finite-state transducer library. Theoretical Computer Science (231): 17\u201332.","DOI":"10.1016\/S0304-3975(99)00014-6"},{"key":"S1351324909990258_ref37","first-page":"231","volume-title":"Proceedings of the annual Meeting of the Association for Computational Linguistics","author":"Mohri","year":"1996"},{"key":"S1351324909990258_ref39","doi-asserted-by":"publisher","DOI":"10.1162\/089120103321337421"},{"key":"S1351324909990258_ref41","doi-asserted-by":"crossref","first-page":"1","DOI":"10.7551\/mitpress\/3007.001.0001","volume-title":"Finite State Natural Language Processing","author":"Roche","year":"1997"},{"key":"S1351324909990258_ref38","first-page":"160","volume-title":"Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics","author":"Och","year":"2003"},{"key":"S1351324909990258_ref43","doi-asserted-by":"publisher","DOI":"10.1006\/csla.2001.0169"},{"key":"S1351324909990258_ref45","first-page":"144","volume-title":"Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics","author":"Toutanova","year":"2002"},{"key":"S1351324909990258_ref46","first-page":"227","volume-title":"Compr\u00e9hension automatique des langues et interaction","author":"V\u00e9ronis","year":"2006"},{"key":"S1351324909990258_ref33","first-page":"4","article-title":"Spellchecking by computer","volume":"20","author":"Mitton","year":"1996","journal-title":"Journal of the Simplified Spelling Society"},{"key":"S1351324909990258_ref47","doi-asserted-by":"publisher","DOI":"10.1006\/csla.1998.0104"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324909990258","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,6,3]],"date-time":"2020-06-03T07:52:00Z","timestamp":1591170720000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324909990258\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,3,24]]},"references-count":48,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2010,4]]}},"alternative-id":["S1351324909990258"],"URL":"https:\/\/doi.org\/10.1017\/s1351324909990258","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,3,24]]}}}