{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T11:23:31Z","timestamp":1772969011888,"version":"3.50.1"},"reference-count":36,"publisher":"Oxford University Press (OUP)","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Digital Scholarship Humanities"],"DOI":"10.1093\/llc\/fqw049","type":"journal-article","created":{"date-parts":[[2016,11,19]],"date-time":"2016-11-19T02:20:56Z","timestamp":1479522056000},"page":"fqw049","source":"Crossref","is-referenced-by-count":1,"title":["A digital corpus resource of authentic anonymized French text messages: 88milSMS\u2014What about transcoding and linguistic annotation?"],"prefix":"10.1093","author":[{"given":"Rachel","family":"Panckhurst","sequence":"first","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2016,11,17]]},"reference":[{"key":"2016111818202994000_fqw049v1.1","doi-asserted-by":"crossref","unstructured":"Accorsi P. Patel N. Lopez C. Panckhurst R. Roche M. (2014). Seek&Hide: anonymising a French SMS corpus using natural language processing techniques, In Cougnon L.-A. Fairon C. (eds), SMS Communication. A Linguistic Approach. Amsterdam\/Philadelphia: John Benjamins, pp. 11\u201328.","DOI":"10.1075\/bct.61.03acc"},{"key":"2016111818202994000_fqw049v1.2","unstructured":"Antoniadis G. Chabert G. Zampa V. (2011). Alpes4science: Constitution d\u2019un corpus de SMS r\u00e9els en France m\u00e9tropolitaine, talk, May 9\u201310, Sherbrooke: 79th Acfas colloquium."},{"key":"2016111818202994000_fqw049v1.3","doi-asserted-by":"crossref","unstructured":"Aw A. Zhang M. Xiao J. Su J. (2006). A phrase-based statistical model for SMS text normalization. In Proceedings of the COLING\/ACL on Main conference poster sessions, Sydney, pp 33\u201340. Association for Computational Linguistics. http:\/\/anthology.aclweb.org\/P\/P06\/P06-2.pdf#page=43.","DOI":"10.3115\/1273073.1273078"},{"key":"2016111818202994000_fqw049v1.4","first-page":"155","article-title":"D\u00e9finition d'un syst\u00e8me d'alignement SMS\/fran\u00e7ais standard \u00e0 l'aide d'un filtre de composition","volume":"2008","author":"Beaufort","year":"2008","journal-title":"Proceedings, JADT"},{"key":"2016111818202994000_fqw049v1.5","unstructured":"Beaufort R. Roekhaut S. Cougnon L.-A. Fairon C. (2010). A hybrid rule\/model-based finite-state framework for normalizing SMS messages, In Haji\u010d Jan (eds), Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Uppsala: Sweden, July 11\u201316, 2010. \u00a9 2010 Association for Computational Linguistics, pp. 770\u2013779."},{"issue":"2","key":"2016111818202994000_fqw049v1.6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.21248\/jlcl.29.2014.187","article-title":"The CoMeRe corpus for French: structuring and annotating heterogeneous CMC genres, Special issue on Building And Annotating Corpora Of Computer-Mediated Discourse: Issues and Challenges at the Interface of Corpus and Computational Linguistics","volume":"29","author":"Chanier","year":"2014","journal-title":"JLCL (Journal of Language Technology and Computational Linguistics)"},{"issue":"94","key":"2016111818202994000_fqw049v1.7","first-page":"39","article-title":"C\u2019est \u00e9crire comme je parle. Une \u00e9tude comparatiste de vari\u00e9t\u00e9s de fran\u00e7ais dans l\u2019\u00e9crit sms","volume":"2","author":"Cougnon","year":"2010","journal-title":"Les voix des Fran\u00e7ais. Modern French Identities"},{"key":"2016111818202994000_fqw049v1.8","unstructured":"Cougnon L.-A. Fairon C. (eds) (2014). SMS Communication. A linguistic Approach. Amsterdam\/Philadelphia: John Benjamins."},{"key":"2016111818202994000_fqw049v1.9","unstructured":"Cougnon L.-A. (2015). Langage et sms. Une \u00e9tude internationale des pratiques actuelles. Louvain-la-Neuve: Presses universitaires de Louvain."},{"key":"2016111818202994000_fqw049v1.10","unstructured":"Drouin P. Guilbault C. (2016). De \u201cViens watcher la partie avec moi\u201d \u00e0 \u201cCome regarder the game with me\u201d. Louvain-la-Neuve, Belgium. Abstracts, PLIN 2016, 12 May, http:\/\/www.plindayucl.com."},{"key":"2016111818202994000_fqw049v1.11","doi-asserted-by":"crossref","unstructured":"D\u00fcrscheid C. Stark E. (2011). SMS4science: an international corpus-based texting project and the specific challenges for multilingual Switzerland. In: Thurlow C. Mroczek K. (eds), Digital Discourse, Language in the New Media. Oxford: Oxford University Press, pp 299\u2013320.","DOI":"10.1093\/acprof:oso\/9780199795437.003.0014"},{"key":"2016111818202994000_fqw049v1.12","unstructured":"Fairon C. Klein J.-R. Paumier S. (2006). SMS pour la science. Corpus de 30.000 SMS et logiciel de consultation. Louvain-la-Neuve: Presses universitaires de Louvain. Manuel+CD-Rom, http:\/\/www.smspourlascience.be\/."},{"key":"2016111818202994000_fqw049v1.13","unstructured":"Goumi A. Bernicot J. (2011). Un corpus de SMS produits par de jeunes adolescents: m\u00e9thode de recueil et premi\u00e8res donn\u00e9es, invited seminar, sud4science LR project, MSH-M, 15\/3\/2011."},{"key":"2016111818202994000_fqw049v1.14","unstructured":"Guilbault C. Drouin P. (2016). Pratiques li\u00e9es aux alternances de code dans un corpus anglais et fran\u00e7ais au Canada, Talk, Cercle linguistique Belge, 13 May 2016, Louvain-la-Neuve, Belgium."},{"key":"2016111818202994000_fqw049v1.15","unstructured":"Guimier de Neef \u00c9. Fessard S. (2007). \u00c9valuation d\u2019un syst\u00e8me de transcription de SMS. In Proceedings of the 26th International Conference on Lexis and Grammar, Bonifacio, France, October 2-6, 2007."},{"key":"2016111818202994000_fqw049v1.16","unstructured":"Ide N. Pustejovsky J. (eds) (forthcoming). Handbook of Linguistic Annotation. Berlin: Springer."},{"key":"2016111818202994000_fqw049v1.17","unstructured":"Kobus C. Yvon F. Damnati G. (2008). Transcrire les SMS comme on reconna\u00eet la parole. In Proceedings, TALN 2008, Avignon, pp. 128\u201338. https:\/\/perso.limsi.fr\/yvon\/publications\/sources\/Kobus08transcrire.pdf (accessed 4 September 2016)."},{"key":"2016111818202994000_fqw049v1.18","doi-asserted-by":"crossref","unstructured":"Langlais P. Drouin P. Paulus A. Rompr\u00e9 Brodeur E. Cottin F. (2012). Texto4Science: a Quebec French Database of Annotated Short Text Messages. Proceedings, LREC, Istanbul, pp.1047\u201354.","DOI":"10.1075\/li.35.2.07lan"},{"key":"2016111818202994000_fqw049v1.19","unstructured":"Lopez C. Bestandji R. Roche M. Panckhurst R. (2014). Towards Electronic SMS Dictionary Construction: an Alignment-based Approach. Proceedings, LREC (Language Resources and Evaluation Conference), Reykjavik, Iceland, May 26-31, pp 2833\u20138."},{"key":"2016111818202994000_fqw049v1.20","doi-asserted-by":"crossref","first-page":"71","DOI":"10.26034\/tranel.2015.2970","article-title":"Classification des items inconnus de 88milSMS: aide \u00e0 l'identification automatique de la cr\u00e9ativit\u00e9 scripturale","volume":"63","author":"Lopez","year":"2015","journal-title":"Travaux neuch\u00e2telois de linguistique (revue TRANEL)"},{"key":"2016111818202994000_fqw049v1.21","unstructured":"Lopez C. Roche M. Panckhurst R. (2016), Non-standard texts: from theoretical positions to Natural Language Processing normalisation. Louvain-la-Neuve, Belgium, Abstracts, PLIN 2016, 12 May, http:\/\/www.plindayucl.com."},{"key":"2016111818202994000_fqw049v1.22","unstructured":"McEnery T. Hardie A. (2012). Corpus Linguistics: Method, theory and practice. Cambridge: Cambridge University Press."},{"key":"2016111818202994000_fqw049v1.23","unstructured":"Mo\u00efse C. (2013). Lol non tkt on ta pas oubli\u00e9. Rapports \u00e0 la norme et valeurs de la faute dans l\u2019\u00e9criture Sms (projet et corpus Sud4science). R\u00e9flexions sociolinguistiques, Plenary, Colloquium Si j'aurais su, j'aurais pas venu! Linguistique des formes exclues: description, genre, \u00e9pist\u00e9mologie, Universit\u00e9 Libre de Bruxelles, Montpellier for Panckhurst, June 20-22."},{"key":"2016111818202994000_fqw049v1.24","unstructured":"Panckhurst R. (2009). Short Message Service (SMS): typologie et probl\u00e9matiques futures. In Arnavielle T. (coord.), Polyphonies, pour Michelle Lanvin. Universit\u00e9 Paul-Val\u00e9ry Montpellier 3, Montpellier, pp. 33\u201352."},{"key":"2016111818202994000_fqw049v1.25","unstructured":"Panckhurst R. D\u00e9trie C. Lopez C. Mo\u00efse C. Roche M. Verine B. (2013). Sud4science, de l'acquisition d'un grand corpus de SMS en fran\u00e7ais \u00e0 l'analyse de l'\u00e9criture SMS. \u00c9pist\u00e9m\u00e8 \u2013 revue internationale de sciences sociales appliqu\u00e9es, 9, Des usages num\u00e9riques aux pratiques scripturales \u00e9lectroniques, University Korea \u2013 Centre for Applied Cultural Studies, Seoul, pp. 107\u201338."},{"key":"2016111818202994000_fqw049v1.26","unstructured":"Panckhurst R. D\u00e9trie C. Lopez C. Mo\u00efse C. Roche M. Verine B. (2014). 88milSMS. A corpus of authentic text messages in French. Produced by the University Paul-Val\u00e9ry Montpellier and the CNRS, in collaboration with the Catholic University of Louvain, funded with support from the MSH-M and the Ministry of Culture (General Delegation for the French language and the languages of France) and with the financial participation of Praxiling, Lirmm, Lidilem, Tetis, Viseo. http:\/\/88milsms.huma-num.fr\/ ISLRN: 024-713-187-947-8."},{"key":"2016111818202994000_fqw049v1.27","doi-asserted-by":"crossref","unstructured":"Panckhurst R. Mo\u00efse C. (2014). French text messages. From SMS data collection to preliminary analysis. In Cougnon L.-A. Fairon C. (eds), SMS Communication. A Linguistic Approach. Amsterdam\/Philadelphia: John Benjamins, University Korea - Centre for Applied Cultural Studies, Seoul, pp. 141\u201368.","DOI":"10.1075\/bct.61.09pan"},{"key":"2016111818202994000_fqw049v1.28","unstructured":"Panckhurst R. Roche M. Lopez C. (2015). Donn\u00e9es authentiques: un grand corpus de SMS en fran\u00e7ais, Abstracts, SHESL-HTL 2015 colloquium, Corpus et constitution des savoirs linguistiques, Paris, 30-31 January 2015, pp. 33\u201335."},{"key":"2016111818202994000_fqw049v1.29","unstructured":"Panckhurst R. D\u00e9trie C. Lopez C. Mo\u00efse C. Roche M. Verine B. (2016). \u201c88milSMS. A corpus of authentic text messages in French\u201d (new version of the ISLRN 024-713-187-947-8 corpus). In Chanier T. (ed), CoMeRe corpora repository. Nancy: Ortolang. [cmr-88milsms-tei-v1; https:\/\/hdl.handle.net\/11403\/comere\/cmr-88milsms\/cmr-88milsms-tei-v1]."},{"key":"2016111818202994000_fqw049v1.30","doi-asserted-by":"crossref","unstructured":"Panckhurst R. Roche M. Lopez C. Verine B. D\u00e9trie C. Mo\u00efse C. (forthcoming), De la collecte \u00e0 l\u2019analyse d\u2019un corpus de SMS authentiques: une d\u00e9marche pluridisciplinaire, H.E.L., 38(2). http:\/\/www.hel-journal.org\/fr\/.","DOI":"10.1051\/hel\/2016380206"},{"key":"2016111818202994000_fqw049v1.31","doi-asserted-by":"crossref","unstructured":"Patel N. Accorsi P. Inkpen D. Lopez C. Roche M. (2013). Approaches of anonymisation of an SMS corpus. In Proceedings of CICLING (Conference on Intelligent Text Processing and Computational Linguistics), LNCS, Springer Verlag, March 24\u201330, 2013, University of the Aegean, Samos, Greece, pp. 77\u201388.","DOI":"10.1007\/978-3-642-37247-6_7"},{"key":"2016111818202994000_fqw049v1.32","unstructured":"Petit Robert de la langue fran\u00e7aise (2014). Digital Multisupport Version of the Dictionary."},{"key":"2016111818202994000_fqw049v1.33","unstructured":"Riou S. Sagot B. (2016). \u00c9tiquetage morpho-syntaxique du corpus FAVI [corpus]. D'apr\u00e8s Yun, H. and Chanier, T. (2014). Corpus d'apprentissage FAVI (Fran\u00e7ais acad\u00e9mique virtuel international) [cmr-favi-tei-v1]. Banque de corpus CoMeRe. Ortolang.fr: Nancy. [https:\/\/hdl.handle.net\/11403\/comere\/cmr-favi\/cmr-favi-tei-v2]."},{"key":"2016111818202994000_fqw049v1.34","unstructured":"Roche M. Verine B. Lopez C. Panckhurst R. (forthcoming). La n\u00e9ographie dans un grand corpus de SMS fran\u00e7ais: 88milSMS. Proceedings, CINEO 2015. Salamanca."},{"key":"2016111818202994000_fqw049v1.35","unstructured":"Sagot B. (2010). The Lefff, a freely available and large-coverage morphological and syntactic lexicon for French. In Proceedings, LREC, 2010, Valletta, Malta. http:\/\/hal.inria.fr\/inria-00521242\/; Project Alexina: https:\/\/gforge.inria.fr\/projects\/alexina\/."},{"key":"2016111818202994000_fqw049v1.36","doi-asserted-by":"crossref","unstructured":"Vilari\u00f1o D. Pinto D. Beltr\u00e1n B. Le\u00f3n S. Castillo E. Tovar M. (2012). A machine-translation method for normalization of SMS. In Pattern Recognition (pp. 293\u2013302). Berlin\/Heidelberg: Springer. http:\/\/www.cs.buap.mx\/~dpinto\/research\/MCPR2012\/MCPR2012_Vilarino.pdf.","DOI":"10.1007\/978-3-642-31149-9_30"}],"container-title":["Digital Scholarship in the Humanities"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/dsh\/article-pdf\/32\/suppl_1\/i92\/17751451\/fqw049.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,21]],"date-time":"2023-08-21T00:03:25Z","timestamp":1692576205000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/dsh\/article-lookup\/doi\/10.1093\/llc\/fqw049"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,11,17]]},"references-count":36,"alternative-id":["10.1093\/llc\/fqw049"],"URL":"https:\/\/doi.org\/10.1093\/llc\/fqw049","relation":{},"ISSN":["2055-7671","2055-768X"],"issn-type":[{"value":"2055-7671","type":"print"},{"value":"2055-768X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,11,17]]}}}