{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,7]],"date-time":"2025-03-07T05:25:54Z","timestamp":1741325154145,"version":"3.38.0"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2011,7,6]],"date-time":"2011-07-06T00:00:00Z","timestamp":1309910400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2011,9]]},"DOI":"10.1007\/s10579-011-9155-y","type":"journal-article","created":{"date-parts":[[2011,7,5]],"date-time":"2011-07-05T01:04:53Z","timestamp":1309827893000},"page":"311-330","source":"Crossref","is-referenced-by-count":3,"title":["Expanding a multilingual media monitoring and information extraction tool to a new language: Swahili"],"prefix":"10.1007","volume":"45","author":[{"given":"Ralf","family":"Steinberger","sequence":"first","affiliation":[]},{"given":"Sylvia","family":"Ombuya","sequence":"additional","affiliation":[]},{"given":"Mijail","family":"Kabadjov","sequence":"additional","affiliation":[]},{"given":"Bruno","family":"Pouliquen","sequence":"additional","affiliation":[]},{"given":"Leo","family":"Della Rocca","sequence":"additional","affiliation":[]},{"given":"Jenya","family":"Belyaeva","sequence":"additional","affiliation":[]},{"given":"Monica","family":"de Paola","sequence":"additional","affiliation":[]},{"given":"Camelia","family":"Ignat","sequence":"additional","affiliation":[]},{"given":"Erik","family":"van der Goot","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,7,6]]},"reference":[{"key":"9155_CR1","unstructured":"Bering, C., Dro\u017cd\u017cy\u0144ski, W., Erbach, G., Guasch, L., Homola, P., Lehmann, S., et al. (2003). Corpora and evaluation tools for multilingual named entity grammar development. In Proceedings of the multilingual corpora workshop at corpus linguistics (pp. 42\u201352). Lancaster, UK."},{"key":"9155_CR2","doi-asserted-by":"crossref","unstructured":"Carenini, M., Whyte, A., Bertorello, L., & Vanocchi, M. (2007). Improving communication in E-democracy using natural language processing. In IEEE Intelligent Systems, 22(1), 20\u201327.","DOI":"10.1109\/MIS.2007.11"},{"key":"9155_CR3","first-page":"303","volume":"18","author":"G De Pauw","year":"2008","unstructured":"De Pauw, G., & de Schryver, G.-M. (2008). Improving the computational morphological analysis of a Swahili corpus for lexicographic purposes. Lexikos, 18, 303\u2013318.","journal-title":"Lexikos"},{"key":"9155_CR4","unstructured":"De Pauw, G., de Schryver, G.-M., & Wagacha, P. W. (2006). Data-driven part-of-speech tagging of Kiswahili. In Text, speech and dialogue (Vol. 4188, pp. 197\u2013204). Berlin: Springer."},{"key":"9155_CR5","doi-asserted-by":"crossref","first-page":"340","DOI":"10.4314\/lex.v19i1.49134","volume":"19","author":"G De Pauw","year":"2009","unstructured":"De Pauw, G., de Schryver, G.-M., & Wagacha, P. W. (2009). A corpus-based survey of four electronic Swahili\u2013English bilingual dictionaries. Lexikos, 19, 340\u2013352.","journal-title":"Lexikos"},{"key":"9155_CR6","unstructured":"De Pauw, G., Wagacha, P., & de Schryver, G.-M. (2011). Exploring the SAWA corpus\u2014Collection and deployment of a parallel corpus English\u2014Swahili. Language Resources and Evaluation Journal. Special Issue on African Language Technology, Springer."},{"key":"9155_CR7","unstructured":"Gamon, M., Lozano, C., Pinkham, J., & Reutter, T. (1997). Practical experience with grammar sharing in multilingual NLP. In Proceedings of ACL\/EACL, Madrid, Spain, pp. 49\u201356."},{"key":"9155_CR8","unstructured":"Ignat, C., Pouliquen, B., Ribeiro, A., & Steinberger, R. (2003). Extending an information extraction tool set to central and eastern European languages. In Proceedings of the workshop information extraction for slavonic and other central and eastern European languages (IESL\u20192003) (pp. 33\u201339). Borovets, Bulgaria, 8\u20139 Sep 2003."},{"key":"9155_CR9","unstructured":"Landauer, T., & Littman, M. (1991). A statistical method for language-independent representation of the topical content of text segments. In 11th International conference expert systems and their applications (Vol. 8, pp. 77\u201385), Avignon, France."},{"key":"9155_CR10","unstructured":"Leek, T., Jin, H., Sista, S., & Schwartz, R. (1999). The BBN crosslingual topic detection and tracking system. In 1999 TDT evaluation system summary papers (pp. 214\u2013221). Vienna, VA, USA."},{"key":"9155_CR11","unstructured":"Manny, R., & Bouillon, P. (1996). Adapting the core language engine to French and Spanish. In Proceedings of the international conference NLP+IA,( pp. 224\u2013232). Mouncton, Canada."},{"key":"9155_CR12","doi-asserted-by":"crossref","unstructured":"Maynard, D., Tablan, V., Cunningham, H., Ursu, C., Saggion, H., Bontcheva, K., & Wilks, Y. (2002). Architectural elements of language engineering robustness. Natural Language Engineering, 8(3), 257\u2013274. Special Issue on Robust Methods in Analysis of Natural Language Data.","DOI":"10.1017\/S1351324902002930"},{"key":"9155_CR13","unstructured":"Ng\u2019ang\u2019a, W. (2005). Word sense disambiguation of Swahili: Extending Swahili language technology with machine learning. Ph.D. thesis, Helsinki University."},{"key":"9155_CR14","doi-asserted-by":"crossref","unstructured":"Och, F., & Ney, H. (2003). A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1), 19\u201351.","DOI":"10.1162\/089120103321337421"},{"key":"9155_CR15","unstructured":"Pastra, K., Maynard, D., Hamza, O., Cunningham, H., & Wilks, Y. (2002). How feasible is the reuse of grammars for Named Entity Recognition? In Proceedings of LREC (pp. 412\u20131418). Las Palmas, Spain."},{"key":"9155_CR16","unstructured":"Pouliquen, B., Kimler, M., Steinberger, R., Ignat, C., Oellinger, T., Blackler, K., et al. (2006). Geocoding multilingual texts: Recognition, disambiguation and visualisation. In Proceedings of LREC\u20192006, (pp. 53\u201358). Genoa, Italy, 24\u201326 May 2006."},{"key":"9155_CR17","unstructured":"Pouliquen, B., & Steinberger, R. (2009). Automatic construction of multilingual name dictionaries. In C. Goutte, N. Cancedda, M. Dymetman & G. Foster (Eds.), Learning machine translation (pp. 59\u201378). Cambridge: MIT Press\u2014Advances in Neural Information Processing Systems Series (NIPS)."},{"key":"9155_CR18","unstructured":"Pouliquen, B., Steinberger, R., & Best, C. (2007). Automatic detection of quotations in multilingual news. In Proceedings of the international conference recent advances in natural language processing (RANLP\u20192007) (pp. 487\u2013492). Borovets, Bulgaria, 27\u201329.09.2007."},{"key":"9155_CR19","unstructured":"Shah, R., Lin, B., Gershman, A., & Frederking, R. (2010). SYNERGY: A named entity recognition system for resource-scarce languages such as Swahili using online machine translation. In Proceedings of the second workshop on African language technology (AfLAT), Malta, 9 July 2010."},{"key":"9155_CR20","unstructured":"Sproat, R., Roth, D., Zhai, C., Benmamoun, E., Fister, A., Karlinsky, N., et al. (2005). Named entity recognition and transliteration for 50 languages. Keynote address at the second midwest computational linguistics colloquium, 14\u201315 May 2010, The Ohio State University."},{"key":"9155_CR21","unstructured":"Steinberger, R. (2011). A survey of methods to ease the development of highly multilingual text mining applications. Language Resources and Evaluation Journal, Special issue on LREC\u20192010."},{"key":"9155_CR22","doi-asserted-by":"crossref","unstructured":"Steinberger, R., Fuart, F., van der Goot, E., Best, C., von Etter, P., & Yangarber, R. (2008b). Text mining from the web for medical intelligence. In F. Fogelman-Souli\u00e9, D. Perrotta, J. Piskorski, & R. Steinberger (Eds.), Mining massive data sets for security (pp. 295\u2013310). Amsterdam, The Netherlands: IOS Press.","DOI":"10.3233\/978-1-58603-898-4-295"},{"key":"9155_CR23","doi-asserted-by":"crossref","unstructured":"Steinberger, R., Pouliquen, B., & Ignat, C. (2008a). Using language-independent rules to achieve high multilinguality in text mining. In F. Fogelman-Souli\u00e9, D. Perrotta, J. Piskorski, & R. Steinberger (Eds.), Mining massive data sets for security (pp. 217\u2013240). Amsterdam, The Netherlands: IOS Press.","DOI":"10.3233\/978-1-58603-898-4-217"},{"key":"9155_CR24","unstructured":"Steinberger, R., Pouliquen, B., & van der Goot, E. (2009). An Introduction to the Europe media monitor family of applications. In F. Gey, N. Kando, & J. Karlgren (Eds.), Information access in a multilingual world. Proceedings of SIGIR-CLIR (pp. 1\u20138). Boston, USA. 23 July 2009."},{"key":"9155_CR25","first-page":"1473","volume":"15","author":"A Vinokourov","year":"2002","unstructured":"Vinokourov, A., Shawe-Taylor, J., & Cristianini, N. (2002). Inferring a semantic representation of text via cross-language correlation analysis. Advances of Neural Information Processing Systems, 15, 1473\u20131480.","journal-title":"Advances of Neural Information Processing Systems"},{"key":"9155_CR26","unstructured":"Wactlar, H. (1999). New directions in video information extraction and summarization. In Proceedings of the 10th DELOS workshop (pp. 1\u201310). Sanorini, Greece."},{"key":"9155_CR27","unstructured":"Wentland, W., Knopp, J., Silberer, C., Hartung, M. (2008). Building a multilingual lexical resource for named entity disambiguation, translation and transliteration. In Proceedings of LREC (pp. 3230\u20133237). Genoa, Italy."},{"key":"9155_CR28","doi-asserted-by":"crossref","unstructured":"Yarowski, D., Ngai, G., & Wicentowski, R. (2001). Inducing multilingual text analysis tools via robust projection across aligned corpora. In Proceedings of the 1st international conference on Human Language Technology research (HLT) (pp. 1\u20138). Stroudsburg, PA, USA.","DOI":"10.3115\/1072133.1072187"}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-011-9155-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10579-011-9155-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-011-9155-y","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,6]],"date-time":"2025-03-06T23:01:26Z","timestamp":1741302086000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10579-011-9155-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,7,6]]},"references-count":28,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2011,9]]}},"alternative-id":["9155"],"URL":"https:\/\/doi.org\/10.1007\/s10579-011-9155-y","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"type":"print","value":"1574-020X"},{"type":"electronic","value":"1574-0218"}],"subject":[],"published":{"date-parts":[[2011,7,6]]}}}