{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,6,3]],"date-time":"2022-06-03T08:25:34Z","timestamp":1654244734328},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2012,11,24]],"date-time":"2012-11-24T00:00:00Z","timestamp":1353715200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/2.0"},{"start":{"date-parts":[[2012,11,24]],"date-time":"2012-11-24T00:00:00Z","timestamp":1353715200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Braz Comput Soc"],"published-print":{"date-parts":[[2013,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>As in many other natural language processing (NLP) fields, the use of statistical methods is now part of mainstream natural language generation (NLG). In the development of systems of this kind, however, there is the issue of data sparseness, a problem that is particularly evident in the case of morphologically-rich languages such as Portuguese. This work presents a shallow surface realisation system that makes use of factored language models (FLMs) of Portuguese to overcome some of these difficulties. The system combines FLMs trained on a large corpus with a number of NLP resources that have been made publicly available by the Brazilian NLP research community in recent years, such as corpora, dictionaries, thesauri and others. Our FLM-based approach to surface realisation has been successfully applied to the generation of Brazilian newspapers headlines, and the results are shown to outperform a number of statistical and non-statistical baseline systems alike.<\/jats:p>","DOI":"10.1007\/s13173-012-0095-1","type":"journal-article","created":{"date-parts":[[2012,11,23]],"date-time":"2012-11-23T14:56:22Z","timestamp":1353682582000},"page":"135-146","update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["Portuguese text generation using factored language models"],"prefix":"10.1007","volume":"19","author":[{"given":"Eder Miranda","family":"de Novais","sequence":"first","affiliation":[]},{"given":"Ivandr\u00e9","family":"Paraboni","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,11,24]]},"reference":[{"key":"95_CR1","doi-asserted-by":"crossref","unstructured":"Reiter E (2007) An architecture for data-to-text systems. In: European natural language generation workshop (ENLG-2007), pp 97\u2013104","DOI":"10.3115\/1610163.1610180"},{"key":"95_CR2","unstructured":"Langkilde I (2000) Forest-based statistical sentence generation. In: Proceedings of ANLP-NAACL\u201900, pp 170\u2013177"},{"key":"95_CR3","doi-asserted-by":"crossref","unstructured":"Varges S (2006) Overgeneration and ranking for spoken dialogue systems. In: Proceedings of the 4th international natural language generation conference (INLG-2006), Sydney, Australia, pp 20\u201322","DOI":"10.3115\/1706269.1706275"},{"issue":"4","key":"95_CR4","doi-asserted-by":"publisher","first-page":"431","DOI":"10.1017\/S1351324907004664","volume":"14","author":"A Belz","year":"2008","unstructured":"Belz A (2008) Automatic generation of weather forecast texts using comprehensive probabilistic generation-space models. Nat Lang Eng 14(4):431\u2013455","journal-title":"Nat Lang Eng"},{"key":"95_CR5","doi-asserted-by":"crossref","unstructured":"DeVault D, Traum D, Arstein R (2008) Practical grammar-based NLG from examples. In: Proceedings of the 5th international natural language generation conference (INLG-2008), Columbus, USA, pp 77\u201385","DOI":"10.3115\/1708322.1708338"},{"key":"95_CR6","doi-asserted-by":"crossref","unstructured":"Novais EM, Paraboni I (2011) Highly-inflected language generation using factored language models. In: 12th International conference on intelligent text processing and computational linguistics (CICLing-2011). LNCS, vol 6608. Springer, Berlin-Heidelberg, pp 429\u2013438","DOI":"10.1007\/978-3-642-19400-9_34"},{"key":"95_CR7","unstructured":"Nunes MGV, Vieira FMC, Zavaglia C, Sossolote CRC, Hernandez J (1996) A constru\u00e7\u00e3o de um l\u00e9xico para o portugu\u00eas do Brasil: li\u00e7\u00f5es aprendidas e perspectivas. II PROPOR, pp 61\u201370"},{"key":"95_CR8","doi-asserted-by":"crossref","unstructured":"Bilmes J, Kirchhoff K (2003) Factored language models and generalized parallel backoff. In: Proceedings of HLT-NAACL-2003, vol 2, pp 4\u20136","DOI":"10.3115\/1073483.1073485"},{"key":"95_CR9","unstructured":"Muniz MCM (2004) A constru\u00e7\u00e3o de recursos lingu\u00edstico-computacionais para o portugu\u00eas do Brasil: o projeto de Unitex-PB. Msc. dissertation, ICMC\/USP"},{"key":"95_CR10","doi-asserted-by":"crossref","unstructured":"Maziero EG, Pardo TAS, di Felippo A, Dias-da-Silva BC (2008) A Base de Dados Lexical e a Interface Web do TeP 2.0\u2013Thesaurus Eletrnico para o Portugus do Brasil. VI Workshop on information and human language technology (TIL-2008), pp 390\u2013392","DOI":"10.1145\/1809980.1810076"},{"key":"95_CR11","unstructured":"Corston-Oliver S, Gamon M, Ringger E, Moore R (2002) An overview of Amalgam: a machine-learned generation module. In: Proceedings of the international natural language generation conference (INLG-2002), pp 33\u201340"},{"key":"95_CR12","unstructured":"Belz A, White M, Espinosa D, Kow E, Hogan D, Stent A (2011) The first surface realisation shared task: overview and evaluation results. In: Proceedings of the 13th European workshop on natural language generation, pp 217\u2013226"},{"key":"95_CR13","unstructured":"Belz A, Bohnet B, Mille S, Wanner L, White M (2012) The surface realisation task: recent developments and future plans. In: Proceedings of the 7th international natural language generation conference (INLG-2012), pp 136\u2013140"},{"key":"95_CR14","unstructured":"Bohnet B, Mille S, Favre B, Wanner L (2011) StuMaBa: from deep representation to surface. In: Proceedings of the 13th European workshop on natural language generation, pp 232\u2013235"},{"key":"95_CR15","unstructured":"Rajkumar R, Espinosa D, White M (2011) The OSU system for surface realization at generation challenges 2011. In: Proceedings of the 13th European workshop on natural language generation, pp 236\u2013238"},{"key":"95_CR16","unstructured":"Guo Y, Hogan D, van Genabith J (2011) DCU* at generation challenges 2011 surface realisation track. In: Proceedings of the 13th European workshop on natural language generation, pp 227\u2013229"},{"key":"95_CR17","unstructured":"Stent A (2011) ATT-0: submission to generation challenges 2011 surface realization shared task. In: Proceedings of the 13th European workshop on natural language generation, pp 230\u2013231"},{"key":"95_CR18","unstructured":"Gervas P (2011) UCM submission to the surface realization challenge. In: Proceedings of the 13th European workshop on natural language generation, pp 239\u2013241"},{"key":"95_CR19","doi-asserted-by":"crossref","unstructured":"Gatt A, Reiter E (2009) SimpleNLG: a realization engine for practical applications. In: European natural language generation workshop (ENLG-2009), pp 90\u201393","DOI":"10.3115\/1610195.1610208"},{"key":"95_CR20","unstructured":"Langkilde-Geary I (2002) An empirical verification of coverage and correctness for a general-purpose sentence generator. In: Proceedings of the international natural language generation conference (INLG-2002), pp 17\u201324"},{"key":"95_CR21","doi-asserted-by":"crossref","unstructured":"Novais E, Tadeu TD, Paraboni I (2010) Improved text generation using N-gram statistics. In: 12th Ibero-American conference on artificial intelligence (IBERAMIA-2010). LNAI, vol 6433, pp 316\u2013325. Springer, Berlin-Heidelberg","DOI":"10.1007\/978-3-642-16952-6_32"},{"key":"95_CR22","unstructured":"Novais EM, Paraboni I, da Silva Junior DFP (2012) Portuguese text generation from large corpora. In: 8th International conference on language resources and evaluation (LREC-2012), Istanbul, pp 4010\u20134014"},{"key":"95_CR23","unstructured":"Abreu SC, Carbonel TI, Coelho JCB, Fuchs JT, Rino LHM, Vieira R (2007) Summit: um corpus anotado com informaes discursivas visando sumarizao automtica. In: V Workshop on information and human language technology (TIL-2007), pp 1605\u20131610"},{"key":"95_CR24","doi-asserted-by":"crossref","unstructured":"Alusio SM, Specia L, Pardo TAS, Maziero E, Fortes RPM (2008) Towards Brazilian Portuguese automatic text simplification systems. The ACM Symposium on Document Engineering, pp 240\u2013248","DOI":"10.1145\/1410140.1410191"},{"issue":"4","key":"95_CR25","doi-asserted-by":"publisher","first-page":"545","DOI":"10.1162\/089120102762671981","volume":"28","author":"E Reiter","year":"2002","unstructured":"Reiter E, Sripada S (2002) Human variation and lexical choice. Comput Linguist 28(4):545\u2013553","journal-title":"Comput Linguist"},{"key":"95_CR26","doi-asserted-by":"crossref","unstructured":"Bangalore S, Rambow O (2000) Corpus-based lexical choice in natural language generation. In: 38th Meeting of the ACL, Hong Kong, pp 464\u2013471","DOI":"10.3115\/1075218.1075277"},{"key":"95_CR27","volume-title":"Default reasoning and lexical organization","author":"B Carpenter","year":"1993","unstructured":"Carpenter B (1993) Skeptical and credulous unification with applications to lexical templates and inheritance. In: Briscoe T, Copestake A, de Paiva V (eds) Default reasoning and lexical organization. Cambridge University Press, Cambridge"},{"key":"95_CR28","doi-asserted-by":"crossref","unstructured":"Oh A, Rudnicky A (2000) Stochastic language generation for spoken dialogue systems. In: Proceedings of the ANLP-NAACL\u201900 workshop on conversational systems, pp 27\u201332","DOI":"10.3115\/1605285.1605291"},{"key":"95_CR29","unstructured":"Ratnaparkhi A (2000) Trainable methods for surface natural language generation. In: Proceedings of ANLP-NAACL\u201900, pp 194\u2013201"},{"key":"95_CR30","doi-asserted-by":"crossref","unstructured":"Malouf R (2000) The order of prenominal adjectives in natural language generation. In: Proceedings of ACL-2000, Hong Kong, pp 85\u201392","DOI":"10.3115\/1075218.1075230"},{"key":"95_CR31","doi-asserted-by":"crossref","unstructured":"Mitchell M (2009) Class-based ordering of prenominal modifiers. In: Proceedings of the 12th European workshop on natural language generation, Athens, pp 50\u201357","DOI":"10.3115\/1610195.1610203"},{"key":"95_CR32","unstructured":"White M, Rajkumar R, Martin S (2007) Towards broad coverage surface realization with CCG. In: MT Summit XI workshop using corpora for natural language generation: language generation and machine translation (UCNLG+MT), pp 22\u201330"},{"key":"95_CR33","first-page":"901","volume":"2","author":"A Stolcke","year":"2002","unstructured":"Stolcke A (2002) SRILM: an extensible language modeling toolkit. Int Conf Spoken Lang Process 2:901\u2013904","journal-title":"Int Conf Spoken Lang Process"},{"key":"95_CR34","doi-asserted-by":"crossref","unstructured":"da Silva Junior DFP, Paraboni I, Novais EM (2012) Um Sistema de Realiza\u00e7\u00e3o Superficial baseado em Regras para Gera\u00e7\u00e3o de Textos em Portugu\u00eas. USP-EACH technical report, pp 1\u201314","DOI":"10.22456\/2175-2745.26629"},{"key":"95_CR35","unstructured":"Papineni S, Roukos T, Ward W, Zhu W (2002) Bleu: a method for automatic evaluation of machine translation. In: ACL-2002, pp 311\u2013318"},{"key":"95_CR36","unstructured":"NIST (2002) Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. http:\/\/www.nist.gov\/speech\/tests\/mt\/doc\/ngram-study.pdf (2002)"},{"key":"95_CR37","doi-asserted-by":"crossref","unstructured":"Gatt A, Belz A (2010) Introducing shared tasks to NLG: the TUNA shared task evaluation challenges. In: Krahmer E, Theune M (eds) Empirical methods in natural language generation. LNAI, vol 5980, pp 264\u2013293","DOI":"10.1007\/978-3-642-15573-4_14"},{"issue":"45","key":"95_CR38","first-page":"48","volume":"14","author":"DJ Lucena","year":"2010","unstructured":"Lucena DJ, Pereira DB, Paraboni I (2010) From semantic properties to surface text: the generation of domain object descriptions. Inteligencia Artif 14(45):48\u201358","journal-title":"Inteligencia Artif"}],"container-title":["Journal of the Brazilian Computer Society"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13173-012-0095-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s13173-012-0095-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s13173-012-0095-1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13173-012-0095-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T20:18:58Z","timestamp":1630527538000},"score":1,"resource":{"primary":{"URL":"https:\/\/journal-bcs.springeropen.com\/articles\/10.1007\/s13173-012-0095-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,11,24]]},"references-count":38,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2013,6]]}},"alternative-id":["95"],"URL":"https:\/\/doi.org\/10.1007\/s13173-012-0095-1","relation":{},"ISSN":["0104-6500","1678-4804"],"issn-type":[{"value":"0104-6500","type":"print"},{"value":"1678-4804","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,11,24]]},"assertion":[{"value":"7 May 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 November 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 November 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}