{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,5]],"date-time":"2026-05-05T17:16:10Z","timestamp":1778001370342,"version":"3.51.4"},"reference-count":51,"publisher":"Cambridge University Press (CUP)","issue":"3","license":[{"start":{"date-parts":[[2022,8,24]],"date-time":"2022-08-24T00:00:00Z","timestamp":1661299200000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":["cambridge.org"],"crossmark-restriction":true},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2023,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Text-to-scene conversion systems map natural language text to formal representations required for visual scenes. The difficulty involved in this mapping is one of the most critical challenges for developing these systems. The current study mapped Persian natural language text as the headmost system to a conceptual scene model. This conceptual scene model is an intermediate semantic representation between natural language and the visual scene and contains descriptions of visual elements of the scene. It will be used to produce meaningful animation based on an input story in this ongoing study. The mapping task was modeled as a sequential labeling problem, and a conditional random field (CRF) model was trained and tested for sequential labeling of scene model elements. To the best of the authors\u2019 knowledge, no dataset for this task exists; thus, the required dataset was collected for this task. The lack of required off-the-shelf natural language processing modules and a significant error rate in the available corpora were important challenges to dataset collection. Some features of the dataset were manually annotated. The results were evaluated using standard text classification metrics, and an average accuracy of 85.7% was obtained, which is satisfactory.<\/jats:p>","DOI":"10.1017\/s1351324922000390","type":"journal-article","created":{"date-parts":[[2022,8,24]],"date-time":"2022-08-24T08:20:12Z","timestamp":1661329212000},"page":"693-719","update-policy":"https:\/\/doi.org\/10.1017\/policypage","source":"Crossref","is-referenced-by-count":3,"title":["Recognition of visual scene elements from a story text in Persian natural language"],"prefix":"10.1017","volume":"29","author":[{"given":"Mojdeh","family":"Hashemi-Namin","sequence":"first","affiliation":[]},{"given":"Mohammad Reza","family":"Jahed-Motlagh","sequence":"additional","affiliation":[]},{"given":"Adel","family":"Torkaman Rahmani","sequence":"additional","affiliation":[]}],"member":"56","published-online":{"date-parts":[[2022,8,24]]},"reference":[{"key":"S1351324922000390_ref6","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1217"},{"key":"S1351324922000390_ref9","doi-asserted-by":"publisher","DOI":"10.1145\/383259.383316"},{"key":"S1351324922000390_ref45","doi-asserted-by":"publisher","DOI":"10.3115\/1596324.1596352"},{"key":"S1351324922000390_ref47","volume-title":"Verb Capacity and Fundamental Structure of Sentence in Current Persian","author":"Tabibzadeh","year":"2006"},{"key":"S1351324922000390_ref15","first-page":"103","article-title":"A method for automatically creating 3D animated scenes from annotated fiction text","volume":"4","author":"Glass","year":"2009","journal-title":"International Journal on Computer Science and Information System"},{"key":"S1351324922000390_ref51","doi-asserted-by":"crossref","unstructured":"Zeng, X. , Tan, M.-l. and Ren, S. (2016). The implementation of graphic constraints for automatic text to scene conversion. In International Conference on Artificial Intelligence and Computer Science, AICS 2016, Guilin, China. World Scientific Pubilshing Company, pp. 364\u2013367.","DOI":"10.12783\/dtcse\/aics2016\/8224"},{"key":"S1351324922000390_ref21","doi-asserted-by":"publisher","DOI":"10.1109\/I2CT.2018.8529491"},{"key":"S1351324922000390_ref25","unstructured":"Lafferty, J. , McCallum, A. and Pereira, F.C. (2001). Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In International Conference on Machine Learning (ICML), MA, USA. Morgan Kaufmann, pp. 282\u2013289."},{"key":"S1351324922000390_ref42","unstructured":"Shamsfard, M. (2011). Challenges and open problems in Persian text processing. In 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics. Lecture Notes in Artificial Intelligence, vol. 8387. Poznan, Poland: Springer, pp. 65\u201369."},{"key":"S1351324922000390_ref43","unstructured":"Shamsfard, M. , Hesabi, A. , Fadaei, H. , Mansoory, N. , Famian, A. , Bagherbeigi, S. , Fekri, E. , Monshizadeh, M. and Assi, S.M. (2010a). Semi automatic development of farsnet; the persian wordnet. In Proceedings of 5th Global WordNet Conference, GWA2010, vol. 29, Mumbai, India. Indian Institute of Technology."},{"key":"S1351324922000390_ref27","first-page":"321","article-title":"From story to animation\u2013full life cycle computer aided animation generation","volume":"28","author":"Lu","year":"2002","journal-title":"Acta Automatica Sinica"},{"key":"S1351324922000390_ref32","unstructured":"Nazari, M. (2006). Film production and play."},{"key":"S1351324922000390_ref33","unstructured":"Okazaki, N. (2007). CRFsuite: A fast implementation of Conditional Random Fields (CRFs)."},{"key":"S1351324922000390_ref44","doi-asserted-by":"crossref","unstructured":"Shamsfard, M. , Jafari, H.S. and Ilbeygi, M. (2010b). STeP-1: A set of fundamental tools for Persian text processing. In 7th Language Resources and Evaluation Conference, LREC 2010, Valletta, Malta. European Language Resources Association, pp. 859\u2013865.","DOI":"10.1109\/NLPKE.2009.5313844"},{"key":"S1351324922000390_ref1","doi-asserted-by":"crossref","unstructured":"Adorni, G. , Di Manzo, M. and Giunchiglia, F. (1984). Natural language driven image generation. In Proceedings of the 10th International Conference on Computational Linguistics, COLING 1984, Stroudsburg, PA, USA. Association for Computational Linguistics, pp. 495\u2013500.","DOI":"10.3115\/980431.980597"},{"key":"S1351324922000390_ref49","unstructured":"Ustalov, D. and Kudryavtsev, A. (2012). An ontology-based approach to text-to-picture synthesis systems. In Proceedings of the Second International Workshop on Concept Discovery in Unstructured Data (CDUD 2012) In Conjunction with the Tenth International Conference on Formal Concept Analysis (ICFCA 2012), vol. 871, Leuven, Belgium. Katholieke Universiteit Leuven, pp. 94\u2013101."},{"key":"S1351324922000390_ref3","unstructured":"Arian, N. and Sabbagh, M. (2017). Semantic labeling of sentences in Persian language with supervised method. In Proceedings of the 22nd National CSI Computer Conference, CSICC 2017, Tehran, Iran. Computer Society of Iran, pp. 1\u20138."},{"key":"S1351324922000390_ref7","first-page":"17","volume-title":"Workshop on Semantic Parsing","author":"Chang","year":"2014"},{"key":"S1351324922000390_ref31","doi-asserted-by":"publisher","DOI":"10.1145\/219717.219748"},{"key":"S1351324922000390_ref10","first-page":"111","article-title":"Frame semantics","author":"Fillmore","year":"1982","journal-title":"Linguistics in the Morning Calm."},{"key":"S1351324922000390_ref48","unstructured":"Takahashi, N. , Ramamonjisoa, D. and Ogata, T. (2007). A tool for supporting an animated movie making based on writing stories in xml. In Proceedings of IADIS International Conference Applied Computing, Salamanca, Spain. International Association for Development of the Information Society, pp. 405\u2013409."},{"key":"S1351324922000390_ref41","volume-title":"FrameNet II: Extended Theory and Practice","author":"Ruppenhofer","year":"2016"},{"key":"S1351324922000390_ref12","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00057"},{"key":"S1351324922000390_ref13","volume-title":"The WEKA Workbench. Online Appendix for \u201cData Mining: Practical Machine Learning Tools and Techniques\u201d","author":"Frank","year":"2016"},{"key":"S1351324922000390_ref50","doi-asserted-by":"crossref","first-page":"3023","DOI":"10.30534\/ijatcse\/2020\/81932020","article-title":"Generating animations from instructional text","volume":"9","author":"Yadav","year":"2020","journal-title":"International Journal of Advanced Trends in Computer Science and Engineering"},{"key":"S1351324922000390_ref4","unstructured":"Chang, A.X. , Eric, M. , Savva, M. and Manning, C.D. (2017). SceneSeer: 3D Scene Design with Natural Language. CoRR, pp. 1\u201310."},{"key":"S1351324922000390_ref39","unstructured":"Qur\u2019anic Question and Answer Project (2014b). Syntactic labeling manual of style on the basis of dependency grammar in Persian. Technical report, Iran Telecommunication Research Center, Tehran, Iran."},{"key":"S1351324922000390_ref8","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-15384-6_40"},{"key":"S1351324922000390_ref29","unstructured":"Mesgar, M. , Hajizade, M. , Darrudi, E. , Farhoodi, M. , Mohamadzade, M. , Alavi, T. , Davoudi, M. , Sarabi, Z. and Khalash, M. (2014). Semantic role labeling of Persian language based on dependency tree. Technical report, Iran Telecommunication Research Center, Tehran, Iran. sent to get published."},{"key":"S1351324922000390_ref22","doi-asserted-by":"crossref","unstructured":"Johansson, R. , Nugues, P. and Williams, D. (2004). Carsim: A system to convert written accident reports into animated 3D scenes. In Proceedings of the 2nd Joint SAIS\/SSLS Workshop Artificial Intelligence and Learning Systems, AILS-04. Department of Computer Science, Lund University, pp. 76\u201386.","DOI":"10.3115\/1628275.1628283"},{"key":"S1351324922000390_ref23","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2009.04.002"},{"key":"S1351324922000390_ref40","volume-title":"Master of Science","author":"Rouhizadeh","year":"2013"},{"key":"S1351324922000390_ref37","volume-title":"C4.5: Programs for Machine Learning","author":"Quinlan","year":"1993"},{"key":"S1351324922000390_ref36","first-page":"1","volume-title":"ICCICT","author":"Pardhi","year":"2021"},{"key":"S1351324922000390_ref26","doi-asserted-by":"crossref","first-page":"159","DOI":"10.2307\/2529310","article-title":"The measurement of observer agreement for categorical data","volume":"33","author":"Landis","year":"1977","journal-title":"Biometrics"},{"key":"S1351324922000390_ref28","unstructured":"Ma, M. (2006). Automatic Conversion of Natural Language to 3D Animation. PhD Thesis, University of Ulster."},{"key":"S1351324922000390_ref38","unstructured":"Qur\u2019anic Question and Answer Project (2014a). Semantic role labeling manual of style. Technical report, Iran Telecommunication Research Center, Tehran, Iran."},{"key":"S1351324922000390_ref18","first-page":"63","article-title":"Development and evaluation of text-to-scene model for Korean language writing education as a Foreign language","volume":"31","author":"Hong","year":"2018","journal-title":"Journal of The Korean Society for Computer Game"},{"key":"S1351324922000390_ref24","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-59286-5_57"},{"key":"S1351324922000390_ref16","doi-asserted-by":"publisher","DOI":"10.1145\/2932710"},{"key":"S1351324922000390_ref2","volume-title":"Introduction to Machine Learning","author":"Alpaydin","year":"2014"},{"key":"S1351324922000390_ref14","unstructured":"Glass, K. and Bangay, S. (2008). Automating the creation of 3D animation from annotated fiction text. In IADIS 2008: Proceedings of the International Conference on Computer Graphics and Visualization 2008, MM\u201910, Amsterdam, The Netherlands. IADIS Press, pp. 3\u201310. 00006."},{"key":"S1351324922000390_ref5","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P15-1006"},{"key":"S1351324922000390_ref17","first-page":"1","article-title":"Generating Scene Descriptor from Indonesian Narrative","author":"Helfiandri","year":"2020","journal-title":"Text."},{"key":"S1351324922000390_ref19","unstructured":"Iran Telecommunication Research Center (2014). Qur\u2019anic Question and Answer Project. http:\/\/quranjooy.itrc.ac.ir."},{"key":"S1351324922000390_ref30","volume-title":"Studies in Computational Intelligence","volume":"181","author":"Miaoulis","year":"2009"},{"key":"S1351324922000390_ref46","doi-asserted-by":"publisher","DOI":"10.1561\/2200000013"},{"key":"S1351324922000390_ref20","volume-title":"Current Studies in Linguistics Series","author":"Jackendoff","year":"1990"},{"key":"S1351324922000390_ref35","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-00831-3_2"},{"key":"S1351324922000390_ref34","first-page":"1","article-title":"The proposition bank: A corpus annotated with semantic roles","volume":"31","author":"Palmer","year":"2005","journal-title":"Computational Linguistics Journal"},{"key":"S1351324922000390_ref11","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219885"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324922000390","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,2]],"date-time":"2024-10-02T12:28:57Z","timestamp":1727872137000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324922000390\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,24]]},"references-count":51,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,5]]}},"alternative-id":["S1351324922000390"],"URL":"https:\/\/doi.org\/10.1017\/s1351324922000390","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,24]]},"assertion":[{"value":"\u00a9 The Author(s), 2022. Published by Cambridge University Press","name":"copyright","label":"Copyright","group":{"name":"copyright_and_licensing","label":"Copyright and Licensing"}}]}}