{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T15:21:33Z","timestamp":1781104893403,"version":"3.54.1"},"reference-count":0,"publisher":"IGI Global Scientific Publishing","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,4,1]]},"abstract":"<p>Audiovisual documents provide a wide range of content description through more descriptors from different media types. Indeed, the extraction of these descriptions has received an increasing attention. But, the lack of semantic description always persists. In fact, this lack affects the retrieval process. To address this problem, this paper describes an automatic and semantic description of cinematic audiovisual documents. This description is based not only on the audiovisual flux in this post-production phase but also in the documentation in the pre-production phase by using textual and visual modalities. In this context, to extract content description, we find it is essential to extract texts superposed in the image. This process is mainly based on the neural network classifier. Moreover, an effective OCR (Tesseract) is adapted for texts recognition. Experiments results confirmed the interesting performance through two databases, namely, \u201cICDAR 2011\u201d and our own created database from the Internet Movie Database Imdb.<\/p>","DOI":"10.4018\/ijmdem.2015040104","type":"journal-article","created":{"date-parts":[[2015,6,8]],"date-time":"2015-06-08T12:36:52Z","timestamp":1433767012000},"page":"52-70","source":"Crossref","is-referenced-by-count":6,"title":["Towards Fusion of Textual and Visual Modalities for Describing Audiovisual Documents"],"prefix":"10.4018","volume":"6","author":[{"given":"Manel","family":"Fourati","sequence":"first","affiliation":[{"name":"Laboratory MIR@CL, University of Sfax, Sfax, Tunisia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Anis","family":"Jedidi","sequence":"additional","affiliation":[{"name":"Laboratory MIR@CL, University of Sfax, Sfax, Tunisia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hanen","family":"Ben Hassin","sequence":"additional","affiliation":[{"name":"Laboratory MIR@CL, University of Sfax, Sfax, Tunisia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Faiez","family":"Gargouri","sequence":"additional","affiliation":[{"name":"Laboratory MIR@CL, University of Sfax, Sfax, Tunisia"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"2432","container-title":["International Journal of Multimedia Data Engineering and Management"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=130339","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,1]],"date-time":"2022-06-01T20:35:12Z","timestamp":1654115712000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/IJMDEM.2015040104"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2015,4,1]]},"references-count":0,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2015,4]]}},"URL":"https:\/\/doi.org\/10.4018\/ijmdem.2015040104","relation":{},"ISSN":["1947-8534","1947-8542"],"issn-type":[{"value":"1947-8534","type":"print"},{"value":"1947-8542","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,4,1]]}}}