{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T02:17:16Z","timestamp":1771467436720,"version":"3.50.1"},"reference-count":37,"publisher":"SAGE Publications","issue":"6","license":[{"start":{"date-parts":[[2019,7,30]],"date-time":"2019-07-30T00:00:00Z","timestamp":1564444800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2019,12,23]]},"abstract":"<jats:p>The aim of the work is to report the results of the Chist-Era project AMIS (Access Multilingual Information opinionS). The purpose of AMIS is to answer the following question: How to make the information in a foreign language accessible for everyone? This issue is not limited to translate a source video into a target language video since the objective is to provide only the main idea of an Arabic video in English. This objective necessitates developing research in several areas that are not, all arrived at a maturity state: Video summarization, Speech recognition, Machine translation, Audio summarization and Speech segmentation. In this article we present several possible architectures to achieve our objective, yet we focus on only one of them. The scientific locks are be presented, and we explain how to deal with them. One of the big challenges of this work is to conceive a way to evaluate objectively a system composed of several components knowing that each of them has its limits and can propagate errors through the first component. Also, a subjective evaluation procedure is proposed in which several annotators have been mobilized to test the quality of the achieved summaries.<\/jats:p>","DOI":"10.3233\/jifs-179350","type":"journal-article","created":{"date-parts":[[2019,8,2]],"date-time":"2019-08-02T12:17:15Z","timestamp":1564748235000},"page":"7415-7426","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":3,"title":["Summarizing videos into a target language: Methodology, architectures and evaluation"],"prefix":"10.1177","volume":"37","author":[{"given":"Kamel","family":"Sma\u00efli","sequence":"first","affiliation":[{"name":"Loria University of Lorraine \u2013 France"}]},{"given":"Dominique","family":"Fohr","sequence":"additional","affiliation":[{"name":"Loria University of Lorraine \u2013 France"}]},{"given":"Carlos-Emiliano","family":"Gonz\u00e1lez-Gallardo","sequence":"additional","affiliation":[{"name":"LIA Avignon Universit\u00e9 \u2013 France"}]},{"given":"Micha\u0142","family":"Grega","sequence":"additional","affiliation":[{"name":"AGH University of Science and Technology Krak\u00f3w \u2013 Poland"}]},{"given":"Lucjan","family":"Janowski","sequence":"additional","affiliation":[{"name":"Loria University of Lorraine \u2013 France"}]},{"given":"Denis","family":"Jouvet","sequence":"additional","affiliation":[{"name":"Loria University of Lorraine \u2013 France"}]},{"given":"Arian","family":"Ko\u017abia\u0142","sequence":"additional","affiliation":[{"name":"AGH University of Science and Technology Krak\u00f3w \u2013 Poland"}]},{"given":"David","family":"Langlois","sequence":"additional","affiliation":[{"name":"Loria University of Lorraine \u2013 France"}]},{"given":"Miko\u0142laj","family":"Leszczuk","sequence":"additional","affiliation":[{"name":"Loria University of Lorraine \u2013 France"}]},{"given":"Odile","family":"Mella","sequence":"additional","affiliation":[{"name":"Loria University of Lorraine \u2013 France"}]},{"given":"Mohamed-Amine","family":"Menacer","sequence":"additional","affiliation":[{"name":"Loria University of Lorraine \u2013 France"}]},{"given":"Amaia","family":"Mendez","sequence":"additional","affiliation":[{"name":"University of DEUSTO Bilbao \u2013 Spain"}]},{"given":"Elvys Linhares","family":"Pontes","sequence":"additional","affiliation":[{"name":"LIA Avignon Universit\u00e9 \u2013 France"}]},{"given":"Eric","family":"SanJuan","sequence":"additional","affiliation":[{"name":"LIA Avignon Universit\u00e9 \u2013 France"}]},{"given":"Juan-Manuel","family":"Torres-Moreno","sequence":"additional","affiliation":[{"name":"LIA Avignon Universit\u00e9 \u2013 France"},{"name":"Polytechnique Montr\u00e9al \u2013 Canada"}]},{"given":"Bego\u00f1a","family":"Garcia-Zapirain","sequence":"additional","affiliation":[{"name":"University of DEUSTO Bilbao \u2013 Spain"}]}],"member":"179","published-online":{"date-parts":[[2019,7,30]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"crossref","unstructured":"BaranR. and ZejaA. The imcop system for data enrichment and content discovery and delivery In 2015 International Conference on Computational Science and Computational Intelligence (CSCI) 2015 pp. 143\u2013146.","DOI":"10.1109\/CSCI.2015.137"},{"key":"e_1_3_2_3_2","first-page":"730","volume-title":"Interspeech","author":"Bell P.","year":"2015","unstructured":"BellP., LaiC., LlewellynC., BirchA. and SinclairM., A system for automatic broadcast news summarisation, geolocation and translation. In Interspeech, 2015, pp. 730\u2013731."},{"issue":"2","key":"e_1_3_2_4_2","first-page":"263","article-title":"The mathematics of statistical machine translation: Parameter estimation","volume":"19","author":"Brown P.F.","year":"1993","unstructured":"BrownP.F., Della PietraV.J., Della PietraS.A. and MercerR.L., The mathematics of statistical machine translation: Parameter estimation, Computational Linguistics 19(2) (1993), 263\u2013311.","journal-title":"Computational Linguistics"},{"key":"e_1_3_2_5_2","article-title":"Network of data centres (netdc): Bnsc-an arabic broadcast news speech corpus","author":"Choukri K.","year":"2004","unstructured":"ChoukriK., NikkhouM. and PaulssonN., Network of data centres (netdc): Bnsc-an arabic broadcast news speech corpus, In LREC, 2004.","journal-title":"LREC"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-24752-4_17"},{"key":"e_1_3_2_7_2","article-title":"Multiun: A multilingual corpus from united nation documents","author":"Eisele A.","year":"2010","unstructured":"EiseleA. and YuC., Multiun: A multilingual corpus from united nation documents, In LREC, 2010.","journal-title":"LREC"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1006\/csla.1998.0043"},{"key":"e_1_3_2_9_2","unstructured":"Gonz\u00e1lez-GallardoC.-E. and Torres-MorenoJ.-M. Sentence boundary detection for french with subword-level information vectors and convolutional neural networks. arXiv preprint arXiv:1802.04559 2018."},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298928"},{"key":"e_1_3_2_11_2","unstructured":"HuangM. MahajanA.B. and DeMenthonD.F. Automatic Performance Evaluation for Video Summarization. AD-a448 064. Maryland Univ. College Park Inst. for Advanced Computer Studies 2004."},{"key":"e_1_3_2_12_2","unstructured":"JouvetD. LangloisD. MenacerM.A. FohrD. MellaO. and KamelS. Adaptation of speech recognition vocabularies for improved transcription of youtube videos In ICNLSSP Conference 2017."},{"key":"e_1_3_2_13_2","first-page":"38","volume-title":"Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra)","author":"Khan M.U.G.","year":"2012","unstructured":"KhanM.U.G., NawabR.M.A. and GotohY., Natural language descriptions of visual scenes corpus generation and analysis. In Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra), 2012, pp. 38\u201347. ACL."},{"key":"e_1_3_2_14_2","first-page":"181e4","article-title":"Improved backing-off for m-gram language modeling","volume":"1","author":"Kneser R.","year":"1995","unstructured":"KneserR. and NeyH., Improved backing-off for m-gram language modeling, In icassp, volume 1, 1995, p. 181e4.","journal-title":"icassp"},{"key":"e_1_3_2_15_2","doi-asserted-by":"crossref","unstructured":"KoehnP. HoangH. BirchA. Callison-BurchC. FedericoM. BertoldiN. CowanB. ShenW. MoranC. ZensR. et al. Moses: Open source toolkit for statistical machine translation In 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions 2007 pp. 177\u2013180. ACL.","DOI":"10.3115\/1557769.1557821"},{"key":"e_1_3_2_16_2","doi-asserted-by":"crossref","unstructured":"KoehnP. OchF.J. and MarcuD. Statistical phrase-based translation In 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1 2003 pp. 48\u201354. ACL.","DOI":"10.3115\/1073445.1073462"},{"key":"e_1_3_2_17_2","first-page":"424","volume-title":"Multimedia and Network Information Systems","author":"Komorowski A.","year":"2019","unstructured":"KomorowskiA., JanowskiL. and LeszczukM., Evaluation of multimedia content summarization algorithms. In Choro\u015bK., KopelM. , KuklaE. and Siemi\u0144skiA. , editors, Multimedia and Network Information Systems, Cham, Springer International Publishing, 2019, pp. 424\u2013433."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-69911-0_7"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2005.06.002"},{"key":"e_1_3_2_20_2","doi-asserted-by":"crossref","unstructured":"LouisA. and NenkovaA. Automatically evaluating content selection in summarization without human models. In 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1 ACL 2009 pp. 306\u2013314.","DOI":"10.3115\/1699510.1699550"},{"key":"e_1_3_2_21_2","unstructured":"MaegaardB. ChoukriK. J\u00f8rgensen L.D. and KrauwerS. Nemlar: Arabic language resources and tools. In Arabic Language Resources and Tools Conference 2004 pp. 42\u201354."},{"key":"e_1_3_2_22_2","unstructured":"ManiI. Summarization evaluation: An overview 2001."},{"key":"e_1_3_2_23_2","unstructured":"ManiI. and MarkT. Maybury Advances in Automatic Text Summarization. MIT Press Cambridge MA USA 1999."},{"key":"e_1_3_2_24_2","doi-asserted-by":"crossref","unstructured":"McFeeB. RaffelC. LiangD. EllisD.P.W. McVicarM. BattenbergE. and NietoO. Librosa: Audio and music signal analysis in python In 14th Python in Science Conference 2015 pp. 18\u201325.","DOI":"10.25080\/Majora-7b98e3ed-003"},{"key":"e_1_3_2_25_2","unstructured":"MenacerM.A. MellaO. FohrD. JouvetD. LangloisD. and Sma\u00efliK. Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect. In ACLing 2017 \u20133rd International Conference on Arabic Computational Linguistics Dubai United Arab Emirates 2017 pp. 1\u20138."},{"key":"e_1_3_2_26_2","first-page":"374","article-title":"Distance measures for speech recognition, psychological and instrumental","volume":"116","author":"Mermelstein P.","year":"1976","unstructured":"MermelsteinP., Distance measures for speech recognition, psychological and instrumental, Pattern Recognition and Artificial Intelligence 116 (1976), 374\u2013388.","journal-title":"Pattern Recognition and Artificial Intelligence"},{"key":"e_1_3_2_27_2","unstructured":"NenkovaA. Automatic text summarization of newswire: Lessons learned from the document understanding conference In 20th National Conference on Artificial Intelligence \u2013 Volume 3 AAAI\u201905 AAAI Press 2005 pp. 1436\u20131441."},{"key":"e_1_3_2_28_2","unstructured":"OchF.J. Giza++: Training of statistical translation models http:\/\/www.isi.edu\/\u223coch\/GIZA++.html 2001."},{"key":"e_1_3_2_29_2","first-page":"160","author":"Och F.J.","year":"2003","unstructured":"OchF.J., Minimum error rate training in statistical machine translation, In 41st Annual Meeting on Association for Computational Linguistics-Volume 1, ACL, 2003, pp. 160\u2013167.","journal-title":"Minimum error rate training in statistical machine translation"},{"key":"e_1_3_2_30_2","first-page":"1","volume-title":"Workshop on Evaluation Metrics and System Comparison for Automatic Summarization","author":"Owczarzak K.","year":"2012","unstructured":"OwczarzakK., ConroyJ.M., DangH.T. and NenkovaA., An assessment of the accuracy of automatic evaluation in summarization. In Workshop on Evaluation Metrics and System Comparison for Automatic Summarization, Stroudsburg, PA, USA, 2012, pp. 1\u20139. ACL."},{"key":"e_1_3_2_31_2","article-title":"The kaldi speech recognition toolkit","author":"Povey D.","year":"2011","unstructured":"PoveyD., GhoshalA., BoulianneG., BurgetL., GlembekO., GoelN., HannemannM., MotlicekP., QianY., SchwarzP., SilovskyJ., StemmerG. and VeselyK., The kaldi speech recognition toolkit, In IEEE 2011 Workshop on Automatic Speech Recognition and Understanding IEEE Signal Processing Society, IEEE Catalog No.: CFP11SRW-USB, 2011.","journal-title":"IEEE 2011 Workshop on Automatic Speech Recognition and Understanding IEEE Signal Processing Society"},{"key":"e_1_3_2_32_2","unstructured":"QuemyA. JamrogK. and JaniszewskiM. Unsupervised video semantic partitioning using ibm watson and topic modelling In Workshops of the EDBT\/ICDT 2018 Joint Conference 2018 pp. 44\u201349."},{"key":"e_1_3_2_33_2","first-page":"583","article-title":"Music\/voice separation using the similarity matrix","author":"Rafii Z.","year":"2012","unstructured":"RafiiZ. and PardoB., Music\/voice separation using the similarity matrix. In ISMIR, 2012, pp. 583\u2013588.","journal-title":"ISMIR"},{"key":"e_1_3_2_34_2","doi-asserted-by":"crossref","unstructured":"SharghiA. LaurelJ.S. and GongB. Queryfocused video summarization: Dataset evaluation and a memory network based approach In 2017 IEEE Conference on Computer Vision and Pattern Recognition CVPR 2017 Honolulu HI USA pp. 2127\u20132136. IEEE Computer Society 2017.","DOI":"10.1109\/CVPR.2017.229"},{"key":"e_1_3_2_35_2","unstructured":"StolckeA. Entropy-based pruning of backoff language models. arXiv preprint cs\/0006025 2000."},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1002\/9781119004752"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.17562\/PB-42-2"},{"key":"e_1_3_2_38_2","doi-asserted-by":"crossref","DOI":"10.21437\/Interspeech.2013-548","article-title":"Sequence-discriminative training of deep neural networks","author":"Vesel\u00fd K.","year":"2013","unstructured":"Vesel\u00fdK., GhoshalA., BurgetL. and PoveyD., Sequence-discriminative training of deep neural networks, In Interspeech\u201913, 2013.","journal-title":"Interspeech\u201913"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179350","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-179350","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179350","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T18:59:28Z","timestamp":1770231568000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-179350"}},"subtitle":[],"editor":[{"given":"Ngoc Thanh","family":"Nguyen","sequence":"additional","affiliation":[]},{"given":"Edward","family":"Szczerbicki","sequence":"additional","affiliation":[]},{"given":"Bogdan","family":"Trawi\u0144ski","sequence":"additional","affiliation":[]},{"given":"Van Du","family":"Nguyen","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,7,30]]},"references-count":37,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2019,12,23]]}},"alternative-id":["10.3233\/JIFS-179350"],"URL":"https:\/\/doi.org\/10.3233\/jifs-179350","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,7,30]]}}}