{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,25]],"date-time":"2025-03-25T14:11:55Z","timestamp":1742911915459,"version":"3.40.3"},"publisher-location":"Cham","reference-count":34,"publisher":"Springer International Publishing","isbn-type":[{"type":"print","value":"9783319149974"},{"type":"electronic","value":"9783319149981"}],"license":[{"start":{"date-parts":[[2015,1,1]],"date-time":"2015-01-01T00:00:00Z","timestamp":1420070400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2015,1,1]],"date-time":"2015-01-01T00:00:00Z","timestamp":1420070400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015]]},"DOI":"10.1007\/978-3-319-14998-1_13","type":"book-chapter","created":{"date-parts":[[2015,4,2]],"date-time":"2015-04-02T18:57:00Z","timestamp":1428001020000},"page":"295-310","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Multimodal Fusion: Combining Visual and Textual Cues for Concept Detection in Video"],"prefix":"10.1007","author":[{"given":"Damianos","family":"Galanopoulos","sequence":"first","affiliation":[]},{"given":"Milan","family":"Dojchinovski","sequence":"additional","affiliation":[]},{"given":"Krishna","family":"Chandramouli","sequence":"additional","affiliation":[]},{"given":"Tom\u00e1\u0161","family":"Kliegr","sequence":"additional","affiliation":[]},{"given":"Vasileios","family":"Mezaris","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2015,4,1]]},"reference":[{"key":"13_CR1","unstructured":"Bao L, Yu SI, Lan ZZ, Overwijk A, Jin Q, Langner B, Garbus M, Burger S, Metze F, Hauptmann A (2011) Informedia@ TRECVID 2011 multimedia event detection, semantic indexing. TRECVID compet 1:107\u2013123"},{"key":"13_CR2","doi-asserted-by":"crossref","unstructured":"Bay H, Tuytelaars T, Van Gool L (2006) SURF: speeded up robust features. In: Computer vision-ECCV 2006. Springer, Heidelberg, pp 404\u2013417","DOI":"10.1007\/11744023_32"},{"issue":"1","key":"13_CR3","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1109\/TCSVT.2005.856896","volume":"16","author":"Z Cernekova","year":"2006","unstructured":"Cernekova Z, Pitas I, Nikou C (2006) Information theory-based shot cut\/fade detection and video summarization. IEEE Trans Circuits Syst Video Technol 16(1):82\u201391","journal-title":"IEEE Trans Circuits Syst Video Technol"},{"key":"13_CR4","doi-asserted-by":"crossref","unstructured":"Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:27:1\u201327:27. Software available at http:\/\/www.csie.ntu.edu.tw\/~cjlin\/libsvm","DOI":"10.1145\/1961189.1961199"},{"key":"13_CR5","unstructured":"Chavez GC, Precioso F, Cord M, Philipp-Foliguet S, Araujo AdA (2006) Shot boundary detection at TRECVID 2006. In: Proceedings of the TREC video retrieval evaluation, p 1\u20138"},{"key":"13_CR6","unstructured":"Delezoide B, Precioso F, Gosselin PH, Redi M, M\u00e9rialdo B, Granjon L, Pellerin D, Rombaut M, J\u00e9gou H, Vieux R et al (2011) IRIM at TRECVID 2011: semantic indexing and instance search. In: Notebook papers of the TREC video retrieval evaluation workshop (TRECVID)"},{"key":"13_CR7","first-page":"1871","volume":"9","author":"RE Fan","year":"2008","unstructured":"Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ (2008) LIBLINEAR: a library for large linear classification. J Mach Learn Res 9:1871\u20131874","journal-title":"J Mach Learn Res"},{"key":"13_CR8","first-page":"1606","volume":"7","author":"E Gabrilovich","year":"2007","unstructured":"Gabrilovich E, Markovitch S (2007) Computing semantic relatedness using Wikipedia-based explicit semantic analysis. IJCAI 7:1606\u20131611","journal-title":"IJCAI"},{"issue":"1","key":"13_CR9","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1016\/S0167-6393(01)00061-9","volume":"37","author":"JL Gauvain","year":"2002","unstructured":"Gauvain JL, Lamel L, Adda G (2002) The LIMSI broadcast news transcription system. Speech Commun 37(1):89\u2013108","journal-title":"Speech Commun"},{"key":"13_CR10","doi-asserted-by":"crossref","unstructured":"Hamadi A, Mulhem P, Qu\u00e9not G (2013) Conceptual feedback for semantic multimedia indexing. In: Proceedings of the 11th international workshop on content-based multimedia indexing (CBMI). IEEE, pp 53\u201358","DOI":"10.1109\/CBMI.2013.6576552"},{"key":"13_CR11","doi-asserted-by":"crossref","unstructured":"Harris C, Stephens M (1988) A combined corner and edge detector. In: Alvey vision conference, vol 15. Manchester, p 50","DOI":"10.5244\/C.2.23"},{"key":"13_CR12","doi-asserted-by":"crossref","unstructured":"Kliegr T, Chandramouli K, Nemrava J, Svatek V, Izquierdo E (2008) Combining image captions and visual analysis for image concept classification. In: Proceedings of the 9th international workshop on multimedia data mining: held in conjunction with the ACM SIGKDD 2008, MDM \u201908ACM, New York, pp 8\u201317","DOI":"10.1145\/1509212.1509214"},{"key":"13_CR13","doi-asserted-by":"crossref","unstructured":"Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, vol 2. IEEE, pp 2169\u20132178","DOI":"10.1109\/CVPR.2006.68"},{"key":"13_CR14","unstructured":"Leong CW, Mihalcea R, Hassan S (2010) Text mining for automatic image tagging. In: Proceedings of the 23rd international conference on computational linguistics: posters. Association for Computational Linguistics, pp 647\u2013655"},{"key":"13_CR15","doi-asserted-by":"crossref","unstructured":"Lin WH, Hauptmann A (2002) News video classification using SVM-based multimodal classifiers and combination strategies. In: Proceedings of the 10th ACM international conference on multimedia. ACM, pp 323\u2013326","DOI":"10.1145\/641007.641075"},{"key":"13_CR16","unstructured":"Liu C, Liu H, Jiang S, Huang Q, Zheng Y, Zhang W (2006) JDL at TRECVID 2006 shot boundary detection. In: TRECVID 2006 workshop"},{"key":"13_CR17","doi-asserted-by":"crossref","unstructured":"Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of the 7th IEEE international conference on computer vision, vol 2. IEEE, pp 1150\u20131157","DOI":"10.1109\/ICCV.1999.790410"},{"key":"13_CR18","unstructured":"Markatopoulou F, Moumtzidou A, Tzelepis C, Avgerinakis K, Gkalelis N, Vrochidis S, Mezaris V, Kompatsiaris I (2013) ITI-CERTH participation to TRECVID 2013. In: Proceedings of TRECVID 2013 workshop. TRECVID 2013"},{"issue":"2","key":"13_CR19","doi-asserted-by":"publisher","first-page":"230","DOI":"10.1109\/TKDE.2004.1269600","volume":"16","author":"A Mittal","year":"2004","unstructured":"Mittal A, Cheong LF (2004) Addressing the problems of Bayesian network classification of video using high-dimensional features. IEEE Trans Knowl Data Eng 16(2):230\u2013244","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"13_CR20","unstructured":"Moumtzidou A, Gkalelis N, Sidiropoulos P, Dimopoulos M, Nikolopoulos S, Vrochidis S, Mezaris V, Kompatsiaris I (2012) ITI-CERTH participation to TRECVID 2012. In: Proceedings of TRECVID 2012 workshop. TRECVID 2012"},{"key":"13_CR21","unstructured":"Over P, Awad G, Michel M, Fiscus J, Sanders G, Kraaij W, Smeaton AF, Qu\u00e9not G (2013) TRECVID 2013\u2014an overview of the goals, tasks, data, evaluation mechanisms and metrics. In: Proceedings of TRECVID 2013. NIST"},{"key":"13_CR22","unstructured":"Over P, Awad G, Michel M, Fiscus J, Sanders G, Shaw B, Kraaij W, Smeaton AF, Qu\u00e9not G (2012) TRECVID 2012\u2014an overview of the goals, tasks, data, evaluation mechanisms and metrics. In: Proceedings of TRECVID 2012. NIST"},{"key":"13_CR23","unstructured":"Qu\u00e9not G, Moraru D, Besacier L (2003) Clips at TRECVID: shot boundary detection and feature detection. In: TRECVID 2003 workshop notebook papers. Citeseer"},{"key":"13_CR24","doi-asserted-by":"crossref","unstructured":"Radinsky K, Agichtein E, Gabrilovich E, Markovitch S (2011) A word at a time: computing word relatedness using temporal semantic analysis. In: Proceedings of the 20th international conference on world wide web. ACM, pp 337\u2013346","DOI":"10.1145\/1963405.1963455"},{"issue":"5","key":"13_CR25","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1016\/0306-4573(88)90021-0","volume":"24","author":"G Salton","year":"1988","unstructured":"Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval. Inf Process Manag 24(5):513\u2013523","journal-title":"Inf Process Manag"},{"issue":"1","key":"13_CR26","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/505282.505283","volume":"34","author":"F Sebastiani","year":"2002","unstructured":"Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv (CSUR) 34(1):1\u201347","journal-title":"ACM Comput Surv (CSUR)"},{"key":"13_CR27","doi-asserted-by":"crossref","unstructured":"Sechidis K, Tsoumakas G, Vlahavas I (2011) On the stratification of multi-label data. In: Machine learning and knowledge discovery in databases. Springer, Berlin, pp 145\u2013158","DOI":"10.1007\/978-3-642-23808-6_10"},{"key":"13_CR28","doi-asserted-by":"crossref","unstructured":"Sidiropoulos P, Mezaris V, Kompatsiaris I (2013) Enhancing video concept detection with the use of tomographs. In: Proceedings of the 20th IEEE international conference on image processing (ICIP), pp 3991\u20133995","DOI":"10.1109\/ICIP.2013.6738822"},{"key":"13_CR29","doi-asserted-by":"crossref","unstructured":"Tsamoura E, Mezaris V, Kompatsiaris I (2008) Gradual transition detection using color coherence and other criteria in a video shot meta-segmentation framework. In: Proceedings of the 15th IEEE international conference on image processing (ICIP), pp 45\u201348","DOI":"10.1109\/ICIP.2008.4711687"},{"issue":"9","key":"13_CR30","doi-asserted-by":"publisher","first-page":"1582","DOI":"10.1109\/TPAMI.2009.154","volume":"32","author":"KE Van De Sande","year":"2010","unstructured":"Van De Sande KE, Gevers T, Snoek CG (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582\u20131596","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"13_CR31","doi-asserted-by":"crossref","unstructured":"Wan KW, Yau WY, Roy S (2013) Metadata enrichment for news video retrieval: a graph-based propagation approach. In: Proceedings of the 21st ACM international conference on multimedia. ACM, pp 373\u2013376","DOI":"10.1145\/2502081.2508122"},{"key":"13_CR32","unstructured":"Witten I, Milne D (2008) An effective, low-cost measure of semantic relatedness obtained from wikipedia links. In: Proceeding of AAAI workshop on Wikipedia and artificial intelligence: an evolving synergy. AAAI Press, Chicago, pp 25\u201330"},{"key":"13_CR33","doi-asserted-by":"crossref","unstructured":"Yilmaz E, Kanoulas E, Aslam JA (2008) A simple and efficient sampling method for estimating AP and NDCG. In: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 603\u2013610","DOI":"10.1145\/1390334.1390437"},{"key":"13_CR34","doi-asserted-by":"crossref","unstructured":"Zhao ZC, Cai AN (2006) Shot boundary detection algorithm in compressed domain based on adaboost and fuzzy theory. In: Advances in natural computation. Springer, Berlin, pp 617\u2013626","DOI":"10.1007\/11881223_76"}],"container-title":["Multimedia Data Mining and Analytics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-319-14998-1_13","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,8]],"date-time":"2023-02-08T10:34:02Z","timestamp":1675852442000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-319-14998-1_13"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015]]},"ISBN":["9783319149974","9783319149981"],"references-count":34,"URL":"https:\/\/doi.org\/10.1007\/978-3-319-14998-1_13","relation":{},"subject":[],"published":{"date-parts":[[2015]]},"assertion":[{"value":"1 April 2015","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}}]}}