{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,13]],"date-time":"2026-06-13T20:10:17Z","timestamp":1781381417651,"version":"3.54.1"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2013,11,18]],"date-time":"2013-11-18T00:00:00Z","timestamp":1384732800000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["EURASIP J. Adv. Signal Process."],"published-print":{"date-parts":[[2013,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>The ability to detect and organize \u2018hot spots\u2019 representing areas of excitement within video streams is a challenging research problem when techniques rely exclusively on video content. A generic method for sports video highlight selection is presented in this study which leverages both video\/image structure as well as audio\/speech properties. Processing begins where the video is partitioned into small segments and several multi-modal features are extracted from each segment. Excitability is computed based on the likelihood of the segmental features residing in certain regions of their joint probability density function space which are considered both exciting and rare. The proposed measure is used to rank order the partitioned segments to compress the overall video sequence and produce a contiguous set of highlights. Experiments are performed on baseball videos based on signal processing advancements for excitement assessment in the commentators\u2019 speech, audio energy, slow motion replay, scene cut density, and motion activity as features. Detailed analysis on correlation between user excitability and various speech production parameters is conducted and an effective scheme is designed to estimate the excitement level of commentator\u2019s speech from the sports videos. Subjective evaluation of excitability and ranking of video segments demonstrate a higher correlation with the proposed measure compared to well-established techniques indicating the effectiveness of the overall approach.<\/jats:p>","DOI":"10.1186\/1687-6180-2013-173","type":"journal-article","created":{"date-parts":[[2013,11,19]],"date-time":"2013-11-19T01:25:36Z","timestamp":1384824336000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Multi-modal highlight generation for sports videos using an information-theoretic excitability measure"],"prefix":"10.1186","volume":"2013","author":[{"given":"Taufiq","family":"Hasan","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hynek","family":"Bo\u0159il","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Abhijeet","family":"Sangwan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"John H","family":"L Hansen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2013,11,18]]},"reference":[{"key":"566_CR1","volume-title":"Proc. IEEE ICASSP 7\u201311","author":"H Pan","year":"2001","unstructured":"Pan H, Van Beek P, Sezan M: Detection of slow-motion replay segments in sports video for highlights generation. Proc. IEEE ICASSP 7\u201311 May 2001"},{"issue":"2","key":"566_CR2","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1016\/j.cviu.2007.09.002","volume":"111","author":"M Delakis","year":"2008","unstructured":"Delakis M, Gravier G, Gros P: Audiovisual integration with Segment Models for tennis video parsing. Comput. Vis. Image Underst 2008, 111(2):142-154. 10.1016\/j.cviu.2007.09.002","journal-title":"Comput. Vis. Image Underst"},{"key":"566_CR3","first-page":"333","volume-title":"Proceedings of the 15th Int. Conf. on Multimedia, Augsburg, Germany, 24\u201329 Sept","author":"M Fleischman","year":"2007","unstructured":"Fleischman M, Roy B, Roy D: Temporal feature induction for Baseball highlight classification. In Proceedings of the 15th Int. Conf. on Multimedia, Augsburg, Germany, 24\u201329 Sept.. New York: ACM; 2007:333-336."},{"key":"566_CR4","volume-title":"Proc. IEEE ICIP, Barcelona, Catalonia, 14-17 Sept","author":"Z Xiong","year":"2003","unstructured":"Xiong Z, Radhakrishnan R, Divakaran A: Generation of sports highlights using motion activity in combination with a common audio feature extraction framework. Proc. IEEE ICIP, Barcelona, Catalonia, 14-17 Sept. 2003."},{"issue":"3","key":"566_CR5","doi-asserted-by":"publisher","first-page":"545","DOI":"10.1007\/s11042-009-0337-1","volume":"47","author":"M Kolekar","year":"2010","unstructured":"Kolekar M, Sengupta S: Semantic concept mining in cricket videos for automated highlight generation. Multimedia Tools and Appl 2010, 47(3):545-579. 10.1007\/s11042-009-0337-1","journal-title":"Multimedia Tools and Appl"},{"key":"566_CR6","doi-asserted-by":"publisher","first-page":"471","DOI":"10.1109\/WACV.2011.5711541","volume-title":"2011 IEEE Workshop on Applications of Computer Vision (WACV), Kona, HI, 5\u20137 January","author":"D Tjondronegoro","year":"2011","unstructured":"Tjondronegoro D, Tao X, Sasongko J, Lau C: Multi-modal summarization of key events and top players in sports tournament videos. In 2011 IEEE Workshop on Applications of Computer Vision (WACV), Kona, HI, 5\u20137 January. Piscataway: IEEE; 2011:471-478."},{"issue":"3","key":"566_CR7","doi-asserted-by":"publisher","first-page":"585","DOI":"10.1109\/TMM.2006.870726","volume":"8","author":"C Cheng","year":"2006","unstructured":"Cheng C, Hsu C: Fusion of audio and motion information on HMM-based highlight extraction for baseball games. IEEE Trans. Multimedia 2006, 8(3):585-599.","journal-title":"IEEE Trans. Multimedia"},{"key":"566_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.jvcir.2006.09.002","volume":"18","author":"C Lien","year":"2007","unstructured":"Lien C, Chiang C, Lee C: Scene-based event detection for baseball videos. J. of Visual Comm. and Image Representation 2007, 18: 1-14. 10.1016\/j.jvcir.2006.09.002","journal-title":"J. of Visual Comm. and Image Representation"},{"key":"566_CR9","first-page":"825","volume-title":"Proc. ICME \u201902, Lausanne, Switzerland, 26-29 Aug. 2002 Volume 1","author":"J Assfalg","year":"2002","unstructured":"Assfalg J, Bertini M, Bimbo AD, Nunziati W, Pala P: Soccer highlights detection and recognition using HMMs. In Proc. ICME \u201902, Lausanne, Switzerland, 26-29 Aug. 2002 Volume 1. Piscataway: IEEE; 2002:825-828."},{"issue":"6","key":"566_CR10","doi-asserted-by":"publisher","first-page":"1114","DOI":"10.1109\/TMM.2005.858397","volume":"7","author":"A Hanjalic","year":"2005","unstructured":"Hanjalic A: Adaptive extraction of highlights from a sport video based on excitement modeling. IEEE Trans. Multimedia 2005, 7(6):1114-1122.","journal-title":"IEEE Trans. Multimedia"},{"key":"566_CR11","first-page":"632","volume-title":"Proc. IEEE ICASSP, Hong Kong, China, 6\u201310 April 2003 Volume 5","author":"Z Xiong","year":"2003","unstructured":"Xiong Z, Radhakrishnan R, Divakaran A, Huang T: Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework. In Proc. IEEE ICASSP, Hong Kong, China, 6\u201310 April 2003 Volume 5. Washington, DC: IEEE Computer Society; 2003:632-635."},{"key":"566_CR12","first-page":"609","volume-title":"Proc. Image Process., Rochester, New York, 22\u201325, Sept. Volume 1","author":"P Chang","year":"2002","unstructured":"Chang P, Han M, Gong Y: Extract highlights from baseball game video with hidden Markov models. In Proc. Image Process., Rochester, New York, 22\u201325, Sept. Volume 1. Piscataway: IEEE; 2002:609-612."},{"key":"566_CR13","first-page":"115","volume-title":"Proceedings of the Eighth ACM International Conference on Multimedia, Los Angeles, CA, October 30 \u2013 November 03","author":"Y Rui","year":"2000","unstructured":"Rui Y, Gupta A, Acero A: Automatically extracting highlights for TV baseball programs. In Proceedings of the Eighth ACM International Conference on Multimedia, Los Angeles, CA, October 30 \u2013 November 03. New York: ACM; 2000:115-115."},{"key":"566_CR14","first-page":"542","volume-title":"Proceedings of the tenth ACM international conference on Multimedia, Juan les Pins, France, 1\u20136 December, 2002","author":"Y Ma","year":"2002","unstructured":"Ma Y, Lu L, Zhang H, Li M: A user attention model for video summarization. In Proceedings of the tenth ACM international conference on Multimedia, Juan les Pins, France, 1\u20136 December, 2002. New York: ACM; 2002:542-542."},{"key":"566_CR15","first-page":"2202","volume-title":"Proc. InterSpeech, Makuhari, Chiba, Japan 26\u201330","author":"H Bo\u0159il","year":"2010","unstructured":"Bo\u0159il H, Sangwan A, Hasan T, Hansen JHL: Automatic excitement-level detection for sports highlights generation. Proc. InterSpeech, Makuhari, Chiba, Japan 26\u201330 September 2010 2202-2205."},{"key":"566_CR16","first-page":"2381","volume-title":"IEEE ICASSP, Kyoto, Japan 25\u201330","author":"T Hasan","year":"2012","unstructured":"Hasan T, Bo\u0159il H, Sangwan A, Hansen JHL: A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure. IEEE ICASSP, Kyoto, Japan 25\u201330, March 2012 2381-2384."},{"key":"566_CR17","doi-asserted-by":"publisher","DOI":"10.1002\/0471200611","volume-title":"Elements of Information Theory","author":"TM Cover","year":"1991","unstructured":"Cover TM, Thomas JA: Elements of Information Theory. New York: Wiley-Interscience; 1991."},{"issue":"1-2","key":"566_CR18","doi-asserted-by":"publisher","first-page":"151","DOI":"10.1016\/S0167-6393(96)00050-7","volume":"20","author":"JHL Hansen","year":"1996","unstructured":"Hansen JHL: Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition. Speech Comm 1996, 20(1-2):151-173. 10.1016\/S0167-6393(96)00050-7","journal-title":"Speech Comm"},{"key":"566_CR19","doi-asserted-by":"publisher","first-page":"32","DOI":"10.1109\/79.911197","volume":"18","author":"R Cowie","year":"2001","unstructured":"Cowie R, Cowie Douglas-E, Tsapatsoulis N, Votsis G, Kollias S, Fellenz W, Taylor JG: Emotion recognition in human-computer interaction. IEEE Signal Process. Mag 2001, 18: 32-80. 10.1109\/79.911197","journal-title":"IEEE Signal Process. Mag"},{"issue":"3","key":"566_CR20","doi-asserted-by":"crossref","first-page":"1996","DOI":"10.1121\/1.3385171","volume":"127","author":"H Bo\u0159il","year":"2010","unstructured":"Bo\u0159il H, Kleinschmidt T, Boyraz P, Hansen JHL: Impact of cognitive load and frustration on drivers\u2019 speech. The J. Acoust. Soc. Am 2010, 127(3):1996-1996.","journal-title":"The J. Acoust. Soc. Am"},{"issue":"2","key":"566_CR21","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1109\/TSA.2004.838534","volume":"13","author":"CM Lee","year":"2005","unstructured":"Lee CM, Narayanan SS: Toward detecting emotions in spoken dialogs. IEEE Trans. on Speech & Audio Process 2005, 13(2):293-303.","journal-title":"IEEE Trans. on Speech & Audio Process"},{"key":"566_CR22","volume-title":"Proc. of ICSLP\u201800, Beijing, China, 16\u201320 Oct","author":"K Sjolander","year":"2000","unstructured":"Sjolander K, Beskow J: Wave Surfer-an open source speech tool. Proc. of ICSLP\u201800, Beijing, China, 16\u201320 Oct. 2000 Volume 4"},{"issue":"S1","key":"566_CR23","doi-asserted-by":"publisher","first-page":"S37","DOI":"10.1121\/1.2022786","volume":"78","author":"R Schulman","year":"1985","unstructured":"Schulman R: Dynamic and perceptual constraints of loud speech. The J. Acoust. Soc. Am 1985, 78(S1):S37-S37.","journal-title":"The J. Acoust. Soc. Am"},{"key":"566_CR24","first-page":"39","volume":"28","author":"P Gramming","year":"1987","unstructured":"Gramming P, Sundberg S, Ternstr\u00f6m S, Perkins W: Relationship between changes in voice pitch and loudness. STL-QPSR 1987, 28: 39-55.","journal-title":"STL-QPSR"},{"issue":"5","key":"566_CR25","doi-asserted-by":"publisher","first-page":"3261","DOI":"10.1121\/1.2990705","volume":"124","author":"Y Lu","year":"2008","unstructured":"Lu Y, Cooke M: Speech production modifications produced by competing talkers, babble, and stationary noise. The J. Acoust. Soc. Am 2008, 124(5):3261-3275. 10.1121\/1.2990705","journal-title":"The J. Acoust. Soc. Am"},{"key":"566_CR26","first-page":"1581","volume-title":"Proc. of ICASSP, Tampa, Florida, 26\u201329 March, Volume 10","author":"D Pisoni","year":"1985","unstructured":"Pisoni D, Bernacki R, Nusbaum H, Yuchtman M: Some acoustic-phonetic correlates of speech produced in noise. In Proc. of ICASSP, Tampa, Florida, 26\u201329 March, Volume 10. Piscataway: IEEE; 1985:1581-1584."},{"key":"566_CR27","volume-title":"The Acoustic Analysis of Speech","author":"RD Kent","year":"1992","unstructured":"Kent RD, Read C, San Diego: The Acoustic Analysis of Speech. Whurr Publishers; 1992."},{"key":"566_CR28","volume-title":"Proc. of ICSLP\u201890, Kobe, Japan, 18\u201322","author":"Z Bond","year":"1990","unstructured":"Bond Z, Moore T: A note on Loud and Lombard speech. Proc. of ICSLP\u201890, Kobe, Japan, 18\u201322 November 1990"},{"key":"566_CR29","volume-title":"Robust speech recognition: analysis and equalization of Lombard effect in Czech Corpora, PhD thesis","author":"H Bo\u0159il","year":"2008","unstructured":"Bo\u0159il H: Robust speech recognition: analysis and equalization of Lombard effect in Czech Corpora, PhD thesis. Czech Republic: Czech Technical University in Prague; 2008. http:\/\/www.utdallas.edu\/~hxb076000"},{"key":"566_CR30","doi-asserted-by":"publisher","first-page":"510","DOI":"10.1121\/1.405631","volume":"93","author":"JC Junqua","year":"1993","unstructured":"Junqua JC: The Lombard reflex and its role on human listeners and automatic speech recognizers. The J. Acoust. Soc. Am 1993, 93: 510-524. 10.1121\/1.405631","journal-title":"The J. Acoust. Soc. Am"},{"issue":"2","key":"566_CR31","doi-asserted-by":"publisher","first-page":"183","DOI":"10.1109\/TASSP.1977.1162929","volume":"25","author":"H Wakita","year":"1977","unstructured":"Wakita H: Normalization of vowels by vocal-tract length and its application to vowel identification. IEEE Trans. Acoust. Speech and Signal Processing 1977, 25(2):183-192. 10.1109\/TASSP.1977.1162929","journal-title":"IEEE Trans. Acoust. Speech and Signal Processing"},{"key":"566_CR32","volume-title":"Discrete-Time Signal Processing","author":"A Oppenheim","year":"1999","unstructured":"Oppenheim A, Schafer R: Discrete-Time Signal Processing. Upper Saddle River, NJ: Prentice Hall; 1999."},{"key":"566_CR33","doi-asserted-by":"publisher","first-page":"415","DOI":"10.1016\/j.cviu.2008.08.002","volume":"113","author":"C Liu","year":"2009","unstructured":"Liu C, Huang Q, Jiang S, Xing L, Ye Q, Gao W: A framework for flexible summarization of racquet sports video using multiple modalities. Comput. Vis. Image Underst 2009, 113: 415-424. 10.1016\/j.cviu.2008.08.002","journal-title":"Comput. Vis. Image Underst"},{"key":"566_CR34","doi-asserted-by":"crossref","unstructured":"Liu H, Zhang Wj, Cai J: A fast block-matching algorithm based on variable shape search. J. Zhejiang University - Science A 7: 2006. [10.1631\/jzus.2006.A0194]","DOI":"10.1631\/jzus.2006.A0194"},{"key":"566_CR35","first-page":"227","volume-title":"Proc. of the 8th ACM Inter. Conf. on Multimedia, Los Angeles, CA, October 30 \u2013 November 03, 200","author":"B Truong","year":"2000","unstructured":"Truong B, Dorai C, Venkatesh S: New enhancements to cut, fade, and dissolve detection processes in video segmentation. In Proc. of the 8th ACM Inter. Conf. on Multimedia, Los Angeles, CA, October 30 \u2013 November 03, 200. New York: ACM; 2000:227-227."},{"key":"566_CR36","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1007\/s11042-007-0145-4","volume":"38","author":"W Chu","year":"2008","unstructured":"Chu W, Wu J: Explicit semantic events detection and development of realistic applications for broadcasting baseball videos. Multimedia Tools and Appl 2008, 38: 27-50. 10.1007\/s11042-007-0145-4","journal-title":"Multimedia Tools and Appl"},{"issue":"3","key":"566_CR37","doi-asserted-by":"publisher","first-page":"252","DOI":"10.1016\/1049-9652(92)90055-3","volume":"54","author":"R Van Den Boomgaard","year":"1992","unstructured":"Van Den Boomgaard R, Van Balen R: Methods for fast morphological image transforms using bitmapped binary images. Graphical Models and Image Process 1992, 54(3):252-258.","journal-title":"Graphical Models and Image Process"}],"container-title":["EURASIP Journal on Advances in Signal Processing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1687-6180-2013-173.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1687-6180-2013-173\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1687-6180-2013-173.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,2]],"date-time":"2021-09-02T02:26:50Z","timestamp":1630549610000},"score":1,"resource":{"primary":{"URL":"https:\/\/asp-eurasipjournals.springeropen.com\/articles\/10.1186\/1687-6180-2013-173"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,11,18]]},"references-count":37,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,12]]}},"alternative-id":["566"],"URL":"https:\/\/doi.org\/10.1186\/1687-6180-2013-173","relation":{},"ISSN":["1687-6180"],"issn-type":[{"value":"1687-6180","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,11,18]]},"assertion":[{"value":"8 November 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 October 2013","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 November 2013","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"173"}}