{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T10:46:23Z","timestamp":1769510783620,"version":"3.49.0"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2011,12,1]],"date-time":"2011-12-01T00:00:00Z","timestamp":1322697600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J AUDIO SPEECH MUSIC PROC."],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>The main objective of the work presented in this paper was to develop a complete system that would accomplish the original visions of the MALACH project. Those goals were to employ automatic speech recognition and information retrieval techniques to provide improved access to the large video archive containing recorded testimonies of the Holocaust survivors. The system has been so far developed for the Czech part of the archive only. It takes advantage of the state-of-the-art speech recognition system tailored to the challenging properties of the recordings in the archive (elderly speakers, spontaneous speech and emotionally loaded content) and its close coupling with the actual search engine. The design of the algorithm adopting the spoken term detection approach is focused on the speed of the retrieval. The resulting system is able to search through the 1,000 h of video constituting the Czech portion of the archive and find query word occurrences in the matter of seconds. The phonetic search implemented alongside the search based on the lexicon words allows to find even the words outside the ASR system lexicon such as names, geographic locations or Jewish slang.<\/jats:p>","DOI":"10.1186\/1687-4722-2011-10","type":"journal-article","created":{"date-parts":[[2011,12,6]],"date-time":"2011-12-06T07:27:57Z","timestamp":1323156477000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":22,"title":["System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive"],"prefix":"10.1186","volume":"2011","author":[{"given":"Josef","family":"Psutka","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jan","family":"\u0160vec","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Josef V","family":"Psutka","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jan","family":"Van\u011bk","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ale\u0161","family":"Pra\u017e\u00e1k","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lubo\u0161","family":"\u0160m\u00eddl","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pavel","family":"Ircing","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2011,12,5]]},"reference":[{"key":"35_CR1","unstructured":"USC Shoah Foundation Institute2011. [http:\/\/college.usc.edu\/vhi\/]"},{"key":"35_CR2","unstructured":"MALACH Multilingual Access to Large Spoken Archives2007. [http:\/\/malach.umiacs.umd.edu\/]"},{"issue":"4","key":"35_CR3","doi-asserted-by":"publisher","first-page":"420","DOI":"10.1109\/TSA.2004.828702","volume":"12","author":"W Byrne","year":"2004","unstructured":"Byrne W, Doermann D, Franz M, Gustman S, Haji\u010d J, Oard D, Picheny M, Psutka J, Ramabhadran B, Soergel D, Ward T, Zhu WJ: Automatic recognition of spontaneous speech for access to multilingual oral history archives. IEEE Trans Speech Audio Process 2004,12(4):420-435. 10.1109\/TSA.2004.828702","journal-title":"IEEE Trans Speech Audio Process"},{"key":"35_CR4","doi-asserted-by":"publisher","first-page":"712","DOI":"10.1007\/978-3-540-85760-0_90","volume-title":"Advances in Multilingual and Multimodal Information Retrieval, Lecture Notes in Computer Science","author":"P Ircing","year":"2008","unstructured":"Ircing P, Psutka J, Vavru\u0161ka J: What can and cannot be found in Czech spontaneous speech using document-oriented IR methods--UWB at CLEF 2007 CL-SR track. In Advances in Multilingual and Multimodal Information Retrieval, Lecture Notes in Computer Science. Volume 5152. Edited by: Peters C, Jijkoun V, Mandl T, M\u00fcller H, Oard D, Pe\u0144as A, Petras V, Santos D. Springer, Berlin; 2008:712-718. 10.1007\/978-3-540-85760-0_90"},{"issue":"4","key":"35_CR5","doi-asserted-by":"publisher","first-page":"840","DOI":"10.1109\/TASL.2009.2014217","volume":"17","author":"P Ircing","year":"2009","unstructured":"Ircing P, Psutka JV, Psutka J: Using Morphological Information for Robust Language Modeling in Czech ASR System. IEEE Trans Audio Speech Lang Process 2009,17(4):840-847.","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"35_CR6","volume-title":"P\u0159\u00edru\u010dn\u00ed mluvnice \u010de\u0161tiny","author":"M Grepl","year":"1996","unstructured":"Grepl M, Hladk\u00e1 Z, Jel\u00ednek M, Karl\u00edk P, Kr\u010dmov\u00e1 M, Nekula M, Rus\u00ednov\u00e1 Z, \u0160losar D: P\u0159\u00edru\u010dn\u00ed mluvnice \u010de\u0161tiny. NLN, Praha; 1996."},{"key":"35_CR7","first-page":"587","volume-title":"Proceedings of ICSP 2008","author":"A Pra\u017e\u00e1k","year":"2008","unstructured":"Pra\u017e\u00e1k A, Ircing P, \u0160vec J, Psutka J: Efficient combination of N-gram language models and recognition grammars in real-time LVCSR decoder. In Proceedings of ICSP 2008. China; 2008:587-591."},{"key":"35_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/1322391.1322394","volume":"5","author":"M Creutz","year":"2007","unstructured":"Creutz M, Hirsim\u00e4ki T, Kurimo M, Puurula A, Pylkk\u00f6nen J, Siivola V, Varjokallio M, Arisoy E, Sara\u00e7lar M, Stolcke A: Morph-based speech recognition and modeling of out-of-vocabulary words across languages. ACM Trans Speech Lang Process 2007, 5: 1-29.","journal-title":"ACM Trans Speech Lang Process"},{"key":"35_CR9","first-page":"139","volume-title":"Text, Speech and Dialogue, Lecture Notes in Computer Science","author":"W Byrne","year":"2000","unstructured":"Byrne W, Haji\u010d J, Ircing P, Krbec P, Psutka J: Morpheme based language models for speech recognition of Czech. In Text, Speech and Dialogue, Lecture Notes in Computer Science. Volume 1902. Edited by: Sojka P, Kopecek I, Pala K. Springer, Berlin; 2000:139-162. 10.1007\/3-540-45323-7_24"},{"key":"35_CR10","volume-title":"Variation in Language: Code Switching in Czech as a Challenge for Sociolinguistics","year":"1992","unstructured":"Sgall P, Hronek J, Stich A, Horeck\u00fd J, (eds.): Variation in Language: Code Switching in Czech as a Challenge for Sociolinguistics. John Benjamins, Amsterdam; 1992."},{"key":"35_CR11","first-page":"749","volume-title":"Proceedings of ICASSP 2004","author":"J Psutka","year":"2004","unstructured":"Psutka J, Haji\u010d J, Byrne W: The development of ASR for Slavic languages in the MALACH project. In Proceedings of ICASSP 2004. Montreal, Canada; 2004:749-752."},{"issue":"4","key":"35_CR12","doi-asserted-by":"publisher","first-page":"1738","DOI":"10.1121\/1.399423","volume":"87","author":"H Hermansky","year":"1990","unstructured":"Hermansky H: Perceptual linear predictive (PLP) analysis of speech. J Acoust Soc Am 1990,87(4):1738-1752. 10.1121\/1.399423","journal-title":"J Acoust Soc Am"},{"key":"35_CR13","volume-title":"The HTK Book","author":"S Young","year":"2000","unstructured":"Young S, Kershaw D, Odell J, Ollason D, Valtchev V, Woodland P:The HTK Book. Entropic, Cambridge; 2000. [http:\/\/htk.eng.cam.ac.uk\/]"},{"key":"35_CR14","volume-title":"Discriminative Training for Large Vocabulary Speech Recognition","author":"D Povey","year":"2003","unstructured":"Povey D: Discriminative Training for Large Vocabulary Speech Recognition. University of Cambridge, Cambridge; 2003. PhD thesis"},{"key":"35_CR15","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1007\/978-3-642-04208-9_46","volume-title":"Text, Speech and Dialogue, Lecture Notes in Computer Science","author":"J Van\u011bk","year":"2009","unstructured":"Van\u011bk J, Psutka J, Zelinka J, Pra\u017e\u00e1k A, Psutka J: Discriminative training of gender-dependent acoustic models. In Text, Speech and Dialogue, Lecture Notes in Computer Science. Volume 5729. Edited by: Ma-tou\u0161ek V, Mautner P. Springer, Berlin; 2009:331-338. 10.1007\/978-3-642-04208-9_46"},{"key":"35_CR16","first-page":"1821","volume-title":"Proceedings of Eurospeech 2003","author":"J Psutka","year":"2003","unstructured":"Psutka J, Ircing P, Psutka JV, Radov\u00e1 V, Byrne W, Haji\u010d J, M\u00edrovsk\u00fd J, Gustman S: Large vocabulary ASR for spontaneous Czech in the MALACH project. In Proceedings of Eurospeech 2003. Geneva, Switzerland; 2003:1821-1824."},{"key":"35_CR17","first-page":"607","volume-title":"Proceedings of LREC 2004","author":"J Psutka","year":"2004","unstructured":"Psutka J, Ircing P, Haji\u010d J, Radov\u00e1 V, Psutka JV, Byrne W, Gustman S: Issues in annotation of the Czech spontaneous speech corpus in the MALACH project. In Proceedings of LREC 2004. Lisbon, Portugal; 2004:607-610."},{"issue":"8","key":"35_CR18","doi-asserted-by":"publisher","first-page":"221","DOI":"10.1109\/97.611282","volume":"4","author":"R Iyer","year":"1997","unstructured":"Iyer R, Ostendorf M, Gish H: Using out-of-domain data to improve in-domain language models. IEEE Signal Process Lett 1997,4(8):221-223. 10.1109\/97.611282","journal-title":"IEEE Signal Process Lett"},{"key":"35_CR19","first-page":"901","volume-title":"Proceedings of ICSLP 2002","author":"A Stolcke","year":"2002","unstructured":"Stolcke A: SRILM--an extensible language modeling toolkit. In Proceedings of ICSLP 2002. Denver, USA; 2002:901-904."},{"key":"35_CR20","volume-title":"An Empirical Study of Smoothing Techniques for Language Modeling. Technical Report TR-10-98","author":"SF Chen","year":"1998","unstructured":"Chen SF, Goodman J: An Empirical Study of Smoothing Techniques for Language Modeling. Technical Report TR-10-98. Computer Science Group, Harvard University, Cambridge, MA; 1998."},{"key":"35_CR21","first-page":"139","volume-title":"Proceedings of SIGMAP 2007","author":"A Pra\u017e\u00e1k","year":"2007","unstructured":"Pra\u017e\u00e1k A, M\u00fcller L, Psutka JV, Psutka J: LIVE TV SUBTITLING--fast 2-pass LVCSR system for online subtitling. In Proceedings of SIGMAP 2007. Barcelona, Spain; 2007:139-142."},{"key":"35_CR22","doi-asserted-by":"publisher","first-page":"274","DOI":"10.1007\/978-3-642-04208-9_39","volume-title":"Text, Speech and Dialogue, Lecture Notes in Computer Science","author":"Z Zaj\u00edc","year":"2009","unstructured":"Zaj\u00edc Z, Machlica L, M\u00fcller L: Refinement approach for adaptation based on combination of MAP and fMLLR. In Text, Speech and Dialogue, Lecture Notes in Computer Science. Volume 5729. Edited by: Matou\u0161ek V, Mautner P. Springer, Heidelberg; 2009:274-281. 10.1007\/978-3-642-04208-9_39"},{"key":"35_CR23","first-page":"323","volume-title":"Proceedings of DHMS 2008","author":"P Ircing","year":"2008","unstructured":"Ircing P, Psutka JV, Psutka J, Pra\u017e\u00e1k A, Tychtl Z: Automatic speech recognition and information retrieval techniques for facilitating access to video archives of cultural heritage. In Proceedings of DHMS 2008. Athens, Greece; 2008:323-328."},{"key":"35_CR24","doi-asserted-by":"publisher","first-page":"759","DOI":"10.1007\/978-3-540-74999-8_95","volume-title":"Evaluation of Multilingual and Multi-modal Information Retrieval, Lecture Notes in Computer Science","author":"P Ircing","year":"2007","unstructured":"Ircing P, M\u00fcller L: Benefit of proper language processing for Czech speech retrieval in the CL-SR task at CLEF 2006. In Evaluation of Multilingual and Multi-modal Information Retrieval, Lecture Notes in Computer Science. Volume 4730. Edited by: Peters C, Clough P, Gey F, Karlgren J, Magnini B, Oard D, de Rijke M, Stempfhuber M. Springer, Berlin; 2007:759-765. 10.1007\/978-3-540-74999-8_95"},{"key":"35_CR25","doi-asserted-by":"publisher","first-page":"674","DOI":"10.1007\/978-3-540-85760-0_86","volume-title":"Advances in Multilingual and Multimodal Information Retrieval, Lecture Notes in Computer Science","author":"P Pecina","year":"2008","unstructured":"Pecina P, Hoffmannov\u00e1 P, Jones G, Zhang Y, Oard D: Overview of the CLEF-2007 cross-language speech retrieval track. In Advances in Multilingual and Multimodal Information Retrieval, Lecture Notes in Computer Science. Volume 5152. Edited by: Peters C, Jijkoun V, Mandl T, M\u00fcller H, Oard D, Pe\u0144as A, Petras V, Santos D. Springer Berlin; 2008:674-686. 10.1007\/978-3-540-85760-0_86"},{"key":"35_CR26","unstructured":"NIST Spoken Term Detection Portal2006. [http:\/\/www.itl.nist.gov\/iad\/mig\/\/tests\/std\/]"},{"key":"35_CR27","doi-asserted-by":"publisher","first-page":"132","DOI":"10.1007\/11551874_17","volume-title":"Text, Speech and Dialogue, Lecture Notes in Computer Science","author":"J Kanis","year":"2005","unstructured":"Kanis J, M\u00fcller L: Automatic lemmatizer construction with focus on OOV words lemmatization. In Text, Speech and Dialogue, Lecture Notes in Computer Science. Volume 3658. Edited by: Matou\u0161ek V, Mautner P, Pavelka T. Springer, Berlin; 2005:132-139. 10.1007\/11551874_17"},{"key":"35_CR28","first-page":"18","volume-title":"Proceedings of JCDL 2002","author":"S Gustman","year":"2002","unstructured":"Gustman S, Soergel D, Oard D, Byrne W, Picheny M, Ramabhadran B, Greenberg D: Supporting access to large digital oral history archives. In Proceedings of JCDL 2002. Portland, USA; 2002:18-27."}],"container-title":["EURASIP Journal on Audio, Speech, and Music Processing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1687-4722-2011-10.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1687-4722-2011-10\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1687-4722-2011-10.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T17:56:21Z","timestamp":1630518981000},"score":1,"resource":{"primary":{"URL":"https:\/\/asmp-eurasipjournals.springeropen.com\/articles\/10.1186\/1687-4722-2011-10"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,12]]},"references-count":28,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["35"],"URL":"https:\/\/doi.org\/10.1186\/1687-4722-2011-10","relation":{},"ISSN":["1687-4722"],"issn-type":[{"value":"1687-4722","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,12]]},"assertion":[{"value":"26 July 2011","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 December 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 December 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"10"}}