{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T20:12:59Z","timestamp":1768075979081,"version":"3.49.0"},"reference-count":64,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,4,7]],"date-time":"2020-04-07T00:00:00Z","timestamp":1586217600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,4,7]],"date-time":"2020-04-07T00:00:00Z","timestamp":1586217600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J AUDIO SPEECH MUSIC PROC."],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In order to improve the performance of hand-crafted features to detect playback speech, two discriminative features, constant-Q variance-based octave coefficients and constant-Q mean-based octave coefficients, are proposed for playback speech detection in this work. They rely on our findings that variance-based modified log magnitude spectrum and mean-based modified log magnitude spectrum can enhance the discriminative power between genuine speech and playback speech. Then constant-Q variance-based octave coefficients (constant-Q mean-based octave coefficients) can be obtained by combining variance-based modified log magnitude spectrum (mean-based modified log magnitude spectrum), octave segmentation, and discrete cosine transform. Finally, constant-Q variance-based octave coefficients and constant-Q mean-based octave coefficients are evaluated on ASVspoof 2017 corpus version 2.0 and ASVspoof 2019 physical access, respectively. Experimental results show that variance-based modified log magnitude spectrum and mean-based modified log magnitude spectrum can produce discriminative features toward playback speech. Further results on the two databases show that constant-Q variance-based octave coefficients and constant-Q mean-based octave coefficients can perform better than some common features, such as mel frequency cepstral coefficients and constant-Q cepstral coefficients.<\/jats:p>","DOI":"10.1186\/s13636-020-00173-5","type":"journal-article","created":{"date-parts":[[2020,4,7]],"date-time":"2020-04-07T10:02:35Z","timestamp":1586253755000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Discriminative features based on modified log magnitude spectrum for playback speech detection"],"prefix":"10.1186","volume":"2020","author":[{"given":"Jichen","family":"Yang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Longting","family":"Xu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bo","family":"Ren","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yunyun","family":"Ji","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,4,7]]},"reference":[{"key":"173_CR1","doi-asserted-by":"publisher","unstructured":"T. Kinnunen, M. Sahidullah, M. Falcone, L. Costantini, R. G. Hautamaki, D. Thomsen, S. Achintya, Z. -H. Tan, H. Delgado, M. Todisco, N. Evans, V. Hautamaki, K. A. Lee, in IEEE International Conference on Acoustics, Speech, and Signal Processing. RedDots replayed: a new replay spoofing attack corpus for text-dependent speaker verification research, (2017), pp. 5395\u20135399. https:\/\/doi.org\/10.1109\/icassp.2017.7953187.","DOI":"10.1109\/icassp.2017.7953187"},{"key":"173_CR2","doi-asserted-by":"publisher","unstructured":"T. Kinnunen, M. Sahidullah, H. Delgado, M. Todisco, N. Evans, J. Yamagishi, K. A. Lee, in Annual Conference of the International Speech Communication Association. The ASVspoof 2017 challenge: assessing the limits of replay spoofing attack detection, (2017), pp. 2\u20136. https:\/\/doi.org\/10.21437\/interspeech.2017-1111.","DOI":"10.21437\/interspeech.2017-1111"},{"key":"173_CR3","doi-asserted-by":"publisher","unstructured":"K. N. R. K. R. Alluri, S. Achanta, S. R. Kadiri, S. V. Gangasheetty, A. K. Vuppala, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Sff anti-spoofer: IIIT-H submission for automatic speaker verificatio spoofing and countermeasures channlenge 2017, (2017), pp. 107\u2013111. https:\/\/doi.org\/10.21437\/interspeech.2017-676.","DOI":"10.21437\/interspeech.2017-676"},{"key":"173_CR4","doi-asserted-by":"publisher","unstructured":"Z. Chen, Z. Xie, W. Zhang, X. Xu, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). ResNet and model fusion for automatic spoofing detection, (2017), pp. 102\u2013106. https:\/\/doi.org\/10.21437\/interspeech.2017-1085.","DOI":"10.21437\/interspeech.2017-1085"},{"key":"173_CR5","doi-asserted-by":"publisher","unstructured":"S. Jelil, R. K. Das, S. R. M. Prasanna, R. Sinha, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Spoof detection using source, instantaneous frequecny and cepstral features, (2017), pp. 22\u201326. https:\/\/doi.org\/10.21437\/interspeech.2017-930.","DOI":"10.21437\/interspeech.2017-930"},{"key":"173_CR6","doi-asserted-by":"publisher","unstructured":"H. A. Patil, M. R. Kamble, T. B. Patel, M. Soni, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Novel variable length teager engery separation based on instantaneous frequency features for replay detection, (2017), pp. 12\u201316. https:\/\/doi.org\/10.21437\/interspeech.2017-1362.","DOI":"10.21437\/interspeech.2017-1362"},{"key":"173_CR7","doi-asserted-by":"publisher","unstructured":"A. G. Alan\u00c3s, A. M. Peinado, J. A. Gonzalez, A. Gomez, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). A deep identity representation for noise robust spoofing detection, (2018), pp. 676\u2013680. https:\/\/doi.org\/10.21437\/interspeech.2018-1909.","DOI":"10.21437\/interspeech.2018-1909"},{"key":"173_CR8","doi-asserted-by":"publisher","unstructured":"G. Suthokumar, V. Sethu, C. Wijenayake, E. Ambikairajah, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Modulation dynamic features for the detection of replay attacks, (2018), pp. 691\u2013695. https:\/\/doi.org\/10.21437\/interspeech.2018-1846.","DOI":"10.21437\/interspeech.2018-1846"},{"key":"173_CR9","doi-asserted-by":"publisher","unstructured":"B. Wickramasinghe, S. Irtza, E. Ambikairajah, J. Epps, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Frequency domain linear prediction features for replay spoofing attack detection, (2018), pp. 661\u2013665. https:\/\/doi.org\/10.21437\/interspeech.2018-1574.","DOI":"10.21437\/interspeech.2018-1574"},{"key":"173_CR10","doi-asserted-by":"publisher","unstructured":"M. S. Saranya, H. Murthy, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Decision-level feature switching as a paradigm for replay attack detection, (2018), pp. 686\u2013690. https:\/\/doi.org\/10.21437\/interspeech.2018-1494.","DOI":"10.21437\/interspeech.2018-1494"},{"key":"173_CR11","doi-asserted-by":"publisher","unstructured":"L. Li, Y. Chen, D. Wang, T. F. Zheng, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). A study on replay attack and anti-spoofing for automatic speaker verification, (2017), pp. 92\u201396. https:\/\/doi.org\/10.21437\/interspeech.2017-456.","DOI":"10.21437\/interspeech.2017-456"},{"key":"173_CR12","doi-asserted-by":"publisher","unstructured":"R. Font, J. M. Esp\u00c3n, M. J. Cano, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Experimental analysis of features for replay attack detection-results on the asvspoof 2017 challenge, (2017), pp. 7\u201311. https:\/\/doi.org\/10.21437\/interspeech.2017-450.","DOI":"10.21437\/interspeech.2017-450"},{"key":"173_CR13","doi-asserted-by":"publisher","unstructured":"M. Withowski, S. Kacprasko, P. Zelasko, K. Kowlczyk, J. Galka, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Audio replay attack detection using high-frequency features, (2017), pp. 27\u201331. https:\/\/doi.org\/10.21437\/interspeech.2017-776.","DOI":"10.21437\/interspeech.2017-776"},{"key":"173_CR14","doi-asserted-by":"publisher","unstructured":"G. Lavrentyeva, S. Novoselov, E. Malykh, A. Kozlov, O. Kudasher, V. Shchemelinin, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Audio replay attack detection with deep learning framework, (2017), pp. 82\u201386. https:\/\/doi.org\/10.21437\/interspeech.2017-360.","DOI":"10.21437\/interspeech.2017-360"},{"key":"173_CR15","doi-asserted-by":"publisher","unstructured":"D. Li, L. Wang, J. Dang, M. Liu, Z. Oo, S. Nakagawa, H. Guan, X. Li, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Multiple phase information combination for replay attacks detection, (2018), pp. 656\u2013660. https:\/\/doi.org\/10.21437\/interspeech.2018-2001.","DOI":"10.21437\/interspeech.2018-2001"},{"key":"173_CR16","doi-asserted-by":"publisher","unstructured":"T. Gunendradasan, B. Wickramasinghe, N. P. Le, E. Ambikairajah, J. Epps, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Detection of replay-spoofing attacks using frequency modulation features, (2018), pp. 636\u2013640. https:\/\/doi.org\/10.21437\/interspeech.2018-1473.","DOI":"10.21437\/interspeech.2018-1473"},{"key":"173_CR17","doi-asserted-by":"publisher","unstructured":"H. B. Sailor, M. R. Kamble, H. A. Patil, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Auditory filterbank learning for temporal modulation features in replay spoof speech detection, (2018), pp. 666\u2013670. https:\/\/doi.org\/10.21437\/interspeech.2018-1651.","DOI":"10.21437\/interspeech.2018-1651"},{"key":"173_CR18","doi-asserted-by":"publisher","unstructured":"M. Kamble, H. Tak, H. A. Patil, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Effectiveness of speech demodulation-based features for replay detection, (2018), pp. 641\u2013645. https:\/\/doi.org\/10.21437\/interspeech.2018-1675.","DOI":"10.21437\/interspeech.2018-1675"},{"key":"173_CR19","doi-asserted-by":"publisher","unstructured":"W. Cai, H. Wu, D. Cai, M. Li, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). The DKU replay detection system for the ASVspoof 2019 challenge on data augmentation, feature representation, clssifier and fusion (Graz, Austria, 2019), pp. 1023\u20131027. https:\/\/doi.org\/10.21437\/interspeech.2019-1230.","DOI":"10.21437\/interspeech.2019-1230"},{"key":"173_CR20","doi-asserted-by":"publisher","unstructured":"R. Bialobrzeski, M. Kosmider, M. Matuszewski, M. Plata, A. Rakowski, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). Robust Bayesian and light neural networks for voice spoofing detection (Graz, Austria, 2019), pp. 1028\u20131032. https:\/\/doi.org\/10.21437\/interspeech.2019-2676.","DOI":"10.21437\/interspeech.2019-2676"},{"key":"173_CR21","doi-asserted-by":"publisher","unstructured":"G. Lavrentyva, S. Novoselov, A. Tseren, M. Volkova, A. Gorlanov, A. Kozlos, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). STC antispoofing systems for the ASVspoof2019 challenge (Graz, Austria, 2019), pp. 1033\u20131037. https:\/\/doi.org\/10.21437\/interspeech.2019-1768.","DOI":"10.21437\/interspeech.2019-1768"},{"key":"173_CR22","doi-asserted-by":"publisher","unstructured":"Y. Yang, H. Wang, H. Dinkel, Z. Chen, S. Wang, Y. Qian, K. Yu, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). The SJTU robust anti-spoofing system for the ASVspoof 2019 challenge (Graz, Austria, 2019), pp. 1038\u20131042. https:\/\/doi.org\/10.21437\/interspeech.2019-2170.","DOI":"10.21437\/interspeech.2019-2170"},{"key":"173_CR23","doi-asserted-by":"publisher","unstructured":"J. -w. Jung, H. -j. Shim, H. -S. Heo, H. -J. Yu, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). Replay attack detection with complementary high-resolution information using end-to-end for the ASVspoof 2019 challenge (Graz, Austria, 2019), pp. 1083\u20131087. https:\/\/doi.org\/10.21437\/interspeech.2019-1991.","DOI":"10.21437\/interspeech.2019-1991"},{"key":"173_CR24","doi-asserted-by":"publisher","unstructured":"W. Shang, M. Stevenson, in Proceddings of Canadian Conference onElectrical and Computer Engineering. A preliminary study of factors affecting the performance of a playback attack detector, (2008), pp. 459\u2013464. https:\/\/doi.org\/10.1109\/ccece.2008.4564576.","DOI":"10.1109\/ccece.2008.4564576"},{"key":"173_CR25","unstructured":"Z. Wang, Q. He, X. Zhang, H. Luo, Z. Su, in Journal of South China University of Technology (Natural Science Edition). Playback attack detection based on channel pattern noise, (2011), pp. 1708\u20131713."},{"key":"173_CR26","doi-asserted-by":"publisher","unstructured":"S. K. Ergunay, E. Khoury, A. Lazaridis, S. Marcel, in Proceedings of the Seventh IEEE International Conference on Biometric Theory, Application Ans Systems. On the vulnerability of speaker verification to realistic voice spoofing, (2015), pp. 1\u20136. https:\/\/doi.org\/10.1109\/btas.2015.7358783.","DOI":"10.1109\/btas.2015.7358783"},{"key":"173_CR27","doi-asserted-by":"publisher","unstructured":"W. Shang, M. Stevenson, in IEEE International Conference on Acoustic, Speech and Signal Processing. Score normalization in playback attack detection, (2010), pp. 1678\u20131681. https:\/\/doi.org\/10.1109\/icassp.2010.5495503.","DOI":"10.1109\/icassp.2010.5495503"},{"key":"173_CR28","doi-asserted-by":"publisher","unstructured":"Z. Wang, G. Wei, Q. He, in Proceedings of the 2011 International Conference on Machine Learning and Cybernetics, vol. 39. Channel pattern noise based on playback attack detection algorithm for speaker recognition, (2011), pp. 5\u201312. https:\/\/doi.org\/10.1109\/icmlc.2011.6016982.","DOI":"10.1109\/icmlc.2011.6016982"},{"key":"173_CR29","doi-asserted-by":"publisher","unstructured":"C. Wang, Y. Zou, S. Liu, W. Zheng, in IEEE Second International Conference on Multimedia Big Data. An efficient learning based smartphone playback attack detection using GMM supervector, (2016), pp. 385\u2013389. https:\/\/doi.org\/10.1109\/bigmm.2016.14.","DOI":"10.1109\/bigmm.2016.14"},{"key":"173_CR30","doi-asserted-by":"publisher","unstructured":"H. Delgado, M. Todisco, M. Sahidullah, N. Evans, T. Kinnunen, K. Lee, J. Yamagishi, in Speaker and Language Recognition Workshop. ASVspoof 2017 version 2.0: meta-data analysis and baseline enhancements, (2018), pp. 296\u2013303. https:\/\/doi.org\/10.21437\/odyssey.2018-42.","DOI":"10.21437\/odyssey.2018-42"},{"key":"173_CR31","doi-asserted-by":"publisher","unstructured":"K. Sriskandaraja, V. Sethu, E. Ambikairajah, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Deep siamese architecture based replay detection for secure voice biometric, (2018), pp. 671\u2013675. https:\/\/doi.org\/10.21437\/interspeech.2018-1819.","DOI":"10.21437\/interspeech.2018-1819"},{"key":"173_CR32","doi-asserted-by":"publisher","unstructured":"F. Tom, M. Jain, P. Dey, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). End-to-end audio replay attack detection using deep convolutional networks with attention, (2018), pp. 681\u2013685. https:\/\/doi.org\/10.21437\/interspeech.2018-2279.","DOI":"10.21437\/interspeech.2018-2279"},{"key":"173_CR33","doi-asserted-by":"publisher","unstructured":"X. Cheng, M. Xu, T. F. Zheng, in Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). Replay detection using cqt-based modified group (Lanzhou, China, 2019), pp. 540\u2013545. https:\/\/doi.org\/10.1109\/apsipaasc47483.2019.9023158.","DOI":"10.1109\/apsipaasc47483.2019.9023158"},{"key":"173_CR34","doi-asserted-by":"publisher","unstructured":"M. Todisco, H. Delgado, N. Evans, in Speaker and Language Recognition Workshop. A new feature for automatic speaker verification anti-spoofing: constant Q cepstral coefficients, (2016). https:\/\/doi.org\/10.21437\/odyssey.2016-41.","DOI":"10.21437\/odyssey.2016-41"},{"key":"173_CR35","doi-asserted-by":"publisher","first-page":"516","DOI":"10.1016\/j.csl.2017.01.001","volume":"45","author":"M. Todisco","year":"2017","unstructured":"M. Todisco, H. Delgado, N. Evans, Constant Q cepstral coefficients: a spoofing countermeasure for automatic Speaker verification. Comput. Speech. Lang.45:, 516\u2013535 (2017).","journal-title":"Comput. Speech. Lang."},{"key":"173_CR36","doi-asserted-by":"publisher","unstructured":"Z. Ji, Z. -Y. Li, P. Li, M. An, S. Gao, D. Wu, F. Zhao, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Ensemble learning for countermeasure of audio repaly spoofing attack in asvspoof2017, (2017), pp. 87\u201391. https:\/\/doi.org\/10.21437\/interspeech.2017-1246.","DOI":"10.21437\/interspeech.2017-1246"},{"key":"173_CR37","doi-asserted-by":"publisher","unstructured":"X. Wang, Y. Xiao, X. Zhu, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Feature selection based on CQCCs for automatic speaker verification spoofing, (2017), pp. 32\u201336. https:\/\/doi.org\/10.21437\/interspeech.2017-304.","DOI":"10.21437\/interspeech.2017-304"},{"key":"173_CR38","doi-asserted-by":"publisher","unstructured":"J. Yang, R. K. Das, H. Li, in Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). Extended constant-Q cepstral coefficients for detection of spoofing attacks, (2018), pp. 1024\u20131029. https:\/\/doi.org\/10.23919\/apsipa.2018.8659537.","DOI":"10.23919\/apsipa.2018.8659537"},{"key":"173_CR39","doi-asserted-by":"publisher","unstructured":"R. K. Das, J. Yang, H. Li, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). Long range acoustic features for spoofed speech detection (Graz, Austria, 2019), pp. 1058\u20131062. https:\/\/doi.org\/10.21437\/interspeech.2019-1887.","DOI":"10.21437\/interspeech.2019-1887"},{"issue":"14","key":"173_CR40","doi-asserted-by":"publisher","first-page":"901","DOI":"10.1049\/el.2018.0739","volume":"54","author":"J. Yang","year":"2018","unstructured":"J. Yang, L. Liu, Playback speech detection based on magnitude-phase spectrum. Electron. Lett.54(14), 901\u2013903 (2018).","journal-title":"Electron. Lett."},{"key":"173_CR41","doi-asserted-by":"publisher","unstructured":"J. Yang, C. You, Q. He, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Feature with complementarity of statistics and principal information for spoofing detection, (2018), pp. 651\u2013655. https:\/\/doi.org\/10.21437\/interspeech.2018-1693.","DOI":"10.21437\/interspeech.2018-1693"},{"key":"173_CR42","doi-asserted-by":"publisher","unstructured":"J. Alam, G. Bhattacharya, P. Kenny, in In Speaker and Language Recognition Workshop (ODYESSEY). Boosting the performance of spoofing detection systems of replay attacks using Q-logarithm domain feature normalization, (2018), pp. 393\u2013398. https:\/\/doi.org\/10.21437\/odyssey.2018-55.","DOI":"10.21437\/odyssey.2018-55"},{"key":"173_CR43","doi-asserted-by":"publisher","unstructured":"M. Kamble, H. A. Patil, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Novel variable length energy separation algorithm using instantaneous amplitude features for replay detection, (2018), pp. 646\u2013650. https:\/\/doi.org\/10.21437\/interspeech.2018-1687.","DOI":"10.21437\/interspeech.2018-1687"},{"key":"173_CR44","doi-asserted-by":"publisher","unstructured":"S. Jelil, R. K. Das, S. R. M. Prasanna, R. Sinha, in 18th Annual Conference of the International Speech Communication Association (INTERSPEECH). Spoof detection using source, instantaneous frequency and cepstral features, (2017), pp. 22\u201326. https:\/\/doi.org\/10.21437\/interspeech.2017-930.","DOI":"10.21437\/interspeech.2017-930"},{"key":"173_CR45","doi-asserted-by":"publisher","unstructured":"R. K. Das, H. Li, in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). Instantaneous phase and excitation source features for detection of replay attacks (Honolulu, Hawaii, 2018), pp. 1030\u20131037. https:\/\/doi.org\/10.23919\/apsipa.2018.8659789.","DOI":"10.23919\/apsipa.2018.8659789"},{"key":"173_CR46","doi-asserted-by":"publisher","unstructured":"K. N. R. K. R. Alluri, A. K. Vupala, Replay spoofing countermeasures using high spectro-temporal resolutional features. Int. J. Speech Technol., 1\u201311 (2019). https:\/\/doi.org\/10.1007\/s10772-019-09602-z.","DOI":"10.1007\/s10772-019-09602-z"},{"key":"173_CR47","doi-asserted-by":"publisher","unstructured":"K. N. R. K. R. Alluri, A. K. Vupala, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). IIIT-H spoofing countermeasures for automatic speaker verification spoofing and countermesures challenge 2019 (Graz, Austria, 2019), pp. 1043\u20131047. https:\/\/doi.org\/10.21437\/interspeech.2019-1623.","DOI":"10.21437\/interspeech.2019-1623"},{"key":"173_CR48","unstructured":"M. Todisco, X. Wang, V. Vestman, M. Sahidullah, H. Delgado, A. Nautsch, J. Yamagishi, N. Evas, T. Kinnunen, K. A. Lee, in 20th Annual Conference of the International Speech Communication Association. ASVspoof 2019: Future horizons in spoofed and fake audio detection, (2019), pp. 1008\u20131012."},{"issue":"6B","key":"173_CR49","doi-asserted-by":"publisher","first-page":"2044","DOI":"10.1121\/1.1913065","volume":"51","author":"J. J. Wolf","year":"1972","unstructured":"J. J. Wolf, Efficient acoustic parameter for speaker recognition. J. Acoust. Soc. Am.51(6B), 2044\u20132056 (1972).","journal-title":"J. Acoust. Soc. Am."},{"key":"173_CR50","doi-asserted-by":"publisher","first-page":"605","DOI":"10.1109\/JSTSP.2017.2684705","volume":"11","author":"D. Paul","year":"2017","unstructured":"D. Paul, M. Pal, G. Saha, Spectral features for synthetic speech detection. IEEE J. Sel. Top. Signal Process.11:, 605\u2013617 (2017).","journal-title":"IEEE J. Sel. Top. Signal Process."},{"key":"173_CR51","doi-asserted-by":"publisher","unstructured":"J. Youngberg, S. Boll, in IEEE International Conference on Acoustics, Speech, and Signal Processing. Constant-q signal analysis and synthesis, (1978), pp. 375\u2013378. https:\/\/doi.org\/10.1109\/icassp.1978.1170547.","DOI":"10.1109\/icassp.1978.1170547"},{"issue":"1","key":"173_CR52","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1121\/1.400476","volume":"89","author":"J. C. Brown","year":"1991","unstructured":"J. C. Brown, Calculation of a constant Q spectral transform. J. Acoust. Soc. Am.89(1), 425\u2013434 (1991).","journal-title":"J. Acoust. Soc. Am."},{"key":"173_CR53","doi-asserted-by":"publisher","unstructured":"J. Yang, C. You, Q. He, in 19th Annual Conference of the International Speech Communication Association (INTERSPEECH). Feature with complementarity of statistics and principal information for spoofing detection, (2018), pp. 651\u2013655. https:\/\/doi.org\/10.21437\/interspeech.2018-1693.","DOI":"10.21437\/interspeech.2018-1693"},{"key":"173_CR54","doi-asserted-by":"publisher","unstructured":"F. Seide, G. Li, X. Chen, D. Yu, in IEEE Workshop on Automatic Speech Recognition and Understanding. Feature engineering in context-dependent deep neural networks for conversational speech transcription, (2011), pp. 24\u201329. https:\/\/doi.org\/10.1109\/asru.2011.6163899.","DOI":"10.1109\/asru.2011.6163899"},{"key":"173_CR55","doi-asserted-by":"publisher","unstructured":"F. Seide, A. Agarwal, in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. CNTK:Microsoft\u2019s open-source deep learning toolkit, (2016), pp. 2135\u20132135. https:\/\/doi.org\/10.1145\/2939672.2945397.","DOI":"10.1145\/2939672.2945397"},{"key":"173_CR56","doi-asserted-by":"publisher","unstructured":", in The Speaker and Language Recognition Workshop. t-DCF: a detection cost function for tandem assessment of spoofing countermeasures and automatic speaker verification, (2018), pp. 312\u2013319. https:\/\/doi.org\/10.21437\/odyssey.2018-44.","DOI":"10.21437\/odyssey.2018-44"},{"key":"173_CR57","doi-asserted-by":"publisher","unstructured":"C. -I. Lai, N. Chen, J. Villaba, N. Dehak, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). ASSERT:Anti-spoofing with squeeze-excitation and residual networks (Graz, Austria, 2019), pp. 1013\u20131017. https:\/\/doi.org\/10.21437\/interspeech.2019-1794.","DOI":"10.21437\/interspeech.2019-1794"},{"key":"173_CR58","doi-asserted-by":"publisher","unstructured":"B. Chettri, D. Stoller, V. Morfi, M. A. M. Ramirez, E. Benetos, B. L. Sturm, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). Ensemble models for spoofing detection in autoamtic speaker verification (Graz, Austria, 2019), pp. 1018\u20131022. https:\/\/doi.org\/10.21437\/interspeech.2019-2505.","DOI":"10.21437\/interspeech.2019-2505"},{"key":"173_CR59","doi-asserted-by":"publisher","unstructured":"R. Li, M. Zhao, Z. Li, L. Li, Q. Hong, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). Anti-spoofing speaker verification system with multi-feature integration and multi-task learning (Graz, Austria, 2019), pp. 1048\u20131052. https:\/\/doi.org\/10.21437\/interspeech.2019-1698.","DOI":"10.21437\/interspeech.2019-1698"},{"key":"173_CR60","doi-asserted-by":"publisher","unstructured":"J. Willoams, J. Rownika, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). Speech replay detection with x-vector attack emneddings and spectral features (Graz, Austria, 2019), pp. 1053\u20131057. https:\/\/doi.org\/10.21437\/interspeech.2019-1760.","DOI":"10.21437\/interspeech.2019-1760"},{"key":"173_CR61","doi-asserted-by":"publisher","unstructured":"S. -Y. Chang, K. C. Wu, C. -P. Chen, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). Transfer-representation learning for detecting spoofing attacks with converted and synthesized speech in automatic speaker verification system (Graz, Austria, 2019), pp. 1063\u20131067. https:\/\/doi.org\/10.21437\/interspeech.2019-2014.","DOI":"10.21437\/interspeech.2019-2014"},{"key":"173_CR62","doi-asserted-by":"publisher","unstructured":"A. Gomez-Alanis, A. M. Peinado, J. A. Gonzalez, A. M. Gomez, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). A light convolutional GRU-RNN deep feature extractor for ASV spoofing detection (Graz, Austria, 2019), pp. 1068\u20131072. https:\/\/doi.org\/10.21437\/interspeech.2019-2212.","DOI":"10.21437\/interspeech.2019-2212"},{"key":"173_CR63","doi-asserted-by":"publisher","unstructured":"M. Alzanto, Z. Wang, M. B. Srivastava, in 20th Annual Conference of the International Speech Communication Association (INTERSPEECH). Deep residual neural networks for audio spoofing detection (Graz, Austria, 2019), pp. 1078\u20131082. https:\/\/doi.org\/10.21437\/interspeech.2019-3174.","DOI":"10.21437\/interspeech.2019-3174"},{"key":"173_CR64","doi-asserted-by":"publisher","unstructured":"R. K. Das, J. Yang, H. Li, in Automatic Speech Recognition and Understanding Workshop (ASRU). Long range acoustic and deep features perspective on ASVspoof 2019 (Sentosa Island, Singapore, 2019), pp. 1018\u20131025. https:\/\/doi.org\/10.1109\/asru46091.2019.9003845.","DOI":"10.1109\/asru46091.2019.9003845"}],"container-title":["EURASIP Journal on Audio, Speech, and Music Processing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13636-020-00173-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s13636-020-00173-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13636-020-00173-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,4,6]],"date-time":"2021-04-06T23:10:44Z","timestamp":1617750644000},"score":1,"resource":{"primary":{"URL":"https:\/\/asmp-eurasipjournals.springeropen.com\/articles\/10.1186\/s13636-020-00173-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,7]]},"references-count":64,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["173"],"URL":"https:\/\/doi.org\/10.1186\/s13636-020-00173-5","relation":{},"ISSN":["1687-4722"],"issn-type":[{"value":"1687-4722","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,4,7]]},"assertion":[{"value":"29 May 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 March 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 April 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"6"}}