{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T16:44:37Z","timestamp":1759941877815},"reference-count":12,"publisher":"Institute of Electronics, Information and Communications Engineers (IEICE)","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IEICE Trans. Inf. &amp; Syst."],"published-print":{"date-parts":[[2020,3,1]]},"DOI":"10.1587\/transinf.2019edl8150","type":"journal-article","created":{"date-parts":[[2020,2,29]],"date-time":"2020-02-29T22:10:34Z","timestamp":1583014234000},"page":"714-715","source":"Crossref","is-referenced-by-count":3,"title":["A Non-Intrusive Speech Intelligibility Estimation Method Based on Deep Learning Using Autoencoder Features"],"prefix":"10.1587","volume":"E103.D","author":[{"given":"Yoonhee","family":"KIM","sequence":"first","affiliation":[{"name":"Seoul National University of Science and Technology"}]},{"given":"Deokgyu","family":"YUN","sequence":"additional","affiliation":[{"name":"Seoul National University of Science and Technology"}]},{"given":"Hannah","family":"LEE","sequence":"additional","affiliation":[{"name":"Seoul National University of Science and Technology"}]},{"given":"Seung Ho","family":"CHOI","sequence":"additional","affiliation":[{"name":"Seoul National University of Science and Technology"}]}],"member":"532","reference":[{"key":"1","doi-asserted-by":"publisher","unstructured":"[1] L. Malfait, J. Berger, and M. Kastner, \u201cP.563 \u2014 The ITU-T standard for single-ended speech quality assessment,\u201d IEEE Trans. Audio, Speech, Language Process., vol.14, no.6, pp.1924-1934, 2006. 10.1109\/tasl.2006.883177","DOI":"10.1109\/TASL.2006.883177"},{"key":"2","doi-asserted-by":"publisher","unstructured":"[2] C.H. Taal, R.C. Hendriks, R. Heusdens, and J. Jensen, \u201cAn algorithm for intelligibility prediction of time-frequency weighted noisy speech,\u201d IEEE Trans. Audio, Speech, Language Process., vol.19, no.7, pp.2125-2136, 2011. 10.1109\/tasl.2011.2114881","DOI":"10.1109\/TASL.2011.2114881"},{"key":"3","doi-asserted-by":"publisher","unstructured":"[3] A.H. Andersen, J.M. de Haan, Z. Tan and J. Jensen, \u201cNonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks,\u201d IEEE\/ACM Trans. Audio, Speech, Language Process., vol.26, no.10, pp.1925-1939, Oct. 2018. 10.1109\/taslp.2018.2847459","DOI":"10.1109\/TASLP.2018.2847459"},{"key":"4","doi-asserted-by":"publisher","unstructured":"[4] C. Spille, S.D. Ewert, B. Kollmeier, and B.T. Meyer, \u201cPredicting speech intelligibility with deep neural networks,\u201d Computer Speech &amp; Language, vol.48, pp.51-66, 2018. 10.1016\/j.csl.2017.10.004","DOI":"10.1016\/j.csl.2017.10.004"},{"key":"5","doi-asserted-by":"publisher","unstructured":"[5] D. Yun, H. Lee, and S.H. Choi, \u201cA deep learning-based approach to non-intrusive objective speech intelligibility estimation,\u201d IEICE Trans. Inf. &amp; Syst., vol.E101-D, no.4, pp.1207-1208, 2018. 10.1587\/transinf.2017edl8225","DOI":"10.1587\/transinf.2017EDL8225"},{"key":"6","unstructured":"[6] T.N. Sainath, B. Kingsbury, and B. Ramab, \u201cAuto-encoder bottleneck features using deep belief networks,\u201d Proc. ICASSP, pp.4153-4156, 2012."},{"key":"7","doi-asserted-by":"crossref","unstructured":"[7] X. Lu, Y. Tsao, S. Matsuda, and C. Hori, \u201cSpeech Enhancement Based on Deep Denoising Autoencoder,\u201d Proc. INTERSPEECH, pp.436-440, 2013.","DOI":"10.21437\/Interspeech.2013-130"},{"key":"8","doi-asserted-by":"publisher","unstructured":"[8] S. Hochreiter and J. Schmidhuber, \u201cLong short-term memory,\u201d Neural Computation, vol.9, no.8, pp.1735-1780, Nov. 1997. 10.1162\/neco.1997.9.8.1735","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"9","doi-asserted-by":"crossref","unstructured":"[9] H. Sak, A.W. Senior, and F. Beaufays, \u201cLong short-term memory recurrent neural network architectures for large scale acoustic modeling models,\u201d Proc. INTERSPEECH, pp.338-342, 2014.","DOI":"10.21437\/Interspeech.2014-80"},{"key":"10","unstructured":"[10] V. Nair and G.E. Hinton, \u201cRectified linear units improve restricted Boltzmann machines,\u201d Proc. 27th international conference on machine learning (ICML-10), pp.807-814, 2010."},{"key":"11","unstructured":"[11] D.P. Kingma and J. Ba, \u201cAdam: A method for stochastic optimization,\u201d arXiv preprint arXiv:1412.6980, 2014."},{"key":"12","doi-asserted-by":"crossref","unstructured":"[12] J.S. Garofolo, L.F. Lamel, W.M. Fisher, J.G. Fiscus, and D.S.Pallett, \u201cDARPA TIMIT acoustic phonetic continuous speech corpus CDROM,\u201d NIST, 1993.","DOI":"10.6028\/NIST.IR.4930"}],"container-title":["IEICE Transactions on Information and Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E103.D\/3\/E103.D_2019EDL8150\/_pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,10,17]],"date-time":"2022-10-17T01:09:25Z","timestamp":1665968965000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E103.D\/3\/E103.D_2019EDL8150\/_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,1]]},"references-count":12,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2020]]}},"URL":"https:\/\/doi.org\/10.1587\/transinf.2019edl8150","relation":{},"ISSN":["0916-8532","1745-1361"],"issn-type":[{"value":"0916-8532","type":"print"},{"value":"1745-1361","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,3,1]]}}}