{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,10]],"date-time":"2024-09-10T15:17:53Z","timestamp":1725981473166},"publisher-location":"Cham","reference-count":20,"publisher":"Springer International Publishing","isbn-type":[{"type":"print","value":"9783319943060"},{"type":"electronic","value":"9783319943077"}],"license":[{"start":{"date-parts":[[2018,1,1]],"date-time":"2018-01-01T00:00:00Z","timestamp":1514764800000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018]]},"DOI":"10.1007\/978-3-319-94307-7_10","type":"book-chapter","created":{"date-parts":[[2018,6,19]],"date-time":"2018-06-19T13:28:33Z","timestamp":1529414913000},"page":"130-143","update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Localized Mandarin Speech Synthesis Services for Enterprise Scenarios"],"prefix":"10.1007","author":[{"given":"Yishuang","family":"Ning","sequence":"first","affiliation":[]},{"given":"Huan","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Chunxiao","family":"Xing","sequence":"additional","affiliation":[]},{"given":"Liang-Jie","family":"Zhang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2018,6,20]]},"reference":[{"key":"10_CR1","doi-asserted-by":"crossref","unstructured":"Na, X., Xie, X., Kuang, J.: Low latency parameter generation for real-time speech synthesis system. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), pp. 1\u20136 (2014)","DOI":"10.1109\/ICME.2014.6890197"},{"issue":"11","key":"10_CR2","doi-asserted-by":"publisher","first-page":"1039","DOI":"10.1016\/j.specom.2009.04.004","volume":"51","author":"H Zen","year":"2009","unstructured":"Zen, H., Tokuda, K., Black, A.: Statistical parametric speech synthesis. Speech Commun. 51(11), 1039\u20131064 (2009)","journal-title":"Speech Commun."},{"issue":"1","key":"10_CR3","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1007\/s11042-013-1601-y","volume":"73","author":"F Meng","year":"2014","unstructured":"Meng, F., Wu, Z., Jia, J., Meng, H., Cai, L.: Synthesizing English emphatic speech for multimodal corrective feedback in computer-aided pronunciation training. Multimedia Tools Appl. 73(1), 463\u2013489 (2014)","journal-title":"Multimedia Tools Appl."},{"issue":"22","key":"10_CR4","doi-asserted-by":"publisher","first-page":"9909","DOI":"10.1007\/s11042-014-2164-2","volume":"74","author":"Z Wu","year":"2015","unstructured":"Wu, Z., Ning, Y., Zang, X., Jia, J., Meng, F., Meng, H., Cai, L.: Generating emphatic speech with hidden Markov model for expressive speech synthesis. Multimedia Tools Appl. 74(22), 9909\u20139925 (2015)","journal-title":"Multimedia Tools Appl."},{"issue":"5","key":"10_CR5","doi-asserted-by":"publisher","first-page":"1234","DOI":"10.1109\/JPROC.2013.2251852","volume":"101","author":"K Tokuda","year":"2013","unstructured":"Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J., Oura, K.: Speech synthesis based on hidden Markov models. Proc. IEEE 101(5), 1234\u20131252 (2013)","journal-title":"Proc. IEEE"},{"issue":"2","key":"10_CR6","first-page":"327","volume":"16","author":"B Tth","year":"2012","unstructured":"Tth, B.: Optimizing HMM speech synthesis for low-resource devices. J. Adv. Comput. Intell. 16(2), 327\u2013334 (2012)","journal-title":"J. Adv. Comput. Intell."},{"key":"10_CR7","doi-asserted-by":"crossref","unstructured":"Sheikhzadeh, H., Cornu, E., Brennan, R., Schneider, T.: Real-time speech synthesis on an ultra low-resource, programmable DSP system. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2002)","DOI":"10.1109\/ICASSP.2002.1005769"},{"key":"10_CR8","doi-asserted-by":"crossref","unstructured":"Parlikar, A., Black, A.: Data-driven phrasing for speech synthesis in low-resource languages. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4013\u20134016 (2012)","DOI":"10.1109\/ICASSP.2012.6288798"},{"issue":"11","key":"10_CR9","doi-asserted-by":"publisher","first-page":"1039","DOI":"10.1016\/j.specom.2009.04.004","volume":"51","author":"H Zen","year":"2009","unstructured":"Zen, H., Tokuda, K., Black, A.: Statistical parametric speech synthesis. Speech Commun. 51(11), 1039\u20131064 (2009)","journal-title":"Speech Commun."},{"key":"10_CR10","doi-asserted-by":"crossref","unstructured":"Zen, H., Agiomyrgiannakis, Y., Egberts, N., Henderson, F., Szczepaniak, P.: Fast, compact, and high quality LSTM-RNN based statistical parametric speech synthesizers for mobile devices. arXiv preprint arXiv:1606.06061 (2016)","DOI":"10.21437\/Interspeech.2016-522"},{"key":"10_CR11","doi-asserted-by":"crossref","unstructured":"Fan, Y., Qian, Y., Xie, F. L., Soong, F.: TTS synthesis with bidirectional LSTM based recurrent neural networks. In: Proceedings of the Annual Conference of the International Speech Communication Association (2014)","DOI":"10.21437\/Interspeech.2014-443"},{"key":"10_CR12","doi-asserted-by":"crossref","unstructured":"Zen, H., Sak, H.: Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4470\u20134474 (2015)","DOI":"10.1109\/ICASSP.2015.7178816"},{"key":"10_CR13","series-title":"Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1007\/978-3-642-17502-2_4","volume-title":"Security and Privacy in Mobile Information and Communication Systems","author":"P Verhaeghe","year":"2010","unstructured":"Verhaeghe, P., Verslype, K., Lapon, J., Naessens, V., De Decker, B.: A mobile and reliable anonymous ePoll infrastructure. In: Schmidt, A.U., Russello, G., Lioy, A., Prasad, N.R., Lian, S. (eds.) MobiSec 2010. LNICST, vol. 47, pp. 41\u201352. Springer, Heidelberg (2010). https:\/\/doi.org\/10.1007\/978-3-642-17502-2_4"},{"key":"10_CR14","unstructured":"Yamagishi, J.: An introduction to HMM-based speech synthesis. Technical report (2006)"},{"key":"10_CR15","unstructured":"Huang, X., Acero, A., Alleva, F., Hwang, M., Jiang, L., Mahajan, M.: Microsoft windows highly intelligent speech recognizer: whisper. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1995)"},{"key":"10_CR16","unstructured":"Burkardt, J.: K-means clustering. Virginia Tech, Advanced Research Computing, Interdisciplinary Center for Applied Mathematics (2009)"},{"issue":"4","key":"10_CR17","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1109\/89.917686","volume":"9","author":"P Baggenstoss","year":"2001","unstructured":"Baggenstoss, P.: A modified Baum-Welch algorithm for hidden Markov models with multiple observation spaces. IEEE Trans. Speech Audio Process. 9(4), 411\u2013416 (2001)","journal-title":"IEEE Trans. Speech Audio Process."},{"issue":"5","key":"10_CR18","doi-asserted-by":"publisher","first-page":"1947","DOI":"10.1109\/TSP.2006.872540","volume":"54","author":"S Yu","year":"2006","unstructured":"Yu, S., Kobayashi, H.: Practical implementation of an efficient forward-backward algorithm for an explicit-duration hidden Markov model. IEEE Trans. Signal Process. 54(5), 1947\u20131951 (2006)","journal-title":"IEEE Trans. Signal Process."},{"key":"10_CR19","doi-asserted-by":"crossref","unstructured":"Ning, Y., Wu, Z., Jia, J., Meng, F., Meng, H., Cai, L.: HMM-based emphatic speech synthesis for corrective feedback in computer-aided pronunciation training. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4934\u20134938 (2015)","DOI":"10.1109\/ICASSP.2015.7178909"},{"key":"10_CR20","unstructured":"Meng, F.: Analysis and generation of focus in continuous speech. Ph.D. Thesis, Tsinghua University (2013)"}],"container-title":["Lecture Notes in Computer Science","Cognitive Computing \u2013 ICCC 2018"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-319-94307-7_10","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,8,25]],"date-time":"2022-08-25T23:25:14Z","timestamp":1661469914000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-3-319-94307-7_10"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018]]},"ISBN":["9783319943060","9783319943077"],"references-count":20,"URL":"https:\/\/doi.org\/10.1007\/978-3-319-94307-7_10","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[2018]]}}}