{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:24:43Z","timestamp":1750220683457,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":15,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,12,3]],"date-time":"2020-12-03T00:00:00Z","timestamp":1606953600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61861005"],"award-info":[{"award-number":["61861005"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,12,3]]},"DOI":"10.1145\/3452940.3453037","type":"proceedings-article","created":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T15:28:54Z","timestamp":1621265334000},"page":"503-507","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Single Channel Target Speaker Extraction Based on Deep Learning"],"prefix":"10.1145","author":[{"given":"Zhixiong","family":"Wang","sequence":"first","affiliation":[{"name":"School of Electronic Engineering, Guangxi Normal University, Guilin China"}]},{"given":"Weiping","family":"Hu","sequence":"additional","affiliation":[{"name":"School of Electronic Engineering, Guangxi Normal University, Guilin China"}]},{"given":"Ting","family":"Xiao","sequence":"additional","affiliation":[{"name":"School of Electronic Engineering, Guangxi Normal University, Guilin China"}]}],"member":"320","published-online":{"date-parts":[[2021,5,17]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/3200108"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2010.2064307"},{"key":"e_1_3_2_1_3_1","first-page":"31","author":"Hershey Z.","year":"2016","unstructured":"J. R Hershey , Z. Chen , J. Le Roux , and S. Watanabe , \"Deep clustering: Discriminative embeddings for segmentation and separation, \" in Proc. of ICASSP. IEEE , 2016 , pp. 31 -- 35 J. R Hershey, Z. Chen, J. Le Roux, and S. Watanabe, \"Deep clustering: Discriminative embeddings for segmentation and separation, \" in Proc. of ICASSP. IEEE, 2016, pp. 31--35","journal-title":"of ICASSP. IEEE"},{"key":"e_1_3_2_1_4_1","volume-title":"Deep attractor network for single-microphone speaker separation[J]","author":"Chen Z","year":"2016","unstructured":"Chen Z , Luo Y , Mesgarani N. Deep attractor network for single-microphone speaker separation[J] . 2016 . Chen Z, Luo Y, Mesgarani N. Deep attractor network for single-microphone speaker separation[J]. 2016."},{"key":"e_1_3_2_1_5_1","first-page":"1","volume":"2017","author":"Kolbaek M","unstructured":"Kolbaek M , Yu D , Tan Z H , Multi -talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks[J]. IEEE\/ACM Transactions on Audio Speech & Language Processing , 2017 : 1 -- 1 . Kolbaek M, Yu D, Tan Z H, et al. Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks[J]. IEEE\/ACM Transactions on Audio Speech & Language Processing, 2017:1--1.","journal-title":"IEEE\/ACM Transactions on Audio Speech & Language Processing"},{"key":"e_1_3_2_1_6_1","volume-title":"TasNet: time-domain audio separation network for realtime, single-channel speech separation[J]","author":"Luo Y","year":"2017","unstructured":"Luo Y , Mesgarani N. TasNet: time-domain audio separation network for realtime, single-channel speech separation[J] . 2017 . Luo Y, Mesgarani N. TasNet: time-domain audio separation network for realtime, single-channel speech separation[J]. 2017."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2018-1205"},{"key":"e_1_3_2_1_8_1","volume-title":"Deep neural network-based speaker embeddings for end-to-end speaker verification[C]\/\/2016 IEEE Spoken Language Technology Workshop (SLT)","author":"Snyder D","year":"2016","unstructured":"Snyder D , Ghahremani P , Povey D , Deep neural network-based speaker embeddings for end-to-end speaker verification[C]\/\/2016 IEEE Spoken Language Technology Workshop (SLT) . IEEE , 2016 : 165--170. Snyder D, Ghahremani P, Povey D, et al. Deep neural network-based speaker embeddings for end-to-end speaker verification[C]\/\/2016 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2016: 165--170."},{"key":"e_1_3_2_1_9_1","first-page":"1492","volume":"2017","author":"Williamson D","unstructured":"Williamson D , Wang D L . Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising[J]. IEEE\/ ACM Transactions on Audio, Speech , and Language Processing , 2017 : 1492 -- 1501 . Williamson D, Wang D L. Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising[J]. IEEE\/ACM Transactions on Audio, Speech, and Language Processing, 2017:1492--1501.","journal-title":"Language Processing"},{"key":"e_1_3_2_1_10_1","volume-title":"SDR-half-baked or well done?[C]\/\/ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Le Roux J","year":"2019","unstructured":"Le Roux J , Wisdom S , Erdogan H , SDR-half-baked or well done?[C]\/\/ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . IEEE , 2019 : 626--630. Le Roux J, Wisdom S, Erdogan H, et al. SDR-half-baked or well done?[C]\/\/ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019: 626--630."},{"key":"e_1_3_2_1_11_1","volume-title":"Linguistic Data Consortium","author":"Garofolo D","year":"1993","unstructured":"J. Garofolo , D Graff , D Paul , and D Pallett , \"Csr-i(wsj0) complete ldc93s6a, \" Philadelphia : Linguistic Data Consortium , 1993 . J. Garofolo, D Graff, D Paul, and D Pallett, \"Csr-i(wsj0) complete ldc93s6a, \" Philadelphia: Linguistic Data Consortium, 1993."},{"key":"e_1_3_2_1_12_1","volume-title":"Ba J. Adam: A method for stochastic optimization[J]. arXiv preprint arXiv:1412.6980","author":"Kingma D P","year":"2014","unstructured":"Kingma D P , Ba J. Adam: A method for stochastic optimization[J]. arXiv preprint arXiv:1412.6980 , 2014 . Kingma D P, Ba J. Adam: A method for stochastic optimization[J]. arXiv preprint arXiv:1412.6980, 2014."},{"key":"e_1_3_2_1_13_1","volume-title":"Single channel target speaker extraction and recognition with speaker beam[C]\/\/2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Delcroix M","year":"2018","unstructured":"Delcroix M , Zmolikova K , Kinoshita K , Single channel target speaker extraction and recognition with speaker beam[C]\/\/2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . IEEE , 2018 : 5554--5558. Delcroix M, Zmolikova K, Kinoshita K, et al. Single channel target speaker extraction and recognition with speaker beam[C]\/\/2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2018: 5554--5558."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2005.858005"},{"key":"e_1_3_2_1_15_1","first-page":"3623","volume":"2018","author":"Huang Z","unstructured":"Huang Z , Wang S , Yu K. Angular Softmax for Short-Duration Text-independent Speaker Verification[C]\/\/ Interspeech. 2018 : 3623 -- 3627 . Huang Z, Wang S, Yu K. Angular Softmax for Short-Duration Text-independent Speaker Verification[C]\/\/Interspeech. 2018: 3623--3627.","journal-title":"Interspeech."}],"event":{"name":"ICITEE2020: The 3rd International Conference on Information Technologies and Electrical Engineering","acronym":"ICITEE2020","location":"Changde City Hunan China"},"container-title":["Proceedings of the 3rd International Conference on Information Technologies and Electrical Engineering"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3452940.3453037","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3452940.3453037","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:03:27Z","timestamp":1750197807000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3452940.3453037"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,3]]},"references-count":15,"alternative-id":["10.1145\/3452940.3453037","10.1145\/3452940"],"URL":"https:\/\/doi.org\/10.1145\/3452940.3453037","relation":{},"subject":[],"published":{"date-parts":[[2020,12,3]]},"assertion":[{"value":"2021-05-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}