{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,1,18]],"date-time":"2025-01-18T15:10:24Z","timestamp":1737213024192,"version":"3.33.0"},"reference-count":16,"publisher":"Wiley","issue":"6","license":[{"start":{"date-parts":[[2007,3,21]],"date-time":"2007-03-21T00:00:00Z","timestamp":1174435200000},"content-version":"vor","delay-in-days":6867,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Systems &amp; Computers in Japan"],"published-print":{"date-parts":[[1988,6]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>At present, one of the most important problems in speech recognition and speaker recognition is the extraction of individual information from the speech waveform. This paper describes the extraction of individual information by the vector\u2010quantization and the text\u2010independent speaker information based on that method. A feature vector is proposed for the first time which is the quantized distribution by the frequency of the vector\u2010quantization code to represent the individual features of the speaker. The properties of the feature vector are investigated, and effectiveness is verified by an actual speaker\u2010identification experiment. The quantization distribution is a feature representing the distribution density in the space for the acoustic features, e.g., the spectrum uttered by the individual. As the acoustic feature parameters, the cepstrum for stationary part, and the change of the cepstrum, are used to construct the quantization distribution. The identification rates are compared. As a result of the identification experiment for 10 speakers, an identification rate of 100 percent was achieved by the quantization distribution of cepstrum for 10 input words, which are different from the training samples. In the experiment using 200 speakers, an identification rate of 88 percent was achieved for the first candidates, and a cumulative identification rate of 95 percent was achieved for up to the second candidate.<\/jats:p>","DOI":"10.1002\/scj.4690190606","type":"journal-article","created":{"date-parts":[[2007,7,7]],"date-time":"2007-07-07T15:16:51Z","timestamp":1183821411000},"page":"63-72","source":"Crossref","is-referenced-by-count":2,"title":["Speaker identification based on frequency distribution of vector\u2010quantized spectra"],"prefix":"10.1002","volume":"19","author":[{"given":"Katsuhiko","family":"Shirai","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kazunori","family":"Mano","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shunichi","family":"Ishige","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2007,3,21]]},"reference":[{"issue":"80","key":"e_1_2_1_2_2","article-title":"A consideration of surmounting individu\u2010uality for speaker\u2010independent speech recognitionm","volume":"85","author":"Ishige S.","year":"1986","journal-title":"Trans. of the Committee on Speech Res., Acoust. Soc. Jap."},{"key":"e_1_2_1_3_2","first-page":"1","article-title":"Representation of speaker individuality for speaker\u2010independent speech recognition and its application","volume":"1","author":"Ishige S.","year":"1986","journal-title":"Annual Meet. of Acoust. Soc. Jap."},{"key":"e_1_2_1_4_2","first-page":"206","volume-title":"Speech Recognition","author":"Niimi Y.","year":"1979"},{"issue":"5","key":"e_1_2_1_5_2","first-page":"537","article-title":"Review and perspective of speaker recognition","volume":"67","author":"Niimi Y.","year":"1984","journal-title":"Jour. I.E.C.E., Japan"},{"issue":"2","key":"e_1_2_1_6_2","first-page":"63","article-title":"Speaker verification from actual telephone voicem","volume":"35","author":"Ichikawa A.","year":"1979","journal-title":"J. Acoust. Soc. Jap."},{"issue":"60","key":"e_1_2_1_7_2","article-title":"Text\u2010independent speaker identification based on piecewise canonical discriminant analysis","volume":"75","author":"Matsumoto H.","year":"1975","journal-title":"Tech. Rep. I.E.C.E., Japan"},{"issue":"2","key":"e_1_2_1_8_2","first-page":"183","article-title":"Speaker recognition by statistical features of cepstrum parametersm","volume":"65","author":"Furui S.","year":"1982","journal-title":"Trans. (A) I.E.C.E., Japan"},{"key":"e_1_2_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1981.1163530"},{"issue":"10","key":"e_1_2_1_10_2","first-page":"549","article-title":"Talker recognition by lag time averaged speech spectrumm","volume":"55","author":"Furui S.","year":"1972","journal-title":"Trans. (A) I.E.C.E., Japan"},{"key":"e_1_2_1_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1977.1162961"},{"issue":"3","key":"e_1_2_1_12_2","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1109\/TASSP.1985.1164616","article-title":"Talker recognition in tandem with talker\u2010independent isolated word recognitionm","volume":"33","author":"Rosenberg A. E.","year":"1986","journal-title":"I.E.E.E. Trans. Acoust., Speech & Signal Process."},{"key":"e_1_2_1_13_2","doi-asserted-by":"crossref","unstructured":"F. K.SoongandA. E.Rosenberg. On the use of instantaneous and transitional spectral information in speaker recognition Proc. I.E.E.E. Int. Conf. on Acoust. Speech & Signal Processing pp.877\u2013880(April1986).","DOI":"10.1109\/ICASSP.1986.1168882"},{"key":"e_1_2_1_14_2","doi-asserted-by":"crossref","unstructured":"A. E.RosenbergandF. K.Soong. Evaluation of a vector quantization talker recognition system in text independent and text dependent modes Proc. I.E.E.E. Int. Conf. on Acoust. Speech & Signal Processing pp.873\u2013876(April1986).","DOI":"10.1109\/ICASSP.1986.1168881"},{"key":"e_1_2_1_15_2","doi-asserted-by":"crossref","unstructured":"F. K.Soong A. E.Rosenberg L. R.RabinerandB. H.Juang. A vector quantization approach to speaker recognition Proc. Int. Conf. on Acoust. Speech & Signal Processing pp.387\u2013390(1985).","DOI":"10.1109\/ICASSP.1985.1168412"},{"key":"e_1_2_1_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCOM.1980.1094577"},{"key":"e_1_2_1_17_2","unstructured":"K. P.LiandE. H.Wrench Jr.An approach to text\u2010independent speaker recognition with short utterances Proc. I.E.E.E. Int. Conf. on Acoust. Speech & Signal Processing pp.555\u2013558(1983)."}],"container-title":["Systems and Computers in Japan"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fscj.4690190606","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/scj.4690190606","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,18]],"date-time":"2025-01-18T14:41:13Z","timestamp":1737211273000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/scj.4690190606"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1988,6]]},"references-count":16,"journal-issue":{"issue":"6","published-print":{"date-parts":[[1988,6]]}},"alternative-id":["10.1002\/scj.4690190606"],"URL":"https:\/\/doi.org\/10.1002\/scj.4690190606","archive":["Portico"],"relation":{},"ISSN":["0882-1666","1520-684X"],"issn-type":[{"type":"print","value":"0882-1666"},{"type":"electronic","value":"1520-684X"}],"subject":[],"published":{"date-parts":[[1988,6]]}}}