{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,9]],"date-time":"2024-06-09T10:08:28Z","timestamp":1717927708048},"reference-count":11,"publisher":"World Scientific Pub Co Pte Lt","issue":"06","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2005,9]]},"abstract":"<jats:p> Rapid increase in the amount of audio data demands an efficient method to automatically segment or classify audio stream based on its content. In this paper, based on the Gabor wavelet features, an audio classification and segmentation method is proposed. This method will first divide an audio stream into clips, each of which contains one-second audio information. Then, each clip is classified as one of two classes or five classes. Two classes contain speech and music; pure speech, pure music, song, speech with music background, and speech with environmental noise background are for five classes. Finally, a merge technique is provided to do segmentation. <\/jats:p><jats:p> In order to make the proposed method robust for a variety of audio sources, we use Fisher Linear Discriminator to obtain features with the highest discriminative ability. Experimental results show that the proposed method can achieve over 98% accuracy rate for speech and music discrimination, and more than 95% for a five-way discrimination. By checking the class types of adjacent clips, we can also identify more than 95% audio scene breaks in audio sequence. <\/jats:p>","DOI":"10.1142\/s0218001405004289","type":"journal-article","created":{"date-parts":[[2005,9,23]],"date-time":"2005-09-23T10:42:07Z","timestamp":1127472127000},"page":"807-822","source":"Crossref","is-referenced-by-count":3,"title":["A NEW APPROACH FOR AUDIO CLASSIFICATION AND SEGMENTATION USING GABOR WAVELETS AND FISHER LINEAR DISCRIMINATOR"],"prefix":"10.1142","volume":"19","author":[{"given":"RUEI-SHIANG","family":"LIN","sequence":"first","affiliation":[{"name":"Department of Computer and Information Science, National Chiao Tung University, 1001 Ta Hsueh Rd., Hsinchu, Taiwan 30050, R.O.C."}]},{"given":"LING-HWEI","family":"CHEN","sequence":"additional","affiliation":[{"name":"Department of Computer and Information Science, National Chiao Tung University, 1001 Ta Hsueh Rd., Hsinchu, Taiwan 30050, R.O.C."}]}],"member":"219","published-online":{"date-parts":[[2011,11,21]]},"reference":[{"key":"rf1","doi-asserted-by":"publisher","DOI":"10.1109\/34.598228"},{"key":"rf3","volume-title":"Pattern Recognition and Image Preprocessing","author":"Bow S.-T.","year":"1992"},{"key":"rf4","doi-asserted-by":"publisher","DOI":"10.1007\/s005300050106"},{"key":"rf6","first-page":"429","volume":"93","author":"Gabor D.","journal-title":"J. Instit. Electr. Eng."},{"key":"rf9","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-8655(00)00119-7"},{"key":"rf10","first-page":"173","volume":"18","author":"Manjunath B. S.","journal-title":"IEEE Trans. Patt. Anal. Mach. Intell."},{"key":"rf13","volume-title":"Joint Time-Frequency Analysis Methods and Applications","author":"Qian S.","year":"1996"},{"key":"rf14","volume-title":"Computational Auditory Scene Analysis","author":"Rosenthal F. D.","year":"1998"},{"key":"rf19","doi-asserted-by":"publisher","DOI":"10.1109\/93.556537"},{"key":"rf22","doi-asserted-by":"publisher","DOI":"10.1109\/89.917689"},{"key":"rf23","volume-title":"Psychoacoustics, Facts and Models","author":"Zwicker E.","year":"1990"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001405004289","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,7]],"date-time":"2019-08-07T16:42:06Z","timestamp":1565196126000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218001405004289"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,9]]},"references-count":11,"journal-issue":{"issue":"06","published-online":{"date-parts":[[2011,11,21]]},"published-print":{"date-parts":[[2005,9]]}},"alternative-id":["10.1142\/S0218001405004289"],"URL":"https:\/\/doi.org\/10.1142\/s0218001405004289","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2005,9]]}}}