{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T16:06:24Z","timestamp":1772553984457,"version":"3.50.1"},"reference-count":14,"publisher":"World Scientific Pub Co Pte Ltd","issue":"01","funder":[{"name":"FCT","award":["UIDB\/50014\/2020"],"award-info":[{"award-number":["UIDB\/50014\/2020"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Vietnam J. Comp. Sci."],"published-print":{"date-parts":[[2023,2]]},"abstract":"<jats:p> Bird species identification is a relevant and time-consuming task for ornithologists and ecologists. With growing amounts of audio-annotated data, automatic bird classification using machine learning techniques is an important trend in the scientific community. Analyzing bird behavior and population trends helps detect other organisms in the environment and is an important problem in ecology. Bird populations react quickly to environmental changes, which make their real-time counting and tracking challenging and very useful. A reliable methodology that automatically identifies bird species from audio would therefore be a valuable tool for the experts in different scientific and applicational domains. The goal of this work is to propose a methodology to identify bird sounds. In this paper, we explore deep learning techniques that are being used in this domain, such as Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) to classify the data. In deep learning, audio problems are commonly approached by converting them into images using audio feature extraction techniques such as Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs). We propose and test multiple deep learning and feature extraction combinations in order to find the most suitable approach to this problem. <\/jats:p>","DOI":"10.1142\/s2196888822500300","type":"journal-article","created":{"date-parts":[[2022,6,4]],"date-time":"2022-06-04T16:35:02Z","timestamp":1654360502000},"page":"39-54","source":"Crossref","is-referenced-by-count":34,"title":["Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning"],"prefix":"10.1142","volume":"10","author":[{"given":"Silvestre","family":"Carvalho","sequence":"first","affiliation":[{"name":"Instituto Superior de Engenharia do Porto, Rua Dr. Bernardino de Almeida, 431, 4200-072, Porto, Portugal"}]},{"given":"Elsa Ferreira","family":"Gomes","sequence":"additional","affiliation":[{"name":"Instituto Superior de Engenharia do Porto & INESC TEC, Rua Dr. Bernardino de Almeida, 431, 4200-072, Porto, Portugal"}]}],"member":"219","published-online":{"date-parts":[[2022,8,10]]},"reference":[{"key":"S2196888822500300BIB001","doi-asserted-by":"publisher","DOI":"10.1016\/j.biocon.2011.10.019"},{"key":"S2196888822500300BIB002","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-021-26488-1"},{"issue":"4","key":"S2196888822500300BIB003","volume":"8","author":"Gavali P.","year":"2019","journal-title":"Int. J. Eng. Res. Technol."},{"key":"S2196888822500300BIB004","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2017.08.250"},{"key":"S2196888822500300BIB005","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2008.02.059"},{"key":"S2196888822500300BIB006","first-page":"73","volume-title":"Proc. Int. Conf. Computer Science and Software Engineering","author":"Colonna J."},{"key":"S2196888822500300BIB007","doi-asserted-by":"publisher","DOI":"10.1155\/2007\/38637"},{"key":"S2196888822500300BIB008","first-page":"129","volume-title":"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007","author":"Wielgat R."},{"key":"S2196888822500300BIB010","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1980.1163420"},{"key":"S2196888822500300BIB012","doi-asserted-by":"publisher","DOI":"10.1038\/nature14539"},{"key":"S2196888822500300BIB015","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"S2196888822500300BIB019","volume-title":"CLEF","author":"Lasseck M."},{"issue":"6","key":"S2196888822500300BIB022","first-page":"536","volume":"7","author":"Butterworth S.","year":"1930","journal-title":"Wireless Eng."},{"key":"S2196888822500300BIB025","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2007.55"}],"container-title":["Vietnam Journal of Computer Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S2196888822500300","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,25]],"date-time":"2023-02-25T04:51:02Z","timestamp":1677300662000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S2196888822500300"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,10]]},"references-count":14,"journal-issue":{"issue":"01","published-print":{"date-parts":[[2023,2]]}},"alternative-id":["10.1142\/S2196888822500300"],"URL":"https:\/\/doi.org\/10.1142\/s2196888822500300","relation":{},"ISSN":["2196-8888","2196-8896"],"issn-type":[{"value":"2196-8888","type":"print"},{"value":"2196-8896","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,10]]}}}