{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T14:04:19Z","timestamp":1760709859306,"version":"build-2065373602"},"reference-count":45,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2019,3,20]],"date-time":"2019-03-20T00:00:00Z","timestamp":1553040000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>The application of machine learning techniques to sound signals requires the previous characterization of said signals. In many cases, their description is made using cepstral coefficients that represent the sound spectra. In this paper, the performance in obtaining cepstral coefficients by two integral transforms, Discrete Fourier Transform (DFT) and Discrete Cosine Transform (DCT), are compared in the context of processing anuran calls. Due to the symmetry of sound spectra, it is shown that DCT clearly outperforms DFT, and decreases the error representing the spectrum by more than 30%. Additionally, it is demonstrated that DCT-based cepstral coefficients are less correlated than their DFT-based counterparts, which leads to a significant advantage for DCT-based cepstral coefficients if these features are later used in classification algorithms. Since the DCT superiority is based on the symmetry of sound spectra and not on any intrinsic advantage of the algorithm, the conclusions of this research can definitely be extrapolated to include any sound signal.<\/jats:p>","DOI":"10.3390\/sym11030405","type":"journal-article","created":{"date-parts":[[2019,3,21]],"date-time":"2019-03-21T04:11:56Z","timestamp":1553141516000},"page":"405","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Exploiting the Symmetry of Integral Transforms for Featuring Anuran Calls"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1205-4722","authenticated-orcid":false,"given":"Amalia","family":"Luque","sequence":"first","affiliation":[{"name":"Ingenier\u00eda del Dise\u00f1o, Escuela Polit\u00e9cnica Superior, Universidad de Sevilla, 41004 Sevilla, Spain"}]},{"given":"Jes\u00fas","family":"G\u00f3mez-Bellido","sequence":"additional","affiliation":[{"name":"Ingenier\u00eda del Dise\u00f1o, Escuela Polit\u00e9cnica Superior, Universidad de Sevilla, 41004 Sevilla, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9474-3929","authenticated-orcid":false,"given":"Alejandro","family":"Carrasco","sequence":"additional","affiliation":[{"name":"Tecnolog\u00eda Electr\u00f3nica, Escuela Ingenier\u00eda Inform\u00e1tica, Universidad de Sevilla, 41004 Sevilla, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9132-6158","authenticated-orcid":false,"given":"Julio","family":"Barbancho","sequence":"additional","affiliation":[{"name":"Tecnolog\u00eda Electr\u00f3nica, Escuela Polit\u00e9cnica Superior, Universidad de Sevilla, 41004 Sevilla, Spain"}]}],"member":"1968","published-online":{"date-parts":[[2019,3,20]]},"reference":[{"key":"ref_1","first-page":"39","article-title":"A critical review and analysis on techniques of speech recognition: The road ahead","volume":"22","author":"Haridas","year":"2018","journal-title":"Int. J. Knowl.-Based Intell. Eng. Syst."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1016\/j.bspc.2018.09.003","article-title":"On the design of automatic voice condition analysis systems. Part II: Review of speaker recognition techniques and study on the effects of different variability factors","volume":"48","year":"2019","journal-title":"Biomed. Signal Process. Control"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Vo, T., Nguyen, T., and Le, C. (2018). Race Recognition Using Deep Convolutional Neural Networks. Symmetry, 10.","DOI":"10.3390\/sym10110564"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Dahake, P.P., Shaw, K., and Malathi, P. (2016, January 9\u201310). Speaker dependent speech emotion recognition using MFCC and Support Vector Machine. Proceedings of the 2016 International Conference on Automatic Control and Dynamic Optimization Techniques (ICACDOT), Pune, India.","DOI":"10.1109\/ICACDOT.2016.7877753"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Chakraborty, S.S., and Parekh, R. (2018). Improved Musical Instrument Classification Using Cepstral Coefficients and Neural Networks. Methodologies and Application Issues of Contemporary Computing Framework, Springer.","DOI":"10.1007\/978-981-13-2345-4_10"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Panteli, M., Benetos, E., and Dixon, S. (2017). A computational study on outliers in world music. PLoS ONE, 12.","DOI":"10.1371\/journal.pone.0189399"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Noda, J.J., S\u00e1nchez-Rodr\u00edguez, D., and Travieso-Gonz\u00e1lez, C.M. (2018). A Methodology Based on Bioacoustic Information for Automatic Identification of Reptiles and Anurans. Reptiles and Amphibians, IntechOpen.","DOI":"10.5772\/intechopen.74333"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1786","DOI":"10.1121\/1.5035847","article-title":"Supervised and unsupervised machine learning approaches to classifying chimpanzee vocalizations","volume":"143","author":"Desai","year":"2018","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"2834","DOI":"10.1121\/1.5036628","article-title":"Automatic fish sounds classification","volume":"143","author":"Malfante","year":"2018","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Wang, Y., Sun, B., Yang, X., and Meng, Q. (2017, January 20\u201322). Heart sound identification based on MFCC and short-term energy. Proceedings of the 2017 Chinese Automation Congress (CAC), Jinan, China.","DOI":"10.1109\/CAC.2017.8244117"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Usman, M., Zubair, M., Shiblee, M., Rodrigues, P., and Jaffar, S. (2018). Probabilistic Modeling of Speech in Spectral Domain using Maximum Likelihood Estimation. Symmetry, 10.","DOI":"10.3390\/sym10120750"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Cao, J., Cao, M., Wang, J., Yin, C., Wang, D., and Vidal, P.P. (2018). Urban noise recognition with convolutional neural network. Multimed. Tools Appl.","DOI":"10.1007\/s11042-018-6295-8"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Xu, J., Wang, Z., Tan, C., Lu, D., Wu, B., Su, Z., and Tang, Y. (2018). Cutting Pattern Identification for Coal Mining Shearer through Sound Signals Based on a Convolutional Neural Network. Symmetry, 10.","DOI":"10.3390\/sym10120736"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Lee, J., Choi, H., Park, D., Chung, Y., Kim, H.Y., and Yoon, S. (2016). Fault detection and diagnosis of railway point machines by sound analysis. Sensors, 16.","DOI":"10.3390\/s16040549"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Choi, Y., Atif, O., Lee, J., Park, D., and Chung, Y. (2018). Noise-Robust Sound-Event Classification System with Texture Analysis. Symmetry, 10.","DOI":"10.3390\/sym10090402"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Guyon, I., and Elisseeff, A. (2006). An introduction to feature extraction. Feature Extraction, Springer.","DOI":"10.1007\/978-3-540-35488-8"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Al\u00edas, F., Socor\u00f3, J., and Sevillano, X. (2016). A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds. Appl. Sci., 6.","DOI":"10.3390\/app6050143"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zhang, H., McLoughlin, I., and Song, Y. (2015, January 19\u201324). Robust sound event recognition using convolutional neural networks. Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia.","DOI":"10.1109\/ICASSP.2015.7178031"},{"key":"ref_19","first-page":"1","article-title":"Feature extraction methods LPC, PLP and MFCC in speech recognition","volume":"1","author":"Dave","year":"2013","journal-title":"Int. J. Adv. Res. Eng. Technol."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"605","DOI":"10.1109\/JSTSP.2017.2684705","article-title":"Spectral features for synthetic speech detection","volume":"11","author":"Paul","year":"2017","journal-title":"IEEE J. Sel. Top. Signal Process."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Taebi, A., and Mansy, H.A. (2017, January 2). Analysis of seismocardiographic signals using polynomial chirplet transform and smoothed pseudo Wigner-Ville distribution. Proceedings of the 2017 IEEE Signal Processing in Medicine and Biology Symposium (SPMB), Philadelphia, PA, USA.","DOI":"10.1109\/SPMB.2017.8257022"},{"key":"ref_22","unstructured":"Dayou, J., Han, N.C., Mun, H.C., Ahmad, A.H., Muniandy, S.V., and Dalimin, M.N. (2011, January 7\u20139). Classification and identification of frog sound based on entropy approach. Proceedings of the 2011 International Conference on Life Science and Technology, Mumbai, India."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"582","DOI":"10.1007\/BF02943243","article-title":"Comparison of different implementations of MFCC","volume":"16","author":"Zheng","year":"2001","journal-title":"J. Comput. Sci. Technol."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Hussain, H., Ting, C.M., Numan, F., Ibrahim, M.N., Izan, N.F., Mohammad, M.M., and Sh-Hussain, H. (2017, January 12\u201314). Analysis of ECG biosignal recognition for client identifiction. Proceedings of the 2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), Kuching, Malaysia.","DOI":"10.1109\/ICSIPA.2017.8120572"},{"key":"ref_25","first-page":"57","article-title":"Classification of Acceleration Data for Biometric Gait Recognition on Mobile Devices","volume":"11","author":"Nickel","year":"2011","journal-title":"Biosig"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Muheidat, F., Tyrer, W.H., and Popescu, M. (2018, January 18\u201321). Walk Identification using a smart carpet and Mel-Frequency Cepstral Coefficient (MFCC) features. Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA.","DOI":"10.1109\/EMBC.2018.8513340"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Negi, S.S., Kishor, N., Negi, R., and Uhlen, K. (2018, January 24\u201327). Event signal characterization for disturbance interpretation in power grid. Proceedings of the 2018 First International Colloquium on Smart Grid Metrology (SmaGriMet), Split, Croatia.","DOI":"10.23919\/SMAGRIMET.2018.8369844"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1007\/s10462-016-9529-z","article-title":"Frog call classification: A survey","volume":"49","author":"Xie","year":"2018","journal-title":"Artif. Int. Rev."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.eswa.2018.03.062","article-title":"Feature evaluation for unsupervised bioacoustic signal segmentation of anuran calls","volume":"106","author":"Colonna","year":"2018","journal-title":"Expert Syst. Appl."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1016\/j.eswa.2017.11.016","article-title":"Non-sequential automatic classification of anuran sounds for the estimation of climate-change indicators","volume":"95","author":"Luque","year":"2018","journal-title":"Expert Syst. Appl."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"e4732","DOI":"10.7717\/peerj.4732","article-title":"Temporally-aware algorithms for the classification of anuran sounds","volume":"6","author":"Luque","year":"2018","journal-title":"PeerJ"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Luque, A., Romero-Lemos, J., Carrasco, A., and Barbancho, J. (2018). Improving Classification Algorithms by Considering Score Series in Wireless Acoustic Sensor Networks. Sensors, 18.","DOI":"10.3390\/s18082465"},{"key":"ref_33","unstructured":"Romero, J., Luque, A., and Carrasco, A. (2016, January 14\u201316). Anuran sound classification using MPEG-7 frame descriptors. Proceedings of the XVII Conferencia de la Asociaci\u00f3n Espa\u00f1ola para la Inteligencia Artificial (CAEPIA), Salamanca, Spain."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Luque, A., G\u00f3mez-Bellido, J., Carrasco, A., Personal, E., and Leon, C. (2017). Evaluation of the processing times in anuran sound classification. Wireless Communications and Mobile Computing.","DOI":"10.1155\/2017\/8079846"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Luque, A., G\u00f3mez-Bellido, J., Carrasco, A., and Barbancho, J. (2018). Optimal Representation of Anuran Call Spectrum in Environmental Monitoring Systems Using Wireless Sensor Networks. Sensors, 18.","DOI":"10.3390\/s18061803"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Hershey, S., Chaudhuri, S., Ellis, D.P., Gemmeke, J.F., Jansen, A., Moore, R.C., Plakal, M., Platt, D., Saurous, R.A., and Seybold, B. (2017, January 5\u20139). CNN architectures for large-scale audio classification. Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA.","DOI":"10.1109\/ICASSP.2017.7952132"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Dai, W., Dai, C., Qu, S., Li, J., and Das, S. (2017, January 5\u20139). Very deep convolutional neural networks for raw waveforms. Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA.","DOI":"10.1109\/ICASSP.2017.7952190"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Strout, J., Rogan, B., Seyednezhad, S.M., Smart, K., Bush, M., and Ribeiro, E. (2017, January 5\u20139). Anuran call classification with deep learning. Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA.","DOI":"10.1109\/ICASSP.2017.7952639"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Colonna, J., Peet, T., Ferreira, C.A., Jorge, A.M., Gomes, E.F., and Gama, J. (2016, January 20\u201322). Automatic classification of anuran sounds using convolutional neural networks. Proceedings of the Ninth International Conference on Computer Science & Software Engineering, Porto, Portugal.","DOI":"10.1145\/2948992.2949016"},{"key":"ref_40","first-page":"1","article-title":"Comparative performance analysis of hamming, hanning and blackman window","volume":"96","author":"Podder","year":"2014","journal-title":"Int. J. Comput. Appl."},{"key":"ref_41","unstructured":"O\u2019shaughnessy, D. (1999). Speech Communication: Human and Machine, Wiley-IEEE Press. [2nd ed.]."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Bhatia, R. (2005). Fourier Series, American Mathematical Society.","DOI":"10.5948\/UPO9781614441045"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Broughton, S.A., and Bryan, K. (2018). Discrete Fourier Analysis and Wavelets: Applications to Signal and Image Processing, John Wiley & Sons.","DOI":"10.1002\/9781119473329"},{"key":"ref_44","unstructured":"Rao, K.R., and Yip, P. (2014). Discrete Cosine Transform: Algorithms, Advantages, Applications, Academic Press."},{"key":"ref_45","unstructured":"Tan, L., and Jiang, J. (2018). Digital Signal Processing: Fundamentals and Applications, Academic Press."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/11\/3\/405\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:39:13Z","timestamp":1760186353000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/11\/3\/405"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,3,20]]},"references-count":45,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2019,3]]}},"alternative-id":["sym11030405"],"URL":"https:\/\/doi.org\/10.3390\/sym11030405","relation":{},"ISSN":["2073-8994"],"issn-type":[{"type":"electronic","value":"2073-8994"}],"subject":[],"published":{"date-parts":[[2019,3,20]]}}}