{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,1]],"date-time":"2026-06-01T12:52:01Z","timestamp":1780318321230,"version":"3.54.1"},"reference-count":35,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2018,12,19]],"date-time":"2018-12-19T00:00:00Z","timestamp":1545177600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["51179157"],"award-info":[{"award-number":["51179157"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>Detecting and classifying ships based on radiated noise provide practical guidelines for the reduction of underwater noise footprint of shipping. In this paper, the detection and classification are implemented by auditory inspired convolutional neural networks trained from raw underwater acoustic signal. The proposed model includes three parts. The first part is performed by a multi-scale 1D time convolutional layer initialized by auditory filter banks. Signals are decomposed into frequency components by convolution operation. In the second part, the decomposed signals are converted into frequency domain by permute layer and energy pooling layer to form frequency distribution in auditory cortex. Then, 2D frequency convolutional layers are applied to discover spectro-temporal patterns, as well as preserve locality and reduce spectral variations in ship noise. In the third part, the whole model is optimized with an objective function of classification to obtain appropriate auditory filters and feature representations that are correlative with ship categories. The optimization reflects the plasticity of auditory system. Experiments on five ship types and background noise show that the proposed approach achieved an overall classification accuracy of 79.2%, which improved by 6% compared to conventional approaches. Auditory filter banks were adaptive in shape to improve accuracy of classification.<\/jats:p>","DOI":"10.3390\/e20120990","type":"journal-article","created":{"date-parts":[[2018,12,19]],"date-time":"2018-12-19T12:12:44Z","timestamp":1545221564000},"page":"990","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":50,"title":["Auditory Inspired Convolutional Neural Networks for Ship Type Classification with Raw Hydrophone Data"],"prefix":"10.3390","volume":"20","author":[{"given":"Sheng","family":"Shen","sequence":"first","affiliation":[{"name":"School of Marine Science and Technology, Northwestern Polytechnical University, Xi\u2019an 710072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7611-4192","authenticated-orcid":false,"given":"Honghui","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Marine Science and Technology, Northwestern Polytechnical University, Xi\u2019an 710072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Junhao","family":"Li","sequence":"additional","affiliation":[{"name":"School of Marine Science and Technology, Northwestern Polytechnical University, Xi\u2019an 710072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Guanghui","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Marine Science and Technology, Northwestern Polytechnical University, Xi\u2019an 710072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Meiping","family":"Sheng","sequence":"additional","affiliation":[{"name":"School of Marine Science and Technology, Northwestern Polytechnical University, Xi\u2019an 710072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2018,12,19]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"2265","DOI":"10.1121\/1.4900181","article-title":"The classification of underwater acoustic target signals based on wave structure and support vector machine","volume":"136","author":"Meng","year":"2014","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2242","DOI":"10.1121\/1.4920186","article-title":"A wave structure based method for recognition of marine acoustic target signals","volume":"137","author":"Meng","year":"2015","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_3","first-page":"8","article-title":"Underwater target recognition based on wavelet packet and principal component analysis","volume":"28","author":"Wei","year":"2011","journal-title":"Comput. Simul."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Siddagangaiah, S., Li, Y., Guo, X., Chen, X., Zhang, Q., Yang, K., and Yang, Y. (2016). A complexity-based approach for the detection of weak signals in ocean ambient noise. Entropy, 18.","DOI":"10.3390\/e18030101"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1049\/iet-rsn.2011.0142","article-title":"Marine vessel classification based on passive sonar data: The cepstrum-based approach","volume":"7","author":"Das","year":"2013","journal-title":"Iet Radar Sonar Navig."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1016\/j.apacoust.2016.06.008","article-title":"ShipsEar: An underwater vessel noise database","volume":"113","year":"2016","journal-title":"Appl. Acoust."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"134304","DOI":"10.7498\/aps.63.134304","article-title":"Underwater acoustic target classification and auditory feature identification based on dissimilarity evaluation","volume":"63","author":"Yang","year":"2014","journal-title":"Acta Phys. Sin."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"7864213","DOI":"10.1155\/2016\/7864213","article-title":"Feature extraction of underwater target signal using Mel frequency cepstrum coefficients based on acoustic vector sensor","volume":"2016","author":"Zhang","year":"2016","journal-title":"J. Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"978","DOI":"10.1038\/nature04485","article-title":"Efficient auditory coding","volume":"439","author":"Smith","year":"2006","journal-title":"Nature"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Yang, H., Gan, A., Chen, H., and Pan, Y. (2016, January 12\u201316). Underwater acoustic target recognition using SVM ensemble via weighted sample and feature selection. Proceedings of the International Bhurban Conference on Applied Sciences and Technology, Islamabad, Pakistan.","DOI":"10.1109\/IBCAST.2016.7429928"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"605","DOI":"10.1049\/iet-rsn.2010.0157","article-title":"Preprocessing passive sonar signals for neural classification","volume":"5","author":"Filho","year":"2011","journal-title":"IET Radar Sonar Navig."},{"key":"ref_12","first-page":"31","article-title":"Classification of underwater signals using neural networks","volume":"3","author":"Chen","year":"2000","journal-title":"Tamkang J. Sci. Eng."},{"key":"ref_13","unstructured":"Damianos, K., Jan, S., Richard, S., William, H., and John, M. (2018, January 15\u201320). Individual Ship Detection Using Underwater Acoustics. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Calgary, AB, Canada."},{"key":"ref_14","unstructured":"Damianos, K., William, H., Richard, S., John, M., Stavros, T., Edin, I., and George, S. (2017, January 18\u201321). Applying speech technology to the ship-type classification problem. Proceedings of the OCEANS 2017, Anchorage, AK, USA."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1049\/iet-rsn.2015.0179","article-title":"Class-modular multi-layer perceptron networks for supporting passive sonar signal classification","volume":"10","author":"Filho","year":"2016","journal-title":"IET Radar Sonar Navig."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Sainath, T.N., Kingsbury, B., Mohamed, A.R., and Ramabhadran, B. (2013, January 8\u201312). Learning filter banks within a deep neural network framework. Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Olomouc, Czech Republic.","DOI":"10.1109\/ASRU.2013.6707746"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Kamal, S., Mohammed, S.K., Pillai, P.R.S., and Supriya, M.H. (2013, January 23\u201325). Deep learning architectures for underwater target recognition. Proceedings of the 2013 International Symposium on Ocean Electronics, Kochi, India.","DOI":"10.1109\/SYMPOL.2013.6701911"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Cao, X., Zhang, X., Yu, Y., and Niu, L. (2017, January 16\u201318). Deep learning-based recognition of underwater target. Proceedings of the IEEE International Conference on Digital Signal Processing, Beijing, China.","DOI":"10.1109\/ICDSP.2016.7868522"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Yang, H., Shen, S., Yao, X., Sheng, M., and Wang, C. (2018). Competitive deep-belief networks for underwater acoustic target recognition. Sensors, 18.","DOI":"10.3390\/s18040952"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Shen, S., Yang, H., and Sheng, M. (2018). Compression of a deep competitive network based on mutual information for underwater acoustic targets recognition. Entropy, 20.","DOI":"10.3390\/e20040243"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Mu, L., Peng, Y., Qiu, M., Yang, X., Hu, C., and Zhang, F. (2016, January 9\u201311). Study on modulation spectrum feature extraction of ship radiated noise based on auditory model. Proceedings of the 2016 IEEE\/OES China Ocean Acoustics, Harbin, China.","DOI":"10.1109\/COA.2016.7535765"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1109\/48.180304","article-title":"A neural network based hybrid system for detection, characterization, and classification of short-duration oceanic signals","volume":"17","author":"Ghosh","year":"2002","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1063\/1.1387599","article-title":"Psychoacoustics: Facts and models","volume":"54","author":"Zwicker","year":"2001","journal-title":"Phys. Today"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Moore, B.C.J. (2008). Cochlear Hearing Loss: Physiological, Psychological and Technical Issues, John Wiley and Sons. [2nd ed.].","DOI":"10.1002\/9780470987889"},{"key":"ref_25","unstructured":"Gelf, S.A. (2009). Hearing: An Introduction to Psychological and Physiological Acoustics, CRC Press."},{"key":"ref_26","unstructured":"Slaney, M. (1993). An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank, Apple Computer."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/0378-5955(90)90170-T","article-title":"Derivation of auditory filter shapes from notched-noise data","volume":"47","author":"Glasberg","year":"1990","journal-title":"Hear. Res."},{"key":"ref_28","unstructured":"Arora, S., Bhaskara, A., Ge, R., and Ma, T. (2014, January 21\u201326). Provable bounds for learning some deep representations. Proceedings of the International Conference on Machine Learning, Beijing, China."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1162\/0899766052530839","article-title":"Efficient coding of time-relative structure using spikes","volume":"17","author":"Smith","year":"2005","journal-title":"Neural Comput."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"18968","DOI":"10.1073\/pnas.1111242109","article-title":"Auditory abstraction from spectro-temporal features to coding auditory entities","volume":"109","author":"Chechik","year":"2012","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1439","DOI":"10.1126\/science.280.5368.1439","article-title":"Optimizing sound features for cortical neurons","volume":"280","author":"Decharms","year":"1998","journal-title":"Science"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Baqar, M., and Zaidi, S.S.H. (2017, January 10\u201314). Performance evaluation of linear and multi-linear subspace learning techniques for object classification based on underwater acoustics. Proceedings of the International Bhurban Conference on Applied Sciences and Technology, Islamabad, Pakistan.","DOI":"10.1109\/IBCAST.2017.7868124"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Meddis, R., and Lopez-Poveda, E.A. (2010). Auditory Periphery: From Pinna to Auditory Nerve, Springer.","DOI":"10.1007\/978-1-4419-5934-8_2"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1121\/1.3664100","article-title":"Underwater radiated noise from modern commercial ships","volume":"131","author":"Mckenna","year":"2012","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_35","first-page":"2579","article-title":"Visualizing high-dimensional data using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"Vigiliae Christianae"}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/20\/12\/990\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:34:55Z","timestamp":1760196895000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/20\/12\/990"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,12,19]]},"references-count":35,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2018,12]]}},"alternative-id":["e20120990"],"URL":"https:\/\/doi.org\/10.3390\/e20120990","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,12,19]]}}}