{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,26]],"date-time":"2025-11-26T11:06:16Z","timestamp":1764155176490,"version":"3.46.0"},"reference-count":63,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2025,11,26]],"date-time":"2025-11-26T00:00:00Z","timestamp":1764115200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MAKE"],"abstract":"<jats:p>Marine mammal vocalizations, such as those of the Northern Right Whale (NARW), are often masked by underwater acoustic noise. The acoustic vocalization signals are characterized by features such as their amplitude, timing, modulation, duration, and spectral content, which cannot be robustly captured by a single feature extraction method. These complex signals pose additional detection challenges beyond their low SNR. Consequently, this study proposes a novel low-SNR NARW classifier for passive acoustic monitoring (PAM). This approach employs an ideal binary mask with a bidirectional long short-term memory highway network (IBM-BHN) to effectively detect and classify NARW upcalls in challenging conditions. To enhance model performance, the reported literature limitations were addressed by employing a hybrid feature extraction method and leveraging the BiLSTM to capture and learn temporal dependencies. Furthermore, the integration of a highway network improves information flow, enabling near-real-time classification and superior model performance. Experimental results show the IBM-BHN method outperformed five considered state-of-the-art baseline models. Specifically, the IBM-BHN achieved an accuracy of 98%, surpassing ResNet (94%), CNN (85%), LSTM (83%), ANN (82%), and SVM (67%). These findings highlight the practical potential of IBM-BHN to support near-real-time monitoring and inform evidence-based, adaptive policy enforcement critical for NARW conservation.<\/jats:p>","DOI":"10.3390\/make7040154","type":"journal-article","created":{"date-parts":[[2025,11,26]],"date-time":"2025-11-26T10:04:57Z","timestamp":1764151497000},"page":"154","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Low-SNR Northern Right Whale Upcall Detection and Classification Using Passive Acoustic Monitoring to Reduce Adverse Human\u2013Whale Interactions"],"prefix":"10.3390","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3059-6920","authenticated-orcid":false,"given":"Doyinsola D.","family":"Olatinwo","sequence":"first","affiliation":[{"name":"Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2, Canada"}]},{"given":"Mae L.","family":"Seto","sequence":"additional","affiliation":[{"name":"Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2, Canada"}]}],"member":"1968","published-online":{"date-parts":[[2025,11,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"105100","DOI":"10.1016\/j.ocecoaman.2020.105100","article-title":"Reflections on the ecosystem services of whales and valuing their contribution to human well-being","volume":"186","author":"Cook","year":"2020","journal-title":"Ocean Coast. Manag."},{"key":"ref_2","unstructured":"NOAA Fisheries (2023, November 02). Laws Policies: Marine Mammal Protection Act, Available online: https:\/\/www.fisheries.noaa.gov\/topic\/laws-policies\/marine-mammal-protection-act."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Erceg, M., and Palamas, G. (2023, January 4\u20138). Towards Harmonious Coexistence: A Bioacoustic-Driven Animal-Computer. Proceedings of the Interaction System for Preventing Ship Collisions with North Atlantic Right Whales, ACI, Raleigh, NC, USA.","DOI":"10.1145\/3637882.3637890"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Roman, J., Estes, J.A., Morissette, L., Smith, C., Costa, D.M., Nation, J.B., Nicol, S., Pershing, A., and Smetacek, V. (2014). Whales as marine ecosystem engineers. Front. Ecol. Environ., 12.","DOI":"10.1890\/130220"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Olatinwo, D.D., and Seto, M.L. (2024, January 23\u201326). Detection of Marine Mammal Vocalizations in Low SNR Environments with Ideal Binary Mask. Proceedings of the IEEE OCEANS Conference, Halifax, NS, Canada.","DOI":"10.1109\/OCEANS55160.2024.10754093"},{"key":"ref_6","unstructured":"Chami, R., and Cosimano, T.C. (2024, September 23). Nature\u2019s Solution to Climate Change. Available online: https:\/\/www.imf.org\/en\/publications\/fandd\/issues\/2019\/12\/natures-solution-to-climate-change-chami."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Brunoldi, M., Bozzini, G., Casale, A., Corvisiero, P., Grosso, D., Magnoli, N., Alessi, J., Bianchi, C.N., Mandich, A., and Morri, C. (2016). A permanent automated real-time passive acoustic monitoring system for bottlenose dolphin conservation in the mediterranean sea. PLoS ONE, 11.","DOI":"10.1371\/journal.pone.0145362"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1111\/brv.12001","article-title":"Estimating animal population density using passive acoustics","volume":"88","author":"Marques","year":"2013","journal-title":"Biol. Rev."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3651","DOI":"10.1121\/1.3651817","article-title":"Vocal characteristics of pygmy blue whales and their change over time","volume":"130","author":"Gavrilov","year":"2011","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Gillespie, D., Hastie, G., Montabaranom, J., Longden, E., Rapson, K., Holoborodko, A., and Sparling, C. (2023). Automated detection and tracking of marine mammals in the vicinity of tidal turbines using multibeam sonar. J. Mar. Sci. Eng., 11.","DOI":"10.3390\/jmse11112095"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1195","DOI":"10.1017\/S0025315413000568","article-title":"Long-term passive acoustic monitoring revealed seasonal and diel patterns of cetacean presence in the Istanbul strait","volume":"94","author":"Dede","year":"2014","journal-title":"J. Mar. Biol. Assoc. United Kingd."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Sherin, B.M., and Supriya, M.H. (2015, January 23). Selection and Parameter Optimization of SVM Kernel Function for Underwater Target Classification. Proceedings of the 2015 IEEE Underwater Technology (UT) 2015, Chennai, India.","DOI":"10.1109\/UT.2015.7108260"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"127141","DOI":"10.1109\/ACCESS.2024.3454815","article-title":"Convolutional Neural Networks for enhancing detection of Dolphin whistles in a dense acoustic environment","volume":"12","author":"Scaradozzi","year":"2024","journal-title":"IEEE Access"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"22","DOI":"10.3390\/make4010002","article-title":"Transfer learning evaluation of deep neural networks for image classification","volume":"4","author":"Zengeler","year":"2022","journal-title":"Mach. Learn. Knowl. Extr."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"3672","DOI":"10.1121\/10.0039669","article-title":"North Atlantic right whale detection range performance quantification on a bottom-mounted linear hydrophone array using a calibrated acoustic source","volume":"158","author":"Premus","year":"2025","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"102906","DOI":"10.1016\/j.ecoinf.2024.102906","article-title":"A deep learning model for detecting and classifying multiple marine mammal species from passive acoustic data","volume":"84","author":"Hamard","year":"2024","journal-title":"Ecol. Inform."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"107020","DOI":"10.1016\/j.apacoust.2019.107020","article-title":"Trends in audio signal feature extraction methods","volume":"158","author":"Sharma","year":"2020","journal-title":"Appl. Acoust."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"101036","DOI":"10.1016\/j.ecoinf.2019.101036","article-title":"Active contourbased detection of estuarine dolphin whistles in spectrogram images","volume":"55","author":"Serra","year":"2020","journal-title":"Ecol. Inform."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Ibrahim, A.K., Zhuang, H., Erdol, N., and Ali, A.M. (2016, January 4\u20136). A new approach for north atlantic right whale upcall detection. Proceedings of the IEEE 2016 International Symposium on Computer, Consumer and Control (IS3C), Xi\u2019an, China.","DOI":"10.1109\/IS3C.2016.76"},{"key":"ref_20","unstructured":"Pourhomayoun, M., Dugan, P., Popescu, M., Risch, D., Lewis, H., and Clark, C. (2013, January 16\u201321). Classification for Big Dataset of Bioacoustic Signals Based on Human Scoring System and Artificial Neural Network. Proceedings of the ICML 2013 Workshop on Machine Learning for Bioacoustic, Atlanta, GA, USA."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1256","DOI":"10.1016\/j.dsp.2009.10.024","article-title":"Blue whale calls classification using short time fourier and wavelet packet transforms and artificial neural network","volume":"20","author":"Bahoura","year":"2010","journal-title":"Digit. Signal Process."},{"key":"ref_22","first-page":"14","article-title":"Introduction to machine learning, neural networks, and deep learning","volume":"9","author":"Choi","year":"2020","journal-title":"Transl. Vis. Sci. Technol."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Wang, D., Zhang, L., Lu, Z., and Xu, K. (2018, January 14). Large-scale whale call classification using deep convolutional neural network architectures. Proceedings of the 2018 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Qingdao, China.","DOI":"10.1109\/ICSPCC.2018.8567758"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Brefeld, U., Fromont, E., Hotho, A., Knobbe, A., Maathuis, M., and Robardet, C. (2020). Marine Mammal Species Classification Using Convolutional Neural Networks and a Novel Acoustic Representation. Machine Learning and Knowledge Discovery in Databases, Springer. ECML PKDD, 2019.","DOI":"10.1007\/978-3-030-46150-8"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"2636","DOI":"10.1121\/10.0001132","article-title":"Performance of a deep neural network at detecting north atlantic right whale upcalls","volume":"147","author":"Kirsebom","year":"2020","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Buchanan, C., Bi, Y., Xue, B., Vennell, R., Childe, S.H., Pine, M.K., and Zhang, M. (2021, January 9). Deep convolutional neural networks for detecting dolphin echolocation clicks. Proceedings of the 2021 36th International Conference on Image and Vision Computing New Zealand (IVCNZ), Tauranga, New Zealand.","DOI":"10.1109\/IVCNZ54163.2021.9653250"},{"key":"ref_27","first-page":"4466037","article-title":"Detection method for echolocation clicks based on LSTM networks","volume":"2022","author":"Duan","year":"2022","journal-title":"Mob. Inf. Syst."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1177\/01445987241269496","article-title":"Comparative study of long short-term memory (LSTM), bidirectional LSTM, and traditional machine learning approaches for energy consumption prediction","volume":"43","author":"Alizadegan","year":"2025","journal-title":"Energy Explor. Exploit."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Makropoulos, D.N., Filntisis, P.P., Prospathopoulos, A., Kassis, D., Tsiami, A., and Maragos, P. (2025, January 25). Improving classification of marine mammal vocalizations using vision transformers and phase-related features. Proceedings of the 2025 25th International Conference on Digital Signal Processing (DSP), Costa Navarino, Greece.","DOI":"10.1109\/DSP65409.2025.11075076"},{"key":"ref_30","first-page":"2049","article-title":"Audio Spectrogram Transformer: General Audio Classification with Image Transformers","volume":"30","author":"Gong","year":"2022","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"You, S.H., Coyotl, E.P., Gunturu, S., and Van Segbroeck, M. (2023, January 4\u201310). Transformer-Based Bioacoustic Sound Event Detection on Few-Shot Learning Tasks. Proceedings of the ICASSP 2023\u20142023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece.","DOI":"10.1109\/ICASSP49357.2023.10097081"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"3684","DOI":"10.1109\/TASLP.2024.3428908","article-title":"Asit: Local-global audio spectrogram vision transformer for event classification","volume":"32","author":"Ahmed","year":"2024","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_33","unstructured":"DCLDE (2025, April 07). DCLDE 2013 Workshop Dataset. Available online: https:\/\/research-portal.st-andrews.ac.uk\/en\/datasets\/dclde-2013-workshop-dataset."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"837","DOI":"10.1111\/j.1748-7692.2010.00376.x","article-title":"Visual and acoustic surveys for North Atlantic right whales, Eubalaena glacialis, in Cape Cod Bay, Massachusetts, 2001\u20132005: Management implications","volume":"26","author":"Clark","year":"2010","journal-title":"Mar. Mammal Sci."},{"key":"ref_35","unstructured":"Thomas, M., Martin, B., Kowarski, K., Gaudet, B., and Matwin, S. (2019, January 14). Detecting Endangered Baleen Whales within Acoustic Recordings using R-CNNs. Proceedings of the AI for Social Good Workshop at NeurIPS, Vancouver, VA, Canada."},{"key":"ref_36","first-page":"105","article-title":"Why 70\/30 or 80\/20 relation between training and testing sets: A pedagogical explanation","volume":"11","author":"Gholamy","year":"2018","journal-title":"Int. J. Intell. Technol. Appl. Stat."},{"key":"ref_37","unstructured":"Pandas (2025, August 08). DataFrame\u2014Pandas 2.3.1 Documentation. Available online: https:\/\/pandas.pydata.org\/docs\/reference\/api\/pandas.DataFrame.html."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhang, W., Li, X., Zhou, A., Ren, K., and Song, J. (2021, January 24\u201326). Underwater acoustic source separation with deep Bi-LSTM networks. Proceedings of the IEEE 4th International Conference on Information Communication and Signal Processing (ICICSP), Shanghai, China.","DOI":"10.1109\/ICICSP54369.2021.9611911"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Divenyi, P. (2005). On ideal binary mask as the computational goal of auditory scene analysis. Speech Separation by Humans and Machines, Springer.","DOI":"10.1007\/b99695"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"1702","DOI":"10.1109\/TASLP.2018.2842159","article-title":"Supervised speech separation based on deep learning: An overview","volume":"26","author":"Wang","year":"2018","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_41","unstructured":"Hu, G., and Wang, D.L. (2001, January 24). Speech segregation based on pitch tracking and amplitude modulation. In Proceeding of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Platz, NY, USA."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1135","DOI":"10.1109\/TNN.2004.832812","article-title":"Monaural speech segregation based on pitch tracking and amplitude modulation","volume":"15","author":"Hu","year":"2004","journal-title":"IEEE Trans. Neural Netw."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Chen, J., Liu, C., Xie, J., An, J., and Huang, N. (2022). Time\u2013frequency mask-aware bidirectional lstm: A deep learning approach for underwater acoustic signal separation. Sensors, 15.","DOI":"10.3390\/s22155598"},{"key":"ref_44","unstructured":"Ravid, R. (2019). Practical Statistics for Educators, Rowman & Littlefield Publishers."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"2742","DOI":"10.1109\/TNSRE.2021.3129790","article-title":"Deep correlation analysis for audio-EEG decoding","volume":"29","author":"Katthi","year":"2001","journal-title":"IEEE Trans. Neural Syst. Rehabil. Eng."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"107581","DOI":"10.1016\/j.apacoust.2020.107581","article-title":"Spectral images based environmental sound classification using CNN with meaningful data augmentation","volume":"172","author":"Mushtaq","year":"2001","journal-title":"Appl. Acoust."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1094","DOI":"10.1121\/10.0017118","article-title":"Machine-learning-based simultaneous detection and ranging of impulsive baleen whale vocalizations using a single hydrophone","volume":"153","author":"Goldwater","year":"2023","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Raju, V.G., Lakshmi, K.P., Jain, V.M., Kalidindi, A., and Padma, V. (2020, January 20). Study the influence of normalization\/transformation process on the accuracy of supervised classification. Proceedings of the IEEE Third International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India.","DOI":"10.1109\/ICSSIT48917.2020.9214160"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Ahsan, M.M., Mahmud, M.P., Saha, P.K., Gupta, K.D., and Siddique, Z. (2021). Effect of data scaling methods on machine learning algorithms and model performance. Technologies, 9.","DOI":"10.3390\/technologies9030052"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Wilkinson, N., and Niesler, T. (2021, January 6). A hybrid CNN-BiLSTM voice activity detector. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada.","DOI":"10.1109\/ICASSP39728.2021.9415081"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Hendricks, B., Keen, E.M., Wray, J.L., Alidina, H.M., McWhinnie, L., Meuter, H., Picard, C.R., and Gulliver, T.A. (2018, January 28). Automated monitoring and analysis of marine mammal vocalizations in coastal habitats. Proceedings of the IEEE OCEANS-MTS\/IEEE Kobe Techno-Oceans (OTO), Kobe, Japan.","DOI":"10.1109\/OCEANSKOBE.2018.8559432"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Zhu, C., Seri, S.G., Mohebbi-Kalkhoran, H., and Ratilal, P. (2020). Long-range automatic detection, acoustic signature characterization and bearing-time estimation of multiple ships with coherent hydrophone array. Remote Sens., 12.","DOI":"10.3390\/rs12223731"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Graves, A., Fern\u00e1ndez, S., and Schmidhuber, J. (2005, January 11\u201315). Bidirectional LSTM networks for improved phoneme classification and recognition. Proceedings of the International Conference on Artificial Neural Networks, Warsaw, Poland.","DOI":"10.1007\/11550907_126"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1016\/j.neucom.2020.01.006","article-title":"Bidirectional lstm with self-attention mechanism and multi-channel features for sentiment classification","volume":"387","author":"Li","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1016\/j.neucom.2019.01.078","article-title":"Bidirectional lstm with attention mechanism and convolutional layer for text classification","volume":"337","author":"Liu","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"1235","DOI":"10.1162\/neco_a_01199","article-title":"A review of recurrent neural networks: LSTM cells and network architectures","volume":"31","author":"Yu","year":"2019","journal-title":"Neural Comput."},{"key":"ref_57","unstructured":"Zilly, J.G. (2017, January 17). Recurrent highway networks. Proceedings of the 34th International Conference on Machine Learning, PMLR, Sydney, Australia."},{"key":"ref_58","first-page":"418","article-title":"Mechanism of Overfitting Avoidance Techniques for Training Deep Neural Networks","volume":"2022","author":"Sabiri","year":"2022","journal-title":"ICEIS"},{"key":"ref_59","unstructured":"Brownlee, J. (2018). Better Deep Learning: Train Faster, Reduce Overfitting, And Make Better Predictions, Machine Learning Mastery."},{"key":"ref_60","unstructured":"Chollet, F. (2021). Deep Learning with Python, Simon and Schuster."},{"key":"ref_61","unstructured":"Raschka, S. (2025, November 09). Stat 451: Machine Learning Lecture Notes. Available online: https:\/\/sebastianraschka.com\/pdf\/lecture-notes\/stat451fs20\/07-ensembles__notes.pdf."},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1109\/TPAMI.2022.3152247","article-title":"A survey on vision transformer","volume":"45","author":"Han","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intelligence"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Song, G., Guo, X., Zhang, Q., Li, J., and Ma, L. (2023). Underwater Noise Modeling and its Application in Noise Classification with Small-Sized Samples. J. Electron., 12.","DOI":"10.3390\/electronics12122669"}],"container-title":["Machine Learning and Knowledge Extraction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-4990\/7\/4\/154\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,26]],"date-time":"2025-11-26T10:56:15Z","timestamp":1764154575000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-4990\/7\/4\/154"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,26]]},"references-count":63,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["make7040154"],"URL":"https:\/\/doi.org\/10.3390\/make7040154","relation":{},"ISSN":["2504-4990"],"issn-type":[{"value":"2504-4990","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,26]]}}}