{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T00:18:30Z","timestamp":1773965910997,"version":"3.50.1"},"reference-count":34,"publisher":"Springer Science and Business Media LLC","issue":"9","license":[{"start":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T00:00:00Z","timestamp":1751846400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T00:00:00Z","timestamp":1751846400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2025,9]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Accurate and timely detection of infant needs through their suggestive cries is crucial for effective intervention and improved well-being. However, existing infant cry classification methods often struggle with the inherent variability of cries, long-term dependencies within cry patterns, and a lack of adaptability to background noise and individual differences. This paper introduces a novel \u201cAdaptive Infant Cry Classification\u201d model that addresses these limitations by dynamically selecting the most informative features from acoustic, spectral, and temporal domains using a multi-armed bandit (MAB) approach. The adaptive feature selection strategy, integrated within an Attentive Convolutional Recurrent Neural Network architecture, enhances the ability of the model to capture both temporal and spectral patterns in infant cries, leading to improved accuracy, precision, and robustness. Evaluated on a comprehensive dataset of infant cry recordings from Baby Chilanto and Donate Cry databases, our model achieves state-of-the-art performance, demonstrating its potential for real-world applications, including early detection of infant distress, infants\u2019 personalized care plans, and the development of new interventions. Experimental results demonstrated significant improvements in classification accuracy (97%) and robustness compared to conventional classical methods. Notably, the proposed framework surpasses standard baseline CNN\u2013RNN-based classifiers by 5\u20137% across multiple cry types, reducing overall error rates from around 12% to just under 5%. Ablation studies reveal that the MAB-based feature selection contributes up to a 10% relative increase in accuracy compared to static methods, while the attention components provide an additional 5% improvement. Combined, these features lead to a 10% absolute gain in F1-score under high noise conditions. This shows the model\u2019s suitability for clinical and home-based environments, aiming to improve artificial parenting anytime and anywhere.<\/jats:p>","DOI":"10.1007\/s40747-025-02000-w","type":"journal-article","created":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T00:30:58Z","timestamp":1751848258000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Adaptive infant cry classification using multi-armed bandit modality selection in an attentive convolutional recurrent neural network model"],"prefix":"10.1007","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-6602-4534","authenticated-orcid":false,"given":"Geofrey","family":"Owino","sequence":"first","affiliation":[]},{"given":"Timothy","family":"Kamanu","sequence":"additional","affiliation":[]},{"given":"John","family":"Ndiritu","sequence":"additional","affiliation":[]},{"given":"Conlet Biketi","family":"Kikechi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,7,7]]},"reference":[{"issue":"4","key":"2000_CR1","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1017\/S0140525X0400010X","volume":"27","author":"J Soltis","year":"2004","unstructured":"Soltis J (2004) The signal functions of early infant crying. Behav Brain Sci 27(4):443\u2013458. https:\/\/doi.org\/10.1017\/S0140525X0400010X","journal-title":"Behav Brain Sci"},{"issue":"12","key":"2000_CR2","doi-asserted-by":"publisher","first-page":"2665","DOI":"10.3390\/math11122665","volume":"11","author":"D Ivanko","year":"2023","unstructured":"Ivanko D, Ryumin D, Karpov A (2023) A review of recent advances on deep learning methods for audio-visual speech recognition. Mathematics 11(12):2665. https:\/\/doi.org\/10.3390\/math11122665","journal-title":"Mathematics"},{"issue":"1","key":"2000_CR3","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1186\/s13636-021-00197-5","volume":"2021","author":"C Ji","year":"2021","unstructured":"Ji C, Mudiyanselage TB, Gao Y, Lui S, Li H (2021) A review of infant cry analysis and classification. J Audio Speech Music Process 2021(1):8. https:\/\/doi.org\/10.1186\/s13636-021-00197-5","journal-title":"J Audio Speech Music Process"},{"issue":"3","key":"2000_CR4","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1007\/BF02150709","volume":"20","author":"O Wasz-H\u00f6ckert","year":"1964","unstructured":"Wasz-H\u00f6ckert O, Partanen TJ, Vuorenkoski V, Michelsson K, Valanne E (1964) The identification of some specific meanings in infant vocalization. Experientia 20(3):154\u2013154. https:\/\/doi.org\/10.1007\/BF02150709","journal-title":"Experientia"},{"key":"2000_CR5","doi-asserted-by":"publisher","DOI":"10.3389\/fpubh.2021.670352","volume":"9","author":"PM Vincent","year":"2021","unstructured":"Vincent PM, Srinivasan K, Chang CY (2021) Deep learning assisted neonatal cry classification via support vector machine models. Front Public Health 9:670352. https:\/\/doi.org\/10.3389\/fpubh.2021.670352","journal-title":"Front Public Health"},{"issue":"3","key":"2000_CR6","doi-asserted-by":"publisher","first-page":"572","DOI":"10.1016\/j.patcog.2010.09.020","volume":"44","author":"M Ayadi","year":"2011","unstructured":"Ayadi M, Kamel MS, Karray F (2011) Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn 44(3):572\u2013587. https:\/\/doi.org\/10.1016\/j.patcog.2010.09.020","journal-title":"Pattern Recogn"},{"key":"2000_CR7","doi-asserted-by":"publisher","first-page":"919","DOI":"10.1007\/s10772-016-9379-8","volume":"19","author":"A Chittora","year":"2016","unstructured":"Chittora A, Patil HA (2016) Newborn infant\u2019s cry analysis. Int J Speech Technol 19:919\u2013928. https:\/\/doi.org\/10.1007\/s10772-016-9379-8","journal-title":"Int J Speech Technol"},{"issue":"2","key":"2000_CR8","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1016\/j.cmpb.2011.07.010","volume":"108","author":"M Hariharan","year":"2012","unstructured":"Hariharan M, Sindhu R, Yaacob S (2012) Normal and hypoacoustic infant cry signal classification using time\u2013frequency analysis and general regression neural network. Comput Methods Progr Biomed 108(2):559\u2013569. https:\/\/doi.org\/10.1016\/j.cmpb.2011.07.010","journal-title":"Comput Methods Progr Biomed"},{"key":"2000_CR9","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/1528\/1\/012019","volume":"1528","author":"TN Maghfira","year":"2020","unstructured":"Maghfira TN, Basaruddin T, Krisnadhi A (2020) Infant cry classification using cnn-rnn. J Phy Conf Ser 1528:012019. https:\/\/doi.org\/10.1088\/1742-6596\/1528\/1\/012019","journal-title":"J Phy Conf Ser"},{"key":"2000_CR10","doi-asserted-by":"publisher","unstructured":"Bella V, Sanjaya SA (2023) Refining baby cry classification using data augmentation (time-stretching and pitch-shifting), mfcc feature extraction, and lstm modeling. In: Proceedings of the 2023 7th International Conference on New Media Studies (CONMEDIA). IEEE, Jakarta, Indonesia, pp 250\u2013256. https:\/\/doi.org\/10.1109\/CONMEDIA60526.2023.10428158","DOI":"10.1109\/CONMEDIA60526.2023.10428158"},{"key":"2000_CR11","doi-asserted-by":"publisher","first-page":"106620","DOI":"10.1109\/ACCESS.2023.3318015","volume":"11","author":"K Zaman","year":"2023","unstructured":"Zaman K, Sah M, Direkoglu C, Unoki M (2023) A survey of audio classification using deep learning. IEEE Access 11:106620\u2013106649. https:\/\/doi.org\/10.1109\/ACCESS.2023.3318015","journal-title":"IEEE Access"},{"key":"2000_CR12","doi-asserted-by":"publisher","first-page":"8543","DOI":"10.1007\/s00521-022-08129-w","volume":"35","author":"G Coro","year":"2023","unstructured":"Coro G, Bardelli S, Cuttano A, Menchetti A, Marangi C (2023) A self-training automatic infant-cry detector. Neural Comput Appl 35:8543\u20138559. https:\/\/doi.org\/10.1007\/s00521-022-08129-w","journal-title":"Neural Comput Appl"},{"issue":"3","key":"2000_CR13","doi-asserted-by":"publisher","first-page":"218","DOI":"10.2991\/jrnal.k.210922.013","volume":"8","author":"T Jian","year":"2021","unstructured":"Jian T, Peng Y, Peng W, Yang Z (2021) Research on lstm+attention model of infant cry classification. J Robot Netw Artif Life 8(3):218\u2013223. https:\/\/doi.org\/10.2991\/jrnal.k.210922.013","journal-title":"J Robot Netw Artif Life"},{"key":"2000_CR14","doi-asserted-by":"publisher","first-page":"665","DOI":"10.1007\/978-3-031-42467-0_62","volume-title":"Open science in engineering","author":"Y Martinez-Ca\u00f1ete","year":"2023","unstructured":"Martinez-Ca\u00f1ete Y, Cano-Ortiz SD, Langmann R (2023) Work-in-progress: deep learning classification models for infant cry diagnostic. In: Auer ME, Langmann R, Tsiatsos T (eds) Open science in engineering. Springer, Cham, pp 665\u2013673. https:\/\/doi.org\/10.1007\/978-3-031-42467-0_62"},{"key":"2000_CR15","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.119750","volume":"222","author":"D Salvati","year":"2023","unstructured":"Salvati D, Drioli C, Foresti GL (2023) A late fusion deep neural network for robust speaker identification using raw waveforms and gammatone cepstral coefficients. Expert Syst Appl 222:119750. https:\/\/doi.org\/10.1016\/j.eswa.2023.119750","journal-title":"Expert Syst Appl"},{"key":"2000_CR16","doi-asserted-by":"publisher","unstructured":"Alagundi D, Patil A, Haridas V, Chikkamath S, R, NS, Budihal S (2024) Infant cry classification using cnn-mfcc fusion. In: 2024 IEEE international conference on contemporary computing and communications (InC4), vol 1, pp 1\u20136. https:\/\/doi.org\/10.1109\/InC460750.2024.10649119","DOI":"10.1109\/InC460750.2024.10649119"},{"key":"2000_CR17","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.116621","volume":"196","author":"A Tiwari","year":"2022","unstructured":"Tiwari A, Chaturvedi A (2022) A hybrid feature selection approach based on information theory and dynamic butterfly optimization algorithm for data classification. Expert Syst Appl 196:116621. https:\/\/doi.org\/10.1016\/j.eswa.2022.116621","journal-title":"Expert Syst Appl"},{"key":"2000_CR18","doi-asserted-by":"publisher","DOI":"10.1016\/j.comnet.2023.109783","volume":"230","author":"Z Shu","year":"2023","unstructured":"Shu Z, Feng H, Taleb T, Zhang Z (2023) A novel combinatorial multi-armed bandit game to identify online the changing top-k flows in software-defined networks. Comput Netw 230:109783. https:\/\/doi.org\/10.1016\/j.comnet.2023.109783","journal-title":"Comput Netw"},{"key":"2000_CR19","doi-asserted-by":"publisher","first-page":"947","DOI":"10.1145\/3394486.3403134","volume":"15","author":"Y Ban","year":"2020","unstructured":"Ban Y, He J (2020) Generic outlier detection in multi-armed bandit. Proc 26th ACM SIGKDD Conf Knowl Discov Data Mining 15:947\u2013955. https:\/\/doi.org\/10.1145\/3394486.3403134","journal-title":"Proc 26th ACM SIGKDD Conf Knowl Discov Data Mining"},{"issue":"2","key":"2000_CR20","doi-asserted-by":"publisher","first-page":"248","DOI":"10.3390\/electronics14020248","volume":"14","author":"SV Shayegh","year":"2025","unstructured":"Shayegh SV, Tadj C (2025) Deep audio features and self-supervised learning for early diagnosis of neonatal diseases: sepsis and respiratory distress syndrome classification from infant cry signals. Electronics 14(2):248. https:\/\/doi.org\/10.3390\/electronics14020248","journal-title":"Electronics"},{"key":"2000_CR21","doi-asserted-by":"publisher","DOI":"10.1017\/9781108571401","volume-title":"Bandit algorithms","author":"T Lattimore","year":"2020","unstructured":"Lattimore T, Szepesv\u00e1ri C (2020) Bandit algorithms. Cambridge University Press, Cambridge"},{"key":"2000_CR22","doi-asserted-by":"publisher","unstructured":"Zhou Y, Zou Y, Wu Y, Shi Y, Zhang J (2025) Chapter 3\u2014learning to detect via multi-armed bandit. In: Zhou Y, Zou Y, Wu Y, Shi Y, Zhang J (eds) Machine learning for low-latency communications. Academic Press, Cambridge, MA, pp 29\u201346. https:\/\/doi.org\/10.1016\/B978-0-44-322073-9.00014-6","DOI":"10.1016\/B978-0-44-322073-9.00014-6"},{"issue":"7553","key":"2000_CR23","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436\u2013444. https:\/\/doi.org\/10.1038\/nature14539","journal-title":"Nature"},{"key":"2000_CR24","doi-asserted-by":"publisher","first-page":"1229","DOI":"10.1007\/s11063-019-10126-9","volume":"51","author":"R Zhao","year":"2020","unstructured":"Zhao R, Lee K (2020) Gru-based recurrent neural network for temporal audio classification. Neural Process Lett 51:1229\u20131244. https:\/\/doi.org\/10.1007\/s11063-019-10126-9","journal-title":"Neural Process Lett"},{"key":"2000_CR25","doi-asserted-by":"publisher","unstructured":"Luong M-T, Pham H, Manning CD (2015) Effective approaches to attention-based neural machine translation. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, Portugal, pp 1412\u20131421. https:\/\/doi.org\/10.18653\/v1\/D15-1166","DOI":"10.18653\/v1\/D15-1166"},{"key":"2000_CR26","volume-title":"Deep learning","author":"I Goodfellow","year":"2016","unstructured":"Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge"},{"key":"2000_CR27","volume-title":"Reinforcement learning: an introduction","author":"RS Sutton","year":"2018","unstructured":"Sutton RS, Barto AG (2018) Reinforcement learning: an introduction, 2nd edn. MIT Press, Cambridge","edition":"2"},{"key":"2000_CR28","doi-asserted-by":"publisher","first-page":"18","DOI":"10.25080\/Majora-7b98e3ed-003","volume":"8","author":"B McFee","year":"2015","unstructured":"McFee B, Raffel C, Liang D, Ellis DP, McVicar M, Battenberg E, Nieto O (2015) librosa: audio and music signal analysis in python. Proc 14th Python Sci Conf 8:18\u201325 (Citeseer)","journal-title":"Proc 14th Python Sci Conf"},{"key":"2000_CR29","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1016\/j.procs.2015.04.230","volume":"49","author":"P Singh","year":"2015","unstructured":"Singh P, Kumar A, Sharma AK (2015) Feature extraction techniques for classification of infant cry: a review. Procedia Comput Sci 49:343\u2013352. https:\/\/doi.org\/10.1016\/j.procs.2015.04.230","journal-title":"Procedia Comput Sci"},{"key":"2000_CR30","doi-asserted-by":"publisher","DOI":"10.1016\/j.bspc.2023.105261","volume":"86","author":"A Abbaskhah","year":"2023","unstructured":"Abbaskhah A, Sedighi H, Marvi H (2023) Infant cry classification by mfcc feature extraction with mlp and cnn structures. Biomed Signal Process Control 86:105261. https:\/\/doi.org\/10.1016\/j.bspc.2023.105261","journal-title":"Biomed Signal Process Control"},{"key":"2000_CR31","doi-asserted-by":"publisher","DOI":"10.1016\/j.bspc.2023.104648","volume":"83","author":"T Ozseven","year":"2023","unstructured":"Ozseven T (2023) Infant cry classification by using different deep neural network models and hand-crafted features. Biomed Signal Process Control 83:104648. https:\/\/doi.org\/10.1016\/j.bspc.2023.104648","journal-title":"Biomed Signal Process Control"},{"issue":"20","key":"2000_CR32","doi-asserted-by":"publisher","first-page":"6575","DOI":"10.3390\/s24206575","volume":"24","author":"F Li","year":"2024","unstructured":"Li F, Cui C, Hu Y (2024) Classification of infant crying sounds using se-resnet-transformer. Sensors 24(20):6575. https:\/\/doi.org\/10.3390\/s24206575","journal-title":"Sensors"},{"issue":"3","key":"2000_CR33","doi-asserted-by":"publisher","DOI":"10.1016\/j.kjs.2024.100221","volume":"51","author":"X Qiao","year":"2024","unstructured":"Qiao X, Jiao S, Li H, Liu G, Gao X, Li Z (2024) Infant cry classification using an efficient graph structure and attention-based model. Kuwait J Sci 51(3):100221. https:\/\/doi.org\/10.1016\/j.kjs.2024.100221","journal-title":"Kuwait J Sci"},{"key":"2000_CR34","doi-asserted-by":"publisher","unstructured":"Laguna A, Pusil S, Baz\u00e1n Zegarra-Valdivia JA, Paltrinieri AL, Piras P, Palomares i Perera C, Pardos V\u00e9glia A, Garcia-Algar O, Orlandi S (2023) Multi-modal analysis of infant cry types characterization: acoustics, body language and brain signals. Comput Biol Med 167:107626. https:\/\/doi.org\/10.1016\/j.compbiomed.2023.107626","DOI":"10.1016\/j.compbiomed.2023.107626"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-025-02000-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-025-02000-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-025-02000-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,7]],"date-time":"2025-09-07T01:13:34Z","timestamp":1757207614000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-025-02000-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,7]]},"references-count":34,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2025,9]]}},"alternative-id":["2000"],"URL":"https:\/\/doi.org\/10.1007\/s40747-025-02000-w","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,7]]},"assertion":[{"value":"15 March 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 June 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 July 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"This study did not involve any research on humans or animals.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Research involving human and\/or animals"}},{"value":"Not applicable.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Informed consent"}}],"article-number":"375"}}