{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T16:26:34Z","timestamp":1770740794029,"version":"3.49.0"},"reference-count":29,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2023,3,29]],"date-time":"2023-03-29T00:00:00Z","timestamp":1680048000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100018806","name":"outstanding young and middle-aged science and technology innovation teams of colleges and universities in Hubei province","doi-asserted-by":"publisher","award":["T201907"],"award-info":[{"award-number":["T201907"]}],"id":[{"id":"10.13039\/501100018806","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018806","name":"outstanding young and middle-aged science and technology innovation teams of colleges and universities in Hubei province","doi-asserted-by":"publisher","award":["2021EHB018"],"award-info":[{"award-number":["2021EHB018"]}],"id":[{"id":"10.13039\/501100018806","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018806","name":"International Science and Technology Cooperation Key Research and Development Program of Science and Technology Agency in Hubei Province","doi-asserted-by":"publisher","award":["T201907"],"award-info":[{"award-number":["T201907"]}],"id":[{"id":"10.13039\/501100018806","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018806","name":"International Science and Technology Cooperation Key Research and Development Program of Science and Technology Agency in Hubei Province","doi-asserted-by":"publisher","award":["2021EHB018"],"award-info":[{"award-number":["2021EHB018"]}],"id":[{"id":"10.13039\/501100018806","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>To achieve high-quality voice communication technology without noise interference in flammable, explosive and strong electromagnetic environments, the speech enhancement technology of a fiber-optic external Fabry\u2013Perot interferometric (EFPI) acoustic sensor based on deep learning is studied in this paper. The combination of a complex-valued convolutional neural network and a long short-term memory (CV-CNN-LSTM) model is proposed for speech enhancement in the EFPI acoustic sensing system. Moreover, the 3 \u00d7 3 coupler algorithm is used to demodulate voice signals. Then, the short-time Fourier transform (STFT) spectrogram features of voice signals are divided into a training set and a test set. The training set is input into the established CV-CNN-LSTM model for model training, and the test set is input into the trained model for testing. The experimental findings reveal that the proposed CV-CNN-LSTM model demonstrates exceptional speech enhancement performance, boasting an average Perceptual Evaluation of Speech Quality (PESQ) score of 3.148. In comparison to the CV-CNN and CV-LSTM models, this innovative model achieves a remarkable PESQ score improvement of 9.7% and 11.4%, respectively. Furthermore, the average Short-Time Objective Intelligibility (STOI) score witnesses significant enhancements of 4.04 and 2.83 when contrasted with the CV-CNN and CV-LSTM models, respectively.<\/jats:p>","DOI":"10.3390\/s23073574","type":"journal-article","created":{"date-parts":[[2023,3,30]],"date-time":"2023-03-30T01:31:30Z","timestamp":1680139890000},"page":"3574","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Deep Learning-Based Speech Enhancement of an Extrinsic Fabry\u2013Perot Interferometric Fiber Acoustic Sensor System"],"prefix":"10.3390","volume":"23","author":[{"given":"Shiyi","family":"Chai","sequence":"first","affiliation":[{"name":"School of Science, Hubei University of Technology, Wuhan 430068, China"},{"name":"Hubei Engineering Technology Research Center of Energy Photoelectric Device and System, Hubei University of Technology, Wuhan 430068, China"}]},{"given":"Can","family":"Guo","sequence":"additional","affiliation":[{"name":"School of Science, Hubei University of Technology, Wuhan 430068, China"},{"name":"Hubei Engineering Technology Research Center of Energy Photoelectric Device and System, Hubei University of Technology, Wuhan 430068, China"}]},{"given":"Chenggang","family":"Guan","sequence":"additional","affiliation":[{"name":"Hubei Engineering Technology Research Center of Energy Photoelectric Device and System, Hubei University of Technology, Wuhan 430068, China"}]},{"given":"Li","family":"Fang","sequence":"additional","affiliation":[{"name":"School of Science, Hubei University of Technology, Wuhan 430068, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,3,29]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"17882","DOI":"10.1109\/JSEN.2021.3086107","article-title":"A Gold Diaphragm-Based Fabry-Perot Interferometer with a Fiber-Optic Collimator for Acoustic Sensing","volume":"21","author":"Xiang","year":"2021","journal-title":"IEEE Sens. J."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"85821","DOI":"10.1109\/ACCESS.2019.2924736","article-title":"A comprehensive study of optical fiber acoustic sensing","volume":"7","author":"Wang","year":"2019","journal-title":"IEEE Access"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"11025","DOI":"10.1364\/OE.26.011025","article-title":"Ultrasonic imaging of seismic physical models using a fringe visibility enhanced fiber-optic Fabry-Perot interferometric sensor","volume":"26","author":"Zhang","year":"2018","journal-title":"Opt. Express"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"38191","DOI":"10.1364\/OE.381197","article-title":"Multiplexing fiber-optic Fabry\u2013Perot acoustic sensors using self-calibrating wavelength shifting interferometry","volume":"27","author":"Liu","year":"2019","journal-title":"Opt. Express"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1016\/j.yofte.2013.07.009","article-title":"Miniature fiber acoustic sensors using a photonic-crystal membrane","volume":"19","author":"Jo","year":"2013","journal-title":"Opt. Fiber Technol."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Zhang, W., Lu, P., and Qu, Z. (2021). Passive Homodyne Phase Demodulation Technique Based on LF-TIT-DCM Algorithm for Interferometric Sensors. Sensors, 21.","DOI":"10.3390\/s21248257"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"24300","DOI":"10.1364\/OE.27.024300","article-title":"Micromachined extrinsic Fabry-P\u00e9rot cavity for low-frequency acoustic wave sensing","volume":"27","author":"Fu","year":"2019","journal-title":"Opt. Express"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Chaudhari, A., and Dhonde, S.B. (2015, January 8\u201310). A review on speech enhancement techniques. Proceedings of the 2015 International Conference on Pervasive Computing (ICPC), Pune, India.","DOI":"10.1109\/PERVASIVE.2015.7087096"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1368","DOI":"10.1109\/TASLP.2021.3066303","article-title":"An overview of deep-learning-based audio-visual speech enhancement and separation","volume":"29","author":"Michelsanti","year":"2021","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_10","unstructured":"Vaswani, A., Shazeer, N., and Parmar, N. (2017). Attention is all you need. Adv. Neural Inf. Process. Syst., 30."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"4340","DOI":"10.1109\/TGRS.2020.3016820","article-title":"More diverse means better: Multimodal deep learning meets remote-sensing imagery classification","volume":"59","author":"Hong","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"5966","DOI":"10.1109\/TGRS.2020.3015157","article-title":"Graph convolutional networks for hyperspectral image classification","volume":"59","author":"Hong","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_13","unstructured":"Ge, Z., Liu, S., and Wang, F. (2021). Yolox: Exceeding yolo series in 2021. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1109\/TIP.2022.3228497","article-title":"UIU-Net: U-Net in U-Net for infrared small object detection","volume":"32","author":"Wu","year":"2022","journal-title":"IEEE Trans. Image Process."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1486","DOI":"10.1121\/1.3184603","article-title":"An algorithm that improves speech intelligibility in noise for normal-hearing listeners","volume":"126","author":"Kim","year":"2009","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1109\/TASL.2012.2215596","article-title":"Towards generalizing classification based speech separation","volume":"21","author":"Han","year":"2012","journal-title":"IEEE Trans. Audio Speech Lang. Process."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.neucom.2018.01.013","article-title":"Training and compensation of class-conditioned NMF bases for speech enhancement","volume":"284","author":"Chung","year":"2018","journal-title":"Neurocomputing"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2136","DOI":"10.1109\/TASLP.2015.2468583","article-title":"Joint optimization of masks and deep recurrent neural networks for monaural source separation","volume":"23","author":"Huang","year":"2015","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Kishore, V., Tiwari, N., and Paramasivam, P. (2020, January 25\u201329). Improved Speech Enhancement Using TCN with Multiple Encoder-Decoder Layers. Proceedings of the Interspeech 2020, Shanghai, China.","DOI":"10.21437\/Interspeech.2020-3122"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Tan, K., and Wang, D.L. (2018, January 2\u20136). A convolutional recurrent neural network for real-time speech enhancement. Proceedings of the Interspeech 2018, Hyderabad, India.","DOI":"10.21437\/Interspeech.2018-1405"},{"key":"ref_21","unstructured":"Choi, H.S., Kim, J.H., and Huh, J. (2019, January 6\u20139). Phase-aware speech enhancement with deep complex u-net. Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Cao, R., Abdulatif, S., and Yang, B. (2022). CMGAN: Conformer-based metric GAN for speech enhancement. arXiv, preprint.","DOI":"10.36227\/techrxiv.21187846.v2"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Park, H.J., Kang, B.H., and Shin, W. (2022, January 22\u201327). Manner: Multi-view attention network for noise erasure. Proceedings of the ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Singapore.","DOI":"10.1109\/ICASSP43922.2022.9747120"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"5224","DOI":"10.1109\/JLT.2018.2868964","article-title":"A miniature fiber-optic microphone based on an annular corrugated MEMS diaphragm","volume":"36","author":"Lu","year":"2018","journal-title":"J. Light. Technol."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Ge, Y.X., Wang, M., and Yan, H.T. (2008, January 7\u20139). Mesa diaphragm-based Fabry-Perot optical MEMS pressure sensor. Proceedings of the 2008 1st Asia-Pacific Optical Fiber Sensors Conference, Chengdu, China.","DOI":"10.1109\/APOS.2008.5226325"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1117\/12.191852","article-title":"Digital demodulation for passive homodyne optical fiber interferometry based on a 3 by 3 coupler","volume":"Volume 2292","author":"Chiu","year":"1994","journal-title":"Proceedings of the Fiber Optic and Laser Sensors XII"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Veaux, C., Yamagishi, J., and King, S. (2013, January 25\u201327). The voice bank corpus: Design, collection and data analysis of a large regional accent speech database. Proceedings of the 2013 International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA\/CASLRE), Gurgaon, India.","DOI":"10.1109\/ICSDA.2013.6709856"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"749","DOI":"10.1109\/ICASSP.2001.941023","article-title":"Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs","volume":"Volume 2","author":"Rix","year":"2001","journal-title":"Proceedings of the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"2125","DOI":"10.1109\/TASL.2011.2114881","article-title":"An algorithm for intelligibility prediction of time\u2013frequency weighted noisy speech","volume":"19","author":"Taal","year":"2011","journal-title":"IEEE Trans. Audio Speech Lang. Process."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/7\/3574\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:06:06Z","timestamp":1760123166000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/7\/3574"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,29]]},"references-count":29,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2023,4]]}},"alternative-id":["s23073574"],"URL":"https:\/\/doi.org\/10.3390\/s23073574","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,29]]}}}