{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T22:45:14Z","timestamp":1769553914349,"version":"3.49.0"},"reference-count":32,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T00:00:00Z","timestamp":1769472000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61902205"],"award-info":[{"award-number":["61902205"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100007129","name":"Shandong Provincial Natural Science Foundation","doi-asserted-by":"publisher","award":["ZR2023MF052"],"award-info":[{"award-number":["ZR2023MF052"]}],"id":[{"id":"10.13039\/501100007129","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>Continuous missing data is a prevalent challenge in long-term air quality monitoring, undermining the reliability of public health protection and sustainable urban development. In this paper, we propose ConFill, a novel contrastive learning-based framework for reconstructing continuous missing data in air quality time series. By leveraging temporal continuity as a supervisory signal, our method constructs positive sample pairs from adjacent subsequences and negative pairs from distant and shuffled segments. Through contrastive learning, the model learns robust representations that preserve intrinsic temporal dynamics, and enable accurate imputation of continuous missing segments. A novel data augmentation strategy is proposed, to integrate noise injection, subsequence masking, and time warping to enhance the diversity and representativeness of training samples. Extensive experiments are conducted on a large scale real-world dataset comprising multi-pollutant observations from 209 monitoring stations across China over a three-year period. Results show that ConFill outperforms baseline imputation methods under various missing scenarios, especially in reconstructing long consecutive gaps. Ablation studies confirm the effectiveness of both the contrastive learning module and the proposed augmentation technique.<\/jats:p>","DOI":"10.3390\/info17020121","type":"journal-article","created":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T09:21:40Z","timestamp":1769505700000},"page":"121","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Long-Term Air Quality Data Filling Based on Contrastive Learning"],"prefix":"10.3390","volume":"17","author":[{"given":"Zihe","family":"Liu","sequence":"first","affiliation":[{"name":"School of Information and Control Engineering, Qingdao University of Technology, Qingdao 266520, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6004-6073","authenticated-orcid":false,"given":"Keyong","family":"Hu","sequence":"additional","affiliation":[{"name":"School of Information and Control Engineering, Qingdao University of Technology, Qingdao 266520, China"}]},{"given":"Jingxuan","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information and Control Engineering, Qingdao University of Technology, Qingdao 266520, China"}]},{"given":"Xingchen","family":"Ren","sequence":"additional","affiliation":[{"name":"School of Information and Control Engineering, Qingdao University of Technology, Qingdao 266520, China"}]},{"given":"Xi","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Liberal Arts, Journalism and Communication, Ocean University of China, Qingdao 266100, China"}]}],"member":"1968","published-online":{"date-parts":[[2026,1,27]]},"reference":[{"key":"ref_1","first-page":"356","article-title":"Atmospheric pollution assessed by in situ measurement of magnetic susceptibility on lichens","volume":"235","author":"Marie","year":"2018","journal-title":"Environ. Pollut."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"115229","DOI":"10.1016\/j.envpol.2020.115229","article-title":"Fine air pollution particles trapped by street tree barks: In situ magnetic biomonitoring","volume":"266","author":"Chaparro","year":"2020","journal-title":"Environ. Pollut."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"106957","DOI":"10.1016\/j.asoc.2020.106957","article-title":"Intelligent modeling strategies for forecasting air quality time series: A review","volume":"102","author":"Liu","year":"2021","journal-title":"Appl. Soft Comput."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"710","DOI":"10.1109\/TIP.2004.826093","article-title":"Linear interpolation revitalized","volume":"13","author":"Blu","year":"2004","journal-title":"IEEE Trans. Image Process."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"112827","DOI":"10.1016\/j.rse.2021.112827","article-title":"A review of statistical methods used for developing large-scale and long-term PM2.5 models from satellite data","volume":"269","author":"Ma","year":"2022","journal-title":"Remote Sens. Environ."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"7436","DOI":"10.1021\/es5009399","article-title":"Estimating ground-level PM2.5 in China using satellite remote sensing","volume":"48","author":"Ma","year":"2014","journal-title":"Environ. Sci. Technol."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"136991","DOI":"10.1016\/j.scitotenv.2020.136991","article-title":"A machine-learning framework for predicting multiple air pollutants\u2019 concentrations via multi-target regression and feature selection","volume":"715","author":"Masmoudi","year":"2020","journal-title":"Sci. Total. Environ."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Wahid, H., Ha, Q., and Duc, H. (July, January 29). Computational intelligence estimation of natural background ozone level and its distribution for air quality modelling and emission control. Proceedings of the 28th International Symposium on Automation and Robotics in Construction (ISARC 2011), Seoul, Republic of Korea.","DOI":"10.22260\/ISARC2011\/0212"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"101250","DOI":"10.1016\/j.apr.2021.101250","article-title":"Source apportionment modelling of PM2.5 using CMAQ-ISAM over a tropical coastal-urban area","volume":"12","author":"Kitagawa","year":"2021","journal-title":"Atmos. Pollut. Res."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"118620","DOI":"10.1016\/j.atmosenv.2021.118620","article-title":"An integrated model combining random forests and WRF\/CMAQ model for high accuracy spatiotemporal PM2.5 predictions in the Kansai region of Japan","volume":"262","author":"Thongthammachart","year":"2021","journal-title":"Atmos. Environ."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"753","DOI":"10.1016\/j.scitotenv.2017.11.291","article-title":"PM10 concentration forecasting in the metropolitan area of Oviedo (Northern Spain) using models based on SVM, MLP, VARMA and ARIMA: A case study","volume":"621","author":"Nieto","year":"2018","journal-title":"Sci. Total. Environ."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"150792","DOI":"10.1016\/j.scitotenv.2021.150792","article-title":"Responses of surface O3 and PM2.5 trends to changes of anthropogenic emissions in summer over Beijing during 2014\u20132019: A study based on multiple linear regression and WRF-Chem","volume":"807","author":"He","year":"2022","journal-title":"Sci. Total. Environ."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Rodr\u00edguez-Trejo, A., B\u00f6hnel, H.N., Ibarra-Ortega, H.E., Salcedo, D., Gonz\u00e1lez-Guzm\u00e1n, R., Casta\u00f1eda Miranda, A.G., S\u00e1nchez-Ramos, L.E., Chaparro, M.A.E., and Chaparro, M.A.E. (2024). Air Quality Monitoring with Low-Cost Sensors: A Record of the Increase of PM2.5 during Christmas and New Year\u2019s Eve Celebrations in the City of Queretaro, Mexico. Atmosphere, 15.","DOI":"10.3390\/atmos15080879"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"119347","DOI":"10.1016\/j.atmosenv.2022.119347","article-title":"Deep learning for air pollutant concentration prediction: A review","volume":"290","author":"Zhang","year":"2022","journal-title":"Atmos. Environ."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Rijal, N., Gutta, R., Cao, T., Lin, J., Bo, Q., and Zhang, J. (2018, January 27\u201329). Ensemble of deep neural networks for estimating particulate matter from images. Proceedings of the 2018 IEEE 3rd international conference on image, Vision and Computing (ICIVC), Chongqing, China.","DOI":"10.1109\/ICIVC.2018.8492790"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1016\/j.ecoenv.2018.11.024","article-title":"Sequential prediction of quantitative health risk assessment for the fine particulate matter in an underground facility using deep recurrent neural networks","volume":"169","author":"Vilela","year":"2019","journal-title":"Ecotoxicol. Environ. Saf."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Li, Y., Gan, Z., Shen, Y., Liu, J., Cheng, Y., Wu, Y., Carin, L., Carlson, D., and Gao, J. (2019). StoryGAN: A Sequential Conditional GAN for Story Visualization. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE.","DOI":"10.1109\/CVPR.2019.00649"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"114513","DOI":"10.1016\/j.eswa.2020.114513","article-title":"Multi-hour and multi-site air quality index forecasting in Beijing using CNN, LSTM, CNN-LSTM, and spatiotemporal clustering","volume":"169","author":"Yan","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Liang, Y., Xia, Y., Ke, S., Wang, Y., Wen, Q., Zhang, J., Zheng, Y., and Zimmermann, R. (2023, January 7\u201314). Airformer: Predicting nationwide air quality in China with transformers. Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA.","DOI":"10.1609\/aaai.v37i12.26676"},{"key":"ref_20","unstructured":"Tolstikhin, I., Housby, N., Kolesnikov, A., Beyer, L., Zhai, X., Unterthiner, T., Yung, J., Steiner, A., Keysers, D., and Uszkoreit, J. (2021). MLP-Mixer: An all-MLP Architecture for Vision. arXiv."},{"key":"ref_21","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A., Kaiser, U., and Polosukhin, I. (2017). Attention is All You Need. Adv. Neural Inf. Process. Syst., 30."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1109\/TPAMI.2022.3152247","article-title":"A Survey on Vision Transformer","volume":"45","author":"Han","year":"2023","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhou, H., Zhang, S., Peng, J., Zhang, S., Li, J., Xiong, H., and Zhang, W. (2021, January 2\u20139). Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, AAAI, Virtually.","DOI":"10.1609\/aaai.v35i12.17325"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Gulati, A., Qin, J., Chiu, C.C., Parmar, N., Zhang, Y., Yu, J., Han, W., Wang, S., Zhang, Z., and Wu, Y. (2020, January 25\u201329). Conformer: Convolution-augmented Transformer for Speech Recognition. Proceedings of the Interspeech 2020, ISCA, ICLR, Shanghai, China.","DOI":"10.21437\/Interspeech.2020-3015"},{"key":"ref_25","first-page":"27268","article-title":"FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting","volume":"Volume 162","author":"Chaudhuri","year":"2022","journal-title":"Proceedings of the 39th International Conference on Machine Learning, Baltimore, MD, USA, 17\u201323 July 2022"},{"key":"ref_26","unstructured":"Kitaev, N., Kaiser, L., and Levskaya, A. (2020). Reformer: The Efficient Transformer. arXiv."},{"key":"ref_27","unstructured":"Wu, H., Xu, J., Wang, J., and Long, M. (2021). Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting. arXiv."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"100800","DOI":"10.1016\/j.uclim.2021.100800","article-title":"Multi-directional temporal convolutional artificial neural network for PM2.5 forecasting with missing values: A deep learning approach","volume":"36","author":"Samal","year":"2021","journal-title":"Urban Clim."},{"key":"ref_29","first-page":"125260","article-title":"The Air Quality Index (AQI) in historical and analytical perspective: A tutorial review","volume":"265","author":"Horn","year":"2023","journal-title":"Talanta"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Liu, J., and Chen, S. (2024, January 26\u201327). TimesURL: Self-Supervised Contrastive Learning for Universal Time Series Representation Learning. Proceedings of the AAAI Conference on Artificial Intelligence, AAAI, Vancouver, BC, Canada.","DOI":"10.1609\/aaai.v38i12.29299"},{"key":"ref_31","unstructured":"Lee, S., Park, T., and Lee, K. (2024, January 7\u201311). Soft Contrastive Learning for Time Series. Proceedings of the International Conference on Learning Representations (ICLR), Vienna, Austria."},{"key":"ref_32","unstructured":"Chen, M., Xu, Z., Zeng, A., and Xu, Q. (2023). FrAug: Frequency Domain Augmentation for Time Series Forecasting. arXiv."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/17\/2\/121\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T09:27:04Z","timestamp":1769506024000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/17\/2\/121"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,1,27]]},"references-count":32,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2026,2]]}},"alternative-id":["info17020121"],"URL":"https:\/\/doi.org\/10.3390\/info17020121","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,1,27]]}}}