{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T15:17:51Z","timestamp":1759331871842,"version":"3.41.0"},"reference-count":62,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2025,2,24]],"date-time":"2025-02-24T00:00:00Z","timestamp":1740355200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,2,24]],"date-time":"2025-02-24T00:00:00Z","timestamp":1740355200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Fraunhofer-Institut f\u00fcr Optronik, Systemtechnik und Bildauswertung IOSB"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2025,7]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Detecting airborne dust in standard RGB images presents significant challenges. Nevertheless, the monitoring of airborne dust holds substantial potential benefits for climate protection, environmentally sustainable construction, scientific research, and various other fields. To develop an efficient and robust algorithm for airborne dust monitoring, several hurdles have to be addressed. Airborne dust can be opaque or translucent, exhibit considerable variation in density, and possess indistinct boundaries. Moreover, distinguishing dust from other atmospheric phenomena, such as fog or clouds, can be particularly challenging. To meet the demand for a high-performing and reliable method for monitoring airborne dust, we introduce DustNet++, a neural network designed for dust density estimation. DustNet++ leverages feature maps from multiple resolution scales and semantic levels through window and grid attention mechanisms to maintain a sparse, globally effective receptive field with linear complexity. To validate our approach, we benchmark the performance of DustNet++ against existing methods from the domains of crowd counting and monocular depth estimation using the Meteodata airborne dust dataset and the URDE binary dust segmentation dataset. Our findings demonstrate that DustNet++ surpasses comparative methodologies in terms of regression and localization capabilities.<\/jats:p>","DOI":"10.1007\/s11263-025-02376-9","type":"journal-article","created":{"date-parts":[[2025,2,24]],"date-time":"2025-02-24T14:49:49Z","timestamp":1740408589000},"page":"4220-4244","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["DustNet++: Deep Learning-Based Visual Regression for Dust Density Estimation"],"prefix":"10.1007","volume":"133","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-1789-9104","authenticated-orcid":false,"given":"Andreas","family":"Michel","sequence":"first","affiliation":[]},{"given":"Martin","family":"Weinmann","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0009-9699-4305","authenticated-orcid":false,"given":"Jannick","family":"Kuester","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0005-1909-4646","authenticated-orcid":false,"given":"Faisal","family":"AlNasser","sequence":"additional","affiliation":[]},{"given":"Tomas","family":"Gomez","sequence":"additional","affiliation":[]},{"given":"Mark","family":"Falvey","sequence":"additional","affiliation":[]},{"given":"Rainer","family":"Schmitz","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8654-7546","authenticated-orcid":false,"given":"Wolfgang","family":"Middelmann","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7323-9800","authenticated-orcid":false,"given":"Stefan","family":"Hinz","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,2,24]]},"reference":[{"key":"2376_CR1","doi-asserted-by":"crossref","unstructured":"Agarwal, A., & Arora, C. (2023). Attention attention everywhere: Monocular depth prediction with skip attention. In Proceedings of the IEEE\/CVF winter conference on applications of computer vision (pp. 5861\u20135870).","DOI":"10.1109\/WACV56688.2023.00581"},{"key":"2376_CR2","doi-asserted-by":"crossref","unstructured":"Avvenuti, M., Bongiovanni, M., Ciampi, L., Falchi, F., Gennaro, C., & Messina, N. (2022). A spatio-temporal attentive network for video-based crowd counting. In Proceedings of the 2022 IEEE symposium on computers and communications (pp. 1\u20136). IEEE.","DOI":"10.1109\/ISCC55528.2022.9913019"},{"key":"2376_CR3","unstructured":"Bhat, S. F., Alhashim, I., & Wonka, P. (2021). Adabins: Depth estimation using adaptive bins. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 4009\u20134018)."},{"key":"2376_CR4","doi-asserted-by":"crossref","unstructured":"Brodersen, K. H., Ong, C. S., Stephan, K. E., & Buhmann, J. M. (2010). The balanced accuracy and its posterior distribution. In Proceedings of the 2010 20th international conference on pattern recognition (pp. 3121\u20133124). IEEE.","DOI":"10.1109\/ICPR.2010.764"},{"key":"2376_CR5","doi-asserted-by":"crossref","unstructured":"Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., & Zagoruyko, S. (2020). End-to-end object detection with transformers. In European conference on computer vision (pp. 213\u2013229). Springer.","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"2376_CR6","unstructured":"Chen, T., Xu, B., Zhang, C., & Guestrin, C. (2016). Training deep nets with sublinear memory cost. arXiv:1604.06174"},{"key":"2376_CR7","unstructured":"Cheng, B., Choudhuri, A., Misra, I., Kirillov, A., Girdhar, R., & Schwing, A. G. (2021). Mask2former for video instance segmentation. arXiv:2112.10764"},{"key":"2376_CR8","doi-asserted-by":"crossref","unstructured":"Cheng, Z.-Q., Dai, Q., Li, H., Song, J., Wu, X., & Hauptmann, A. G. (2022). Rethinking spatial invariance of convolutional networks for object counting. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 19638\u201319648).","DOI":"10.1109\/CVPR52688.2022.01902"},{"key":"2376_CR9","unstructured":"Chung, J., Gulcehre, C., Cho, K., & Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.3555"},{"key":"2376_CR10","first-page":"3965","volume":"34","author":"Z Dai","year":"2021","unstructured":"Dai, Z., Liu, H., Le, Q. V., & Tan, M. (2021). Coatnet: Marrying convolution and attention for all data sizes. Advances in Neural Information Processing Systems, 34, 3965\u20133977.","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"1","key":"2376_CR11","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1038\/s41597-022-01918-x","volume":"10","author":"A De Silva","year":"2023","unstructured":"De Silva, A., Ranasinghe, R., Sounthararajah, A., Haghighi, H., & Kodikara, J. (2023). A benchmark dataset for binary segmentation and quantification of dust emissions from unsealed roads. Scientific Data, 10(1), 14.","journal-title":"Scientific Data"},{"key":"2376_CR12","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., et al. (2020). An image is worth 16x16 words: Transattentions for image recognition at scale. arXiv:2010.11929"},{"key":"2376_CR13","unstructured":"Eigen, D., Puhrsch, C., Fergus, R. (2014). Depth map prediction from a single image using a multi-scale deep network. In Advances in neural information processing systems, Vol. 27."},{"key":"2376_CR14","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/j.neunet.2017.12.012","volume":"107","author":"S Elfwing","year":"2018","unstructured":"Elfwing, S., Uchibe, E., & Doya, K. (2018). Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. Neural Networks, 107, 3\u201311.","journal-title":"Neural Networks"},{"key":"2376_CR15","unstructured":"FairScale authors. (2021). FairScale: A general purpose modular PyTorch library for high performance and large scale training. https:\/\/github.com\/facebookresearch\/fairscale"},{"key":"2376_CR16","doi-asserted-by":"crossref","unstructured":"Fu, H., Gong, M., Wang, C., Batmanghelich, K., & Tao, D. (2018). Deep ordinal regression network for monocular depth estimation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2002\u20132011).","DOI":"10.1109\/CVPR.2018.00214"},{"key":"2376_CR17","unstructured":"gabort@AdobeStock. (2023). https:\/\/www.stock.adobe.com"},{"key":"2376_CR18","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770\u2013778).","DOI":"10.1109\/CVPR.2016.90"},{"key":"2376_CR19","unstructured":"Hendrycks, D., & Gimpel, K. (2016). Gaussian error linear units (GELUS). arXiv:1606.08415"},{"issue":"8","key":"2376_CR20","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735\u20131780.","journal-title":"Neural Computation"},{"key":"2376_CR21","unstructured":"Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., & Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861"},{"key":"2376_CR22","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., & Sun, G. (2018). Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 7132\u20137141).","DOI":"10.1109\/CVPR.2018.00745"},{"key":"2376_CR23","unstructured":"Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Proceedings of the international conference on machine learning (pp. 448\u2013456)."},{"key":"2376_CR24","first-page":"497","volume":"2","author":"P Jain","year":"2020","unstructured":"Jain, P., Jain, A., Nrusimha, A., Gholami, A., Abbeel, P., Gonzalez, J., Keutzer, K., & Stoica, I. (2020). Checkmate: Breaking the memory wall with optimal tensor rematerialization. Proceedings of Machine Learning and Systems, 2, 497\u2013511.","journal-title":"Proceedings of Machine Learning and Systems"},{"key":"2376_CR25","unstructured":"Khan, A. R., & Khan, A. (2023). Maxvit-unet: Multi-axis attention for medical image segmentation. arXiv:2305.08396"},{"issue":"6","key":"2376_CR26","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1145\/3065386","volume":"60","author":"A Krizhevsky","year":"2017","unstructured":"Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2017). Imagenet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84\u201390.","journal-title":"Communications of the ACM"},{"issue":"3","key":"2376_CR27","doi-asserted-by":"publisher","first-page":"456","DOI":"10.3390\/rs13030456","volume":"13","author":"J Lee","year":"2021","unstructured":"Lee, J., Shi, Y. R., Cai, C., Ciren, P., Wang, J., Gangopadhyay, A., & Zhang, Z. (2021). Machine learning based algorithms for global dust aerosol detection from satellite images: Inter-comparisons and evaluation. Remote Sensing, 13(3), 456.","journal-title":"Remote Sensing"},{"key":"2376_CR28","doi-asserted-by":"crossref","unstructured":"Lee, M., Hwang, S., Park, C., & Lee, S. (2022). Edgeconv with attention module for monocular depth estimation. In Proceedings of the IEEE\/CVF winter conference on applications of computer vision (pp. 2858\u20132867).","DOI":"10.1109\/WACV51458.2022.00242"},{"issue":"12","key":"2376_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10661-019-7825-1","volume":"191","author":"SK Leghari","year":"2019","unstructured":"Leghari, S. K., Zaidi, M. A., Siddiqui, M. F., Sarangzai, A. M., Sheikh, S.-U.-R., & Arsalan. (2019). Dust exposure risk from stone crushing to workers and locally grown plant species in Quetta, Pakistan. Environmental Monitoring and Assessment, 191(12), 1. https:\/\/doi.org\/10.1007\/s10661-019-7825-1","journal-title":"Environmental Monitoring and Assessment"},{"key":"2376_CR30","doi-asserted-by":"crossref","unstructured":"Li, X., Chen, S., Hu, X., & Yang, J. (2019). Understanding the disharmony between dropout and batch normalization by variance shift. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 2682\u20132690).","DOI":"10.1109\/CVPR.2019.00279"},{"key":"2376_CR31","doi-asserted-by":"crossref","unstructured":"Li, Y., Zhang, X., & Chen, D. (2018). Csrnet: Dilated convolutional neural networks for understanding the highly congested scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1091\u20131100).","DOI":"10.1109\/CVPR.2018.00120"},{"key":"2376_CR32","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., Belongie, S. (2017). Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2117\u20132125).","DOI":"10.1109\/CVPR.2017.106"},{"key":"2376_CR33","doi-asserted-by":"crossref","unstructured":"Liu, W., Salzmann, M., Fua, P. (2019). Context-aware crowd counting. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5099\u20135108).","DOI":"10.1109\/CVPR.2019.00524"},{"key":"2376_CR34","doi-asserted-by":"crossref","unstructured":"Liu, Z., Hu, H., Lin, Y., Yao, Z., Xie, Z., Wei, Y., Ning, J., Cao, Y., Zhang, Z., Dong, L., et\u00a0al. (2022). Swin transattention attention v2: Scaling up capacity and resolution. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 12009\u201312019).","DOI":"10.1109\/CVPR52688.2022.01170"},{"key":"2376_CR35","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., & Guo, B. (2021). Swin transattention attention: Hierarchical vision transattention attention using shifted windows. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 10012\u201310022).","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"2376_CR36","doi-asserted-by":"crossref","unstructured":"Liu, Z., Ning, J., Cao, Y., Wei, Y., Zhang, Z., Lin, S., & Hu, H. (2022). Video swin transattention attention. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 3202\u20133211).","DOI":"10.1109\/CVPR52688.2022.00320"},{"key":"2376_CR37","unstructured":"Loshchilov, I., & Hutter, F. (2017). Decoupled weight decay regularization. arXiv:1711.05101"},{"key":"2376_CR38","doi-asserted-by":"crossref","unstructured":"Luo, A., Yang, F., Li, X., Nie, D., Jiao, Z., Zhou, S., Cheng, H. (2020). Hybrid graph neural networks for crowd counting. In Proceedings of the AAAI conference on artificial intelligence Vol. 34 (pp. 11693\u201311700).","DOI":"10.1609\/aaai.v34i07.6839"},{"key":"2376_CR39","doi-asserted-by":"crossref","unstructured":"Michel, A., Weinmann, M., Schenkel, F., Gomez, T., Falvey, M., Schmitz, R., Middelmann, W., & Hinz, S. (2023). Terrestrial visual dust density estimation based on deep learning. In: Proceedings of the 2023 IEEE international geoscience and remote sensing symposium.","DOI":"10.1109\/IGARSS52108.2023.10281563"},{"key":"2376_CR40","doi-asserted-by":"publisher","unstructured":"Michel, A., Weinmann, M., Schenkel, F., Gomez, T., Falvey, M., Schmitz, R., Middelmann, W., & Hinz, S. (2024). DustNet: Attention to dust (pp. 211\u2013226). https:\/\/doi.org\/10.1007\/978-3-031-54605-1_14","DOI":"10.1007\/978-3-031-54605-1_14"},{"key":"2376_CR41","doi-asserted-by":"crossref","unstructured":"Morman, S. A., & Plumlee, G. S. (2014). Dust and human health. Mineral dust: A key player in the Earth system (pp. 385\u2013409).","DOI":"10.1007\/978-94-017-8978-3_15"},{"key":"2376_CR42","unstructured":"NASA. (2023). Mars curiosity image gallery. Retrieved March 08, 2019, from https:\/\/www.nasa.gov\/mission_pages\/msl\/images\/index.html"},{"key":"2376_CR43","doi-asserted-by":"crossref","unstructured":"Patil, V., Sakaridis, C., Liniger, A., & Van\u00a0Gool, L. (2022). P3depth: Monocular depth estimation with a piecewise planarity prior. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 1610\u20131621).","DOI":"10.1109\/CVPR52688.2022.00166"},{"key":"2376_CR44","doi-asserted-by":"crossref","unstructured":"Ranftl, R., Bochkovskiy, A., & Koltun, V. (2021). Vision transattentions for dense prediction. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 12179\u201312188).","DOI":"10.1109\/ICCV48922.2021.01196"},{"key":"2376_CR45","doi-asserted-by":"crossref","unstructured":"Ren, J., Zhang, M., Yu, C., & Liu, Z. (2022). Balanced MSE for imbalanced visual regression. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 7926\u20137935).","DOI":"10.1109\/CVPR52688.2022.00777"},{"key":"2376_CR46","unstructured":"Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems, Vol. 28."},{"key":"2376_CR47","doi-asserted-by":"crossref","unstructured":"Sam, D. B., Surya, S., & Babu, R. V. (2017). Switching convolutional neural network for crowd counting. In Proceedings of the 2017 IEEE conference on computer vision and pattern recognition (pp. 4031\u20134039). IEEE.","DOI":"10.1109\/CVPR.2017.429"},{"key":"2376_CR48","doi-asserted-by":"crossref","unstructured":"Shaw, P., Uszkoreit, J., & Vaswani, A. (2018). Self-attention with relative position representations. arXiv:1803.02155","DOI":"10.18653\/v1\/N18-2074"},{"key":"2376_CR49","doi-asserted-by":"crossref","unstructured":"Shi, W., Caballero, J., Husz\u00e1r, F., Totz, J., Aitken, A. P., Bishop, R., Rueckert, D., & Wang, Z. (2016). Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1874\u20131883).","DOI":"10.1109\/CVPR.2016.207"},{"issue":"11","key":"2376_CR50","doi-asserted-by":"publisher","first-page":"4381","DOI":"10.1109\/TCSVT.2021.3049869","volume":"31","author":"M Song","year":"2021","unstructured":"Song, M., Lim, S., & Kim, W. (2021). Monocular depth estimation using Laplacian pyramid-based depth residuals. IEEE Transactions on Circuits and Systems for Video Technology, 31(11), 4381\u20134393.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"issue":"1","key":"2376_CR51","first-page":"1929","volume":"15","author":"N Srivastava","year":"2014","unstructured":"Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1), 1929\u20131958.","journal-title":"The Journal of Machine Learning Research"},{"key":"2376_CR52","doi-asserted-by":"crossref","unstructured":"Tay, Y., Dehghani, M., Abnar, S., Chung, H. W., Fedus, W., Rao, J., Narang, S., Tran, V. Q., Yogatama, D., & Metzler, D. (2022). Scaling laws vs model architectures: How does inductive bias influence scaling? arXiv:2207.10551","DOI":"10.18653\/v1\/2023.findings-emnlp.825"},{"key":"2376_CR53","doi-asserted-by":"crossref","unstructured":"Tu, Z., Talebi, H., Zhang, H., Yang, F., Milanfar, P., Bovik, A., & Li, Y. (2022). Maxvit: Multi-axis vision transformer. In European conference on computer vision (pp. 459\u2013479). Springer.","DOI":"10.1007\/978-3-031-20053-3_27"},{"key":"#cr-split#-2376_CR54.1","unstructured":"U.S. Environmental Protection Agency. (2022). Particulate Matter"},{"key":"#cr-split#-2376_CR54.2","unstructured":"(PM) 2022 Report. Retrieved 22 May, 2024, from https:\/\/www.epa.gov\/system\/files\/documents\/2023-06\/PM_2022.pdf"},{"key":"2376_CR55","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, \u0141., & Polosukhin, I. (2017). Attention is all you need. In Advances in neural information processing systems, Vol. 30."},{"key":"2376_CR56","doi-asserted-by":"crossref","unstructured":"Wang, L., Zhang, J., Wang, Y., Lu, H., & Ruan, X. (2020). Cliffnet for monocular depth estimation with hierarchical embedding loss. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part V (pp. 316\u2013331). Springer.","DOI":"10.1007\/978-3-030-58558-7_19"},{"key":"2376_CR57","doi-asserted-by":"publisher","first-page":"2301","DOI":"10.1109\/TIP.2019.2946126","volume":"29","author":"F Yuan","year":"2020","unstructured":"Yuan, F., Zhang, L., Xia, X., Huang, Q., & Li, X. (2020). A wave-shaped deep neural network for smoke density estimation. IEEE Transactions on Image Processing, 29, 2301\u20132313.","journal-title":"IEEE Transactions on Image Processing"},{"key":"2376_CR58","doi-asserted-by":"crossref","unstructured":"Yuan, W., Gu, X., Dai, Z., Zhu, S., & Tan, P. (2022). Neural window fully-connected crfs for monocular depth estimation. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 3916\u20133925).","DOI":"10.1109\/CVPR52688.2022.00389"},{"key":"2376_CR59","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Zhou, D., Chen, S., Gao, S., & Ma, Y. (2016) . Single-image crowd counting via multi-column convolutional neural network. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 589\u2013597).","DOI":"10.1109\/CVPR.2016.70"},{"key":"2376_CR60","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., & Jia, J. (2017). Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2881\u20132890).","DOI":"10.1109\/CVPR.2017.660"},{"key":"2376_CR61","doi-asserted-by":"crossref","unstructured":"Zhu, X. X., Tuia, D., Mou, L., Xia, G.-S., Zhang, L., Xu, F., & Fraundorfer, F. (2017). Deep learning in remote sensing: A comprehensive review and list of resources. IEEE Geoscience and Remote Sensing Magazine, 5(4), 8\u201336.","DOI":"10.1109\/MGRS.2017.2762307"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-025-02376-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11263-025-02376-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-025-02376-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,7]],"date-time":"2025-06-07T06:00:34Z","timestamp":1749276034000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11263-025-02376-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,24]]},"references-count":62,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2025,7]]}},"alternative-id":["2376"],"URL":"https:\/\/doi.org\/10.1007\/s11263-025-02376-9","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"type":"print","value":"0920-5691"},{"type":"electronic","value":"1573-1405"}],"subject":[],"published":{"date-parts":[[2025,2,24]]},"assertion":[{"value":"28 May 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 February 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 February 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}