{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T05:27:59Z","timestamp":1773811679365,"version":"3.50.1"},"reference-count":53,"publisher":"MDPI AG","issue":"13","license":[{"start":{"date-parts":[[2020,7,7]],"date-time":"2020-07-07T00:00:00Z","timestamp":1594080000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004955","name":"Austrian Research Promotion Agency","doi-asserted-by":"publisher","award":["865997"],"award-info":[{"award-number":["865997"]}],"id":[{"id":"10.13039\/501100004955","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>In this paper, we present a Remote Sens. approach to localize parking cars in a city in order to enable the development of parking space availability models. We propose to use high-resolution stereo satellite images for this problem, as they provide enough details to make individual cars recognizable and the time interval between the stereo shots allows to reason about the moving or static condition of a car. Consequently, we describe a complete processing pipeline where raw satellite images are georeferenced, ortho-rectified, equipped with a digital surface model and an inclusion layer generated from Open Street Model vector data, and finally analyzed for parking cars by means of an adapted Faster R-CNN oriented bounding box detector. As a test site for the proposed approach, a new publicly available dataset of the city of Barcelona labeled with parking cars is presented. On this dataset, a Faster R-CNN model directly trained on the two ortho-rectified stereo images achieves an average precision of 0.65 for parking car detection. Finally, an extensive empirical and analytical evaluation shows the validity of our idea, as parking space occupancy can be broadly derived in fully visible areas, whereas moving cars are efficiently ruled out. Our evaluation also includes an in-depth analysis of the stereo occlusion problem in view of our application scenario as well as the suitability of using a reconstructed Digital Surface Model (DSM) as additional data modality for car detection. While an additional adoption of the DSM in our pipeline does not provide a beneficial cue for the detection task, the stereo images provide essentially two views of the dynamic scene at different timestamps. Therefore, for future studies, we recommend a satellite image acquisition geometry with smaller incidence angles, to decrease occlusions by buildings and thus improve the results with respect to completeness.<\/jats:p>","DOI":"10.3390\/rs12132170","type":"journal-article","created":{"date-parts":[[2020,7,7]],"date-time":"2020-07-07T10:41:09Z","timestamp":1594118469000},"page":"2170","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":24,"title":["Detection of Parking Cars in Stereo Satellite Images"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3459-8122","authenticated-orcid":false,"given":"Sebastian","family":"Zambanini","sequence":"first","affiliation":[{"name":"Computer Vision Lab, Institute of Visual Computing and Human-Centered Technology, TU Wien, A-1040 Vienna, Austria"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8995-289X","authenticated-orcid":false,"given":"Ana-Maria","family":"Loghin","sequence":"additional","affiliation":[{"name":"Department of Geodesy and Geoinformation, TU Wien, A-1040 Vienna, Austria"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2348-7929","authenticated-orcid":false,"given":"Norbert","family":"Pfeifer","sequence":"additional","affiliation":[{"name":"Department of Geodesy and Geoinformation, TU Wien, A-1040 Vienna, Austria"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elena M\u00e0rmol","family":"Soley","sequence":"additional","affiliation":[{"name":"Parkbob GmbH, A-1200 Vienna, Austria"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Robert","family":"Sablatnig","sequence":"additional","affiliation":[{"name":"Computer Vision Lab, Institute of Visual Computing and Human-Centered Technology, TU Wien, A-1040 Vienna, Austria"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,7,7]]},"reference":[{"key":"ref_1","unstructured":"(2020, July 06). IBM Global Parking Survey: Drivers Share Worldwide Parking Woes. Available online: https:\/\/www-03.ibm.com\/press\/us\/en\/pressrelease\/35515.wss."},{"key":"ref_2","unstructured":"Rodrigue, J.P., Comtois, C., and Slack, B. (2013). The Geography of Transport Systems, Routledge. [3rd ed.]."},{"key":"ref_3","unstructured":"(2020, July 06). Searching for Parking Costs Americans $73 Billion a Year. Available online: https:\/\/inrix.com\/press-releases\/parking-pain-us\/."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A survey on object detection in optical Remote Sens. images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.isprsjprs.2016.02.007","article-title":"Street-side vehicle detection, classification and change detection using mobile laser scanning data","volume":"114","author":"Xiao","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_6","first-page":"1","article-title":"Error Bounded Foreground and Background Modeling for Moving Object Detection in Satellite Videos","volume":"58","author":"Zhang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1074","DOI":"10.1109\/TGRS.2019.2943366","article-title":"Object Tracking in Satellite Videos by Improved Correlation Filters With Motion Estimations","volume":"58","author":"Xuan","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_8","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster R-CNN: Towards real-time object detection with region proposal networks. Proceedings of the Advances in Neural Information Processing Systems 2015, Montreal, QC, Canada."},{"key":"ref_9","unstructured":"Zambanini, S., Login, A.M., Pfeifer, N., Soley, E.M., and Sablatnig, R. (2020, July 06). PaCaBa - Parking Cars Barcelona Dataset. Available online: https:\/\/doi.org\/10.5281\/zenodo.3701453."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Mundhenk, T.N., Konjevod, G., Sakla, W.A., and Boakye, K. (2016, January 8\u201316). A large contextual dataset for classification, detection and counting of cars with deep learning. Proceedings of the European Conference on Computer Vision 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46487-9_48"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201322). DOTA: A large-scale dataset for object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1016\/j.jvcir.2015.11.002","article-title":"Vehicle detection in aerial imagery: A small target detection benchmark","volume":"34","author":"Razakarivony","year":"2016","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_13","unstructured":"Liu, L., Ouyang, W., Wang, X., Fieguth, P., Chen, J., Liu, X., and Pietik\u00e4inen, M. (1809). Deep learning for generic object detection: A survey. Int. J. Comput. Vis., 1\u201358."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Liu, W., Zhang, Z., Li, S., and Tao, D. (2017). Road Detection by Using a Generalized Hough Transform. Remote Sens., 9.","DOI":"10.3390\/rs9060590"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"He, H., Zhou, J., Chen, M., Chen, T., Li, D., and Cheng, P. (2019). Building Extraction from UAV Images Jointly Using 6D-SLIC and Multiscale Siamese Convolutional Networks. Remote Sens., 11.","DOI":"10.3390\/rs11091040"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1109\/LGRS.2012.2193552","article-title":"Building detection in very high spatial resolution multispectral images using the hit-or-miss transform","volume":"10","author":"Stankov","year":"2012","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1016\/j.isprsjprs.2003.09.002","article-title":"Object extraction and revision by image analysis using existing geodata and knowledge: Current status and steps towards operational systems","volume":"58","author":"Baltsavias","year":"2004","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_18","unstructured":"Bishop, C.M. (2006). Pattern Recognition and Machine Learning, Springer."},{"key":"ref_19","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201326). Histograms of oriented gradients for human detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2005, San Diego, CA, USA."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Ammour, N., Alhichri, H., Bazi, Y., Benjdira, B., Alajlan, N., and Zuair, M. (2017). Deep learning approach for car detection in UAV imagery. Remote Sens., 9.","DOI":"10.3390\/rs9040312"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Audebert, N., Le Saux, B., and Lef\u00e8vre, S. (2017). Segment-before-Detect: Vehicle Detection and Classification through Semantic Segmentation of Aerial Images. Remote Sens., 9.","DOI":"10.3390\/rs9040368"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Terrail, J.O.D., and Jurie, F. (2017, January 17\u201320). On the use of deep neural networks for the detection of small vehicles in ortho-images. Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China.","DOI":"10.1109\/ICIP.2017.8297076"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2015, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 8\u201316). SSD: Single shot multibox detector. Proceedings of the European Conference on Computer Vision 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2017, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Tang, T., Zhou, S., Deng, Z., Lei, L., and Zou, H. (2017). Arbitrary-oriented vehicle detection in aerial imagery with single convolutional neural networks. Remote Sens., 9.","DOI":"10.3390\/rs9111170"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Azimi, S.M., Vig, E., Bahmanyar, R., K\u00f6rner, M., and Reinartz, P. (2018, January 2\u20136). Towards multi-class object detection in unconstrained Remote Sens. imagery. Proceedings of the Asian Conference on Computer Vision 2018, Perth, Australia.","DOI":"10.1007\/978-3-030-20893-6_10"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G., and Lu, Q. (2019, January 16\u201320). Learning RoI Transformer for Oriented Object Detection in Aerial Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"AL-Alimi, D., Shao, Y., Feng, R., Al-qaness, M.A., Elaziz, M.A., and Kim, S. (2019). Multi-Scale Geospatial Object Detection Based on Shallow-Deep Feature Extraction. Remote Sens., 11.","DOI":"10.3390\/rs11212525"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Fan, Q., Brown, L., and Smith, J. (2016, January 19\u201322). A closer look at Faster R-CNN for vehicle detection. Proceedings of the 2016 IEEE Intelligent Vehicles Symposium (IV), Gotenburg, Sweden.","DOI":"10.1109\/IVS.2016.7535375"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Tang, T., Zhou, S., Deng, Z., Zou, H., and Lei, L. (2017). Vehicle detection in aerial images based on region convolutional neural networks and hard negative example mining. Sensors, 17.","DOI":"10.3390\/s17020336"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Ren, Y., Zhu, C., and Xiao, S. (2018). Small object detection in optical Remote Sens. images via modified faster R-CNN. Appl. Sci., 8.","DOI":"10.3390\/app8050813"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Yang, M.Y., Liao, W., Li, X., and Rosenhahn, B. (2018, January 7\u201310). Deep learning for vehicle detection in aerial images. Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, Athens, Greece.","DOI":"10.1109\/ICIP.2018.8451454"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"5028","DOI":"10.1109\/TGRS.2019.2895362","article-title":"R3-Net: A Deep Network for Multioriented Vehicle Detection in Aerial Images and Videos","volume":"57","author":"Li","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-Oriented Scene Text Detection via Rotation Proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimedia"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1111\/j.1477-9730.2011.00665.x","article-title":"Review of developments in geometric modelling for high resolution satellite pushbroom sensors","volume":"27","author":"Poli","year":"2012","journal-title":"Photogramm. Rec."},{"key":"ref_37","unstructured":"(2020, July 06). Trimble Geospatial. Available online: https:\/\/de.geospatial.trimble.com\/products-and-solutions\/inpho."},{"key":"ref_38","first-page":"83","article-title":"Volume based DTM generation from Very High Resolution Photogrammetric DSMs","volume":"41","author":"Piltz","year":"2016","journal-title":"ISPRS Arch."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1016\/j.compenvurbsys.2013.11.002","article-title":"OPALS\u2013A framework for Airborne Laser Scanning data analysis","volume":"45","author":"Pfeifer","year":"2014","journal-title":"Comput. Environ. Urban Syst."},{"key":"ref_40","unstructured":"Bang, K.I., and Habib, A.F. (2007, January 7\u201311). Comparative Analysis of Alternative Methodologies for True Ortho-Photo Generation from High Resolution Satellite Imagery. Proceedings of the ASPRS Annual Conference 2007, Tampa, FL, USA."},{"key":"ref_41","first-page":"16","article-title":"The generation of true orthophotos using a 3D building model in conjunction with a conventional DTM","volume":"32","author":"Amhar","year":"1998","journal-title":"ISPRS Arch."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Zhang, L., Lin, L., Liang, X., and He, K. (2016, January 8\u201316). Is Faster R-CNN Doing Well for Pedestrian Detection?. Proceedings of the European Conference on Computer Vision 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46475-6_28"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Koo, J., Seo, J., Jeon, S., Choe, J., and Jeon, T. (2018, January 6\u20139). RBox-CNN: Rotated bounding box based CNN for ship detection in remote sensing image. Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA.","DOI":"10.1145\/3274895.3274915"},{"key":"ref_44","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u20139). Very Deep Convolutional Networks for Large-Scale Image Recognition. Proceedings of the International Conference on Learning Representations (ICLR) 2015, San Diego, CA, USA."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision, 2015, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_46","unstructured":"Feng, D., Haase-Schuetz, C., Rosenbaum, L., Hertlein, H., Timm, F., Glaeser, C., Wiesbeck, W., and Dietmayer, K. (2019). Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges. arXiv, Available online: https:\/\/arxiv.org\/abs\/1902.07830."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Fei-Fei, L. (2009, January 22\u201324). Imagenet: A large-scale hierarchical image database. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2009, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The PASCAL Visual Object Classes (VOC) Challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Yan, J., Wang, H., Yan, M., Diao, W., Sun, X., and Li, H. (2019). IoU-adaptive deformable R-CNN: Make full use of IoU for multi-class object detection in Remote Sens. imagery. Remote Sens., 11.","DOI":"10.3390\/rs11030286"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., Sun, X., and Fu, K. (November, January 27). SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects. Proceedings of the IEEE International Conference on Computer Vision (ICCV) 2019, Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00832"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Stentoumis, C., Karkalou, E., and Karras, G. (2015, January 3\u20135). A review and evaluation of penalty functions for Semi-Global Matching. Proceedings of the 2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP), Cluj-Napoca, Romania.","DOI":"10.1109\/ICCP.2015.7312624"},{"key":"ref_52","unstructured":"Lhuillier, M., and Quan, L. (2000, January 3\u20137). Robust dense matching using local and global geometric constraints. Proceedings of the 15th International Conference on Pattern Recognition 2000, Barcelona, Spain."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Loghin, A.M., Otepka-Schremmer, J., and Pfeifer, N. (2020). Potential of Pl\u00e9iades and WorldView-3 Tri-Stereo DSMs to Represent Heights of Small Isolated Objects. Sensors, 20.","DOI":"10.3390\/s20092695"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/13\/2170\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T09:48:29Z","timestamp":1760176109000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/13\/2170"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,7]]},"references-count":53,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2020,7]]}},"alternative-id":["rs12132170"],"URL":"https:\/\/doi.org\/10.3390\/rs12132170","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,7,7]]}}}