{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,22]],"date-time":"2026-02-22T05:44:28Z","timestamp":1771739068405,"version":"3.50.1"},"reference-count":46,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2018,4,24]],"date-time":"2018-04-24T00:00:00Z","timestamp":1524528000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Many applications, such as autonomous navigation, urban planning, and asset monitoring, rely on the availability of accurate information about objects and their geolocations. In this paper, we propose the automatic detection and computation of the coordinates of recurring stationary objects of interest using street view imagery. Our processing pipeline relies on two fully convolutional neural networks: the first segments objects in the images, while the second estimates their distance from the camera. To geolocate all the detected objects coherently we propose a novel custom Markov random field model to estimate the objects\u2019 geolocation. The novelty of the resulting pipeline is the combined use of monocular depth estimation and triangulation to enable automatic mapping of complex scenes with the simultaneous presence of multiple, visually similar objects of interest. We validate experimentally the effectiveness of our approach on two object classes: traffic lights and telegraph poles. The experiments report high object recall rates and position precision of approximately 2 m, which is approaching the precision of single-frequency GPS receivers.<\/jats:p>","DOI":"10.3390\/rs10050661","type":"journal-article","created":{"date-parts":[[2018,4,24]],"date-time":"2018-04-24T04:44:48Z","timestamp":1524545088000},"page":"661","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":79,"title":["Automatic Discovery and Geotagging of Objects from Street View Imagery"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9734-5974","authenticated-orcid":false,"given":"Vladimir A.","family":"Krylov","sequence":"first","affiliation":[{"name":"ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin, Dublin 2, Ireland"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1895-1368","authenticated-orcid":false,"given":"Eamonn","family":"Kenny","sequence":"additional","affiliation":[{"name":"ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin, Dublin 2, Ireland"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0983-3052","authenticated-orcid":false,"given":"Rozenn","family":"Dahyot","sequence":"additional","affiliation":[{"name":"ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin, Dublin 2, Ireland"}]}],"member":"1968","published-online":{"date-parts":[[2018,4,24]]},"reference":[{"key":"ref_1","unstructured":"BBC News (2018, April 20). Google\u2019s Street View Cameras Get Quality Boost. Available online: http:\/\/www.bbc.com\/news\/technology-41082920."},{"key":"ref_2","unstructured":"(2018, April 20). Mapillary: Celebrating 200 Million Images. Available online: https:\/\/blog.mapillary.com\/update\/2017\/10\/05\/200-million-images.html."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Mattyus, G., Wang, S., Fidler, S., and Urtasun, R. (July, January 26). HD maps: Fine-grained road segmentation by parsing ground and aerial images. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.393"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Wegner, J.D., Branson, S., Hall, D., Schindler, K., and Perona, P. (July, January 26). Cataloging public objects using aerial and street-level images-urban trees. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.647"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Workman, S., Zhai, M., Crandall, D.J., and Jacobs, N. (2017, January 21\u201326). A unified model for near and remote sensing. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/ICCV.2017.293"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1145\/2717513","article-title":"Improving public transit accessibility for blind riders by crowdsourcing bus stop landmark locations with Google Street View: An extended analysis","volume":"6","author":"Hara","year":"2015","journal-title":"ACM Trans. Access. Comput."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1596","DOI":"10.1109\/TITS.2015.2513086","article-title":"Crowdsourcing in ITS: The state of the work and the networking","volume":"17","author":"Wang","year":"2016","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_8","unstructured":"Wired (2018, April 20). Google\u2019s New Street View Cameras will Help Algorithms Index the Real World. Available online: https:\/\/www.wired.com\/story\/googles-new-street-view-cameras-will-help-algorithms-index-the-real-world\/."},{"key":"ref_9","unstructured":"(2018, April 20). Seamless Google Street View Panoramas. Available online: https:\/\/research.googleblog.com\/2017\/11\/seamless-google-street-view-panoramas.html."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Hebbalaguppe, R., Garg, G., Hassan, E., Ghosh, H., and Verma, A. (2017, January 24\u201331). Telecom Inventory management via object recognition and localisation on Google Street View Images. Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA.","DOI":"10.1109\/WACV.2017.86"},{"key":"ref_11","unstructured":"Federal Aviation Administration (2018). Global Positioning System (GPS) Standard Positioning Service (SPS) Performance Analysis Report #100, Technical Report."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Timofte, R., and Van Gool, L. (2011, January 6\u201313). Multi-view manhole detection, recognition, and 3D localisation. Proceedings of the 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), Barcelona, Spain.","DOI":"10.1109\/ICCVW.2011.6130242"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1884","DOI":"10.1109\/JPROC.2017.2684300","article-title":"Toward Seamless Multiview Scene Analysis From Satellite to Street Level","volume":"105","author":"Tuia","year":"2017","journal-title":"Proc. IEEE"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Hays, J., and Efros, A.A. (2008, January 23\u201328). IM2GPS: Estimating geographic information from a single image. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Anchorage, Alaska.","DOI":"10.1109\/CVPR.2008.4587784"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"675","DOI":"10.1016\/j.ufug.2015.06.006","article-title":"Assessing street-level urban greenery using Google Street View and a modified green view index","volume":"14","author":"Li","year":"2015","journal-title":"Urban For. Urban Green."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.cag.2017.01.005","article-title":"Social media based 3D visual popularity","volume":"63","author":"Bulbul","year":"2017","journal-title":"Comput. Graph."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Du, R., and Varshney, A. (2016, January 22\u201324). Social Street View: Blending Immersive Street Views with Geo-tagged Social Media. Proceedings of the International Conference on Web3D Technology, Anaheim, CA, USA.","DOI":"10.1145\/2945292.2945294"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1016\/j.compenvurbsys.2017.03.001","article-title":"Parcel-based urban land use classification in megacity using airborne LiDAR, high resolution orthoimagery, and Google Street View","volume":"64","author":"Zhang","year":"2017","journal-title":"Comput. Environ. Urban Syst."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1007\/s00138-017-0845-3","article-title":"Urban 3D segmentation and modelling from street view images and LiDAR point clouds","volume":"28","author":"Babahajiani","year":"2017","journal-title":"Mach. Vis. Appl."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.isprsjprs.2014.01.006","article-title":"3D change detection at street level using mobile laser scanning point clouds and terrestrial images","volume":"90","author":"Qin","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1016\/j.patcog.2017.09.013","article-title":"A survey on Visual-Based Localization: On the benefit of heterogeneous data","volume":"74","author":"Piasco","year":"2018","journal-title":"Pattern Recognit."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Ardeshir, S., Zamir, A.R., Torroella, A., and Shah, M. (2014, January 6\u201312). GIS-assisted object detection and geospatial localization. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10599-4_39"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Wang, S., Fidler, S., and Urtasun, R. (2015, January 7\u201312). Holistic 3D scene understanding from a single geo-tagged image. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299022"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Zhen, Z., Quackenbush, L.J., and Zhang, L. (2016). Trends in automatic individual tree crown detection and delineation\u2014Evolution of lidar data. Remote Sens., 8.","DOI":"10.3390\/rs8040333"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Ord\u00f3\u00f1ez, C., Cabo, C., and Sanz-Ablanedo, E. (2017). Automatic Detection and Classification of Pole-Like Objects for Urban Cartography Using Mobile Laser Scanning Data. Sensors, 17.","DOI":"10.3390\/s17071465"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1374","DOI":"10.1109\/TGRS.2014.2338915","article-title":"Semiautomated extraction of street light poles from mobile LiDAR point-clouds","volume":"53","author":"Yu","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1635","DOI":"10.1109\/TGRS.2013.2253108","article-title":"Automatic car counting method for unmanned aerial vehicle images","volume":"52","author":"Moranduzzo","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1074","DOI":"10.3390\/rs70101074","article-title":"UAV remote sensing for urban vegetation mapping using random forest and texture analysis","volume":"7","author":"Feng","year":"2015","journal-title":"Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2012.11.009","article-title":"Detection and 3D reconstruction of traffic signs from multiple view color images","volume":"77","author":"Soheilian","year":"2013","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_30","unstructured":"Trehard, G., Pollard, E., Bradai, B., and Nashashibi, F. (2014, January 7\u201310). Tracking both pose and status of a traffic light via an interacting multiple model filter. Proceedings of the International Conference on Information Fusion (FUSION), Salamanca, Spain."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1800","DOI":"10.1109\/TITS.2015.2509509","article-title":"Vision for looking at traffic lights: Issues, survey, and perspectives","volume":"17","author":"Jensen","year":"2016","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_32","unstructured":"Viola, P., and Jones, M. (2001, January 8\u201314). Rapid object detection using a boosted cascade of simple features. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Kauai, HI, USA."},{"key":"ref_33","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_34","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u20139). Very deep convolutional networks for large-scale image recognition. Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"640","DOI":"10.1109\/TPAMI.2016.2572683","article-title":"Fully convolutional networks for semantic segmentation","volume":"39","author":"Shelhamer","year":"2017","journal-title":"IEEE T-PAMI"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster r-cnn: Towards real-time object detection with region proposal networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE T-PAMI"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single Shot MultiBox Detector. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zheng, S., Jayasumana, S., Romera-Paredes, B., Vineet, V., Su, Z., Du, D., Huang, C., and Torr, P.H. (2015, January 7\u201312). Conditional random fields as recurrent neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/ICCV.2015.179"},{"key":"ref_39","unstructured":"Seitz, S.M., Curless, B., Diebel, J., Scharstein, D., and Szeliski, R. (July, January 26). A comparison and evaluation of multi-view stereo reconstruction algorithms. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA."},{"key":"ref_40","unstructured":"Tron, R., Zhou, X., and Daniilidis, K. (July, January 26). A survey on rotation optimization in structure from motion. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Las Vegas, NV, USA."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Godard, C., Mac Aodha, O., and Brostow, G.J. (2017, January 21\u201326). Unsupervised Monocular Depth Estimation with Left-Right Consistency. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.699"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Laina, I., Rupprecht, C., Belagiannis, V., Tombari, F., and Navab, N. (2016, January 25\u201328). Deeper depth prediction with fully convolutional residual networks. Proceedings of the International Conference on 3D Vision, Stanford, CA, USA.","DOI":"10.1109\/3DV.2016.32"},{"key":"ref_43","unstructured":"Li, B., Shen, C., Dai, Y., van den Hengel, A., and He, M. (2015, January 7\u201312). Depth and surface normal estimation from monocular images using regression on deep features and hierarchical CRFs. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1561\/2000000035","article-title":"Markov random fields in image segmentation","volume":"5","author":"Kato","year":"2012","journal-title":"Found. Trends Signal Process."},{"key":"ref_45","unstructured":"(2018, April 20). Mapillary Vistas Dataset. Available online: https:\/\/www.mapillary.com\/dataset\/vistas."},{"key":"ref_46","unstructured":"Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., and Schiele, B. (July, January 26). The cityscapes dataset for semantic urban scene understanding. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/5\/661\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:01:52Z","timestamp":1760194912000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/5\/661"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,4,24]]},"references-count":46,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2018,5]]}},"alternative-id":["rs10050661"],"URL":"https:\/\/doi.org\/10.3390\/rs10050661","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,4,24]]}}}