{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,3]],"date-time":"2025-12-03T20:37:29Z","timestamp":1764794249014,"version":"3.37.3"},"reference-count":44,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2021,10,20]],"date-time":"2021-10-20T00:00:00Z","timestamp":1634688000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,10,20]],"date-time":"2021-10-20T00:00:00Z","timestamp":1634688000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"SINTEF AS"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Appl Intell"],"published-print":{"date-parts":[[2022,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In the heavy asset industry, such as oil &amp; gas, offshore personnel need to locate various equipment on the installation on a daily basis for inspection and maintenance purposes. However, locating equipment in such GPS denied environments is very time consuming due to the complexity of the environment and the large amount of equipment. To address this challenge we investigate an alternative approach to study the navigation problem based on visual imagery data instead of current ad-hoc methods where engineering drawings or large CAD models are used to find equipment. In particular, this paper investigates the combination of deep learning and decomposition for the image retrieval problem which is central for visual navigation. A convolutional neural network is first used to extract relevant features from the image database. The database is then decomposed into clusters of visually similar images, where several algorithms have been explored in order to make the clusters as independent as possible. The Bag-of-Words (BoW) approach is then applied on each cluster to build a vocabulary forest. During the searching process the vocabulary forest is exploited to find the most relevant images to the query image. To validate the usefulness of the proposed framework, intensive experiments have been carried out using both standard datasets and images from industrial environments. We show that the suggested approach outperforms the BoW-based image retrieval solutions, both in terms of computing time and accuracy. We also show the applicability of this approach on real industrial scenarios by applying the model on imagery data from offshore oil platforms.<\/jats:p>","DOI":"10.1007\/s10489-021-02908-z","type":"journal-article","created":{"date-parts":[[2021,10,20]],"date-time":"2021-10-20T17:53:02Z","timestamp":1634752382000},"page":"8101-8117","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Deep learning based decomposition for visual navigation in industrial platforms"],"prefix":"10.1007","volume":"52","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0135-7450","authenticated-orcid":false,"given":"Youcef","family":"Djenouri","sequence":"first","affiliation":[]},{"given":"Johan","family":"Hatleskog","sequence":"additional","affiliation":[]},{"given":"Jon","family":"Hjelmervik","sequence":"additional","affiliation":[]},{"given":"Elias","family":"Bjorne","sequence":"additional","affiliation":[]},{"given":"Trygve","family":"Utstumo","sequence":"additional","affiliation":[]},{"given":"Milad","family":"Mobarhan","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,10,20]]},"reference":[{"key":"2908_CR1","doi-asserted-by":"publisher","first-page":"26,549","DOI":"10.1109\/ACCESS.2020.2971172","volume":"8","author":"A Anwar","year":"2020","unstructured":"Anwar A, Raychowdhury A (2020) Autonomous navigation via deep reinforcement learning for resource constraint edge nodes using transfer learning. IEEE Access 8:26,549\u201326,560","journal-title":"IEEE Access"},{"doi-asserted-by":"crossref","unstructured":"Arandjelovic R, Gronat P, Torii A, Pajdla T, Sivic J (2016) Netvlad: Cnn architecture for weakly supervised place recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5297\u20135307","key":"2908_CR2","DOI":"10.1109\/CVPR.2016.572"},{"doi-asserted-by":"crossref","unstructured":"Bai Y, Yu W, Xiao T, Xu C, Yang K, Ma WY, Zhao T (2014) Bag-of-words based deep neural network for image retrieval. In: Proceedings of the 22nd ACM international conference on Multimedia, pp 229\u2013232","key":"2908_CR3","DOI":"10.1145\/2647868.2656402"},{"doi-asserted-by":"crossref","unstructured":"Ban X, Lv X, Chen J (2009) Color image retrieval and classification using fuzzy similarity measure and fuzzy clustering method. In: Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference. IEEE, pp 7777\u20137782","key":"2908_CR4","DOI":"10.1109\/CDC.2009.5400757"},{"doi-asserted-by":"crossref","unstructured":"Baumgartl H, Buettner R (2020) Development of a highly precise place recognition module for effective human-robot interactions in changing lighting and viewpoint conditions. In: Proceedings of the 53rd Hawaii International Conference on System Sciences","key":"2908_CR5","DOI":"10.24251\/HICSS.2020.069"},{"key":"2908_CR6","doi-asserted-by":"publisher","first-page":"10,569","DOI":"10.1109\/ACCESS.2020.2964682","volume":"8","author":"A Belhadi","year":"2020","unstructured":"Belhadi A, Djenouri Y, Lin JCW, Zhang C, Cano A (2020) Exploring pattern mining algorithms for hashtag retrieval problem. IEEE Access 8:10,569\u201310,583","journal-title":"IEEE Access"},{"unstructured":"Bholowalia P, Kumar A (2014) Ebk-means: A clustering technique based on elbow method and k-means in wsn. Int J Comput Appl 105(9)","key":"2908_CR7"},{"doi-asserted-by":"crossref","unstructured":"Cao F, Yan F, Wang S, Zhuang Y, Wang W (2020) Season-invariant and viewpoint-tolerant lidar place recognition in gps denied environments. IEEE Transactions on Industrial Electronics","key":"2908_CR8","DOI":"10.1109\/TIE.2019.2962416"},{"doi-asserted-by":"crossref","unstructured":"Cao Y, Long M, Wang J, Liu S (2017) Deep visual-semantic quantization for efficient image retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1328\u20131337","key":"2908_CR9","DOI":"10.1109\/CVPR.2017.104"},{"key":"2908_CR10","doi-asserted-by":"publisher","first-page":"30,480","DOI":"10.1109\/ACCESS.2020.2971938","volume":"8","author":"A Carrio","year":"2020","unstructured":"Carrio A, Tordesillas J, Vemprala S, Saripalli S, Campoy P, How JP (2020) Onboard detection and localization of drones using depth maps. IEEE Access 8:30,480\u201330,490","journal-title":"IEEE Access"},{"issue":"2","key":"2908_CR11","doi-asserted-by":"publisher","first-page":"993","DOI":"10.1109\/LRA.2020.2967324","volume":"5","author":"M Chancan","year":"2020","unstructured":"Chancan M, Hernandez-Nunez L, Narendra A, Barron AB, Milford M (2020) A hybrid compact neural architecture for visual place recognition. IEEE Robot Autom Lett 5(2):993\u20131000","journal-title":"IEEE Robot Autom Lett"},{"issue":"10","key":"2908_CR12","doi-asserted-by":"publisher","first-page":"2039","DOI":"10.1007\/s00371-020-01902-9","volume":"36","author":"J Choi","year":"2020","unstructured":"Choi J, Son MG, Lee YY, Lee KH, Park J, Yeo CH, Park J, Choi S, Kim WD, Kang TW et al (2020) Position-based augmented reality platform for aiding construction and inspection of offshore plants. Vis Comput 36(10):2039\u20132049","journal-title":"Vis Comput"},{"doi-asserted-by":"crossref","unstructured":"Cormack GV, Lynam TR (2006) Statistical precision of information retrieval evaluation. In: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pp 533\u2013540","key":"2908_CR13","DOI":"10.1145\/1148170.1148262"},{"unstructured":"David A (2007) Vassilvitskii s.: K-means++: The advantages of careful seeding. In: 18Th annual ACM-SIAM symposium on discrete algorithms (SODA), New orleans, pp 1027\u20131035","key":"2908_CR14"},{"key":"2908_CR15","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1016\/j.ins.2019.11.034","volume":"514","author":"VQ Dinh","year":"2020","unstructured":"Dinh VQ, Munir F, Azam S, Yow KC, Jeon M (2020) Transfer learning for vehicle detection using two cameras with different focal lengths. Inf Sci 514:71\u201387","journal-title":"Inf Sci"},{"key":"2908_CR16","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1016\/j.ins.2018.04.008","volume":"453","author":"Y Djenouri","year":"2018","unstructured":"Djenouri Y, Belhadi A, Fournier-Viger P, Lin JCW (2018) Fast and effective cluster-based information retrieval using frequent closed itemsets. Inf Sci 453:154\u2013167","journal-title":"Inf Sci"},{"key":"2908_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.ins.2017.08.043","volume":"420","author":"Y Djenouri","year":"2017","unstructured":"Djenouri Y, Comuzzi M (2017) Combining apriori heuristic and bio-inspired algorithms for solving the frequent itemsets mining problem. Inf Sci 420:1\u201315","journal-title":"Inf Sci"},{"doi-asserted-by":"crossref","unstructured":"Djenouri Y, Hjelmervik J (2021) Hybrid decomposition convolution neural network and vocabulary forest for image retrieval. In: 25th International Conference on Pattern Recognition, pp in press. IEEE","key":"2908_CR18","DOI":"10.1109\/ICPR48806.2021.9412104"},{"doi-asserted-by":"crossref","unstructured":"Doan AD, Latif Y, Chin TJ, Liu Y, Do TT, Reid I (2019) Scalable place recognition under appearance change for autonomous driving. In: Proceedings of the IEEE International Conference on Computer Vision, pp 9319\u20139328","key":"2908_CR19","DOI":"10.1109\/ICCV.2019.00941"},{"doi-asserted-by":"crossref","unstructured":"Eder M, Reip M, Steinbauer G (2021) Creating a robot localization monitor using particle filter and machine learning approaches. Appl Intell:1\u201315","key":"2908_CR20","DOI":"10.1007\/s10489-020-02157-6"},{"unstructured":"Erra U, Senatore S (2011) Hand-draw sketching for image retrieval through fuzzy clustering techniques. In: SEBD, pp 413\u2013420","key":"2908_CR21"},{"issue":"2","key":"2908_CR22","doi-asserted-by":"publisher","first-page":"1688","DOI":"10.1109\/LRA.2020.2969197","volume":"5","author":"B Ferrarini","year":"2020","unstructured":"Ferrarini B, Waheed M, Waheed S, Ehsan S, Milford M, McDonald-Maier K (2020) Exploring performance bounds of visual place recognition using extended precision. IEEE Robot Autom Lett 5 (2):1688\u20131695","journal-title":"IEEE Robot Autom Lett"},{"issue":"11","key":"2908_CR23","doi-asserted-by":"publisher","first-page":"1231","DOI":"10.1177\/0278364913491297","volume":"32","author":"A Geiger","year":"2013","unstructured":"Geiger A, Lenz P, Stiller C, Urtasun R (2013) Vision meets robotics: The kitti dataset. Int J Robot Res 32(11):1231\u20131237","journal-title":"Int J Robot Res"},{"issue":"1","key":"2908_CR24","doi-asserted-by":"publisher","first-page":"1223","DOI":"10.1007\/s11042-020-09759-9","volume":"80","author":"D Giveki","year":"2021","unstructured":"Giveki D (2021) Scale-space multi-view bag of words for scene categorization. Multimed Tools Appl 80(1):1223\u20131245","journal-title":"Multimed Tools Appl"},{"doi-asserted-by":"crossref","unstructured":"Hong Z, Petillot Y, Lane D, Miao Y, Wang S (2019) Textplace: Visual place recognition and topological localization through reading scene texts. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2861\u20132870","key":"2908_CR25","DOI":"10.1109\/ICCV.2019.00295"},{"doi-asserted-by":"crossref","unstructured":"Husbands P, Shim Y, Garvie M, Dewar A, Domcsek N, Graham P, Knight J, Nowotny T, Philippides A (2021) Recent advances in evolutionary and bio-inspired adaptive robotics: Exploiting embodied dynamics. Appl Intell:1\u201330","key":"2908_CR26","DOI":"10.1007\/s10489-021-02275-9"},{"doi-asserted-by":"crossref","unstructured":"Khaliq A, Ehsan S, Chen Z, Milford M, McDonald-Maier K (2019) A holistic visual place recognition approach using lightweight cnns for significant viewpoint and appearance changes. IEEE Transactions on Robotics","key":"2908_CR27","DOI":"10.1109\/TRO.2019.2956352"},{"key":"2908_CR28","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1016\/j.ins.2018.10.006","volume":"477","author":"D Kim","year":"2019","unstructured":"Kim D, Seo D, Cho S, Kang P (2019) Multi-co-training for document classification using various document representations: Tf\u2013idf, lda, and doc2vec. Inf Sci 477:15\u201329","journal-title":"Inf Sci"},{"issue":"1","key":"2908_CR29","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1007\/s10489-020-01827-9","volume":"51","author":"DH Lee","year":"2021","unstructured":"Lee DH, Chen KL, Liou KH, Liu CL, Liu JL (2021) Deep learning and control algorithms of direct perception for autonomous driving. Appl Intell 51(1):237\u2013247","journal-title":"Appl Intell"},{"doi-asserted-by":"crossref","unstructured":"Liu X, Zhang S, Huang T, Tian Q (2019) E2bows: An end-to-end bag-of-words model via deep convolutional neural network for image retrieval. Neurocomputing","key":"2908_CR30","DOI":"10.1016\/j.neucom.2017.12.069"},{"issue":"1","key":"2908_CR31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TRO.2015.2496823","volume":"32","author":"S Lowry","year":"2015","unstructured":"Lowry S, S\u00fcnderhauf N, Newman P, Leonard JJ, Cox D, Corke P, Milford M (2015) Visual place recognition: a survey. IEEE Trans Robot 32(1):1\u201319","journal-title":"IEEE Trans Robot"},{"unstructured":"MacQueen J et al (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp 281\u2013297","key":"2908_CR32"},{"issue":"3","key":"2908_CR33","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1177\/0278364917702237","volume":"36","author":"AL Majdik","year":"2017","unstructured":"Majdik AL, Till C, Scaramuzza D (2017) The zurich urban micro aerial vehicle dataset. Int J Robot Res 36(3):269\u2013273","journal-title":"Int J Robot Res"},{"issue":"103","key":"2908_CR34","first-page":"472","volume":"126","author":"SS Mansouri","year":"2020","unstructured":"Mansouri SS, Kanellakis C, Kominiak D, Nikolakopoulos G (2020) Deploying mavs for autonomous navigation in dark underground mine environments. Robot Auton Syst 126(103): 472","journal-title":"Robot Auton Syst"},{"issue":"10","key":"2908_CR35","doi-asserted-by":"publisher","first-page":"3125","DOI":"10.1007\/s10489-020-01704-5","volume":"50","author":"QC Mao","year":"2020","unstructured":"Mao QC, Sun HM, Zuo LQ, Jia RS (2020) Finding every car: a traffic surveillance multi-scale vehicle object detection method. Appl Intell 50(10):3125\u20133136","journal-title":"Appl Intell"},{"issue":"103","key":"2908_CR36","first-page":"701","volume":"136","author":"R de Queiroz Mendes","year":"2021","unstructured":"de Queiroz Mendes R, Ribeiro EG, dos Santos Rosa N, Grassi Jr V (2021) On deep learning techniques to boost monocular depth estimation for autonomous navigation. Robot Auton Syst 136(103):701","journal-title":"Robot Auton Syst"},{"doi-asserted-by":"crossref","unstructured":"Sculley D (2010) Web-scale k-means clustering. In: Proceedings of the 19th international conference on World wide web, pp 1177\u20131178","key":"2908_CR37","DOI":"10.1145\/1772690.1772862"},{"key":"2908_CR38","doi-asserted-by":"publisher","first-page":"82,066","DOI":"10.1109\/ACCESS.2020.2989863","volume":"8","author":"H Seong","year":"2020","unstructured":"Seong H, Hyun J, Kim E (2020) Fosnet: an end-to-end trainable deep neural network for scene recognition. IEEE Access 8:82,066\u201382,077","journal-title":"IEEE Access"},{"issue":"2","key":"2908_CR39","doi-asserted-by":"publisher","first-page":"1730","DOI":"10.1109\/LRA.2019.2897160","volume":"4","author":"O Vysotska","year":"2019","unstructured":"Vysotska O, Stachniss C (2019) Effective visual place recognition using multi-sequence maps. IEEE Robot Autom Lett 4(2):1730\u20131736","journal-title":"IEEE Robot Autom Lett"},{"doi-asserted-by":"crossref","unstructured":"Yang X, Gao X, Song B, Han B (2020) Hierarchical deep embedding for aurora image retrieval. IEEE Transactions on Cybernetics","key":"2908_CR40","DOI":"10.1109\/TCYB.2019.2959261"},{"doi-asserted-by":"crossref","unstructured":"Yu J, Zhu C, Zhang J, Huang Q, Tao D (2019) Spatial pyramid-enhanced netvlad with weighted triplet loss for place recognition. IEEE transactions on neural networks and learning systems","key":"2908_CR41","DOI":"10.1109\/TNNLS.2019.2908982"},{"issue":"2","key":"2908_CR42","doi-asserted-by":"publisher","first-page":"1835","DOI":"10.1109\/LRA.2020.2969917","volume":"5","author":"M Zaffar","year":"2020","unstructured":"Zaffar M, Ehsan S, Milford M, McDonald-Maier K (2020) Cohog: a light-weight, compute-efficient, and training-free visual place recognition technique for changing environments. IEEE Robot Autom Lett 5 (2):1835\u20131842","journal-title":"IEEE Robot Autom Lett"},{"doi-asserted-by":"crossref","unstructured":"Zhan Z, Zhou G, Yang X (2020) A method of hierarchical image retrieval for real-time photogrammetry based on multiple features. IEEE Access","key":"2908_CR43","DOI":"10.1109\/ACCESS.2020.2969287"},{"doi-asserted-by":"crossref","unstructured":"Zhang J, Peng Y, Yuan M (2018) Unsupervised generative adversarial cross-modal hashing. In: Thirty-second AAAI conference on artificial intelligence","key":"2908_CR44","DOI":"10.1609\/aaai.v32i1.11263"}],"container-title":["Applied Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-021-02908-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10489-021-02908-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-021-02908-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,13]],"date-time":"2023-01-13T00:35:18Z","timestamp":1673570118000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10489-021-02908-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,20]]},"references-count":44,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2022,5]]}},"alternative-id":["2908"],"URL":"https:\/\/doi.org\/10.1007\/s10489-021-02908-z","relation":{},"ISSN":["0924-669X","1573-7497"],"issn-type":[{"type":"print","value":"0924-669X"},{"type":"electronic","value":"1573-7497"}],"subject":[],"published":{"date-parts":[[2021,10,20]]},"assertion":[{"value":"7 October 2021","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 October 2021","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}