{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,23]],"date-time":"2026-06-23T02:07:16Z","timestamp":1782180436358,"version":"3.54.5"},"reference-count":79,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2024,3,1]],"date-time":"2024-03-01T00:00:00Z","timestamp":1709251200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Future Internet"],"abstract":"<jats:p>In the digital transformation era, video media libraries\u2019 untapped potential is immense, restricted primarily by their non-machine-readable nature and basic search functionalities limited to standard metadata. This study presents a novel multimodal methodology that utilizes advances in artificial intelligence, including neural networks, computer vision, and natural language processing, to extract and geocode geospatial references from videos. Leveraging the geospatial information from videos enables semantic searches, enhances search relevance, and allows for targeted advertising, particularly on mobile platforms. The methodology involves a comprehensive process, including data acquisition from ARD Mediathek, image and text analysis using advanced machine learning models, and audio and subtitle processing with state-of-the-art linguistic models. Despite challenges like model interpretability and the complexity of geospatial data extraction, this study\u2019s findings indicate significant potential for advancing the precision of spatial data analysis within video content, promising to enrich media libraries with more navigable, contextually rich content. This advancement has implications for user engagement, targeted services, and broader urban planning and cultural heritage applications.<\/jats:p>","DOI":"10.3390\/fi16030087","type":"journal-article","created":{"date-parts":[[2024,3,1]],"date-time":"2024-03-01T07:36:21Z","timestamp":1709278581000},"page":"87","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Advanced Techniques for Geospatial Referencing in Online Media Repositories"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4527-3059","authenticated-orcid":false,"given":"Dominik","family":"Warch","sequence":"first","affiliation":[{"name":"Department of Applied Informatics and Geodesy, School of Technology, Mainz University of Applied Sciences, 55128 Mainz, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Patrick","family":"Stellbauer","sequence":"additional","affiliation":[{"name":"Department of Applied Informatics and Geodesy, School of Technology, Mainz University of Applied Sciences, 55128 Mainz, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5158-796X","authenticated-orcid":false,"given":"Pascal","family":"Neis","sequence":"additional","affiliation":[{"name":"Department of Applied Informatics and Geodesy, School of Technology, Mainz University of Applied Sciences, 55128 Mainz, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2024,3,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Hopfgartner, F., and Sch\u00f6ffmann, K. (2017, January 7\u201311). Interactive Search in Video & Lifelogging Repositories. Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval, Oslo, Norway.","DOI":"10.1145\/3020165.3022161"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Rupapara, V., Thipparthy, K.R., Gunda, N.K., Narra, M., and Gandhi, S. (2020, January 23\u201324). Improving Video Ranking on Social Video Platforms. Proceedings of the 2020 7th International Conference on Smart Structures and Systems (ICSSS), Chennai, India.","DOI":"10.1109\/ICSSS49621.2020.9202153"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2061","DOI":"10.1109\/JSAC.2016.2596998","article-title":"Guest Editorial Video Distribution over Future Internet","volume":"34","author":"Westphal","year":"2016","journal-title":"IEEE J. Select. Areas Commun."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2115","DOI":"10.1080\/13658816.2020.1737700","article-title":"GeoVisuals: A Visual Analytics Approach to Leverage the Potential of Spatial Videos and Associated Geonarratives","volume":"34","author":"Jamonnak","year":"2020","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"12","DOI":"10.54097\/ijeh.v4i1.1152","article-title":"Analysis of Algorithm Recommendation Mechanism of TikTok","volume":"4","author":"Chen","year":"2022","journal-title":"Int. J. Educ. Humanit."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Hyv\u00f6nen, E. (2012). Publishing and Using Cultural Heritage Linked Data on the Semantic Web; Synthesis Lectures on Data, Semantics, and Knowledge, Springer International Publishing.","DOI":"10.1007\/978-3-031-79438-4"},{"key":"ref_7","first-page":"33","article-title":"From Text to Geographic Coordinates: The Current State of Geocoding","volume":"19","author":"Goldberg","year":"2007","journal-title":"Urisa J."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1075\/li.30.1.03nad","article-title":"A Survey of Named Entity Recognition and Classification","volume":"30","author":"Nadeau","year":"2007","journal-title":"Lingvisticae Investig."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1007\/s10707-012-0173-8","article-title":"An Algorithm for Local Geoparsing of Microtext","volume":"17","author":"Gelernter","year":"2013","journal-title":"Geoinformatica"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1145\/2047296.2047298","article-title":"Detecting Geographical References in the Form of Place Names and Associated Spatial Natural Language","volume":"3","author":"Leidner","year":"2011","journal-title":"Sigspatial Spec."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"714","DOI":"10.1080\/13658816.2018.1458986","article-title":"A Natural Language Processing and Geospatial Clustering Framework for Harvesting Local Place Names from Geotagged Housing Advertisements","volume":"33","author":"Hu","year":"2018","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_12","unstructured":"Stenetorp, P., Pyysalo, S., Topic, G., Ohta, T., Ananiadou, S., and Tsujii, J. (2012, January 23\u201327). Brat: A Web-Based Tool for NLP-Assisted Text Annotati-on. Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1016\/j.compenvurbsys.2014.11.001","article-title":"Spatiotemporal and Semantic Information Extraction from Web News Reports about Natural Hazards","volume":"50","author":"Wang","year":"2015","journal-title":"Comput. Environ. Urban Syst."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1162\/tacl_a_00141","article-title":"Design Challenges for Entity Linking","volume":"3","author":"Ling","year":"2015","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_15","first-page":"112","article-title":"Location Reference Recognition from Texts: A Survey and Comparison","volume":"56","author":"Hu","year":"2023","journal-title":"ACM Comput. Surv."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3366\/ijhac.2015.0135","article-title":"Geoparsing, GIS, and Textual Analysis: Current Developments in Spatial Humanities Research","volume":"9","author":"Gregory","year":"2015","journal-title":"Int. J. Humanit. Arts Comput."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1111\/tgis.12212","article-title":"Automated Geocoding of Textual Documents: A Survey of Current Approaches","volume":"21","author":"Melo","year":"2017","journal-title":"Trans. GIS"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Leetaru, K.H. (2012). Fulltext Geocoding versus Spatial Metadata for Large Text Archives: Towards a Geographically Enriched Wikipedia. D-Lib Mag., 18.","DOI":"10.1045\/september2012-leetaru"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1007\/s10579-017-9385-8","article-title":"What\u2019s Missing in Geographical Parsing?","volume":"52","author":"Gritta","year":"2018","journal-title":"Lang. Resour. Eval."},{"key":"ref_20","first-page":"164","article-title":"Geographic Information Retrieval: Progress and Challenges in Spatial Search of Text","volume":"12","author":"Purves","year":"2018","journal-title":"FNT Inf. Retr."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Li, L.T., Pedronette, D.C.G., Almeida, J., Penatti, O.A.B., Calumby, R.T., Da, S., and Torres, R. (2012, January 6\u20139). Multimedia Multimodal Geocoding. Proceedings of the 20th International Conference on Advances in Geographic Information Systems, Redondo Beach, CA, USA.","DOI":"10.1145\/2424321.2424393"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Penatti, O.A.B., Li, L.T., Almeida, J., Da, S., and Torres, R. (2012, January 5\u20138). A Visual Approach for Video Geocoding Using Bag-of-Scenes. Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, Hong Kong, China.","DOI":"10.1145\/2324796.2324857"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1007\/978-3-319-12610-4_12","article-title":"Beyond SIFT for Image Categorization by Bag-of-Scenes Analysis","volume":"Volume 318","author":"Fred","year":"2015","journal-title":"Pattern Recognition Applications and Methods"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Trevisiol, M., J\u00e9gou, H., Delhumeau, J., and Gravier, G. (2013, January 16\u201320). Retrieving Geo-Location of Videos with a Divide & Conquer Hierarchical Multimodal Approach. Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval, Dallas, TX, USA.","DOI":"10.1145\/2461466.2461468"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.ins.2013.02.045","article-title":"Georeferencing Flickr Resources Based on Textual Meta-Data","volume":"238","author":"Schockaert","year":"2013","journal-title":"Inf. Sci."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1553\/giscience2022_02_s66","article-title":"Methods for Georeferencing Linear and Non-Linear Media Content","volume":"10","author":"Horbach","year":"2023","journal-title":"GI_Forum"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2013). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 580\u2013587.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Graves, A., Mohamed, A., and Hinton, G. (2013, January 26\u201331). Speech Recognition with Deep Recurrent Neural Networks. Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada.","DOI":"10.1109\/ICASSP.2013.6638947"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., and Fei-Fei, L. (2014, January 23\u201328). Large-Scale Video Classification with Convolutional Neural Networks. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.223"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Tran, D., Bourdev, L., Fergus, R., Torresani, L., and Paluri, M. (2014, January 23\u201328). Learning Spatiotemporal Features with 3D Convolutional Networks. Proceedings of the IEEE International Conference on Computer Vision, Columbus, OH, USA.","DOI":"10.1109\/ICCV.2015.510"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"3137","DOI":"10.1109\/TMM.2018.2823900","article-title":"Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification","volume":"20","author":"Jiang","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Poria, S., Chaturvedi, I., Cambria, E., and Hussain, A. (2016, January 12\u201315). Convolutional MKL Based Multimodal Emotion Recognition and Sentiment Analysis. Proceedings of the 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, Spain.","DOI":"10.1109\/ICDM.2016.0055"},{"key":"ref_33","unstructured":"Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., and LeCun, Y. (2014). OverFeat: Integrated Recognition, Localization and Detec-tion Using Convolutional Networks. arXiv."},{"key":"ref_34","unstructured":"Chen, Z., Lam, O., Jacobson, A., and Milford, M. (2014). Convolutional Neural Network-Based Place Recognition. arXiv."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Razavian, A.S., Azizpour, H., Sullivan, J., and Carlsson, S. (2014, January 23\u201328). CNN Features Off-the-Shelf: An Astounding Baseline for Recognition. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, Columbus, OH, USA.","DOI":"10.1109\/CVPRW.2014.131"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"ImageNet Classification with Deep Convolutional Neural Networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"S\u00fcnderhauf, N., Shirazi, S., Dayoub, F., Upcroft, B., and Milford, M. (October, January 28). On the Performance of ConvNet Features for Place Recognition. Proceedings of the 2015 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany.","DOI":"10.1109\/IROS.2015.7353986"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Hou, Y., Zhang, H., and Zhou, S. (2015, January 8\u201310). Convolutional Neural Network-Based Image Representation for Visual Loop Closure Detection. Proceedings of the 2015 IEEE International Conference on Information and Automation, Lijiang, China.","DOI":"10.1109\/ICInfA.2015.7279659"},{"key":"ref_39","unstructured":"Panphattarasap, P., and Calway, A. (2016). Computer Vision\u2013ACCV 2016, Proceedings of the 13th Asian Conference on Computer Vision, Taipei, Taiwan, 20\u201324 November 2016, Revised Selected Papers, Part IV 13, Springer."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"488","DOI":"10.1049\/cje.2018.03.010","article-title":"CNN Feature Boosted SeqSLAM for Real-Time Loop Closure Detection","volume":"27","author":"Bai","year":"2018","journal-title":"Chin. J. Electron."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Chen, Z., Jacobson, A., Sunderhauf, N., Upcroft, B., Liu, L., Shen, C., Reid, I., and Milford, M. (June, January 29). Deep Learning Features at Scale for Visual Place Recognition. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.","DOI":"10.1109\/ICRA.2017.7989366"},{"key":"ref_42","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"484","DOI":"10.1109\/LRA.2016.2517824","article-title":"Beyond Holistic Descriptors, Keypoints, and Fixed Patches: Multiscale Superpixel Grids for Place Recog-nition in Changing Environments","volume":"1","author":"Neubert","year":"2016","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Chen, Z., Maffra, F., Sa, I., and Chli, M. (2017, January 24\u201328). Only Look Once, Mining Distinctive Landmarks from ConvNet for Visual Place Recogni-tion. Proceedings of the 2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada.","DOI":"10.1109\/IROS.2017.8202131"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"4015","DOI":"10.1109\/LRA.2018.2859916","article-title":"Learning Context Flexible Attention Model for Long-Term Visual Place Recognition","volume":"3","author":"Chen","year":"2018","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_46","first-page":"1655","article-title":"Fine-Tuning CNN Image Retrieval with No Human Annotation","volume":"41","author":"Tolias","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Kim, H.J., Dunn, E., and Frahm, J.-M. (2017, January 21\u201326). Learned Contextual Feature Reweighting for Image Geo-Localization. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.346"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T., and Sivic, J. (2016, January 27\u201330). NetVLAD: CNN Architecture for Weakly Supervised Place Recogni-tion. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.572"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Weyand, T., Araujo, A., Cao, B., and Sim, J. (2020, January 14\u201319). Google Landmarks Dataset v2\u2014A Large-Scale Benchmark for Instance-Level Recognition and Retrieval. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00265"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Hausler, S., Garg, S., Xu, M., Milford, M., and Fischer, T. (2021, January 19\u201325). Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01392"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Smith, R. (2007, January 23\u201326). An Overview of the Tesseract OCR Engine. Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, PR, Brazil.","DOI":"10.1109\/ICDAR.2007.4376991"},{"key":"ref_52","unstructured":"Islam, N., Islam, Z., and Noor, N. (2017). A Survey on Optical Character Recognition System. arXiv."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Mittal, R., and Garg, A. (2020, January 15\u201317). Text Extraction Using OCR: A Systematic Review. Proceedings of the 2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA), Coimbatore, India.","DOI":"10.1109\/ICIRCA48905.2020.9183326"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"9411","DOI":"10.1007\/s11042-020-10073-7","article-title":"Automatic Speech Recognition: A Survey","volume":"80","author":"Malik","year":"2021","journal-title":"Multimed. Tools Appl."},{"key":"ref_55","unstructured":"Radford, A., Kim, J.W., Xu, T., Brockman, G., McLeavey, C., and Sutskever, I. (2022, January 17\u201323). Robust Speech Recognition via Large-Scale Weak Su-pervision. Proceedings of the International Conference on Machine Learning, Baltimore, MD, USA."},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Sato, T., Kanade, T., Hughes, E.K., and Smith, M.A. (1998, January 3). Video OCR for Digital News Archive. Proceedings of the 1998 IEEE International Workshop on Content-Based Access of Image and Video Database, Bombay, India.","DOI":"10.1109\/CAIVD.1998.646033"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Saluja, R., Maheshwari, A., Ramakrishnan, G., Chaudhuri, P., and Carman, M. (2019, January 20\u201325). OCR On-the-Go: Robust End-to-End Systems for Reading License Plates & Street Signs. Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00033"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Priambada, S., and Widyantoro, D.H. (2017, January 1\u20133). Levensthein Distance as a Post-Process to Improve the Performance of OCR in Written Road Signs. Proceedings of the 2017 Second International Conference on Informatics and Computing (ICIC), Jayapura, Indonesia.","DOI":"10.1109\/IAC.2017.8280534"},{"key":"ref_59","first-page":"43","article-title":"Use of Place Names in the Subtitle Corpus of Highest-Grossing Movies of the Past 20 Years","volume":"1","author":"Paiders","year":"2018","journal-title":"J. Int. Symp. Stud. Engl. Croat. Ital. Stud."},{"key":"ref_60","unstructured":"(2024, January 07). ARD Mediathek. Available online: https:\/\/www.ardmediathek.de\/."},{"key":"ref_61","first-page":"120","article-title":"The OpenCV Library","volume":"25","author":"Bradski","year":"2020","journal-title":"Dr. Dobb\u2019s J. Softw. Tools"},{"key":"ref_62","unstructured":"Cao, B., Araujo, A., and Sim, J. (2020). Computer Vision\u2013ECCV 2020, Proceedings of the 16th European Conference, Glasgow, UK, 23\u201328 August 2020, Proceedings, Part XX 16, Springer International Publishing."},{"key":"ref_63","unstructured":"(2024, January 07). Wikimedia Commons. Available online: https:\/\/commons.wikimedia.org\/wiki\/Main_Page."},{"key":"ref_64","unstructured":"(2024, February 09). ABBYY FineReader PDF. Available online: https:\/\/pdf.abbyy.com\/."},{"key":"ref_65","unstructured":"(2024, February 09). Adobe Acrobat: Easily Edit Your Scanned PDF Documents with OCR. Available online: https:\/\/www.adobe.com\/acrobat\/how-to\/ocr-software-convert-pdf-to-text.html."},{"key":"ref_66","unstructured":"(2024, February 09). Google Cloud Vision API: Detect Text in Images. Available online: https:\/\/cloud.google.com\/vision\/docs\/ocr."},{"key":"ref_67","unstructured":"(2024, February 09). Microsoft Azure AI Vision Documentation: OCR\u2014Optical Character Recognition. Available online: https:\/\/learn.microsoft.com\/en-us\/azure\/ai-services\/computer-vision\/overview-ocr."},{"key":"ref_68","unstructured":"(2024, February 09). Amazon Textract: Automatically Extract Printed Text, Handwriting, Layout Elements and Any Data from Any Document. Available online: https:\/\/aws.amazon.com\/textract."},{"key":"ref_69","unstructured":"(2024, January 07). Nominatim: Open-Source Geocoding with OpenStreetMap Data. Available online: https:\/\/nominatim.org\/."},{"key":"ref_70","unstructured":"(2024, January 07). OpenStreetMap. Available online: https:\/\/openstreetmap.org\/."},{"key":"ref_71","unstructured":"Akbik, A., Bergmann, T., Blythe, D., Rasul, K., Schweter, S., and Vollgraf, R. (2019, January 6\u201311). FLAIR: An Easy-to-Use Framework for State-of-the-Art NLP. Proceedings of the NAACL 2019, 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations), Online."},{"key":"ref_72","unstructured":"Akbik, A., Blythe, D., and Vollgraf, R. (2018, January 20\u201326). Contextual String Embeddings for Sequence Labeling. Proceedings of the COLING 2018, 27th International Conference on Computational Linguistics, Santa Fe, NM, USA."},{"key":"ref_73","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1111\/tgis.12510","article-title":"GeoTxt: A Scalable Geoparsing System for Unstructured Text Geolocation","volume":"23","author":"Karimzadeh","year":"2019","journal-title":"Trans. GIS"},{"key":"ref_74","unstructured":"(2024, January 07). spaCy: Industrial-Strength Natural Language Processing. Available online: https:\/\/spacy.io\/."},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Manning, C., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S., and McClosky, D. (2014, January 22\u201327). The Stanford CoreNLP Natural Language Processing Toolkit. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Baltimore, MD, USA.","DOI":"10.3115\/v1\/P14-5010"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Qi, P., Zhang, Y., Zhang, Y., Bolton, J., and Manning, C.D. (2020). Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. arXiv.","DOI":"10.18653\/v1\/2020.acl-demos.14"},{"key":"ref_77","unstructured":"Benikova, D., Biemann, C., and Reznicek, M. (2014, January 26\u201331). NoSta-D Named Entity Annotation for German: Guidelines and Dataset. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC\u201914), Reykjavik, Iceland."},{"key":"ref_78","unstructured":"(2024, January 07). OpenAI GPT-3.5 Turbo Fine-Tuning and API Updates. Available online: https:\/\/openai.com\/blog\/gpt-3-5-turbo-fine-tuning-and-api-updates."},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Hossain, M.M., Labib, M.F., Rifat, A.S., Das, A.K., and Mukta, M. (2019, January 28\u201330). Auto-Correction of English to Bengali Transliteration System Using Levenshtein Distance. Proceedings of the 2019 7th International Conference on Smart Computing & Communications (ICSCC), Sarawak, Malaysia.","DOI":"10.1109\/ICSCC.2019.8843613"}],"container-title":["Future Internet"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-5903\/16\/3\/87\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:08:00Z","timestamp":1760105280000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-5903\/16\/3\/87"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,1]]},"references-count":79,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2024,3]]}},"alternative-id":["fi16030087"],"URL":"https:\/\/doi.org\/10.3390\/fi16030087","relation":{},"ISSN":["1999-5903"],"issn-type":[{"value":"1999-5903","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3,1]]}}}