{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T04:38:15Z","timestamp":1772771895819,"version":"3.50.1"},"reference-count":38,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2021,12,2]],"date-time":"2021-12-02T00:00:00Z","timestamp":1638403200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001665","name":"Agence Nationale de la Recherche","doi-asserted-by":"publisher","award":["16-IDEX-0005"],"award-info":[{"award-number":["16-IDEX-0005"]}],"id":[{"id":"10.13039\/501100001665","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Geocoding aims to assign unambiguous locations (i.e., geographic coordinates) to place names (i.e., toponyms) referenced within documents (e.g., within spreadsheet tables or textual paragraphs). This task comes with multiple challenges, such as dealing with referent ambiguity (multiple places with a same name) or reference database completeness. In this work, we propose a geocoding approach based on modeling pairs of toponyms, which returns latitude-longitude coordinates. One of the input toponyms will be geocoded, and the second one is used as context to reduce ambiguities. The proposed approach is based on a deep neural network that uses Long Short-Term Memory (LSTM) units to produce representations from sequences of character n-grams. To train our model, we use toponym co-occurrences collected from different contexts, namely textual (i.e., co-occurrences of toponyms in Wikipedia articles) and geographical (i.e., inclusion and proximity of places based on Geonames data). Experiments based on multiple geographical areas of interest\u2014France, United States, Great-Britain, Nigeria, Argentina and Japan\u2014were conducted. Results show that models trained with co-occurrence data obtained a higher geocoding accuracy, and that proximity relations in combination with co-occurrences can help to obtain a slightly higher accuracy in geographical areas with fewer places in the data sources.<\/jats:p>","DOI":"10.3390\/ijgi10120818","type":"journal-article","created":{"date-parts":[[2021,12,2]],"date-time":"2021-12-02T21:19:08Z","timestamp":1638479948000},"page":"818","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Deep Learning for Toponym Resolution: Geocoding Based on Pairs of Toponyms"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1783-934X","authenticated-orcid":false,"given":"Jacques","family":"Fize","sequence":"first","affiliation":[{"name":"INSA Lyon, LIRIS UMR CNRS 5205, 69100 Villeurbanne, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1590-9546","authenticated-orcid":false,"given":"Ludovic","family":"Moncla","sequence":"additional","affiliation":[{"name":"INSA Lyon, LIRIS UMR CNRS 5205, 69100 Villeurbanne, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3856-2936","authenticated-orcid":false,"given":"Bruno","family":"Martins","sequence":"additional","affiliation":[{"name":"Instituto Superior T\u00e9cnico and INESC-ID, University of Lisbon, 1049-001 Lisbon, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2021,12,2]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Smith, D.A., and Crane, G. (2001, January 4\u20139). Disambiguating geographic names in a historical digital library. Proceedings of the International Conference on Theory and Practice of Digital Libraries, Darmstadt, Germany.","DOI":"10.1007\/3-540-44796-2_12"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.cageo.2016.07.017","article-title":"A survey on the geographic scope of textual documents","volume":"96","author":"Monteiro","year":"2016","journal-title":"Comput. Geosci."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1145\/2047296.2047300","article-title":"Approaches to Disambiguating Toponyms","volume":"3","author":"Buscaldi","year":"2011","journal-title":"Sigspatial Spec."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"DeLozier, G., Baldridge, J., and London, L. (2015, January 25\u201330). Gazetteer-Independent Toponym Resolution Using Geographic Word Profiles. Proceedings of the 29th AAAI Conference on Artificial Intelligence, Austin, TX, USA.","DOI":"10.1609\/aaai.v29i1.9531"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ardanuy, M.C., and Sporleder, C. (2017, January 1\u20132). Toponym disambiguation in historical documents using semantic and geographic features. Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage, G\u00f6ttingen, Germany.","DOI":"10.1145\/3078081.3078099"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Moncla, L., McDonough, K., Vigier, D., Joliveau, T., and Brenon, A. (2019, January 5). Toponym disambiguation in historical documents using network analysis of qualitative relationships. Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Geospatial Humanities, Chicago, IL, USA.","DOI":"10.1145\/3356991.3365471"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1145\/1328964.1328989","article-title":"Toponym resolution in text: Annotation, evaluation and applications of spatial grounding","volume":"Volume 41","author":"Leidner","year":"2007","journal-title":"ACM SIGIR Forum"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1080\/13658810701626251","article-title":"A conceptual density-based approach for the disambiguation of toponyms","volume":"22","author":"Buscaldi","year":"2008","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Lieberman, M.D., Samet, H., and Sankaranarayanan, J. (2010, January 1\u20136). Geotagging with local lexicons to build indexes for textually-specified spatial data. Proceedings of the 26th International Conference on Data Engineering (ICDE), Long Beach, CA, USA.","DOI":"10.1109\/ICDE.2010.5447903"},{"key":"ref_10","unstructured":"Ester, M., Kriegel, H.P., Sander, J., and Xu, X. (1996, January 2\u20134). A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining KDD\u201996, Portland, OR, USA."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Moncla, L., Renteria-Agualimpia, W., Nogueras-Iso, J., and Gaio, M. (2014, January 4). Geocoding for texts with fine-grain toponyms: An experiment on a geoparsed hiking descriptions corpus. Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Dallas, TX, USA.","DOI":"10.1145\/2666310.2666386"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Amitay, E., Har\u2019El, N., Sivan, R., and Soffer, A. (2004, January 25\u201329). Web-a-where: Geotagging web content. Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, UK.","DOI":"10.1145\/1008992.1009040"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1080\/13658810701626236","article-title":"Using co-occurrence models for placename disambiguation","volume":"22","author":"Overell","year":"2008","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Batista, D.S., Ferreira, J.D., Couto, F.M., and Silva, M.J. (2012, January 17\u201320). Toponym disambiguation using ontology-based semantic similarity. Proceedings of the International Conference on Computational Processing of the Portuguese Language, Coimbra, Portugal.","DOI":"10.1007\/978-3-642-28885-2_20"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Scharl, A., and Tochtermann, K. (2007). A Supervised Machine Learning Approach to Toponym Disambiguation. The Geospatial Web: How Geobrowsers, Social Software and the Web 2.0 Are Shaping the Network Society, Springer.","DOI":"10.1007\/978-1-84628-827-2"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Lieberman, M.D., and Samet, H. (2012, January 12\u201316). Adaptive Context Features for Toponym Resolution in Streaming News. Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, OR, USA.","DOI":"10.1145\/2348283.2348381"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"114855","DOI":"10.1016\/j.eswa.2021.114855","article-title":"Geographic Named Entity Recognition and Disambiguation in Mexican News using word embeddings","volume":"176","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1007\/s10708-014-9553-y","article-title":"Using machine learning methods for disambiguating place references in textual documents","volume":"80","author":"Santos","year":"2015","journal-title":"GeoJournal"},{"key":"ref_19","first-page":"1","article-title":"Neural network methods for natural language processing","volume":"10","author":"Goldberg","year":"2017","journal-title":"Synth. Lect. Hum. Lang. Technol."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Kamalloo, E., and Rafiei, D. (2018, January 23\u201327). A coherent unsupervised model for toponym resolution. Proceedings of the 2018 World Wide Web Conference, Lyon, France.","DOI":"10.1145\/3178876.3186027"},{"key":"ref_21","unstructured":"Speriosu, M., and Baldridge, J. (2013, January 4\u20139). Text-driven toponym resolution using indirect supervision. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Sofia, Bulgaria."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1007\/s10579-017-9385-8","article-title":"What is missing in geographical parsing?","volume":"52","author":"Gritta","year":"2018","journal-title":"Lang. Resour. Eval."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Cardoso, A.B., Martins, B., and Estima, J. (2019, January 3\u20136). Using Recurrent Neural Networks for Toponym Resolution in Text. Proceedings of the EPIA Conference on Artificial Intelligence, Vila Real, Portugal.","DOI":"10.1007\/978-3-030-30244-3_63"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Kulkarni, S., Jain, S., Hosseini, M., Baldridge, J., Le, E., and Zhang, L. (2021, January 6). Multi-Level Gazetteer-Free Geocoding. Proceedings of the International Combined Workshop on Spatial Language Understanding and Grounded Communication for Robotics, Bangkok, Thailand.","DOI":"10.18653\/v1\/2021.splurobonlp-1.9"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., and Zettlemoyer, L. (2018, January 1\u20136). Deep contextualized word representations. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), New Orleans, LA, USA.","DOI":"10.18653\/v1\/N18-1202"},{"key":"ref_26","unstructured":"Huang, Z., Xu, W., and Yu, K. (2015). Bidirectional LSTM-CRF models for sequence tagging. arXiv."},{"key":"ref_27","unstructured":"Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013, January 2\u20134). Efficient Estimation of Word Representations in Vector Space. Proceedings of the 1st International Conference on Learning Representations, ICLR Workshop Track Proceedings, Scottsdale, AZ, USA."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Akbik, A., Bergmann, T., and Vollgraf, R. (2019, January 3\u20135). Pooled Contextualized Embeddings for Named Entity Recognition. Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Minneapolis, MN, USA.","DOI":"10.18653\/v1\/N19-1078"},{"key":"ref_29","unstructured":"Agarap, A.F. (2018). Deep learning using rectified linear units (relu). arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1145\/2629489","article-title":"Wikidata: A Free Collaborative Knowledgebase","volume":"57","year":"2014","journal-title":"Commun. ACM"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1086\/427976","article-title":"HEALPix: A Framework for High-Resolution Discretization and Fast Analysis of Data Distributed on the Sphere","volume":"622","author":"Gorski","year":"2005","journal-title":"Astrophys. J."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Cheng, Z., Caverlee, J., and Lee, K. (2010, January 26\u201330). You are where you tweet: A content-based approach to geo-locating twitter users. Proceedings of the 19th ACM International Conference on INFORMATION and Knowledge Management, Toronto, ON, Canada.","DOI":"10.1145\/1871437.1871535"},{"key":"ref_33","unstructured":"Jurgens, D., Finethy, T., McCorriston, J., Xu, Y.T., and Ruths, D. (2015, January 26\u201329). Geolocation prediction in twitter using social networks: A critical analysis and review of current practice. Proceedings of the Ninth International AAAI Conference on Web and Social Media, Oxford, UK."},{"key":"ref_34","unstructured":"Mani, I., Hitzeman, J., Richer, J., Harris, D., Quimby, R., and Wellner, B. (2008, January 28\u201330). SpatialML: Annotation Scheme, Corpora, and Tools. Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Leidner, J.L. (2007). Toponym Resolution in Text: Annotation, Evaluation and Applications of Spatial Grounding of Place Names. [Ph.D. Thesis, University of Edinburgh].","DOI":"10.1145\/1328964.1328989"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Rayson, P., Reinhold, A., Butler, J., Donaldson, C., Gregory, I., and Taylor, J. (2017, January 7\u201310). A deeply annotated testbed for geographical text analysis: The corpus of lake district writing. Proceedings of the 1st ACM SIGSPATIAL Workshop on Geospatial Humanities, Redondo Beach, CA, USA.","DOI":"10.1145\/3149858.3149865"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"DeLozier, G., Wing, B., Baldridge, J., and Nesbit, S. (2016, January 11). Creating a Novel Geolocation Corpus from Historical Texts. Proceedings of the 10th Linguistic Annotation Workshop held in conjunction with ACL 2016 (LAW-X 2016), Berlin, Germany.","DOI":"10.18653\/v1\/W16-1721"},{"key":"ref_38","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv."}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/10\/12\/818\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:39:06Z","timestamp":1760168346000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/10\/12\/818"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,2]]},"references-count":38,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2021,12]]}},"alternative-id":["ijgi10120818"],"URL":"https:\/\/doi.org\/10.3390\/ijgi10120818","relation":{},"ISSN":["2220-9964"],"issn-type":[{"value":"2220-9964","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,12,2]]}}}