{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T23:31:13Z","timestamp":1780443073114,"version":"3.54.1"},"reference-count":42,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,1,6]],"date-time":"2021-01-06T00:00:00Z","timestamp":1609891200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,1,6]],"date-time":"2021-01-06T00:00:00Z","timestamp":1609891200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001711","name":"Swiss National Science foundation","doi-asserted-by":"crossref","award":["NRP 75, grant 407540 167149"],"award-info":[{"award-number":["NRP 75, grant 407540 167149"]}],"id":[{"id":"10.13039\/501100001711","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.<\/jats:p>","DOI":"10.1186\/s40537-020-00383-w","type":"journal-article","created":{"date-parts":[[2021,1,6]],"date-time":"2021-01-06T13:06:31Z","timestamp":1609938391000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":50,"title":["Querying knowledge graphs in natural language"],"prefix":"10.1186","volume":"8","author":[{"given":"Shiqi","family":"Liang","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kurt","family":"Stockinger","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tarcisio Mendes","family":"de Farias","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Maria","family":"Anisimova","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Manuel","family":"Gil","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2021,1,6]]},"reference":[{"key":"383_CR1","doi-asserted-by":"crossref","unstructured":"Boutet E, Lieberherr D, Tognolli M, Schneider M, Bairoch A. Uniprotkb\/swiss-prot. In: Plant Bioinformatics, pp. 89\u2013112. Springer, 2007.","DOI":"10.1007\/978-1-59745-535-0_4"},{"issue":"3","key":"383_CR2","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1007\/s10115-017-1100-y","volume":"55","author":"D Diefenbach","year":"2018","unstructured":"Diefenbach D, Lopez V, Singh K, Maret P. Core techniques of question answering systems over knowledge bases: a survey. Knowl Informat syst. 2018;55(3):529\u201369.","journal-title":"Knowl Informat syst"},{"issue":"1","key":"383_CR3","doi-asserted-by":"publisher","first-page":"73","DOI":"10.14778\/2735461.2735468","volume":"8","author":"F Li","year":"2014","unstructured":"Li F, Jagadish H. Constructing an interactive natural language interface for relational databases. Proceed VLDB Endowment. 2014;8(1):73\u201384.","journal-title":"Proceed VLDB Endowment"},{"key":"383_CR4","doi-asserted-by":"crossref","unstructured":"Basik F, H\u00e4ttasch B, Ilkhechi A, Usta A, Ramaswamy S, Utama P, Weir N, Binnig C, Cetintemel U. Dbpal: A learned nl-interface for databases. In: Proceedings of the 2018 International Conference on Management of Data, ACM 2018;1765\u20131768.","DOI":"10.1145\/3183713.3193562"},{"key":"383_CR5","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-019-00567-8","author":"K Affolter","year":"2019","unstructured":"Affolter K, Stockinger K, Bernstein A. A comparative survey of recent natural language interfaces for databases. VLDB J. 2019. https:\/\/doi.org\/10.1007\/s00778-019-00567-8.","journal-title":"VLDB J."},{"issue":"6","key":"383_CR6","doi-asserted-by":"publisher","first-page":"895","DOI":"10.3233\/SW-160247","volume":"8","author":"K H\u00f6ffner","year":"2017","unstructured":"H\u00f6ffner K, Walter S, Marx E, Usbeck R, Lehmann J, Ngonga Ngomo A-C. Survey on challenges of question answering in the semantic web. Semant Web. 2017;8(6):895\u2013920.","journal-title":"Semant Web"},{"key":"383_CR7","unstructured":"Sing K, Lytra I, Radhakrishna AS, Shekarpour S, Vidal M-E, Lehmann J. No one is perfect: Analysing the performance of question answering components over the dbpedia knowledge graph. arXiv preprint arXiv:1809.10044. 2018."},{"key":"383_CR8","doi-asserted-by":"publisher","first-page":"baz106","DOI":"10.1093\/database\/baz106","volume":"2019","author":"AC Sima","year":"2019","unstructured":"Sima AC, Mendes de Farias T, Zbinden E, Anisimova M, Gil M, Stockinger H, Stockinger K, Robinson-Rechavi M, Dessimoz C. Enabling semantic queries across federated bioinformatics databases. Database. 2019;2019: baz106.","journal-title":"Database."},{"key":"383_CR9","doi-asserted-by":"crossref","unstructured":"Zafar H, Napolitano G, Lehmann J. Formal query generation for question answering over knowledge bases. In: European Semantic Web Conference, 2018;714\u2013728. Springer","DOI":"10.1007\/978-3-319-93417-4_46"},{"key":"383_CR10","doi-asserted-by":"crossref","unstructured":"Singh K, Radhakrishna AS, Both A, Shekarpour S, Lytra I, Usbeck R, Vyas A, Khikmatullaev A, Punjani D, Lange C, Vidal ME, Lehmann J, Auer S. Why reinvent the wheel: Let\u2019s build question answering systems together. In: Proceedings of the 2018 World Wide Web Conference 2018.","DOI":"10.1145\/3178876.3186023"},{"key":"383_CR11","unstructured":"Trivedi P, Maheshwari G, Dubey M, Lehmann J. Lc-quad: A corpus for complex question answering over knowledge graphs. In: International Semantic Web Conference, 2017;210\u2013218. Springer"},{"key":"383_CR12","doi-asserted-by":"crossref","unstructured":"Tai KS, Socher R, Manning CD. Improved semantic representations from tree-structured long short-term memory networks. arXiv preprint arXiv:1503.00075. 2015.","DOI":"10.3115\/v1\/P15-1150"},{"issue":"4","key":"383_CR13","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1017\/S0269888900005476","volume":"5","author":"A Copestake","year":"1990","unstructured":"Copestake A, Jones KS. Natural language interfaces to databases. Knowl Eng Rev. 1990;5(4):225\u201349.","journal-title":"Knowl Eng Rev"},{"issue":"1","key":"383_CR14","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1017\/S135132490000005X","volume":"1","author":"I Androutsopoulos","year":"1995","unstructured":"Androutsopoulos I, Ritchie GD, Thanisch P. Natural language interfaces to databases-an introduction. Nat Lang Eng. 1995;1(1):29\u201381.","journal-title":"Nat Lang Eng"},{"key":"383_CR15","doi-asserted-by":"publisher","unstructured":"Popescu A-M, Etzioni O, Kautz H. Towards a theory of natural language interfaces to databases. In: Proceedings of the 8th International Conference on Intelligent User Interfaces. IUI \u201903, pp. 149\u2013157. Association for Computing Machinery, New York 2003. https:\/\/doi.org\/10.1145\/604045.604070.","DOI":"10.1145\/604045.604070"},{"key":"383_CR16","unstructured":"Dong L, Lapata M. Language to logical form with neural attention. CoRR abs\/1601.01280. 1601.01280. 2016."},{"key":"383_CR17","unstructured":"Xu X, Liu C, Song D. Sqlnet: Generating structured queries from natural language without reinforcement learning. CoRR abs\/1711.04436. 1711.04436CoRR 2017."},{"key":"383_CR18","doi-asserted-by":"crossref","unstructured":"Guo J, Zhan Z, Gao Y, Xiao Y, Lou J, Liu T, Zhang D. Towards complex text-to-sql in cross-domain database with intermediate representation. CoRR abs\/1905.08205. 2019. 1905.08205","DOI":"10.18653\/v1\/P19-1444"},{"key":"383_CR19","doi-asserted-by":"crossref","unstructured":"Wang B, Shin R, Liu X, Polozov O, Richardson M. Rat-sql: Relation-aware schema encoding and linking for text-to-sql parsers. 2019. arXiv preprint arXiv:1911.04942.","DOI":"10.18653\/v1\/2020.acl-main.677"},{"key":"383_CR20","doi-asserted-by":"publisher","unstructured":"Zou L, Huang R, Wang H, Yu J, He W, Zhao D. Natural language question answering over rdf - a graph data driven approach. Proceedings of the ACM SIGMOD International Conference on Management of Data. 2014. https:\/\/doi.org\/10.1145\/2588555.2610525.","DOI":"10.1145\/2588555.2610525"},{"key":"383_CR21","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1007\/978-3-319-69146-6_8","volume-title":"Semantic Web Challenges","author":"D Diefenbach","year":"2017","unstructured":"Diefenbach D, Singh K, Maret P. Wdaqua-core0: a question answering component for the research community. In: Dragoni M, Solanki M, Blomqvist E, editors. Semantic Web Challenges. Cham: Springer; 2017. p. 84\u201389."},{"key":"383_CR22","unstructured":"Diefenbach D, Both A, Singh K, Maret P. Towards a question answering system over the semantic web. Semantic Web. 2018;1\u201319:"},{"key":"383_CR23","unstructured":"Chakraborty N, Lukovnikov D, Maheshwari G, Trivedi P, Lehmann J, Fischer A. Introduction to neural network based approaches for question answering over knowledge graphs. 2019. arXiv preprint arXiv:1907.09361."},{"key":"383_CR24","doi-asserted-by":"crossref","unstructured":"Abdelkawi A, Zafar H, Maleshkova M, Lehmann J. Complex query augmentation for question answering over knowledge graphs. In: OTM Confederated International Conferences\u201d On the Move to Meaningful Internet Systems\u201d, 2019:571\u2013587. Springer","DOI":"10.1007\/978-3-030-33246-4_36"},{"key":"383_CR25","doi-asserted-by":"crossref","unstructured":"Honnibal M, Johnson M. An improved non-monotonic transition system for dependency parsing. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015:1373\u20131378","DOI":"10.18653\/v1\/D15-1162"},{"key":"383_CR26","volume-title":"Modern Information Retrieval","author":"R Baeza-Yates","year":"1999","unstructured":"Baeza-Yates R, Ribeiro-Neto B, et al. Modern Information Retrieval, vol. 463. New York: ACM press; 1999."},{"issue":"1","key":"383_CR27","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L. Random forests. Machine learn. 2001;45(1):5\u201332.","journal-title":"Machine learn"},{"key":"383_CR28","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1108\/00330331211221828","volume":"46","author":"M Morsey","year":"2012","unstructured":"Morsey M, Lehmann J, Auer S, Stadler C, Hellmann S. Dbpedia and the live extraction of structured data from wikipedia. Program Electron Libr Informat Syst. 2012;46:157\u201381. https:\/\/doi.org\/10.1108\/00330331211221828.","journal-title":"Program Electron Libr Informat Syst"},{"key":"383_CR29","doi-asserted-by":"crossref","unstructured":"Daiber J, Jakob M, Hokamp C, Mendes PN. Improving efficiency and accuracy in multilingual entity extraction. In: Proceedings of the 9th International Conference on Semantic Systems (I-Semantics) 2013.","DOI":"10.1145\/2506182.2506198"},{"key":"383_CR30","doi-asserted-by":"publisher","unstructured":"Ferragina P, Scaiella U. Tagme: On-the-fly annotation of short text fragments (by wikipedia entities). In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management. CIKM \u201910, pp. 1625\u20131628. ACM, New York, 2010. https:\/\/doi.org\/10.1145\/1871437.1871689.","DOI":"10.1145\/1871437.1871689"},{"key":"383_CR31","doi-asserted-by":"publisher","unstructured":"Dubey M, Banerjee D, Chaudhuri D, Lehmann J. EARL: joint entity and relation linking for question answering over knowledge graphs. CoRR abs\/1801.03825 2018;. https:\/\/doi.org\/10.1007\/s00778-019-00567-80","DOI":"10.1007\/s00778-019-00567-8"},{"key":"383_CR32","doi-asserted-by":"publisher","unstructured":"Sakor A, Onando\u00a0Mulang\u2019 I, Singh K, Shekarpour S, Esther\u00a0Vidal M, Lehmann J, Auer S. Old is gold: Linguistic driven approach for entity and relation linking of short text. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 2336\u20132346. Association for Computational Linguistics, Minneapolis, Minnesota 2019;. https:\/\/doi.org\/10.1007\/s00778-019-00567-81","DOI":"10.1007\/s00778-019-00567-8"},{"key":"383_CR33","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/j.websem.2013.05.006","volume":"21","author":"V Lopez","year":"2013","unstructured":"Lopez V, Unger C, Cimiano P, Motta E. Evaluating question answering over linked data. Web Semant Sci Serv Agents World Wide Web. 2013;21:3\u201313. https:\/\/doi.org\/10.1016\/j.websem.2013.05.0062.","journal-title":"Web Semant Sci Serv Agents World Wide Web"},{"key":"383_CR34","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1007\/978-3-319-68204-4_22","volume-title":"The Semantic Web-ISWC 2017","author":"P Trivedi","year":"2017","unstructured":"Trivedi P, Maheshwari G, Dubey M, Lehmann J. Lc-quad: A corpus for complex question answering over knowledge graphs. In: d\u2019Amato C, Fernandez M, Tamma V, Lecue F, Cudr\u00e9-Mauroux P, Sequeda J, Lange C, Heflin J, editors. The Semantic Web-ISWC 2017. Cham: Springer; 2017. p. 210\u2013218."},{"key":"383_CR35","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1007\/978-3-319-69146-6_6","volume-title":"Semantic Web Challenges","author":"R Usbeck","year":"2017","unstructured":"Usbeck R, Ngomo A-CN, Haarmann B, Krithara A, R\u00f6der M. Napolitano G. 7th open challenge on question answering over linked data (qald-7). In: Dragoni M, Solanki M, Blomqvist E, editors. Semantic Web Challenges. Cham: Springer; 2017. p. 59\u201369."},{"key":"383_CR36","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"P Bojanowski","year":"2017","unstructured":"Bojanowski P, Grave E, Joulin A, Mikolov T. Enriching word vectors with subword information. Transact Assoc Computat Linguist. 2017;5:135\u201346.","journal-title":"Transact Assoc Computat Linguist"},{"issue":"Jul","key":"383_CR37","first-page":"2121","volume":"12","author":"J Duchi","year":"2011","unstructured":"Duchi J, Hazan E, Singer Y. Adaptive subgradient methods for online learning and stochastic optimization. J Mach Learn Res. 2011;12(Jul):2121\u201359.","journal-title":"J Mach Learn Res"},{"key":"383_CR38","unstructured":"Kullback S. Information Theory and Statistics.: Courier Corporation; 1997."},{"key":"383_CR39","doi-asserted-by":"publisher","unstructured":"Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J. Freebase: A collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. SIGMOD \u201908, pp. 1247\u20131250. ACM, New York 2008. https:\/\/doi.org\/10.1145\/1376616.1376746.","DOI":"10.1145\/1376616.1376746"},{"key":"383_CR40","doi-asserted-by":"crossref","unstructured":"Raiman JR, Raiman OM. Deeptype: multilingual entity linking by neural type system evolution. In: Thirty-Second AAAI Conference on Artificial Intelligence 2018.","DOI":"10.1609\/aaai.v32i1.12008"},{"key":"383_CR41","doi-asserted-by":"crossref","unstructured":"Vrande\u010di\u0107 D, Kr\u00f6tzsch M. Wikidata: a free collaborative knowledge base 2014.","DOI":"10.1145\/2629489"},{"key":"383_CR42","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1016\/j.artint.2012.06.001","volume":"194","author":"J Hoffart","year":"2013","unstructured":"Hoffart J, Suchanek FM, Berberich K, Weikum G. Yago2: a spatially and temporally enhanced knowledge base from wikipedia. Artifici Intell. 2013;194:28\u201361.","journal-title":"Artifici Intell"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00383-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s40537-020-00383-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00383-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,10]],"date-time":"2022-12-10T15:06:01Z","timestamp":1670684761000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-020-00383-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,6]]},"references-count":42,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["383"],"URL":"https:\/\/doi.org\/10.1186\/s40537-020-00383-w","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-70794\/v1","asserted-by":"object"}]},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,6]]},"assertion":[{"value":"11 September 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 November 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 January 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"3"}}