{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,1]],"date-time":"2025-11-01T22:25:00Z","timestamp":1762035900409,"version":"build-2065373602"},"reference-count":90,"publisher":"Informa UK Limited","issue":"1-2","license":[{"start":{"date-parts":[[2021,2,28]],"date-time":"2021-02-28T00:00:00Z","timestamp":1614470400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":["www.tandfonline.com"],"crossmark-restriction":true},"short-container-title":["New Review of Hypermedia and Multimedia"],"published-print":{"date-parts":[[2021,4,3]]},"DOI":"10.1080\/13614568.2021.1889692","type":"journal-article","created":{"date-parts":[[2021,3,1]],"date-time":"2021-03-01T07:53:27Z","timestamp":1614585207000},"page":"128-176","update-policy":"https:\/\/doi.org\/10.1080\/tandf_crossmark_01","source":"Crossref","is-referenced-by-count":9,"title":["Knowledge models from PDF textbooks"],"prefix":"10.1080","volume":"27","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6931-9787","authenticated-orcid":false,"given":"Isaac","family":"Alpizar-Chacon","sequence":"first","affiliation":[{"name":"Utrecht University, Utrecht, The Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8023-1770","authenticated-orcid":false,"given":"Sergey","family":"Sosnovsky","sequence":"additional","affiliation":[{"name":"Utrecht University, Utrecht, The Netherlands"}]}],"member":"301","published-online":{"date-parts":[[2021,2,28]]},"reference":[{"key":"CIT0001","unstructured":"The chicago manual of style. 2017. The University of Chicago Press."},{"key":"CIT0002","doi-asserted-by":"publisher","DOI":"10.3115\/992628.992635"},{"key":"CIT0003","unstructured":"Alpizar-Chacon, I. & Sosnovsky, S. (2019). Interlingua: Linking textbooks across different languages. In S. Sosnovsky, P. Brusilovsky, R. Baraniuk, R. Agrawal, & A. Lan (Eds.),Proceedings of the First Workshop on Intelligent Textbooks (Vol. 2384, pp. 104\u2013117). CEUR-WS."},{"key":"CIT0004","unstructured":"Alpizar-Chacon, I., van der Hart, M., Wiersma, Z. S., Theunissen, L. & Sosnovsky, S. (2020). Transformation of PDF textbooks into intelligent educational resources. In S. Sosnovsky, P. Brusilovsky, R. Baraniuk, & A. Lan (Eds.), Proceedings of the Second Workshop on Intelligent Textbooks (Vol. 2674, pp. 4\u201316). CEUR-WS."},{"key":"CIT0005","doi-asserted-by":"publisher","DOI":"10.1016\/B978-081551481-7.50002-0"},{"key":"CIT0006","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-02614-0_19"},{"key":"CIT0007","doi-asserted-by":"publisher","DOI":"10.1109\/JCDL.2017.7991564"},{"key":"CIT0008","unstructured":"Bayomi, M. & Lawless, S. (2018). C-HTS: A concept-based hierarchical text segmentation approach. In N. Calzolari (Conference chair), K. Choukri, C. Cieri, T. Declerck, S. Goggi, K. Hasida, H. Isahara, B. Maegaard, J. Mariani, H. Mazo, A. Moreno, J. Odijk, S. Piperidis, & T. Tokunaga (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). European Language Resources Association."},{"key":"CIT0009","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2009.07.002"},{"key":"CIT0010","first-page":"993","volume":"3","author":"Blei D. M.","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"CIT0011","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-77703-0_71"},{"key":"CIT0012","unstructured":"Chambliss, M. J. (2002). The characteristics of well-designed science textbooks. In J. Otero, J. A. Le\u00f3n, & A. C. Graesser (Eds.), The psychology of science text comprehension (pp. 51\u201372)."},{"key":"CIT0013","doi-asserted-by":"publisher","DOI":"10.17265\/2159-5313\/2016.09.003"},{"key":"CIT0014","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-28640-0_20"},{"key":"CIT0015","unstructured":"Councill, I. G. & Giles, C. L. (2008). ParsCit: An open-source CRF reference string parsing package. In N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, & D. Tapias (Eds.), International language resources and evaluation. European Language Resources Association."},{"volume-title":"Introductory statistics with R","year":"2011","author":"Dalgaard P.","key":"CIT0016"},{"key":"CIT0017","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-009-0078-8"},{"key":"CIT0018","doi-asserted-by":"publisher","DOI":"10.1007\/1-84628-168-7"},{"key":"CIT0019","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-013-0324-z"},{"key":"CIT0020","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-0391-3"},{"key":"CIT0021","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2013.290"},{"key":"CIT0022","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-010-0127-3"},{"key":"CIT0023","doi-asserted-by":"publisher","DOI":"10.1109\/RE.2013.6636736"},{"key":"CIT0024","doi-asserted-by":"publisher","DOI":"10.1201\/b18587"},{"key":"CIT0025","doi-asserted-by":"publisher","DOI":"10.1007\/978-94-007-4056-3"},{"key":"CIT0026","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2011.304"},{"issue":"1","key":"CIT0027","first-page":"1","volume":"1","author":"F\u00e4rber M.","year":"2015","journal-title":"Semantic Web Journal"},{"key":"CIT0028","doi-asserted-by":"publisher","DOI":"10.1109\/MS.2011.122"},{"key":"CIT0029","doi-asserted-by":"publisher","DOI":"10.1007\/b105519"},{"key":"CIT0030","doi-asserted-by":"publisher","DOI":"10.1145\/1998076.1998079"},{"key":"CIT0031","doi-asserted-by":"publisher","DOI":"10.1109\/DAS.2008.30"},{"key":"CIT0032","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2009.143"},{"key":"CIT0033","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24309-2_28"},{"key":"CIT0034","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40814-4_11"},{"key":"CIT0035","unstructured":"Hahm, Y., Park, J., Lim, K., Kim, Y., Hwang, D. & Choi, K. S. (2014). Named entity corpus construction using wikipedia and DBpedia ontology. In N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds.), Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014) (pp. 2565\u20132569). European Language Resources Association."},{"key":"CIT0036","doi-asserted-by":"publisher","DOI":"10.1080\/02702710590962550"},{"key":"CIT0037","unstructured":"Han, X. & Sun, L. (2011). A generative entity-mention model for linking entities with knowledge base. In D. Lin, Y. Matsumoto, & R. Mihalcea (Eds.), Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (Vol. 1, pp. 945\u2013954). Association for Computational Linguistics."},{"key":"CIT0038","doi-asserted-by":"publisher","DOI":"10.1145\/1600193.1600206"},{"key":"CIT0039","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2396832"},{"key":"CIT0040","unstructured":"Hollingsworth, B., Lewin, I. & Tidhar, D. (2005). Retrieving hierarchical text structure from typeset scientific articles: A prerequisite for e-science text mining. In S. Cox & D. W. Walker (Eds.), Proceedings of the 4th UK E-science All Hands Meeting (pp. 267\u2013273). EPSRC."},{"key":"CIT0041","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-25007-6_26"},{"volume-title":"Universal artificial intelligence sequential decisions based on algorithmic probability","year":"2010","author":"Hutter M.","key":"CIT0042"},{"key":"CIT0043","doi-asserted-by":"publisher","DOI":"10.1145\/582415.582418"},{"key":"CIT0044","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-23502-3"},{"key":"CIT0045","doi-asserted-by":"publisher","DOI":"10.1045\/july2012-kern"},{"key":"CIT0046","doi-asserted-by":"publisher","DOI":"10.1045\/september2013-kern"},{"key":"CIT0047","doi-asserted-by":"crossref","unstructured":"Kobilarov, G., Scott, T., Raimond, Y., Oliver, S., Sizemore, C., Smethurst, M., Bizer, C. & Lee, R. (2009). Media meets semantic web: How the BBC uses DBpedia and linked data to make connections. In L. Aroyo, P. Traverso, F. Ciravegna, P. Cimiano, T. Heath, E. Hyv\u00f6nen, R. Mizoguchi, E. Oren, M. Sabou, & E. Simperl (Eds.), Proceedings of the 6th European Semantic Web Conference on the Semantic Web: Research and Applications (pp. 723\u2013737). Springer.","DOI":"10.1007\/978-3-642-02121-3_53"},{"key":"CIT0048","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2013.36"},{"key":"CIT0049","doi-asserted-by":"crossref","unstructured":"Larra\u00f1aga, M., Rueda, U., Elorriaga, J. A. & Arruarte, A. (2004). Acquisition of the domain structure from document indexes using heuristic reasoning. In J. C. Lester, R. M. Vicari, & F. Paragua\u00e7u (Eds.), Intelligent tutoring systems (pp. 175\u2013186). Springer Berlin Heidelberg.","DOI":"10.1007\/978-3-540-30139-4_17"},{"key":"CIT0050","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2011.285"},{"key":"CIT0051","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-73165-8_5"},{"key":"CIT0052","doi-asserted-by":"publisher","DOI":"10.1007\/s13173-010-0020-4"},{"key":"CIT0053","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-17656-2"},{"volume-title":"An introduction to information retrieval","year":"2009","author":"Manning C. D.","key":"CIT0054"},{"key":"CIT0055","doi-asserted-by":"publisher","DOI":"10.1145\/2506182.2506185"},{"key":"CIT0056","doi-asserted-by":"publisher","DOI":"10.1145\/1860559.1860576"},{"key":"CIT0057","unstructured":"Medelyan, O., Witten, I. H. & Milne, D. (2008). Topic indexing with Wikipedia. In R. Bunescu, E. Gabrilovich, & R. Mihalcea (Eds.), Proceedings of the AAAI Wikiai Workshop (Vol. 1, pp. 19\u201324). Association for the Advancement of Artificial Intelligence."},{"key":"CIT0058","doi-asserted-by":"publisher","DOI":"10.1145\/2063518.2063519"},{"key":"CIT0059","doi-asserted-by":"publisher","DOI":"10.1145\/1458082.1458150"},{"key":"CIT0060","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-13911-6_23"},{"key":"CIT0061","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-16985-4_13"},{"key":"CIT0062","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00179"},{"key":"CIT0063","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2012.07.001"},{"key":"CIT0064","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2009.12"},{"key":"CIT0065","doi-asserted-by":"publisher","DOI":"10.1186\/1751-0473-7-7"},{"key":"CIT0066","doi-asserted-by":"crossref","unstructured":"Ramanathan, C., Jayabal, Y. & Sheth, M. J. (2012). Challenges in generating bookmarks from TOC entries in e-books. In Proceedings of the 2012 ACM Symposium on Document Engineering (p. 37). ACM.","DOI":"10.1145\/2361354.2361363"},{"key":"CIT0067","doi-asserted-by":"publisher","DOI":"10.1136\/adc.88.5.408"},{"key":"CIT0068","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-91464-0_43"},{"key":"CIT0069","unstructured":"Ruiz Fabo, P., Berm\u00fadez Sabel, H., Mart\u00ednez Cant\u00f3n, C. I., Gonz\u00e1lez-Blanco Garc\u00eda, E. & Navarro Colorado, B. (2018). The diachronic Spanish sonnet corpus (DISCO): TEI and linked open data encoding, data distribution and metrical findings. In J. Gir\u00f3n Palau & I. Galina Russell (Eds.), Digital humanities 2018, book of abstracts. Red de Humanidades Digitales A. C."},{"key":"CIT0070","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"CIT0071","doi-asserted-by":"publisher","DOI":"10.1145\/361219.361220"},{"key":"CIT0072","doi-asserted-by":"publisher","DOI":"10.1002\/9781119047063"},{"key":"CIT0073","doi-asserted-by":"crossref","unstructured":"Shao, M. & Futrelle, R. P. (2005). Recognition and classification of figures in PDF documents. In W. Liu & J. Llad\u00f3s (Eds.), International Workshop on Graphics Recognition (pp. 231\u2013242). Springer.","DOI":"10.1007\/11767978_21"},{"key":"CIT0074","unstructured":"Slabbekoorn, K., Hollink, L. & Houben, G. J. (2012, November). Domain-aware ontology matching. In P. Cudr\u00e9-Mauroux, J. Heflin, E. Sirin, T. Tudorache, J. Euzenat, M. Hauswirth, J. Xavier Parreira, J. Hendler, G. Schreiber, A. Bernstein, & E. Blomqvist (Eds.), The Semantic Web-ISWC 2012 (pp. 542\u2013558). Springer."},{"key":"CIT0075","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33263-0_38"},{"key":"CIT0076","doi-asserted-by":"publisher","DOI":"10.17265\/2159-5313\/2016.09.003"},{"key":"CIT0077","unstructured":"Tittel, S., Berm\u00fadez-Sabel, H. & Chiarcos, C. (2018). Using RDFa to link text and dictionary data for medieval French. In N. Calzolari, K. Choukri, C. Cieri, T. Declerck, S. Goggi, K. Hasida, H. Isahara, B. Maegaard, J. Mariani, H. Mazo, A. Moreno, J. Odijk, S. Piperidis, & T. Tokunaga (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (pp. 7\u201312). European Language Resources Association."},{"key":"CIT0078","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-015-0249-8"},{"key":"CIT0079","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2013.151"},{"key":"CIT0080","doi-asserted-by":"publisher","DOI":"10.1109\/TBDATA.2016.2546302"},{"key":"CIT0081","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2015.7333927"},{"key":"CIT0082","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-70936-9"},{"key":"CIT0083","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-11964-9_29"},{"key":"CIT0084","doi-asserted-by":"publisher","DOI":"10.1145\/2629489"},{"volume-title":"Probability & statistics for engineers & scientists","year":"2012","author":"Walpole R. E.","key":"CIT0085"},{"key":"CIT0086","doi-asserted-by":"publisher","DOI":"10.1145\/2682571.2797062"},{"key":"CIT0087","unstructured":"Whitington, J. (2011). PDF explained. In (chap. I Introduction). O'Reilly Media."},{"key":"CIT0088","doi-asserted-by":"publisher","DOI":"10.1145\/2494266.2494282"},{"key":"CIT0089","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2013.244"},{"key":"CIT0090","doi-asserted-by":"publisher","DOI":"10.1609\/aimag.v36i3.2601"}],"container-title":["New Review of Hypermedia and Multimedia"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.tandfonline.com\/doi\/pdf\/10.1080\/13614568.2021.1889692","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,11,11]],"date-time":"2021-11-11T20:06:05Z","timestamp":1636661165000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.tandfonline.com\/doi\/full\/10.1080\/13614568.2021.1889692"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,28]]},"references-count":90,"journal-issue":{"issue":"1-2","published-print":{"date-parts":[[2021,4,3]]}},"alternative-id":["10.1080\/13614568.2021.1889692"],"URL":"https:\/\/doi.org\/10.1080\/13614568.2021.1889692","relation":{},"ISSN":["1361-4568","1740-7842"],"issn-type":[{"type":"print","value":"1361-4568"},{"type":"electronic","value":"1740-7842"}],"subject":[],"published":{"date-parts":[[2021,2,28]]},"assertion":[{"value":"The publishing and review policy for this title is described in its Aims & Scope.","order":1,"name":"peerreview_statement","label":"Peer Review Statement"},{"value":"http:\/\/www.tandfonline.com\/action\/journalInformation?show=aimsScope&journalCode=tham20","URL":"http:\/\/www.tandfonline.com\/action\/journalInformation?show=aimsScope&journalCode=tham20","order":2,"name":"aims_and_scope_url","label":"Aim & Scope"},{"value":"2020-06-05","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-02-09","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-02-28","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}