{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T08:23:50Z","timestamp":1765268630591,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":34,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,5,8]],"date-time":"2019-05-08T00:00:00Z","timestamp":1557273600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,5,8]]},"DOI":"10.1145\/3322905.3322909","type":"proceedings-article","created":{"date-parts":[[2019,10,23]],"date-time":"2019-10-23T15:44:57Z","timestamp":1571845497000},"page":"117-122","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Curation Technologies for Cultural Heritage Archives"],"prefix":"10.1145","author":[{"given":"Georg","family":"Rehm","sequence":"first","affiliation":[{"name":"DFKI GmbH, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Martin","family":"Lee","sequence":"additional","affiliation":[{"name":"Freie Universit\u00e4t Berlin, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Juli\u00e1n","family":"Moreno-Schneider","sequence":"additional","affiliation":[{"name":"DFKI GmbH, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter","family":"Bourgonje","sequence":"additional","affiliation":[{"name":"DFKI GmbH, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,5,8]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Apache. 2010. OpenNLP. http:\/\/opennlp.apache.org  Apache. 2010. OpenNLP. http:\/\/opennlp.apache.org"},{"key":"e_1_3_2_1_2_1","volume-title":"Workshop Report Non-Latin Scripts in Multilingual Environments: Research Data and Digital Humanities in Area Studies. https:\/\/blogs.fu-berlin.de\/bibliotheken\/2019\/01\/18\/workshop-nls2018\/.","author":"Asef Esther","year":"2019","unstructured":"Esther Asef , Cosima Wagner , and Martin Lee . 2019 . Workshop Report Non-Latin Scripts in Multilingual Environments: Research Data and Digital Humanities in Area Studies. https:\/\/blogs.fu-berlin.de\/bibliotheken\/2019\/01\/18\/workshop-nls2018\/. Esther Asef, Cosima Wagner, and Martin Lee. 2019. Workshop Report Non-Latin Scripts in Multilingual Environments: Research Data and Digital Humanities in Area Studies. https:\/\/blogs.fu-berlin.de\/bibliotheken\/2019\/01\/18\/workshop-nls2018\/."},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the Language Resources and Evaluation Conference 2012 (LREC 2012","author":"Bank Mathias","year":"2012","unstructured":"Mathias Bank and Martin Schierle . 2012 . A Survey of Text Mining Architectures and the UIMA Standard .. In Proceedings of the Language Resources and Evaluation Conference 2012 (LREC 2012 ), Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Ugur Dogan, Bente Maegaard, Joseph Mariani, Jan Odijk, and Stelios Piperidis (Eds.). European Language Resources Association (ELRA), 3479--3486. http:\/\/dblp.uni-trier.de\/db\/conf\/lrec\/lrec 2012.html#BankS12 Mathias Bank and Martin Schierle. 2012. A Survey of Text Mining Architectures and the UIMA Standard.. In Proceedings of the Language Resources and Evaluation Conference 2012 (LREC 2012), Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Ugur Dogan, Bente Maegaard, Joseph Mariani, Jan Odijk, and Stelios Piperidis (Eds.). European Language Resources Association (ELRA), 3479--3486. http:\/\/dblp.uni-trier.de\/db\/conf\/lrec\/lrec2012.html#BankS12"},{"key":"e_1_3_2_1_4_1","volume-title":"The Semantic Web (Lecture Notes in Computer Science), Harald Sack, Giuseppe Rizzo, Nadine Steinmetz, Dunja Mladenia, S\u00f6ren Auer","author":"Bourgonje Peter","year":"2016","unstructured":"Peter Bourgonje , Julian Moreno-Schneider , Jan Nehring , Georg Rehm , Felix Sasaki , and Ankit Srivastava . 2016. Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer . In The Semantic Web (Lecture Notes in Computer Science), Harald Sack, Giuseppe Rizzo, Nadine Steinmetz, Dunja Mladenia, S\u00f6ren Auer , and Christoph Lange (Eds.). Springer , 65--68. ESWC 2016 Satellite Events. Heraklion, Crete, Greece, 2016 Revised Selected Papers. Peter Bourgonje, Julian Moreno-Schneider, Jan Nehring, Georg Rehm, Felix Sasaki, and Ankit Srivastava. 2016. Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer. In The Semantic Web (Lecture Notes in Computer Science), Harald Sack, Giuseppe Rizzo, Nadine Steinmetz, Dunja Mladenia, S\u00f6ren Auer, and Christoph Lange (Eds.). Springer, 65--68. ESWC 2016 Satellite Events. Heraklion, Crete, Greece, 2016 Revised Selected Papers."},{"key":"e_1_3_2_1_5_1","volume-title":"Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014","author":"Cassidy Steve","year":"2014","unstructured":"Steve Cassidy , Dominique Estival , Timothy Jones , Denis Burnham , and Jared Burghold . 2014 . The Alveo Virtual Laboratory: A Web Based Repository API . In Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014 , Reykjavik, Iceland , May 26-31, 2014., Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asunci\u00f3n Moreno, Jan Odijk, and Stelios Piperidis (Eds.). European Language Resources Association (ELRA), 1--7. http:\/\/www.lrec-conf.org\/proceedings\/lrec2014\/summaries\/628.html Steve Cassidy, Dominique Estival, Timothy Jones, Denis Burnham, and Jared Burghold. 2014. The Alveo Virtual Laboratory: A Web Based Repository API. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, Reykjavik, Iceland, May 26-31, 2014., Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asunci\u00f3n Moreno, Jan Odijk, and Stelios Piperidis (Eds.). European Language Resources Association (ELRA), 1--7. http:\/\/www.lrec-conf.org\/proceedings\/lrec2014\/summaries\/628.html"},{"key":"e_1_3_2_1_6_1","volume-title":"Espresso: Korean Part of Speech Tagger. https:\/\/doi.org\/10.5281\/zenodo.884606","author":"Cha Jeong-Won","year":"2017","unstructured":"Jeong-Won Cha , Jeen-Pyo Hong , and Chang-Uk Shin . 2017 . Espresso: Korean Part of Speech Tagger. https:\/\/doi.org\/10.5281\/zenodo.884606 10.5281\/zenodo.884606 Jeong-Won Cha, Jeen-Pyo Hong, and Chang-Uk Shin. 2017. Espresso: Korean Part of Speech Tagger. https:\/\/doi.org\/10.5281\/zenodo.884606"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118935.1118956"},{"volume-title":"WebLicht: Web-Based Linguistic Chaining Tool. Online. Date Accessed","year":"2019","key":"e_1_3_2_1_8_1","unstructured":"CLARIN-D\/SfS-Uni. T\u00fcbingen. 2012. WebLicht: Web-Based Linguistic Chaining Tool. Online. Date Accessed : 18 Jan 2019 . URL https:\/\/weblicht.sfs.unituebingen.de\/. CLARIN-D\/SfS-Uni. T\u00fcbingen. 2012. WebLicht: Web-Based Linguistic Chaining Tool. Online. Date Accessed: 18 Jan 2019. URL https:\/\/weblicht.sfs.unituebingen.de\/."},{"key":"e_1_3_2_1_9_1","series-title":"Version 6","volume-title":"Text Processing with GATE","author":"Cunningham Hamish","unstructured":"Hamish Cunningham , Diana Maynard , Kalina Bontcheva , Valentin Tablan , Niraj Aswani , Ian Roberts , Genevieve Gorrell , Adam Funk , Angus Roberts , Danica Damljanovic , Thomas Heitz , Mark A. Greenwood , Horacio Saggion , Johann Petrak , Yaoyong Li , and Wim Peters . 2011. Text Processing with GATE ( Version 6 ). http:\/\/tinyurl.com\/gatebook Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan, Niraj Aswani, Ian Roberts, Genevieve Gorrell, Adam Funk, Angus Roberts, Danica Damljanovic, Thomas Heitz, Mark A. Greenwood, Horacio Saggion, Johann Petrak, Yaoyong Li, and Wim Peters. 2011. Text Processing with GATE (Version 6). http:\/\/tinyurl.com\/gatebook"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2015.06.003"},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH) at COLING","author":"de Castilho Richard Eckart","year":"2016","unstructured":"Richard Eckart de Castilho , \u00c9va M\u00fajdricza-Maydt , Seid Muhie Yimam , Silvana Hartmann , Iryna Gurevych , Anette Frank , and Chris Biemann . 2016 . A Web-based Tool for the Integrated Annotation of Semantic and Syntactic Structures . In Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH) at COLING 2016. 76--84. http:\/\/tubiblio.ulb.tu-darmstadt.de\/97939\/ Richard Eckart de Castilho, \u00c9va M\u00fajdricza-Maydt, Seid Muhie Yimam, Silvana Hartmann, Iryna Gurevych, Anette Frank, and Chris Biemann. 2016. A Web-based Tool for the Integrated Annotation of Semantic and Syntactic Structures. In Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH) at COLING 2016. 76--84. http:\/\/tubiblio.ulb.tu-darmstadt.de\/97939\/"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324904003523"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-41338-4_7"},{"key":"e_1_3_2_1_15_1","first-page":"10","volume-title":"Proceedings of the ACL 2010 System Demonstrations. 25--29","author":"Hinrichs Erhard","year":"2010","unstructured":"Erhard Hinrichs , Marie Hinrichs , and Thomas Zastrow . 2010 . WebLicht: Web-Based LRT Services for German . In Proceedings of the ACL 2010 System Demonstrations. 25--29 . http:\/\/www.aclweb.org\/anthology\/P 10 - 4005 Erhard Hinrichs, Marie Hinrichs, and Thomas Zastrow. 2010. WebLicht: Web-Based LRT Services for German. In Proceedings of the ACL 2010 System Demonstrations. 25--29. http:\/\/www.aclweb.org\/anthology\/P10-4005"},{"key":"e_1_3_2_1_16_1","volume-title":"Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014)","author":"Hinrichs Erhard","year":"2014","unstructured":"Erhard Hinrichs and Steven Krauwer . 2014 . The CLARIN Research Infrastructure: Resources and Tools for e-Humanities Scholars . Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014) (May 2014), 1525--1531. http:\/\/dspace.library.uu.nl\/handle\/1874\/307981 Erhard Hinrichs and Steven Krauwer. 2014. The CLARIN Research Infrastructure: Resources and Tools for e-Humanities Scholars. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014) (May 2014), 1525--1531. http:\/\/dspace.library.uu.nl\/handle\/1874\/307981"},{"key":"e_1_3_2_1_17_1","volume-title":"The Language Application Grid. In Revised Selected Papers of the Second International Workshop on Worldwide Language Service Infrastructure -","volume":"9442","author":"Ide Nancy","year":"2016","unstructured":"Nancy Ide , James Pustejovsky , Christopher Cieri , Eric Nyberg , Denise Dipersio , Chunqi Shi , Keith Suderman , Marc Verhagen , Di Wang , and Jonathan Wright . 2016 . The Language Application Grid. In Revised Selected Papers of the Second International Workshop on Worldwide Language Service Infrastructure - Volume 9442 (WLSI 2015). Springer-Verlag New York, Inc., New York, NY, USA, 51--70. https:\/\/doi.org\/10.1007\/978-3-319-31468-6_4 10.1007\/978-3-319-31468-6_4 Nancy Ide, James Pustejovsky, Christopher Cieri, Eric Nyberg, Denise Dipersio, Chunqi Shi, Keith Suderman, Marc Verhagen, Di Wang, and Jonathan Wright. 2016. The Language Application Grid. In Revised Selected Papers of the Second International Workshop on Worldwide Language Service Infrastructure - Volume 9442 (WLSI 2015). Springer-Verlag New York, Inc., New York, NY, USA, 51--70. https:\/\/doi.org\/10.1007\/978-3-319-31468-6_4"},{"key":"#cr-split#-e_1_3_2_1_18_1.1","unstructured":"Park Jungyeul. 2017. Berkeley parser model for Korean: Sejong treebank Berkeley parser model for Korean: Sejong treebank. https:\/\/doi.org\/10.5281\/zenodo.891267 10.5281\/zenodo.891267"},{"key":"#cr-split#-e_1_3_2_1_18_1.2","unstructured":"Park Jungyeul. 2017. Berkeley parser model for Korean: Sejong treebank Berkeley parser model for Korean: Sejong treebank. https:\/\/doi.org\/10.5281\/zenodo.891267"},{"key":"e_1_3_2_1_19_1","volume-title":"Digitale Kuratierungstechnologien f\u00fcr Bibliotheken. Zeitschrift f\u00fcr Bibliothekskultur 027.7 4, 2 (November","author":"Neudecker Clemens","year":"2016","unstructured":"Clemens Neudecker and Georg Rehm . 2016. Digitale Kuratierungstechnologien f\u00fcr Bibliotheken. Zeitschrift f\u00fcr Bibliothekskultur 027.7 4, 2 (November 2016 ). http:\/\/0277.ch\/ojs\/index.php\/cdrs_0277\/article\/view\/158 Clemens Neudecker and Georg Rehm. 2016. Digitale Kuratierungstechnologien f\u00fcr Bibliotheken. Zeitschrift f\u00fcr Bibliothekskultur 027.7 4, 2 (November 2016). http:\/\/0277.ch\/ojs\/index.php\/cdrs_0277\/article\/view\/158"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2012.03.006"},{"key":"#cr-split#-e_1_3_2_1_21_1.1","unstructured":"Jungyeul Park. 2017. JHE Korean-English evaluation data. https:\/\/doi.org\/10.5281\/zenodo.891295 10.5281\/zenodo.891295"},{"key":"#cr-split#-e_1_3_2_1_21_1.2","unstructured":"Jungyeul Park. 2017. JHE Korean-English evaluation data. https:\/\/doi.org\/10.5281\/zenodo.891295"},{"key":"#cr-split#-e_1_3_2_1_22_1.1","unstructured":"Jungyeul Park. 2017. MaltParser model for Korean: Sejong treebank. https:\/\/doi.org\/10.5281\/zenodo.891273 10.5281\/zenodo.891273"},{"key":"#cr-split#-e_1_3_2_1_22_1.2","unstructured":"Jungyeul Park. 2017. MaltParser model for Korean: Sejong treebank. https:\/\/doi.org\/10.5281\/zenodo.891273"},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)","author":"Mark Greenwood Petr Knoth Antonis Lempesis","year":"2018","unstructured":"Antonis Lempesis Mark Greenwood Petr Knoth Richard Eckart de Castilho Stavros Sachtouris Byron Georgantopoulos Stefania Martziou Lucas Anastasiou Katerina Gkirtzou Natalia Manola Penny Labropoulou , Dimitris Galanis and Stelios Piperidis . 2018 . OpenMinTeD: A Platform Facilitating Text Mining of Scholarly Content . In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (7-12). European Language Resources Association (ELRA), Paris, France. Antonis Lempesis Mark Greenwood Petr Knoth Richard Eckart de Castilho Stavros Sachtouris Byron Georgantopoulos Stefania Martziou Lucas Anastasiou Katerina Gkirtzou Natalia Manola Penny Labropoulou, Dimitris Galanis and Stelios Piperidis. 2018. OpenMinTeD: A Platform Facilitating Text Mining of Scholarly Content. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (7-12). European Language Resources Association (ELRA), Paris, France."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.scico.2016.01.001"},{"key":"e_1_3_2_1_25_1","volume-title":"Special Issue of the Baltic Journal of Modern Computing (Vol. 4, No. 2) - Proceedings of the 19th Annual Conference of the European Association for Machine Translation (EAMT","author":"Rehm Georg","year":"2016","unstructured":"Georg Rehm and Felix Sasaki . 2016. Digital Curation Technologies . In Special Issue of the Baltic Journal of Modern Computing (Vol. 4, No. 2) - Proceedings of the 19th Annual Conference of the European Association for Machine Translation (EAMT 2016 ). Riga , Latvia , 399. Georg Rehm and Felix Sasaki. 2016. Digital Curation Technologies. In Special Issue of the Baltic Journal of Modern Computing (Vol. 4, No. 2) - Proceedings of the 19th Annual Conference of the European Association for Machine Translation (EAMT 2016). Riga, Latvia, 399."},{"key":"e_1_3_2_1_26_1","volume-title":"Peter Bourgonje, Ankit Srivastava, Rolf Fricke, Jan Thomsen, Jing He, Joachim Quantz, Armin Berger, Luca K\u00f6nig, S\u00f6ren R\u00e4uchle, Jens Gerth, and David Wabnitz.","author":"Rehm Georg","year":"2018","unstructured":"Georg Rehm , Juli\u00e1n Moreno Schneider , Peter Bourgonje, Ankit Srivastava, Rolf Fricke, Jan Thomsen, Jing He, Joachim Quantz, Armin Berger, Luca K\u00f6nig, S\u00f6ren R\u00e4uchle, Jens Gerth, and David Wabnitz. 2018 . Different Types of Automated and Semi-Automated Semantic Storytelling: Curation Technologies for Different Sectors. In Language Technologies for the Challenges of the Digital Age: 27th International Conference, GSCL 2017, Berlin, Germany, September 13-14, 2017, Proceedings (Lecture Notes in Artificial Intelligence (LNAI)), Georg Rehm and Thierry Declerck (Eds.). Gesellschaft f\u00fcr Sprachtechnologie und Computerlinguistik e.V., Springer , Cham, Switzerland, 232--247. 13\/14 September 2017. Georg Rehm, Juli\u00e1n Moreno Schneider, Peter Bourgonje, Ankit Srivastava, Rolf Fricke, Jan Thomsen, Jing He, Joachim Quantz, Armin Berger, Luca K\u00f6nig, S\u00f6ren R\u00e4uchle, Jens Gerth, and David Wabnitz. 2018. Different Types of Automated and Semi-Automated Semantic Storytelling: Curation Technologies for Different Sectors. In Language Technologies for the Challenges of the Digital Age: 27th International Conference, GSCL 2017, Berlin, Germany, September 13-14, 2017, Proceedings (Lecture Notes in Artificial Intelligence (LNAI)), Georg Rehm and Thierry Declerck (Eds.). Gesellschaft f\u00fcr Sprachtechnologie und Computerlinguistik e.V., Springer, Cham, Switzerland, 232--247. 13\/14 September 2017."},{"key":"e_1_3_2_1_27_1","volume-title":"Peter Bourgonje, Ankit Srivastava, Jan Nehring, Armin Berger, Luca K\u00f6nig, S\u00f6ren R\u00e4uchle, and Jens Gerth.","author":"Rehm Georg","year":"2017","unstructured":"Georg Rehm , Julian Moreno Schneider , Peter Bourgonje, Ankit Srivastava, Jan Nehring, Armin Berger, Luca K\u00f6nig, S\u00f6ren R\u00e4uchle, and Jens Gerth. 2017 . Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters. In Proceedings of the Events and Stories in the News Workshop, Tommaso Caselli, Ben Miller, Marieke van Erp, Piek Vossen, Martha Palmer, Eduard Hovy, and Teruko Mitamura (Eds.). Association for Computational Linguistics , Vancouver, Canada, 42--51. Co-located with ACL 2017. Georg Rehm, Julian Moreno Schneider, Peter Bourgonje, Ankit Srivastava, Jan Nehring, Armin Berger, Luca K\u00f6nig, S\u00f6ren R\u00e4uchle, and Jens Gerth. 2017. Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters. In Proceedings of the Events and Stories in the News Workshop, Tommaso Caselli, Ben Miller, Marieke van Erp, Piek Vossen, Martha Palmer, Eduard Hovy, and Teruko Mitamura (Eds.). Association for Computational Linguistics, Vancouver, Canada, 42--51. Co-located with ACL 2017."},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of the LREC 2018 Workshop on Language Resources and Technologies for the Legal Knowledge Graph, Georg Rehm, V\u00edctor Rodr\u00edguez-Doncel, and Julian Moreno Schneider (Eds.)","author":"Schneider Julian Moreno","year":"2018","unstructured":"Julian Moreno Schneider and Georg Rehm . 2018 . Towards a Workflow Manager for Curation Technologies in the Legal Domain . In Proceedings of the LREC 2018 Workshop on Language Resources and Technologies for the Legal Knowledge Graph, Georg Rehm, V\u00edctor Rodr\u00edguez-Doncel, and Julian Moreno Schneider (Eds.) . Miyazaki, Japan, 30--35. 12 May 2018. Julian Moreno Schneider and Georg Rehm. 2018. Towards a Workflow Manager for Curation Technologies in the Legal Domain. In Proceedings of the LREC 2018 Workshop on Language Resources and Technologies for the Legal Knowledge Graph, Georg Rehm, V\u00edctor Rodr\u00edguez-Doncel, and Julian Moreno Schneider (Eds.). Miyazaki, Japan, 30--35. 12 May 2018."},{"key":"e_1_3_2_1_29_1","volume-title":"Proc. 9th IEEE Intl. Conf. on Document Analysis and Recognition (ICDAR. 629--633","author":"Ray Smith and Google Inc.","year":"2007","unstructured":"Ray Smith and Google Inc. 2007 . An overview of the Tesseract OCR Engine . In Proc. 9th IEEE Intl. Conf. on Document Analysis and Recognition (ICDAR. 629--633 . Ray Smith and Google Inc. 2007. An overview of the Tesseract OCR Engine. In Proc. 9th IEEE Intl. Conf. on Document Analysis and Recognition (ICDAR. 629--633."},{"volume-title":"HMM-Based Korean Named Entity Recognition for Information Extraction","author":"Yun Bo-Hyun","key":"e_1_3_2_1_30_1","unstructured":"Bo-Hyun Yun . 2007. HMM-Based Korean Named Entity Recognition for Information Extraction . In Knowledge Science, Engineering and Management, Zili Zhang and J\u00f6rg Siekmann (Eds.). Springer Berlin Heidelberg , Berlin, Heidelberg , 526--531. Bo-Hyun Yun. 2007. HMM-Based Korean Named Entity Recognition for Information Extraction. In Knowledge Science, Engineering and Management, Zili Zhang and J\u00f6rg Siekmann (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 526--531."},{"key":"e_1_3_2_1_31_1","volume-title":"Proceedings of Corpus Linguistics. https:\/\/doi.org\/10","author":"Zeldes Amir","year":"2009","unstructured":"Amir Zeldes , Anke L\u00fcdeling , Julia Ritz , and Christian Chiarcos . 2009 . ANNIS: a search tool for multi-layer annotated corpora . In Proceedings of Corpus Linguistics. https:\/\/doi.org\/10 .18452\/13437 10.18452\/13437 Amir Zeldes, Anke L\u00fcdeling, Julia Ritz, and Christian Chiarcos. 2009. ANNIS: a search tool for multi-layer annotated corpora. In Proceedings of Corpus Linguistics. https:\/\/doi.org\/10.18452\/13437"}],"event":{"name":"DATeCH2019: 3rd International Conference on Digital Access to Textual Cultural Heritage","acronym":"DATeCH2019","location":"Brussels Belgium"},"container-title":["Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3322905.3322909","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3322905.3322909","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:02:26Z","timestamp":1750208546000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3322905.3322909"}},"subtitle":["Analysing and transforming a heterogeneous data set into an interactive curation workbench"],"short-title":[],"issued":{"date-parts":[[2019,5,8]]},"references-count":34,"alternative-id":["10.1145\/3322905.3322909","10.1145\/3322905"],"URL":"https:\/\/doi.org\/10.1145\/3322905.3322909","relation":{},"subject":[],"published":{"date-parts":[[2019,5,8]]},"assertion":[{"value":"2019-05-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}