{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:17:38Z","timestamp":1750306658375,"version":"3.41.0"},"reference-count":32,"publisher":"Association for Computing Machinery (ACM)","issue":"Summer","license":[{"start":{"date-parts":[[2014,7,1]],"date-time":"2014-07-01T00:00:00Z","timestamp":1404172800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGWEB Newsl."],"published-print":{"date-parts":[[2014,7]]},"abstract":"<jats:p>Information Extraction (IE) is the technique for transforming textual data into structured representation that can be understood by machines. It is a crucial technique in enabling the Semantic Web, where increasing interest has been seen in recent years. This article reports recent progress in the LODIE project - Linked Open Data for Information Extraction, aimed at advancing Web IE to a new frontier by exploiting largely available, semantically annotated, Linked Open Data as background knowledge. We cover topics of wrapper induction, IE from semi-structured content such as tables and lists, and IE from free-text. We describe new challenges in the research and methods proposed to address them, together with summaries of recent evaluations showing encouraging results.<\/jats:p>","DOI":"10.1145\/2641730.2641735","type":"journal-article","created":{"date-parts":[[2014,7,29]],"date-time":"2014-07-29T12:28:17Z","timestamp":1406636897000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["\"Linked data as background knowledge for information extraction on the web\" by Ziqi Zhang, Anna Lisa Gentile and Isabelle Augenstein with Martin Vesely as coordinator"],"prefix":"10.1145","volume":"2014","author":[{"given":"Ziqi","family":"Zhang","sequence":"first","affiliation":[{"name":"University of Sheffield"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anna Lisa","family":"Gentile","sequence":"additional","affiliation":[{"name":"University of Sheffield"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Isabelle","family":"Augenstein","sequence":"additional","affiliation":[{"name":"University of Sheffield"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2014,7]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/375663.375774"},{"key":"e_1_2_1_2_1","volume-title":"Exploiting Linked Data for Web-Scale Relation Extraction. Submitted to ISWC Research Track","author":"Augenstein I.","year":"2014","unstructured":"Augenstein , I. 2014a. Exploiting Linked Data for Web-Scale Relation Extraction. Submitted to ISWC Research Track 2014 . http:\/\/staffwww.dcs.shef.ac.uk\/people\/I.Augenstein\/ISWC2014-Augenstein.pdf. Augenstein, I. 2014a. Exploiting Linked Data for Web-Scale Relation Extraction. Submitted to ISWC Research Track 2014. http:\/\/staffwww.dcs.shef.ac.uk\/people\/I.Augenstein\/ISWC2014-Augenstein.pdf."},{"key":"e_1_2_1_3_1","volume-title":"Seed Selection for Self-Supervised Web-Based Relation Extraction. Submitted to SWAIE","author":"Augenstein I.","year":"2014","unstructured":"Augenstein , I. 2014b. Seed Selection for Self-Supervised Web-Based Relation Extraction. Submitted to SWAIE 2014 . http:\/\/staffwww.dcs.shef.ac.uk\/people\/I.Augenstein\/SWAIE2014-Augenstein.pdf. Augenstein, I. 2014b. Seed Selection for Self-Supervised Web-Based Relation Extraction. Submitted to SWAIE 2014. http:\/\/staffwww.dcs.shef.ac.uk\/people\/I.Augenstein\/SWAIE2014-Augenstein.pdf."},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the Nineteenth Text REtrieval Conference (TREC","author":"Balog K.","year":"2010","unstructured":"Balog , K. and Serdyukov , P . 2011. Overview of the TREC 2010 Entity Track . In Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010 ). NIST. Balog, K. and Serdyukov, P. 2011. Overview of the TREC 2010 Entity Track. In Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010). NIST."},{"volume-title":"Proceedings of the Workshop on Ontology and Semantic Web Patterns (4th edition) - WOP2013","author":"Blomqvist E.","key":"e_1_2_1_5_1","unstructured":"Blomqvist , E. , Zhang , Z. , Gentile , A. L. , Augenstein , I. , and Ciravegna , F . 2013. Statistical knowledge patterns for characterizing linked data . In Proceedings of the Workshop on Ontology and Semantic Web Patterns (4th edition) - WOP2013 . Lecture Notes in Computer Science. Springer. Blomqvist, E., Zhang, Z., Gentile, A. L., Augenstein, I., and Ciravegna, F. 2013. Statistical knowledge patterns for characterizing linked data. In Proceedings of the Workshop on Ontology and Semantic Web Patterns (4th edition) - WOP2013. Lecture Notes in Computer Science. Springer."},{"volume-title":"Proceedings of the Conference on Artificial Intelligence (AAAI). AAAI Press, 1306--1313","author":"Carlson A.","key":"e_1_2_1_6_1","unstructured":"Carlson , A. , Betteridge , J. , Kisiel , B. , Settles , B., Jr., E. H. , and Mitchell , T . 2010. Toward an architecture for never-ending language learning . In Proceedings of the Conference on Artificial Intelligence (AAAI). AAAI Press, 1306--1313 . Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Jr., E. H., and Mitchell, T. 2010. Toward an architecture for never-ending language learning. In Proceedings of the Conference on Artificial Intelligence (AAAI). AAAI Press, 1306--1313."},{"key":"e_1_2_1_7_1","volume-title":"Eds. CEUR Workshop Proceedings","volume":"925","author":"Ciravegna F.","unstructured":"Ciravegna , F. , Gentile , A. L. , and Zhang , Z . 2012. Lodie: Linked open data for web-scale information extraction. In SWAIE, D. Maynard, M. van Erp, and B. Davis , Eds. CEUR Workshop Proceedings , vol. 925 . CEUR-WS.org, 11--22. Ciravegna, F., Gentile, A. L., and Zhang, Z. 2012. Lodie: Linked open data for web-scale information extraction. In SWAIE, D. Maynard, M. van Erp, and B. Davis, Eds. CEUR Workshop Proceedings, vol. 925. CEUR-WS.org, 11--22."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1409360.1409378"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/988672.988687"},{"volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. EMNLP '11","author":"Freedman M.","key":"e_1_2_1_10_1","unstructured":"Freedman , M. , Ramshaw , L. , Boschee , E. , Gabbard , R. , Kratkiewicz , G. , Ward , N. , and Weischedel , R . 2011. Extreme extraction: Machine reading in a week . In Proceedings of the Conference on Empirical Methods in Natural Language Processing. EMNLP '11 . Association for Computational Linguistics, Stroudsburg, PA, USA, 1437--1446. Freedman, M., Ramshaw, L., Boschee, E., Gabbard, R., Kratkiewicz, G., Ward, N., and Weischedel, R. 2011. Extreme extraction: Machine reading in a week. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. EMNLP '11. Association for Computational Linguistics, Stroudsburg, PA, USA, 1437--1446."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2013.08.003"},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Gangemi A. and Presutti V. 2010. Towards a pattern science for the semantic web. Semant. web 1 1 2 (Apr.) 61--68. Gangemi A. and Presutti V. 2010. Towards a pattern science for the semantic web. Semant. web 1 1 2 (Apr.) 61--68.","DOI":"10.3233\/SW-2010-0020"},{"volume-title":"AAAI Fall Symposium Series. AAAI, 24--27","author":"Gentile A.","key":"e_1_2_1_13_1","unstructured":"Gentile , A. , Zhang , Z. , and Ciravegna , F . 2013. Web scale information extraction with lodie . In AAAI Fall Symposium Series. AAAI, 24--27 . Gentile, A., Zhang, Z., and Ciravegna, F. 2013. Web scale information extraction with lodie. In AAAI Fall Symposium Series. AAAI, 24--27."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2479832.2479845"},{"volume-title":"Self Training Wrapper Induction with Linked Data. In 17th International Conference on Text, Speech and Dialogue. Springer, To appear.","author":"Gentile A. L.","key":"e_1_2_1_15_1","unstructured":"Gentile , A. L. , Zhang , Z. , and Fabio , C . 2014 . Self Training Wrapper Induction with Linked Data. In 17th International Conference on Text, Speech and Dialogue. Springer, To appear. Gentile, A. L., Zhang, Z., and Fabio, C. 2014. Self Training Wrapper Induction with Linked Data. In 17th International Conference on Text, Speech and Dialogue. Springer, To appear."},{"volume-title":"ESWC2010","author":"Halpin H.","key":"e_1_2_1_16_1","unstructured":"Halpin , H. and Hayes , P. J . 2010. When owl:sameAs isnt the same: An analysis of identity links on the semantic web . In ESWC2010 . Halpin, H. and Hayes, P. J. 2010. When owl:sameAs isnt the same: An analysis of identity links on the semantic web. In ESWC2010."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1921005"},{"key":"e_1_2_1_19_1","unstructured":"Lu C. Bing L. Lam W. Chan K. and Gu Y. 2013. Web entity detection for semi-structured text data records with unlabeled data. International Journal of Computational Linguistics and Applications To appear. Lu C. Bing L. Lam W. Chan K. and Gu Y. 2013. Web entity detection for semi-structured text data records with unlabeled data. International Journal of Computational Linguistics and Applications To appear ."},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP","volume":"2","author":"Mintz M.","unstructured":"Mintz , M. , Bills , S. , Snow , R. , and Jurafsky , D . 2009. Distant supervision for relation extraction without labeled data . In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP : Volume 2-Volume 2 . Association for Computational Linguistics, 1003--1011. Mintz, M., Bills, S., Snow, R., and Jurafsky, D. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2. Association for Computational Linguistics, 1003--1011."},{"key":"e_1_2_1_21_1","unstructured":"Mulwad V. Finin T. and Joshi A. 2013. Semantic message passing for generating linked data from tables. In International Semantic Web Conference (1) H. Alani L. Kagal A. Fokoue P. T. Groth C. Biemann J. X. Parreira L. Aroyo N. F. Noy C. Welty and K. Janowicz Eds. Lecture Notes in Computer Science vol. 8218. Springer 363--378. Mulwad V. Finin T. and Joshi A. 2013. Semantic message passing for generating linked data from tables. In International Semantic Web Conference (1) H. Alani L. Kagal A. Fokoue P. T. Groth C. Biemann J. X. Parreira L. Aroyo N. F. Noy C. Welty and K. Janowicz Eds. Lecture Notes in Computer Science vol. 8218. Springer 363--378."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1075\/li.30.1.01for"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1935826.1935869"},{"volume-title":"Proc. of the 10th international conference on The semantic web -","author":"Nuzzolese A. G.","key":"e_1_2_1_24_1","unstructured":"Nuzzolese , A. G. , Gangemi , A. , Presutti , V. , and Ciancarini , P . 2011. Encyclopedic knowledge patterns from wikipedia links . In Proc. of the 10th international conference on The semantic web - Volume Part I . ISWC'11. Springer-Verlag, Berlin, Heidelberg, 520--536. Nuzzolese, A. G., Gangemi, A., Presutti, V., and Ciancarini, P. 2011. Encyclopedic knowledge patterns from wikipedia links. In Proc. of the 10th international conference on The semantic web - Volume Part I. ISWC'11. Springer-Verlag, Berlin, Heidelberg, 520--536."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/1889788.1889799"},{"key":"e_1_2_1_26_1","doi-asserted-by":"crossref","unstructured":"Roth B. and Klakow D. 2013. Combining Generative and Discriminative Model Scores for Distant Supervision. In EMNLP. ACL 24--29. Roth B. and Klakow D. 2013. Combining Generative and Discriminative Model Scores for Distant Supervision. In EMNLP . ACL 24--29.","DOI":"10.18653\/v1\/D13-1003"},{"key":"e_1_2_1_27_1","unstructured":"Surdeanu M. Tibshirani J. Nallapati R. and Manning C. D. 2012. Multi-instance Multi-label Learning for Relation Extraction. In EMNLP-CoNLL. ACL 455--465. Surdeanu M. Tibshirani J. Nallapati R. and Manning C. D. 2012. Multi-instance Multi-label Learning for Relation Extraction. In EMNLP-CoNLL . ACL 455--465."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2505515.2505559"},{"key":"e_1_2_1_29_1","unstructured":"Zhang Z. 2014. Start small build complete: Effective and efficient semantic table interpretation using tableminer. In Under transparent review: The Semantic Web Journal. http:\/\/www.semantic-web-journal.net\/content\/start-small-build-complete-effective-and-efficient-semantic-table-interpretation-using. Zhang Z. 2014. Start small build complete: Effective and efficient semantic table interpretation using tableminer. In Under transparent review: The Semantic Web Journal . http:\/\/www.semantic-web-journal.net\/content\/start-small-build-complete-effective-and-efficient-semantic-table-interpretation-using."},{"volume-title":"Proc. of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics","author":"Zhang Z.","key":"e_1_2_1_30_1","unstructured":"Zhang , Z. , Gentile , A. L. , Augenstein , I. , Blomqvist , E. , and Ciravegna , F . 2013. Mining equivalent relations from linked data . In Proc. of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics , Sofia, Bulgaria, 289--293. Zhang, Z., Gentile, A. L., Augenstein, I., Blomqvist, E., and Ciravegna, F. 2013. Mining equivalent relations from linked data. In Proc. of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Sofia, Bulgaria, 289--293."},{"key":"e_1_2_1_31_1","unstructured":"Zhang Z. Gentile A. L. Blomqvist E. Augenstein I. and Ciravegna F. 2013. Statistical knowledge patterns: Identifying synonymous relations in large linked datasets. In International Semantic Web Conference (1) H. Alani L. Kagal A. Fokoue P. T. Groth C. Biemann J. X. Parreira L. Aroyo N. F. Noy C. Welty and K. Janowicz Eds. Lecture Notes in Computer Science vol. 8218. Springer 703--719. Zhang Z. Gentile A. L. Blomqvist E. Augenstein I. and Ciravegna F. 2013. Statistical knowledge patterns: Identifying synonymous relations in large linked datasets. In International Semantic Web Conference (1) H. Alani L. Kagal A. Fokoue P. T. Groth C. Biemann J. X. Parreira L. Aroyo N. F. Noy C. Welty and K. Janowicz Eds. Lecture Notes in Computer Science vol. 8218. Springer 703--719."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219893"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526724"}],"container-title":["ACM SIGWEB Newsletter"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2641730.2641735","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2641730.2641735","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:56:19Z","timestamp":1750229779000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2641730.2641735"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,7]]},"references-count":32,"journal-issue":{"issue":"Summer","published-print":{"date-parts":[[2014,7]]}},"alternative-id":["10.1145\/2641730.2641735"],"URL":"https:\/\/doi.org\/10.1145\/2641730.2641735","relation":{},"ISSN":["1931-1745","1931-1435"],"issn-type":[{"type":"print","value":"1931-1745"},{"type":"electronic","value":"1931-1435"}],"subject":[],"published":{"date-parts":[[2014,7]]},"assertion":[{"value":"2014-07-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}