{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T21:58:00Z","timestamp":1767909480501,"version":"3.49.0"},"reference-count":127,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2006,7,25]],"date-time":"2006-07-25T00:00:00Z","timestamp":1153785600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2006,7,25]]},"abstract":"<jats:p>The growing availability of online textual sources and the potential number of applications of knowledge acquisition from textual data has lead to an increase in Information Extraction (IE) research. Some examples of these applications are the generation of data bases from documents, as well as the acquisition of knowledge useful for emerging technologies like question answering, information integration, and others related to text mining. However, one of the main drawbacks of the application of IE refers to its intrinsic domain dependence. For the sake of reducing the high cost of manually adapting IE applications to new domains, experiments with different Machine Learning (ML) techniques have been carried out by the research community. This survey describes and compares the main approaches to IE and the different ML techniques used to achieve Adaptive IE technology.<\/jats:p>","DOI":"10.1145\/1132956.1132957","type":"journal-article","created":{"date-parts":[[2006,7,25]],"date-time":"2006-07-25T14:14:26Z","timestamp":1153836866000},"page":"4","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":96,"title":["Adaptive information extraction"],"prefix":"10.1145","volume":"38","author":[{"given":"Jordi","family":"Turmo","sequence":"first","affiliation":[{"name":"TALP Research Center, Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alicia","family":"Ageno","sequence":"additional","affiliation":[{"name":"TALP Research Center, Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Neus","family":"Catal\u00e0","sequence":"additional","affiliation":[{"name":"TALP Research Center, Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2006,7,25]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072017.1072033"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072413"},{"key":"e_1_2_1_3_1","volume-title":"Principle-Based Parsing: Computation and Psycholinguistics","author":"Abney S."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/336597.336644"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.3115\/992628.992635"},{"key":"e_1_2_1_6_1","volume-title":"Lecture Notes in Artificial Intelligence","volume":"1040","author":"Aone C."},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of the 7th Message Understanding Conference (MUC--7).]]","author":"Aone C."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072017.1072039"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 4th Message Understanding Conference (MUC--4).]]","author":"Appelt D."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072412"},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the 13th International Joint Conference On Artificial Intelligence (IJCAI).]]","author":"Appelt D."},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the AAAI Workshop on Machine Learning for Information Extraction.]]","author":"Aseltine J.","year":"1999"},{"key":"e_1_2_1_13_1","volume-title":"1999. Modern Information Retrieval","author":"Baeza-Yates R."},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the International Conference of Pacific Association for Computational Linguistics (PACLING).]]","author":"Baluja S."},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the ECAI Workshop on Machine Learning for Information Extraction.]]","author":"Basili R."},{"key":"e_1_2_1_16_1","first-page":"1","article-title":"An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process","volume":"3","author":"Baum L.","year":"1972","journal-title":"Inequalities"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.3115\/974557.974586"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the 6th ACL Workshop on Very Large Corpora.]]","author":"Borthwick A."},{"key":"e_1_2_1_20_1","volume-title":"WebDB Workshop at 6th International Conference on Extending Database Technology, (EDBT'98)","author":"Brin S.","year":"1998"},{"key":"e_1_2_1_22_1","volume-title":"Proceeding of the 4th Conference on Computational Natural Language Learning.]]","author":"Cardie C."},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP\/VLC).]]","author":"Cardie C."},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of 1st International Conference on Language Resources and Evaluation (LREC)","author":"Carroll J."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the 14th European Conference on Artificial Intelligence (ECAI), 411--415","author":"Catal\u00e0 N."},{"key":"e_1_2_1_27_1","volume-title":"Proceedings of the ACL Workshop on Natural Language Learning.]]","author":"Chai J."},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the 16th AAAI National Conference on Artificial Intelligence (AAAI).]]","author":"Chai J."},{"key":"e_1_2_1_29_1","volume-title":"Proceedings of the ECAI Workshop on Machine Learning for Information Extraction.]]","author":"Chidlovskii B.","year":"2000"},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the 18th National Conference on Artificial Intelligence (AAAI).]]","author":"Chieu H. L."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.3115\/1075096.1075124"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072410"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the IJCAI Workshop on Adaptive Text Extraction and Mining.]]","author":"Ciravegna F.","year":"2001"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(99)00102-2"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/511446.511477"},{"key":"e_1_2_1_36_1","volume-title":"Proceedings of the IJCAI Workshop on Adaptive Text Extraction and Mining.]]","author":"Cohen W."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.3115\/976909.979620"},{"key":"e_1_2_1_38_1","volume-title":"First PASCAL Challenges Workshop.]]","author":"Cox C."},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the AAAI Workshop on Machine Learning for Information Extraction.]]","author":"Craven M.","year":"1999"},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of the 15th AAAI National Conference on Artificial Intelligence (AAAI).]]","author":"Craven M."},{"key":"e_1_2_1_41_1","unstructured":"Eikvil L. 1999. Information extraction from World Wide Web---A survey. Tech. rep. 945 http:\/\/www.nr.no\/documents\/samba\/research_areas\/BAMG\/Publications\/webIE_rep945.ps.]]  Eikvil L. 1999. Information extraction from World Wide Web---A survey. Tech. rep. 945 http:\/\/www.nr.no\/documents\/samba\/research_areas\/BAMG\/Publications\/webIE_rep945.ps.]]"},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of the AAAI Workshop on Adaptive Text Extraction and Mining.]]","author":"Finn A."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072412"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.3115\/980845.980914"},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the ECAI Workshop on Machine Learning for Information Extraction.]]","author":"Freitag D."},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of the AAAI Workshop on Machine Learning for Information Extraction.]]","author":"Freitag D.","year":"1999"},{"key":"e_1_2_1_48_1","volume-title":"Proceedings of the 17th AAAI National Conference on Artificial Intelligence (AAAI).]]","author":"Freitag D.","year":"2000"},{"key":"e_1_2_1_49_1","volume-title":"Department of Computer Science","author":"Gaizauskas R."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072418"},{"key":"e_1_2_1_51_1","volume-title":"Proceedings of the 7th Message Understanding Conference (MUC--7).]]","author":"Garigliano R."},{"key":"e_1_2_1_52_1","first-page":"59","article-title":"MITA: An information extraction approach to analyses of free-form text in life insurance applications","volume":"19","author":"Glasgow B.","year":"1998","journal-title":"Artif. Intell."},{"key":"e_1_2_1_53_1","volume-title":"Proceedings of the AAAI Workshop on Machine Learning for Information Extraction.]]","author":"Glickman O."},{"key":"e_1_2_1_54_1","volume-title":"1998. Cross-Language Information Retrieval","author":"Grefenstette G."},{"key":"e_1_2_1_55_1","volume-title":"Where is the syntax&quest","author":"Grishman R."},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072017.1072036"},{"key":"e_1_2_1_57_1","volume-title":"Proceedings of 3rd International Conference on Language Resources and Evaluation (LREC).]]","author":"Harabagiu S."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072017.1072029"},{"key":"e_1_2_1_59_1","volume-title":"Proceedings of the 14th AAAI National Conference on Artificial Intelligence (AAAI). 992--999","author":"Holowczak R."},{"key":"e_1_2_1_60_1","volume-title":"Proceedings of the AAAI Workshop on AI and Information Integration.]]","author":"Hsu C.-N.","year":"1998"},{"key":"e_1_2_1_61_1","volume-title":"Proceedings of the Conference on Automated Learning and Discovering.]]","author":"Hsu C.-N."},{"key":"e_1_2_1_62_1","volume-title":"Proceedings of the IJCAI Workshop on New Approaches to Learn for NLP.]]","author":"Huffman S.","year":"1995"},{"key":"e_1_2_1_63_1","volume-title":"Proceedings of the 7th Message Understanding Conference (MUC--7).]]","author":"Humphreys K."},{"key":"e_1_2_1_64_1","doi-asserted-by":"crossref","volume-title":"Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval","author":"Jacobs P.","DOI":"10.4324\/9781315806952"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.3115\/1219044.1219066"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/69.469825"},{"key":"e_1_2_1_67_1","volume-title":"Proceedings of the IJCAI Workshop on Adaptive Text Extraction and Mining.]]","author":"Knoblock C. A."},{"key":"e_1_2_1_68_1","unstructured":"Ko H. 1998. Empirical assembly sequence planning: A multistrategy constructive learning approach. In Machine Learning and Data Mining I. B. R. S. Michalsky and M. Kubat Eds. John Wiley & Sons.]]  Ko H. 1998. Empirical assembly sequence planning: A multistrategy constructive learning approach. In Machine Learning and Data Mining I. B. R. S. Michalsky and M. Kubat Eds. John Wiley & Sons.]]"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072419"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(99)00100-9"},{"key":"e_1_2_1_72_1","volume-title":"Proceedings of the 18th International Conference on Machine Learning (ICML).]]","author":"Lafferty J."},{"key":"e_1_2_1_73_1","volume-title":"Proceedings of the AAAI Workshop on Adaptive Text Extraction and Mining.]]","author":"Lavelli A."},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072064.1072105"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.3115\/1071958.1071994"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072017.1072043"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072411"},{"key":"e_1_2_1_78_1","unstructured":"Manning C. and Sch\u00fctze H. 1999. Foundations of Statistical Natural Language Processing. MIT Press Cambridge MA.]]   Manning C. and Sch\u00fctze H. 1999. Foundations of Statistical Natural Language Processing. MIT Press Cambridge MA.]]"},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.5555\/972470.972475"},{"key":"e_1_2_1_80_1","volume-title":"Proceedings of the 17th International Conference on Machine Learning (ICML).]]","author":"McCallum A."},{"key":"e_1_2_1_81_1","volume-title":"Proceedings of the IJCAI-03 Workshop on Learning Statistical Models from Relational Data.]]","author":"McCallum A."},{"key":"e_1_2_1_82_1","volume-title":"Proceedings of the 14th International Join Conference on Artificial Intelligence (IJCAI).]]","author":"McCarthy J."},{"key":"e_1_2_1_83_1","volume-title":"Readings in Knowledge Acquisition and Learning","author":"Michalski R."},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1093\/ijl\/3.4.235"},{"key":"e_1_2_1_85_1","volume-title":"Proceedings of the 7th Message Understanding Conference (MUC--7).]]","author":"Miller S."},{"key":"e_1_2_1_86_1","volume-title":"Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics.]]","author":"Miller S."},{"key":"e_1_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.3115\/980691.980712"},{"key":"e_1_2_1_88_1","volume-title":"Tutorial (AAAI) in Workshop on Machine Learning for Information Extraction.]]","author":"Mooney R."},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072408"},{"key":"e_1_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF03037227"},{"key":"e_1_2_1_91_1","volume-title":"Proceedings of the 5th International Conference on Machine Learning (ICML).]]","author":"Muggleton S."},{"key":"e_1_2_1_92_1","doi-asserted-by":"crossref","unstructured":"Muggleton S. and Feng C. 1992. Efficient induction of logic programs. In Inductive Logic Programming S. Muggleton Ed. Academic Press New York NY.]]  Muggleton S. and Feng C. 1992. Efficient induction of logic programs. In Inductive Logic Programming S. Muggleton Ed. Academic Press New York NY.]]","DOI":"10.1007\/BF03037089"},{"key":"e_1_2_1_93_1","volume-title":"Proceedings of the AAAI Workshop on Machine Learning for Information Extraction.]]","author":"Muslea I.","year":"1999"},{"key":"e_1_2_1_94_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010022931168"},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1145\/301136.301191"},{"key":"e_1_2_1_96_1","doi-asserted-by":"publisher","DOI":"10.3115\/1119355.1119370"},{"key":"e_1_2_1_97_1","doi-asserted-by":"crossref","unstructured":"Pasca M. 2003. Large open-domain question answering from large text collections. CSLI Studies in Computational Linguistics.]]  Pasca M. 2003. Large open-domain question answering from large text collections. CSLI Studies in Computational Linguistics.]]","DOI":"10.1162\/089120103322753383"},{"key":"e_1_2_1_98_1","volume-title":"Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI03)","author":"Peshkin L."},{"key":"e_1_2_1_99_1","volume-title":"Proceedings of the European Conference on Machine Learning (ECML)","author":"Quinlan J."},{"key":"e_1_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022699322624"},{"key":"e_1_2_1_101_1","volume-title":"Tutorial in 27st Annual International Conference on Research and Development in Information Retrieval (SIGIR).]]","author":"Radev D.","year":"2004"},{"key":"e_1_2_1_102_1","volume-title":"Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI01)","author":"Ray S."},{"key":"e_1_2_1_103_1","volume-title":"Proceedings of the 11th National Conference on Artificial Intelligence (AAAI). 811--816","author":"Riloff E.","year":"1993"},{"key":"e_1_2_1_104_1","volume-title":"Proceedings of the 13th National Conference on Artificial Intelligence (AAAI). 1044--1049","author":"Riloff E.","year":"1996"},{"key":"e_1_2_1_105_1","volume-title":"Proceedings of the 15th AAAI National Conference on Artificial Intelligence (AAAI). 806--813","author":"Roth D.","year":"1998"},{"key":"e_1_2_1_106_1","volume-title":"Proceedings of the 15th International Conference On Artificial Intelligence (IJCAI).]]","author":"Roth D."},{"key":"e_1_2_1_107_1","volume-title":"Proceedings of the SIG NL\/SI of Information Processing Society of Japan.]]","author":"Sekine S."},{"key":"e_1_2_1_108_1","volume-title":"Proceedings of the 16th AAAI National Conference on Artificial Intelligence (AAAI).]]","author":"Seymore K."},{"key":"e_1_2_1_109_1","volume-title":"Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI03)","author":"Skounakis M."},{"key":"e_1_2_1_110_1","volume-title":"Proceedings of the 3th International Conference on Knowledge Discovery and Data Mining (KDD).]]","author":"Soderland S.","year":"1997"},{"key":"e_1_2_1_111_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007562322031"},{"key":"e_1_2_1_112_1","volume-title":"Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI). 1314--1321","author":"Soderland S."},{"key":"e_1_2_1_113_1","volume-title":"1999. Natural Language Information Retrieval","author":"Strzalkowski T."},{"key":"e_1_2_1_114_1","volume-title":"Proceedings of the First NSF\/NIJ Symposium on Intelligence and Security Informatics (ISI03)","author":"Sun A."},{"key":"e_1_2_1_115_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118853.1118882"},{"key":"e_1_2_1_116_1","volume-title":"Proceedings of the AAAI Workshop on Machine Learning for Information Extraction.]]","author":"Thomas B.","year":"1999"},{"key":"e_1_2_1_117_1","volume-title":"Proceedings of Sixteenth International Machine Learning Conference. 406--414","author":"Thompson C."},{"key":"e_1_2_1_119_1","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324902002863"},{"key":"e_1_2_1_120_1","volume-title":"Information Extraction: Towards Scalability, Adaptable Systems","author":"Vilain M."},{"key":"e_1_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072407"},{"key":"e_1_2_1_122_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072064.1072091"},{"key":"e_1_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072017.1072030"},{"key":"e_1_2_1_124_1","doi-asserted-by":"publisher","DOI":"10.3115\/1071958.1071982"},{"key":"e_1_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.3115\/1075096.1075140"},{"key":"e_1_2_1_127_1","volume-title":"Proceedings of the 7th Message Understanding Conference (MUC--5).]]","author":"Yangarber R."},{"key":"e_1_2_1_128_1","volume-title":"Proceedings of the ECAI Workshop on Machine Learning for Information Extraction.]]","author":"Yangarber R."},{"key":"e_1_2_1_129_1","doi-asserted-by":"publisher","DOI":"10.3115\/992730.992782"},{"key":"e_1_2_1_130_1","volume-title":"Proceedings of the ACL Workshop on Multilingual and Mixed-language Named Entity Recognition.]]","author":"Yarowsky D.","year":"2003"},{"key":"e_1_2_1_131_1","volume-title":"Eds","author":"Young S.","year":"1997"},{"key":"e_1_2_1_132_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944964"},{"key":"e_1_2_1_133_1","volume-title":"Proceedings of the 12th National Conference on Artificial Intelligence (AAAI). 748--753","author":"Zelle J."},{"key":"e_1_2_1_134_1","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219892"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1132956.1132957","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1132956.1132957","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T15:06:13Z","timestamp":1750259173000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1132956.1132957"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,7,25]]},"references-count":127,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2006,7,25]]}},"alternative-id":["10.1145\/1132956.1132957"],"URL":"https:\/\/doi.org\/10.1145\/1132956.1132957","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,7,25]]},"assertion":[{"value":"2006-07-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}