{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,30]],"date-time":"2025-10-30T06:21:44Z","timestamp":1761805304901},"reference-count":96,"publisher":"MIT Press","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computational Linguistics"],"published-print":{"date-parts":[[2014,3]]},"abstract":"<jats:p>Finding the right representations for words is critical for building accurate NLP systems when domain-specific labeled data for the task is scarce. This article investigates novel techniques for extracting features from n-gram models, Hidden Markov Models, and other statistical language models, including a novel Partial Lattice Markov Random Field model. Experiments on part-of-speech tagging and information extraction, among other tasks, indicate that features taken from statistical language models, in combination with more traditional features, outperform traditional representations alone, and that graphical model representations outperform n-gram models, especially on sparse and polysemous words.<\/jats:p>","DOI":"10.1162\/coli_a_00167","type":"journal-article","created":{"date-parts":[[2013,6,26]],"date-time":"2013-06-26T14:39:53Z","timestamp":1372257593000},"page":"85-120","source":"Crossref","is-referenced-by-count":24,"title":["Learning Representations for Weakly Supervised Natural Language Processing Tasks"],"prefix":"10.1162","volume":"40","author":[{"given":"Fei","family":"Huang","sequence":"first","affiliation":[{"name":"Temple University"}]},{"given":"Arun","family":"Ahuja","sequence":"additional","affiliation":[{"name":"Northwestern University"}]},{"given":"Doug","family":"Downey","sequence":"additional","affiliation":[{"name":"Northwestern University"}]},{"given":"Yi","family":"Yang","sequence":"additional","affiliation":[{"name":"Northwestern University"}]},{"given":"Yuhong","family":"Guo","sequence":"additional","affiliation":[{"name":"Temple University"}]},{"given":"Alexander","family":"Yates","sequence":"additional","affiliation":[{"name":"Temple University"}]}],"member":"281","reference":[{"key":"R1","first-page":"225","volume-title":"Proceedings of the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (NAACL-HLT)","author":"Ahuja Arun","year":"2010"},{"key":"R2","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219841"},{"key":"R3","first-page":"2670","volume-title":"Proceedings of the IJCAI","author":"Banko Michele","year":"2007"},{"key":"R4","doi-asserted-by":"publisher","DOI":"10.3115\/1220355.1220435"},{"key":"R5","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-009-5152-4"},{"key":"R6","first-page":"127","volume-title":"Advances in Neural Information Processing Systems 20","author":"Ben-David Shai","year":"2007"},{"key":"R7","doi-asserted-by":"publisher","DOI":"10.4249\/scholarpedia.3881"},{"key":"R8","first-page":"1,137","volume":"3","author":"Bengio Yoshua","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"R9","doi-asserted-by":"publisher","DOI":"10.1145\/1553374.1553380"},{"key":"R10","first-page":"182","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing","author":"Bikel Daniel M.","year":"2004"},{"key":"R11","doi-asserted-by":"publisher","DOI":"10.1162\/0891201042544929"},{"key":"R12","doi-asserted-by":"publisher","DOI":"10.1162\/jmlr.2003.3.4-5.993"},{"key":"R13","unstructured":"Blitzer, John. 2008. Domain Adaptation of Natural Language Processing Systems. Ph.D. thesis, University of Pennsylvania, Philadelphia, PA."},{"key":"R14","first-page":"129","volume-title":"Advances in Neural Information Processing Systems","author":"Blitzer John","year":"2007"},{"key":"R15","first-page":"40","volume-title":"Association for Computational Linguistics (ACL)","author":"Blitzer John","year":"2007"},{"key":"R16","doi-asserted-by":"publisher","DOI":"10.3115\/1610075.1610094"},{"key":"R17","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-19488-6_110"},{"key":"R19","first-page":"467","volume":"18","author":"Brown Peter F.","year":"1992","journal-title":"Computational Linguistics"},{"key":"R20","doi-asserted-by":"publisher","DOI":"10.3115\/1697236.1697263"},{"key":"R21","doi-asserted-by":"publisher","DOI":"10.3115\/1220175.1220187"},{"key":"R22","first-page":"285","volume-title":"Proceedings of the EMNLP","author":"Chelba Ciprian","year":"2004"},{"key":"R23","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390177"},{"key":"R24","first-page":"540","volume-title":"Proceedings of the National Conference on Artificial Intelligence (AAAI)","author":"Dai Wenyuan","year":"2007"},{"key":"R25","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1176345006"},{"key":"R26","first-page":"256","volume-title":"Proceedings of the ACL","author":"Daum\u00e9 Hal","year":"2007"},{"key":"R27","first-page":"53","volume-title":"Proceedings of the ACL Workshop on Domain Adaptation (DANLP)","author":"Daum\u00e9 Hal","year":"2010"},{"key":"R28","doi-asserted-by":"publisher","DOI":"10.1613\/jair.1872"},{"key":"R29","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"issue":"1","key":"R30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","volume":"39","author":"Dempster Arthur","year":"1977","journal-title":"Journal of the Royal Statistical Society, Series B"},{"key":"R31","doi-asserted-by":"publisher","DOI":"10.3115\/1699510.1699514"},{"key":"R32","first-page":"886","volume-title":"Proceedings of the Advances in Neural Information Processing Systems (NIPS)","volume":"24","author":"Dhillon Paramveer S.","year":"2011"},{"key":"R33","first-page":"2,733","volume-title":"Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007)","author":"Downey Doug","year":"2007"},{"key":"R34","doi-asserted-by":"publisher","DOI":"10.21236\/ADA534427"},{"key":"R35","doi-asserted-by":"publisher","DOI":"10.3115\/1613715.1613801"},{"key":"R36","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-009-5148-0"},{"key":"R37","doi-asserted-by":"publisher","DOI":"10.3115\/1620754.1620842"},{"key":"R38","doi-asserted-by":"publisher","DOI":"10.3115\/1609067.1609091"},{"key":"R39","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007425814087"},{"key":"R40","first-page":"167","volume-title":"Conference on Empirical Methods in Natural Language Processing","author":"Gildea Daniel","year":"2001"},{"key":"R41","first-page":"744","volume-title":"Proceedings of the ACL","author":"Goldwater Sharon","year":"2007"},{"key":"R42","first-page":"664","volume-title":"Proceedings of the Neural Information Processing Systems Conference (NIPS)","author":"Gra\u00e7a Jo\u00e3o V.","year":"2009"},{"key":"R43","doi-asserted-by":"publisher","DOI":"10.1080\/00437956.1954.11659520"},{"key":"R44","first-page":"268","volume-title":"Proceedings of the ACL","author":"Hindle Donald","year":"1990"},{"key":"R45","first-page":"401","volume-title":"Proceedings of the International ICSC Symposium on Soft Computing","author":"Honkela Timo","year":"1997"},{"key":"R46","doi-asserted-by":"publisher","DOI":"10.3115\/1687878.1687948"},{"key":"R47","first-page":"23","volume-title":"Proceedings of the ACL 2010 Workshop on Domain Adaptation for Natural Language Processing (DANLP)","author":"Huang Fei","year":"2010"},{"key":"R48","first-page":"125","volume-title":"Proceedings of the Conference on Natural Language Learning (CoNLL)","author":"Huang Fei","year":"2011"},{"key":"R49","first-page":"264","volume-title":"Proceedings of ACL","author":"Jiang Jing","year":"2007"},{"key":"R50","doi-asserted-by":"publisher","DOI":"10.1145\/1321440.1321498"},{"key":"R51","first-page":"296","volume-title":"Proceedings of the EMNLP","author":"Johnson Mark","year":"2007"},{"key":"R52","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.1998.682302"},{"key":"R53","first-page":"595","volume-title":"Proceedings of the Annual Meeting of the Association of Computational Linguistics (ACL)","author":"Koo Terry","year":"2008"},{"key":"R54","doi-asserted-by":"publisher","DOI":"10.3115\/1690219.1690290"},{"key":"R55","doi-asserted-by":"publisher","DOI":"10.1007\/BF01589116"},{"key":"R56","first-page":"1,041","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Mansour Y.","year":"2009"},{"issue":"2","key":"R57","first-page":"313","volume":"19","author":"Marcus Mitchell P.","year":"1993","journal-title":"Computational Linguistics"},{"key":"R58","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-6393(97)00062-9"},{"key":"R59","unstructured":"McClosky, David. 2010. Any Domain Parsing: Automatic Domain Adaptation for Parsing. Ph.D. thesis, Brown University, Providence, RI."},{"key":"R60","first-page":"28","volume-title":"North American Chapter of the Association for Computational Linguistics - Human Language Technologies 2010 Conference (NAACL-HLT 2010)","author":"McClosky David","year":"2010"},{"key":"R61","first-page":"337","volume-title":"Proceedings of the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)","author":"Miller Scott","year":"2004"},{"key":"R62","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273577"},{"key":"R63","first-page":"1,081","volume-title":"Proceedings of the Neural Information Processing Systems (NIPS)","author":"Mnih Andriy","year":"2009"},{"key":"R64","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2008.12.025"},{"key":"R65","first-page":"246","volume-title":"Proceedings of the International Workshop on Artificial Intelligence and Statistics","author":"Morin Frederic","year":"2005"},{"key":"R66","doi-asserted-by":"publisher","DOI":"10.3115\/1699571.1699635"},{"key":"R68","doi-asserted-by":"publisher","DOI":"10.3115\/981574.981598"},{"key":"R69","first-page":"556","volume-title":"Proceedings of NAACL-HLT","author":"Pradhan Sameer","year":"2007"},{"key":"R70","doi-asserted-by":"publisher","DOI":"10.1109\/5.18626"},{"key":"R71","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273592"},{"key":"R72","doi-asserted-by":"publisher","DOI":"10.3115\/1596374.1596399"},{"key":"R73","doi-asserted-by":"publisher","DOI":"10.1007\/BF00203171"},{"key":"R74","volume-title":"Synactic Theory: A Formal Introduction.","author":"Sag Ivan A.","year":"2003"},{"key":"R75","first-page":"1","volume-title":"Proceedings of the Semantic Knowledge Acquisition and Categorization Workshop","author":"Sahlgren Magnus","year":"2001"},{"key":"R76","first-page":"1","volume-title":"Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering (TKE)","volume":"87","author":"Sahlgren Magnus","year":"2005"},{"key":"R77","unstructured":"Sahlgren, Magnus. 2006. The word-space model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces. Ph.D. thesis, Stockholm University."},{"key":"R78","volume-title":"Introduction to Modern Information Retrieval.","author":"Salton Gerard","year":"1983"},{"key":"R79","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-74976-9_23"},{"key":"R80","doi-asserted-by":"publisher","DOI":"10.3115\/974557.974572"},{"key":"R81","first-page":"760","volume-title":"Proceedings of the ACL","author":"Shen Libin","year":"2007"},{"key":"R82","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219884"},{"key":"R83","first-page":"693","volume":"8","author":"Sutton Charles","year":"2007","journal-title":"Journal of Machine Learning Research"},{"key":"R84","first-page":"665","volume-title":"Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL-HLT)","author":"Suzuki Jun","year":"2008"},{"key":"R85","doi-asserted-by":"publisher","DOI":"10.3115\/1699571.1699585"},{"key":"R87","first-page":"127","volume-title":"Proceedings of the 4th Conference on Computational Natural Language Learning","author":"Tjong Erik F.","year":"2000"},{"key":"R88","first-page":"1,521","volume-title":"Proceedings of the NIPS","author":"Toutanova Kristina","year":"2007"},{"key":"R89","first-page":"32","volume-title":"Proceedings of the Fourth SIGHAN Workshop","author":"Tseng Huihsin","year":"2005"},{"key":"R90","doi-asserted-by":"publisher","DOI":"10.3115\/1620853.1620921"},{"key":"R91","first-page":"384","volume-title":"Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL)","author":"Turian Joseph","year":"2010"},{"key":"R92","doi-asserted-by":"publisher","DOI":"10.1613\/jair.2934"},{"key":"R93","doi-asserted-by":"publisher","DOI":"10.3115\/993268.993390"},{"key":"R94","first-page":"173","volume-title":"Proceedings of the STePs 2004 Cognition + Cybernetics Symposium","author":"V\u00e4yrynen Jaakko","year":"2004"},{"key":"R95","first-page":"135","volume-title":"Proceedings of the International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning (AKRR)","author":"V\u00e4yrynen Jaakko","year":"2005"},{"key":"R96","first-page":"20","volume-title":"Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR)","author":"V\u00e4yrynen Jaakko","year":"2007"},{"key":"R97","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390303"},{"key":"R98","first-page":"579","volume-title":"Proceedings of the NAACL-HLT","author":"Yang Yi","year":"2013"},{"key":"R99","doi-asserted-by":"publisher","DOI":"10.3115\/1596409.1596418"}],"container-title":["Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/COLI_a_00167","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,13]],"date-time":"2024-05-13T01:50:26Z","timestamp":1715565026000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/coli\/article\/40\/1\/85-120\/1460"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,3]]},"references-count":96,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2014,3]]}},"alternative-id":["10.1162\/COLI_a_00167"],"URL":"https:\/\/doi.org\/10.1162\/coli_a_00167","relation":{},"ISSN":["0891-2017","1530-9312"],"issn-type":[{"value":"0891-2017","type":"print"},{"value":"1530-9312","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,3]]}}}