{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:57:31Z","timestamp":1760245051213,"version":"3.38.0"},"reference-count":41,"publisher":"SAGE Publications","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IDA"],"published-print":{"date-parts":[[2017,10,10]]},"DOI":"10.3233\/ida-163148","type":"journal-article","created":{"date-parts":[[2017,10,17]],"date-time":"2017-10-17T16:19:12Z","timestamp":1508257152000},"page":"1213-1231","source":"Crossref","is-referenced-by-count":7,"title":["Distant supervised relation extraction via long short term memory networks with sentence embedding"],"prefix":"10.1177","volume":"21","author":[{"given":"Dengchao","family":"He","sequence":"first","affiliation":[]},{"given":"Hongjun","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Wenning","family":"Hao","sequence":"additional","affiliation":[]},{"given":"Rui","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Gang","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Dawei","family":"Jin","sequence":"additional","affiliation":[]},{"given":"Kai","family":"Cheng","sequence":"additional","affiliation":[]}],"member":"179","reference":[{"key":"10.3233\/IDA-163148_ref1","doi-asserted-by":"crossref","unstructured":"A.P. Parikh, S.B. Cohen and E.P. Xing, Spectral unsupervised parsing with additive tree metrics, In: Proceedings of ACL, 2014, pages 1062\u20131072.","DOI":"10.3115\/v1\/P14-1100"},{"key":"10.3233\/IDA-163148_ref2","doi-asserted-by":"crossref","unstructured":"C.N. Dos Santos, B. Xiang and B. Zhou, Classifying relations by ranking with convolutional neural networks, In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, 2015.","DOI":"10.3115\/v1\/P15-1061"},{"issue":"10","key":"10.3233\/IDA-163148_ref3","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.1016\/S0893-6080(03)00138-2","article-title":"The general inefficiency of batch training for gradient descent learning","volume":"16","author":"Wilson","year":"2003","journal-title":"Neural Networks"},{"key":"10.3233\/IDA-163148_ref4","first-page":"1083","article-title":"Kernel methods for relation extraction","volume":"3","author":"Zelenko","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"10.3233\/IDA-163148_ref5","first-page":"2335","article-title":"Relation Classification via Convolutional Deep Neural Network","author":"Zeng","year":"2014","journal-title":"COLING"},{"key":"10.3233\/IDA-163148_ref6","doi-asserted-by":"crossref","unstructured":"D. Zeng, K. Liu, Y. Chen et al., Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks, In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015, pp. 17\u201321.","DOI":"10.18653\/v1\/D15-1203"},{"issue":"1","key":"10.3233\/IDA-163148_ref7","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1162\/COLI_a_00167","article-title":"Learning representations for weakly supervised natural language processing tasks","volume":"40","author":"Huang","year":"2014","journal-title":"Journal of Computational Linguistics"},{"key":"10.3233\/IDA-163148_ref8","doi-asserted-by":"crossref","unstructured":"F. Suchanek, G. Ifrim and G. Weikum, Combining linguistic and statistical analysis to extract relations from web documents, In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006, pp. 712\u2013717.","DOI":"10.1145\/1150402.1150492"},{"key":"10.3233\/IDA-163148_ref9","unstructured":"F. Suchanek, J. Fan, R. Hoffmann et al., Advances in automated knowledge base construction, SIGMOD Records Journal, March 2013."},{"key":"10.3233\/IDA-163148_ref10","unstructured":"F. Wu and D. Weld, Open information extraction using Wikipedia, In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 2010."},{"issue":"4","key":"10.3233\/IDA-163148_ref11","first-page":"212","article-title":"Improving Neural Networks by Preventing Co-adaptation of Feature Detectors","volume":"3","author":"Hinton","year":"2012","journal-title":"Computer Science"},{"key":"10.3233\/IDA-163148_ref12","unstructured":"G.D. Zhou, M. Zhang, D.H. Ji et al., Tree kernel-based relation extraction with context-sensitive structured parse tree information, In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2007."},{"key":"10.3233\/IDA-163148_ref13","doi-asserted-by":"crossref","unstructured":"H. Palangi, L. Deng, Y. Shen et al., Deep sentence embedding using the long short term memory network: Analysis and application to information retrieval, IEEE\/ACM Transactions on Audio, Speech, and Language Processing 24(4) (2016), 694\u2013707.","DOI":"10.1109\/TASLP.2016.2520371"},{"key":"10.3233\/IDA-163148_ref14","doi-asserted-by":"crossref","unstructured":"J. Ebrahimi and D.J. Dou, Chain based RNN for relation classification, In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2015, pp. 1244\u20131249.","DOI":"10.3115\/v1\/N15-1133"},{"key":"10.3233\/IDA-163148_ref15","first-page":"1532","article-title":"Glove: Global Vectors for Word Representation","volume":"14","author":"Pennington","year":"2014","journal-title":"EMNLP"},{"issue":"3","key":"10.3233\/IDA-163148_ref16","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1093\/bioinformatics\/btl616","article-title":"RelEx \u2013 Relation extraction using dependency parse trees","volume":"23","author":"Fundel","year":"2007","journal-title":"Bioinformatics"},{"key":"10.3233\/IDA-163148_ref17","first-page":"1372","article-title":"Simple Customization of Recursive Neural Networks for Semantic Relation Classification","author":"Hashimoto","year":"2013","journal-title":"EMNLP"},{"issue":"7","key":"10.3233\/IDA-163148_ref18","first-page":"941","article-title":"Semantic relation classification via convolutional neural networks with simple negative sampling","volume":"71","author":"Xu","year":"2015","journal-title":"Computer Science"},{"key":"10.3233\/IDA-163148_ref19","doi-asserted-by":"crossref","unstructured":"L.H. Qian, G.D. Zhou, F. Kong et al., Exploiting constituent dependencies for tree kernel-based semantic relation extraction, In: Proceedings of COLING, 2008, pp. 697\u2013704.","DOI":"10.3115\/1599081.1599169"},{"key":"10.3233\/IDA-163148_ref20","doi-asserted-by":"crossref","unstructured":"M. Lyyer et al., A neural network for factoid question answering over paragraphs, In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014, pp. 633\u2013644.","DOI":"10.3115\/v1\/D14-1070"},{"key":"10.3233\/IDA-163148_ref21","doi-asserted-by":"crossref","unstructured":"M. Mintz, S. Bills, R. Snow et al., Distant supervision for relation extraction without labeled data, In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2. Association for Computational Linguistics, 2009, pp. 1003\u20131011.","DOI":"10.3115\/1690219.1690287"},{"key":"10.3233\/IDA-163148_ref23","unstructured":"M. Surdeanu, J. Tibshirani, R. Nallapati et al., Multi-instance multi-label learning for relation extraction, In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, 2012, pp. 455\u2013465."},{"key":"10.3233\/IDA-163148_ref24","unstructured":"M. Zeiler, Adadelta: An adaptive learning rate method, Computer Science, 2012."},{"key":"10.3233\/IDA-163148_ref25","doi-asserted-by":"crossref","unstructured":"M.L. Zhang and Z.H. Zhou, Adapting RBF neural networks to multi-instance learning, Neural Processing Letters 23(1) (2006), 1\u201326.","DOI":"10.1007\/s11063-005-2192-z"},{"key":"10.3233\/IDA-163148_ref26","doi-asserted-by":"crossref","unstructured":"N. Kambhatla, Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations, In: Proceedings of the ACL 2004 on Interactive Poster and Demonstration Sessions, 2004.","DOI":"10.3115\/1219044.1219066"},{"key":"10.3233\/IDA-163148_ref28","first-page":"1188","article-title":"Distributed representations of sentences and documents","volume":"14","author":"Le","year":"2014","journal-title":"ICML"},{"key":"10.3233\/IDA-163148_ref29","doi-asserted-by":"crossref","unstructured":"R. Bunescu and R. Mooney, A shortest path dependency kernel for relation extraction, In: Proceedings of HLT\/EMNLP, 2005, pp. 724\u2013731.","DOI":"10.3115\/1220575.1220666"},{"key":"10.3233\/IDA-163148_ref30","unstructured":"R. Bunescu and R. Mooney, Subsequence kernels for relation extraction, In: Proceedings of NIPS 18 (2006), 171\u2013178."},{"key":"10.3233\/IDA-163148_ref31","first-page":"2493","article-title":"Natural language processing (almost) from scratch","volume":"12","author":"Collobert","year":"2011","journal-title":"The Journal of Machine Learning Research"},{"key":"10.3233\/IDA-163148_ref32","doi-asserted-by":"crossref","unstructured":"R. Grishman, Information Extraction: Capabilities and Challenges, Oxford University Press, August 2011.","DOI":"10.1093\/oxfordhb\/9780199276349.013.0030"},{"issue":"2","key":"10.3233\/IDA-163148_ref33","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1142\/S0218488598000094","article-title":"The vanishing gradient problem during learning recurrent neural nets and problem solutions","volume":"6","author":"Hochreiter","year":"1998","journal-title":"International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems"},{"key":"10.3233\/IDA-163148_ref34","unstructured":"R. Hoffmann, C. Zhang, X. Ling et al., Knowledge-based weak supervision for information extraction of overlapping relations, In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics, 2011, pp. 541\u2013550."},{"key":"10.3233\/IDA-163148_ref35","first-page":"122","article-title":"Characterizing the Errors of Data-Driven Dependency Parsing Models","author":"McDonald","year":"2007","journal-title":"EMNLP-CoNLL"},{"key":"10.3233\/IDA-163148_ref36","first-page":"148","article-title":"Modeling relations and their mentions without labeled text","author":"Riedel","year":"2010","journal-title":"Machine Learning and Knowledge Discovery in Databases"},{"key":"10.3233\/IDA-163148_ref37","unstructured":"S. Takamatsu, I. Sato and H. Nakagawa, Reducing wrong labels in distant supervision for relation extraction, In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1. Association for Computational Linguistics, 2012, pp. 721\u2013729."},{"key":"10.3233\/IDA-163148_ref38","unstructured":"R. Socher, J. Pennington, E.H. Huang et al., Semi-supervised recursive autoencoders for predicting sentiment distributions, In: Proceedings of the Conference on Empirical Methods in Natural Language Association for Computational Linguistics, 2011, pp. 151\u2013161."},{"issue":"1","key":"10.3233\/IDA-163148_ref39","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/S0004-3702(96)00034-3","article-title":"Solving the multiple instance problem with axis-parallel rectangles","volume":"89","author":"Dietterich","year":"1997","journal-title":"Artificial Intelligence"},{"key":"10.3233\/IDA-163148_ref40","unstructured":"T. Mikolov, K. Chen, G. Corrado et al., Efficient estimation of word representations in vector space, Computer Science, 2013."},{"issue":"4","key":"10.3233\/IDA-163148_ref41","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1145\/2333112.2333115","article-title":"Ontology learning from text: A look back and into the future","volume":"44","author":"Wong","year":"2012","journal-title":"ACM Computing Surveys (CSUR)"},{"key":"10.3233\/IDA-163148_ref42","unstructured":"W. Xu, R.L. Zhao and R. Grishman, Filling Knowledge Base Gaps for Distant Supervision of Relation Extraction, In: Proceedings of Association for Computational Linguistics, 2013."},{"key":"10.3233\/IDA-163148_ref44","doi-asserted-by":"crossref","unstructured":"Y. Xu, L. Mou, G. Li et al., Classifying relations via long short term memory networks along shortest dependency paths, In: Proceedings of Conference on Empirical Methods in Natural Language Processing, 2015.","DOI":"10.18653\/v1\/D15-1206"}],"container-title":["Intelligent Data Analysis"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/IDA-163148","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,11]],"date-time":"2025-03-11T05:29:06Z","timestamp":1741670946000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/IDA-163148"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,10,10]]},"references-count":41,"journal-issue":{"issue":"5"},"URL":"https:\/\/doi.org\/10.3233\/ida-163148","relation":{},"ISSN":["1088-467X","1571-4128"],"issn-type":[{"type":"print","value":"1088-467X"},{"type":"electronic","value":"1571-4128"}],"subject":[],"published":{"date-parts":[[2017,10,10]]}}}