{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T04:26:49Z","timestamp":1777696009795,"version":"3.51.4"},"reference-count":35,"publisher":"SAGE Publications","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IDA"],"published-print":{"date-parts":[[2021,4,20]]},"abstract":"<jats:p>Storyline extraction aims to generate concise summaries of related events unfolding over time from a collection of temporally-ordered news articles. Some existing approaches to storyline extraction are typically built on probabilistic graphical models that jointly model the extraction of events and the storylines from news published in different periods. However, their parameter inference procedures are often complex and require a long time to converge, which hinders their use in practical applications. More recently, a neural network-based approach has been proposed to tackle such limitations. However, event representations of documents, which are important for the quality of the generated storylines, are not learned. In this paper, we propose a novel unsupervised neural network-based approach to extract latent events and link patterns of storylines jointly from documents over time. Specifically, event representations are learned by a stacked autoencoder and clustered for event extraction, then a fusion component is incorporated to link the related events across consecutive periods for storyline extraction. The proposed model has been evaluated on three news corpora and the experimental results show that it outperforms state-of-the-art approaches with significant improvements.<\/jats:p>","DOI":"10.3233\/ida-195061","type":"journal-article","created":{"date-parts":[[2021,4,23]],"date-time":"2021-04-23T14:40:35Z","timestamp":1619188835000},"page":"589-603","source":"Crossref","is-referenced-by-count":1,"title":["Unsupervised latent event representation learning and storyline extraction from news articles based on neural networks"],"prefix":"10.1177","volume":"25","author":[{"given":"Jiasheng","family":"Si","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Key Laboratory of Computer Network and Information Integration, Ministry of Education, Southeast University, China"}]},{"given":"Linsen","family":"Guo","sequence":"additional","affiliation":[{"name":"Meituan-Dianping Group, China"}]},{"given":"Deyu","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Key Laboratory of Computer Network and Information Integration, Ministry of Education, Southeast University, China"}]}],"member":"179","reference":[{"key":"10.3233\/IDA-195061_ref1","doi-asserted-by":"crossref","unstructured":"A. Ahmed, Q. Ho, J. Eisenstein, E. Xing, A.J. Smola and C.H. Teo, Unified analysis of streaming news, in: Proceedings of the 20th International Conference on World Wide Web, ACM, 2011, pp. 267\u2013276.","DOI":"10.1145\/1963405.1963445"},{"key":"10.3233\/IDA-195061_ref2","unstructured":"A. Ahmed, Q. Ho, C.H. Teo, J. Eisenstein, A. Smola and E. Xing, Online inference for the infinite topic-cluster model: Storylines from streaming text, in: Proceedings of the 14th International Conference on Artificial Intelligence and Statistics, 2011, pp. 101\u2013109."},{"key":"10.3233\/IDA-195061_ref3","doi-asserted-by":"crossref","unstructured":"R. Asadi and A. Regan, Spatio-temporal clustering of traffic data with deep embedded clustering, in: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Prediction of Human Mobility, ACM, 2019, pp. 45\u201352.","DOI":"10.1145\/3356995.3364537"},{"key":"10.3233\/IDA-195061_ref4","doi-asserted-by":"crossref","unstructured":"D.M. Blei and J.D. Lafferty, Dynamic topic models, in: Proceedings of the 23rd International Conference on Machine Learning, ACM, 2006, pp. 113\u2013120.","DOI":"10.1145\/1143844.1143859"},{"key":"10.3233\/IDA-195061_ref5","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"10.3233\/IDA-195061_ref6","doi-asserted-by":"crossref","unstructured":"Z. Cao, S. Li, Y. Liu, W. Li and H. Ji, A novel neural topic model and its supervised extension, in: Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015, pp. 2210\u20132216.","DOI":"10.1609\/aaai.v29i1.9499"},{"key":"10.3233\/IDA-195061_ref7","doi-asserted-by":"crossref","unstructured":"Q. Diao and J. Jiang, Recurrent chinese restaurant process with a duration-based discount for event identification from twitter, in: Proceedings of the 2014 SIAM International Conference on Data Mining, SIAM, 2014, pp. 388\u2013397.","DOI":"10.1137\/1.9781611973440.45"},{"key":"10.3233\/IDA-195061_ref8","doi-asserted-by":"crossref","unstructured":"N. Du, M. Farajtabar, A. Ahmed, A.J. Smola and L. Song, Dirichlet-hawkes processes with applications to clustering continuous-time document streams, in: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2015, pp. 219\u2013228.","DOI":"10.1145\/2783258.2783411"},{"key":"10.3233\/IDA-195061_ref9","doi-asserted-by":"crossref","unstructured":"K. Ghasedi Dizaji, A. Herandi, C. Deng, W. Cai and H. Huang, Deep clustering via joint convolutional autoencoder embedding and relative entropy minimization, in: Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 5736\u20135745.","DOI":"10.1109\/ICCV.2017.612"},{"key":"10.3233\/IDA-195061_ref10","doi-asserted-by":"crossref","unstructured":"X. Guo, L. Gao, X. Liu and J. Yin, Improved deep embedded clustering with local structure preservation, in: Proceedings of Conference on International Joint Conference on Artificial Intelligence, 2017, pp. 1753\u20131759.","DOI":"10.24963\/ijcai.2017\/243"},{"key":"10.3233\/IDA-195061_ref11","doi-asserted-by":"crossref","unstructured":"X. Guo, X. Liu, E. Zhu, X. Zhu, M. Li, X. Xu and J. Yin, Adaptive self-paced deep clustering with data augmentation, in: IEEE Transactions on Knowledge and Data Engineering, 2019.","DOI":"10.1109\/TKDE.2019.2911833"},{"key":"10.3233\/IDA-195061_ref12","doi-asserted-by":"crossref","unstructured":"A. Hadifar, L. Sterckx, T. Demeester and C. Develder, A self-training approach for short text clustering, in: Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019), 2019, pp. 194\u2013199.","DOI":"10.18653\/v1\/W19-4322"},{"key":"10.3233\/IDA-195061_ref13","unstructured":"L. Huang and L. Huang, Optimized event storyline generation based on mixture-event-aspect model, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2013, pp. 726\u2013735."},{"key":"10.3233\/IDA-195061_ref15","doi-asserted-by":"crossref","unstructured":"N. Kawamae, Trend analysis model: trend consists of temporal words, topics, and timestamps, in: Proceedings of the fourth ACM International Conference on Web Search and Data Mining, ACM, 2011, pp. 317\u2013326.","DOI":"10.1145\/1935826.1935880"},{"key":"10.3233\/IDA-195061_ref16","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/j.patcog.2018.05.019","article-title":"Discriminatively boosted image clustering with fully convolutional auto-encoders","volume":"83","author":"Li","year":"2018","journal-title":"Pattern Recognition"},{"key":"10.3233\/IDA-195061_ref17","doi-asserted-by":"crossref","unstructured":"J. Li and C. Cardie, Timeline generation: Tracking individuals on twitter, in: Proceedings of the 23rd International Conference on World Wide Web, ACM, 2014, pp. 643\u2013652.","DOI":"10.1145\/2566486.2567969"},{"key":"10.3233\/IDA-195061_ref18","unstructured":"J. Li and S. Li, Evolutionary hierarchical dirichlet process for timeline summarization, in: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2013, pp. 556\u2013560."},{"key":"10.3233\/IDA-195061_ref20","doi-asserted-by":"crossref","unstructured":"C. Lin, C. Lin, J. Li, D. Wang, Y. Chen and T. Li, Generating event storylines from microblogs, in: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, ACM, 2012, pp. 175\u2013184.","DOI":"10.1145\/2396761.2396787"},{"key":"10.3233\/IDA-195061_ref21","doi-asserted-by":"crossref","unstructured":"B. Liu, D. Niu, K. Lai, L. Kong and Y. Xu, Growing story forest online from massive breaking news, in: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, ACM, 2017, pp. 777\u2013785.","DOI":"10.1145\/3132847.3132852"},{"key":"10.3233\/IDA-195061_ref22","first-page":"2579","article-title":"Visualizing data using t-sne","volume":"9","author":"Maaten","year":"2008","journal-title":"Journal of Machine Learning Research"},{"key":"10.3233\/IDA-195061_ref23","unstructured":"Y. Miao, L. Yu and P. Blunsom, Neural variational inference for text processing, in: International Conference on Machine Learning, 2016, pp. 1727\u20131736."},{"key":"10.3233\/IDA-195061_ref24","doi-asserted-by":"crossref","unstructured":"K. Nigam and R. Ghani, Analyzing the effectiveness and applicability of co-training, in: International Conference on Information &. Knowledge Management, 2000.","DOI":"10.1145\/354756.354805"},{"key":"10.3233\/IDA-195061_ref25","doi-asserted-by":"crossref","unstructured":"J. Pennington, R. Socher and C. Manning, Glove: Global vectors for word representation, in: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014, pp. 1532\u20131543.","DOI":"10.3115\/v1\/D14-1162"},{"key":"10.3233\/IDA-195061_ref26","doi-asserted-by":"crossref","unstructured":"K. Radinsky and E. Horvitz, Mining the web to predict future events, in: Proceedings of the sixth ACM International Conference on Web Search and Data Mining, ACM, 2013, pp. 255\u2013264.","DOI":"10.1145\/2433396.2433431"},{"key":"10.3233\/IDA-195061_ref28","unstructured":"S. Tang, F. Wu, S. Li, W. Lu, Z. Zhang and Y. Zhuang, Sketch the storyline with charcoal: a non-parametric approach, in: Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015."},{"key":"10.3233\/IDA-195061_ref30","first-page":"3371","article-title":"Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion","volume":"11","author":"Vincent","year":"2010","journal-title":"Journal of Machine Learning Research"},{"key":"10.3233\/IDA-195061_ref31","doi-asserted-by":"crossref","first-page":"102098","DOI":"10.1016\/j.ipm.2019.102098","article-title":"Atm: adversarial-neural topic model","volume":"56","author":"Wang","year":"2019","journal-title":"Information Processing & Management"},{"key":"10.3233\/IDA-195061_ref32","unstructured":"J. Xie, R. Girshick and A. Farhadi, Unsupervised deep embedding for clustering analysis, in: International Conference on Machine Learning, 2016, pp. 478\u2013487."},{"key":"10.3233\/IDA-195061_ref33","unstructured":"R. Yan, L. Kong, C. Huang, X. Wan, X. Li and Y. Zhang, Timeline generation through evolutionary trans-temporal summarization, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2011, pp. 433\u2013443."},{"key":"10.3233\/IDA-195061_ref34","doi-asserted-by":"crossref","first-page":"1076","DOI":"10.1016\/j.neucom.2015.05.016","article-title":"Tracking news article evolution by dense subgraph learning","volume":"168","author":"Yu","year":"2015","journal-title":"Neurocomputing"},{"key":"10.3233\/IDA-195061_ref35","doi-asserted-by":"crossref","unstructured":"J. Yuan, F. Gao, Q. Ho, W. Dai, J. Wei, X. Zheng, E.P. Xing, T.-Y. Liu and W.-Y. Ma, Lightlda: Big topic models on modest computer clusters, in: Proceedings of the 24th International Conference on World Wide Web, International World Wide Web Conferences Steering Committee, 2015, pp. 1351\u20131361.","DOI":"10.1145\/2736277.2741115"},{"key":"10.3233\/IDA-195061_ref36","doi-asserted-by":"crossref","unstructured":"J. Zhang, Y. Song, C. Zhang and S. Liu, Evolutionary hierarchical dirichlet processes for multiple correlated time-varying corpora, in: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2010, pp. 1079\u20131088.","DOI":"10.1145\/1835804.1835940"},{"key":"10.3233\/IDA-195061_ref37","doi-asserted-by":"crossref","unstructured":"D. Zhou, L. Guo and Y. He, Neural storyline extraction model for storyline generation from news articles, in: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), Vol. 1, 2018, pp. 1727\u20131736.","DOI":"10.18653\/v1\/N18-1156"},{"key":"10.3233\/IDA-195061_ref38","unstructured":"D. Zhou, H. Xu, X.-Y. Dai and Y. He, Unsupervised storyline extraction from news articles, in: Proceedings of Conference on International Joint Conference on Artificial Intelligence, 2016, pp. 3014\u20133021."},{"key":"10.3233\/IDA-195061_ref39","doi-asserted-by":"crossref","unstructured":"D. Zhou, H. Xu and Y. He, An unsupervised bayesian modelling approach for storyline detection on news articles, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015, pp. 1943\u20131948.","DOI":"10.18653\/v1\/D15-1225"}],"container-title":["Intelligent Data Analysis"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/IDA-195061","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:19:02Z","timestamp":1777454342000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/IDA-195061"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,20]]},"references-count":35,"journal-issue":{"issue":"3"},"URL":"https:\/\/doi.org\/10.3233\/ida-195061","relation":{},"ISSN":["1088-467X","1571-4128"],"issn-type":[{"value":"1088-467X","type":"print"},{"value":"1571-4128","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,4,20]]}}}