{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,20]],"date-time":"2025-12-20T22:13:41Z","timestamp":1766268821368,"version":"3.40.5"},"reference-count":62,"publisher":"Cambridge University Press (CUP)","issue":"4","license":[{"start":{"date-parts":[[2019,8,15]],"date-time":"2019-08-15T00:00:00Z","timestamp":1565827200000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2020,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Twitter and other social media platforms are often used for sharing interest in products. The identification of purchase decision stages, such as in the AIDA model (Awareness, Interest, Desire, and Action), can enable more personalized e-commerce services and a finer-grained targeting of advertisements than predicting purchase intent only. In this paper, we propose and analyze neural models for identifying the purchase stage of single tweets in a user\u2019s tweet sequence. In particular, we identify three challenges of purchase stage identification: imbalanced label distribution with a high number of non-purchase-stage instances, limited amount of training data, and domain adaptation with no or only little target domain data. Our experiments reveal that the imbalanced label distribution is the main challenge for our models. We address it with ranking loss and perform detailed investigations of the performance of our models on the different output classes. In order to improve the generalization of the models and augment the limited amount of training data, we examine the use of sentiment analysis as a complementary, secondary task in a multitask framework. For applying our models to tweets from another product domain, we consider two scenarios: for the first scenario without any labeled data in the target product domain, we show that learning domain-invariant representations with adversarial training is most promising, while for the second scenario with a small number of labeled target examples, fine-tuning the source model weights performs best. Finally, we conduct several analyses, including extracting attention weights and representative phrases for the different purchase stages. The results suggest that the model is learning features indicative of purchase stages and that the confusion errors are sensible.<\/jats:p>","DOI":"10.1017\/s1351324919000433","type":"journal-article","created":{"date-parts":[[2019,8,15]],"date-time":"2019-08-15T05:49:56Z","timestamp":1565848196000},"page":"383-411","source":"Crossref","is-referenced-by-count":2,"title":["Tackling challenges of neural purchase stage identification from imbalanced twitter data"],"prefix":"10.1017","volume":"26","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2787-0084","authenticated-orcid":false,"given":"Heike","family":"Adel","sequence":"first","affiliation":[]},{"given":"Francine","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Yan-Ying","family":"Chen","sequence":"additional","affiliation":[]}],"member":"56","published-online":{"date-parts":[[2019,8,15]]},"reference":[{"key":"S1351324919000433_ref36","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939729"},{"key":"S1351324919000433_ref53","unstructured":"Theano Development Team (2016). Theano: A Python framework for fast computation of mathematical expressions. In arXiv:1605.02688."},{"key":"S1351324919000433_ref31","unstructured":"Korpusik, M. , Sakaki, S. , Chen, F. and Chen, Y. (2016). Recurrent neural networks for customer purchase prediction on Twitter. In Proceedings of the 3rd Workshop on New Trends in Content-Based Recommender Systems co-located with ACM Conference on Recommender Systems (RecSys 2016), Boston, MA, USA, pp. 47\u201350."},{"key":"S1351324919000433_ref18","first-page":"2096","article-title":"Domain-adversarial training of neural networks","volume":"17","author":"Ganin","year":"2016","journal-title":"Journal of Machine Learning Research"},{"key":"S1351324919000433_ref7","doi-asserted-by":"publisher","DOI":"10.1016\/B978-1-55860-307-3.50012-5"},{"key":"S1351324919000433_ref38","doi-asserted-by":"publisher","DOI":"10.1145\/2856767.2856800"},{"key":"S1351324919000433_ref21","first-page":"2411","volume-title":"Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing","author":"Gui","year":"2017"},{"key":"S1351324919000433_ref24","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"S1351324919000433_ref61","first-page":"1480","volume-title":"Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Yang","year":"2016"},{"key":"S1351324919000433_ref16","first-page":"626","volume-title":"Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Dos Santos","year":"2015"},{"key":"S1351324919000433_ref57","first-page":"2764","volume-title":"Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence","author":"Weston","year":"2011"},{"key":"S1351324919000433_ref1","first-page":"592","volume-title":"Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers","author":"Adel","year":"2017"},{"key":"S1351324919000433_ref17","first-page":"14","article-title":"Three natural fields of salesmanship","volume":"2","author":"Dukesmith","year":"1904","journal-title":"Salesmanship"},{"key":"S1351324919000433_ref15","doi-asserted-by":"crossref","unstructured":"Ding, X. , Liu, T. , Duan, J. and Nie, J. (2015). Mining user consumption intention from social media using domain adaptive convolutional neural network. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, TX, USA, pp. 2389\u20132395.","DOI":"10.1609\/aaai.v29i1.9529"},{"key":"S1351324919000433_ref27","unstructured":"Kharratzadeh, M. and Coates, M. (2012). Weblog analysis for predicting correlations in stock price evolutions. In Proceedings of the Sixth International Conference on Weblogs and Social Media, Dublin, Ireland."},{"key":"S1351324919000433_ref10","doi-asserted-by":"crossref","unstructured":"Chen, X. , Tan, T. , Liu, X. , Lanchantin, P. , Wan, M. , Gales, M.J.F. and Woodland, P.C. (2015). Recurrent neural network language model adaptation for multi-genre broadcast speech recognition. In INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, pp. 3511\u20133515.","DOI":"10.21437\/Interspeech.2015-696"},{"key":"S1351324919000433_ref60","doi-asserted-by":"crossref","unstructured":"Yan, Z. , Duan, N. , Chen, P. , Zhou, M. , Zhou, J. and Li, Z. (2017). Building task-oriented dialogue systems for online shopping. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, pp. 4618\u20134626.","DOI":"10.1609\/aaai.v31i1.11182"},{"key":"S1351324919000433_ref29","first-page":"1528","volume-title":"Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Klerke","year":"2016"},{"key":"S1351324919000433_ref20","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W15-4322"},{"key":"S1351324919000433_ref2","first-page":"22","volume-title":"Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers","author":"Adel","year":"2017"},{"key":"S1351324919000433_ref50","unstructured":"Sakaki, S. , Chen, F. , Korpusik, M. and Chen, Y. (2016). Corpus for customer purchase behavior prediction in social media. In Language Resources and Evaluation Conference, Portoro\u017e, Slovenia, pp. 2976\u20132980."},{"key":"S1351324919000433_ref62","unstructured":"Zeiler, M.D. (2012). ADADELTA: An adaptive learning rate method. In arXiv:1212.5701."},{"key":"S1351324919000433_ref11","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1179"},{"key":"S1351324919000433_ref3","doi-asserted-by":"publisher","DOI":"10.1109\/WI-IAT.2010.63"},{"key":"S1351324919000433_ref4","unstructured":"Bahdanau, D. , Cho, K. and Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. In 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA."},{"volume-title":"CEAS 2010 - Seventh annual Collaboration, Electronic messaging, Anti-Abuse and Spam Conference","year":"2010","author":"Benevenuto","key":"S1351324919000433_ref5"},{"key":"S1351324919000433_ref56","doi-asserted-by":"publisher","DOI":"10.1109\/5.58337"},{"key":"S1351324919000433_ref6","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2011.323"},{"key":"S1351324919000433_ref8","doi-asserted-by":"publisher","DOI":"10.1145\/1007730.1007733"},{"key":"S1351324919000433_ref9","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00039"},{"key":"S1351324919000433_ref43","first-page":"380","volume-title":"Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Owoputi","year":"2013"},{"key":"S1351324919000433_ref13","first-page":"2493","article-title":"Natural language processing (almost) from scratch","volume":"12","author":"Collobert","year":"2011","journal-title":"Journal of Machine Learning Research"},{"key":"S1351324919000433_ref14","first-page":"84","article-title":"Know what your customers want before they do","volume":"89","author":"Davenport","year":"2011","journal-title":"Harvard Business Review"},{"key":"S1351324919000433_ref34","unstructured":"Lebret, R. , Pinheiro, P. and Collobert, R. (2015). Phrase-based image captioning. In Proceedings of the 32th International Conference on Machine Learning, ICML, Lille, France, pp. 2085\u20132094."},{"key":"S1351324919000433_ref19","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1002"},{"key":"S1351324919000433_ref22","doi-asserted-by":"crossref","unstructured":"Gupta, V. , Varshney, D. , Jhamtani, H. , Kedia, D. and Karwa, S. 2014. Identifying purchase intent from social posts. In Proceedings of the Eighth International Conference on Weblogs and Social Media, ICWSM 2014, Ann Arbor, Michigan.","DOI":"10.1609\/icwsm.v8i1.14505"},{"key":"S1351324919000433_ref51","doi-asserted-by":"publisher","DOI":"10.1109\/78.650093"},{"key":"S1351324919000433_ref23","unstructured":"Hermann, K.M. , Kocisk\u00fd, T. , Grefenstette, E. , Espeholt, L. , Kay, W. , Suleyman, M. and Blunsom, P. (2015). Teaching machines to read and comprehend. In Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, Montreal, QC, Canada, pp. 1693\u2013701."},{"key":"S1351324919000433_ref12","unstructured":"Chung, J. , Gulcehre, C. , Cho, K. and Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. In NIPS 2014 Workshop on Deep Learning, Montreal, QC, Canada."},{"key":"S1351324919000433_ref25","doi-asserted-by":"publisher","DOI":"10.1145\/2487788.2488009"},{"key":"S1351324919000433_ref26","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1062"},{"key":"S1351324919000433_ref28","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1181"},{"key":"S1351324919000433_ref30","doi-asserted-by":"crossref","unstructured":"Kombrink, S. , Mikolov, T. , Karafi\u00e1t, M. and Burget, L. (2011). Recurrent neural network based language modeling in meeting recognition. In INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, pp. 2877\u20132780.","DOI":"10.21437\/Interspeech.2011-720"},{"key":"S1351324919000433_ref52","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1167"},{"key":"S1351324919000433_ref32","doi-asserted-by":"publisher","DOI":"10.1109\/EDOC.2014.20"},{"key":"S1351324919000433_ref33","unstructured":"Le, Q.V. and Mikolov, T. (2014). Distributed representations of sentences and documents. In Proceedings of the 31th International Conference on Machine Learning, ICML, Beijing, China, pp. 1188\u20131196."},{"key":"S1351324919000433_ref35","first-page":"124","article-title":"Catch-line and argument","volume":"15","author":"Lewis","year":"1903","journal-title":"The Book-Keeper"},{"key":"S1351324919000433_ref37","first-page":"590","volume-title":"International Conference on Management Science and Engineering (ICMSE)","author":"Lv","year":"2014"},{"key":"S1351324919000433_ref39","unstructured":"Mikolov, T. (2012). Statistical language models based on neural networks. PhD thesis. Brno University of Technology."},{"key":"S1351324919000433_ref40","unstructured":"Mikolov, T. , Chen, K. , Corrado, G. and Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of Workshop at 1st International Conference on Learning Representations (ICLR), Scottsdale, AZ, USA."},{"key":"S1351324919000433_ref41","unstructured":"Morris, M.R. , Teevan, J. and Panovich, K. (2010). What do people ask their social networks, and why? A survey study of status message Q&A behavior. In Proceedings of the 28th International Conference on Human Factors in Computing Systems, CHI 2010, Atlanta, GA, USA, pp. 1739\u20131748."},{"key":"S1351324919000433_ref42","first-page":"1","volume-title":"Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)","author":"Nakov","year":"2016"},{"volume-title":"Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28","year":"2013","author":"Pascanu","key":"S1351324919000433_ref44"},{"key":"S1351324919000433_ref45","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"Journal of Machine Learning Research"},{"key":"S1351324919000433_ref46","first-page":"54","volume-title":"Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text","author":"Ramanand","year":"2010"},{"key":"S1351324919000433_ref49","first-page":"49","article-title":"How to write a sales-making letter","volume":"115","author":"Russell","year":"1921","journal-title":"Printers\u2019 Ink"},{"key":"S1351324919000433_ref47","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939778"},{"key":"S1351324919000433_ref48","unstructured":"Ruder, S. (2017). An overview of multi-task learning in deep neural networks. arXiv preprint arXiv:1706.05098."},{"key":"S1351324919000433_ref54","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.316"},{"key":"S1351324919000433_ref55","unstructured":"Vieira, A. (2015). Predicting online user behaviour using deep learning algorithms. In arXiv:1511.06247."},{"key":"S1351324919000433_ref58","unstructured":"Wijaya, B.S. (2015). The development of hierarchy of effects model in advertising. International Research Journal of Business Studies 5(1)."},{"key":"S1351324919000433_ref59","first-page":"183","volume-title":"Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)","author":"Xu","year":"2016"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324919000433","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,19]],"date-time":"2023-09-19T03:57:14Z","timestamp":1695095834000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324919000433\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,8,15]]},"references-count":62,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,7]]}},"alternative-id":["S1351324919000433"],"URL":"https:\/\/doi.org\/10.1017\/s1351324919000433","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"type":"print","value":"1351-3249"},{"type":"electronic","value":"1469-8110"}],"subject":[],"published":{"date-parts":[[2019,8,15]]}}}