{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,8]],"date-time":"2025-09-08T06:47:56Z","timestamp":1757314076373,"version":"3.40.3"},"publisher-location":"Cham","reference-count":29,"publisher":"Springer Nature Switzerland","isbn-type":[{"type":"print","value":"9783031264375"},{"type":"electronic","value":"9783031264382"}],"license":[{"start":{"date-parts":[[2023,1,1]],"date-time":"2023-01-01T00:00:00Z","timestamp":1672531200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,2,23]],"date-time":"2023-02-23T00:00:00Z","timestamp":1677110400000},"content-version":"vor","delay-in-days":53,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Industry classification schemes provide a taxonomy for segmenting companies based on their business activities. They are relied upon in industry and academia as an integral component of many types of financial and economic analysis. However, even modern classification schemes have failed to embrace the era of big data and remain a largely subjective undertaking prone to inconsistency and misclassification. To address this, we propose a multimodal neural model for training company embeddings, which harnesses the dynamics of both historical pricing data and financial news to learn objective company representations that capture nuanced relationships. We explain our approach in detail and highlight the utility of the embeddings through several case studies and application to the downstream task of industry classification.<\/jats:p>","DOI":"10.1007\/978-3-031-26438-2_7","type":"book-chapter","created":{"date-parts":[[2023,2,22]],"date-time":"2023-02-22T06:32:56Z","timestamp":1677047576000},"page":"81-94","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["A Machine Learning Approach to\u00a0Industry Classification in\u00a0Financial Markets"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5607-9948","authenticated-orcid":false,"given":"Rian","family":"Dolphin","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0962-3362","authenticated-orcid":false,"given":"Barry","family":"Smyth","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2509-1370","authenticated-orcid":false,"given":"Ruihai","family":"Dong","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,2,23]]},"reference":[{"issue":"1","key":"7_CR1","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/srep03578","volume":"3","author":"M Alanyali","year":"2013","unstructured":"Alanyali, M., Moat, H.S., Preis, T.: Quantifying the relationship between financial news and the stock market. Sci. Rep. 3(1), 1\u20136 (2013)","journal-title":"Sci. Rep."},{"key":"7_CR2","doi-asserted-by":"crossref","unstructured":"Ang, G., Lim, E.P.: Learning knowledge-enriched company embeddings for investment management. In: Proceedings of the Second ACM ICAIF, pp. 1\u20139 (2021)","DOI":"10.1145\/3490354.3494390"},{"key":"7_CR3","doi-asserted-by":"crossref","unstructured":"Bachelier, L.: Th\u00e9orie de la sp\u00e9culation. In: Annales scientifiques de l\u2019\u00c9cole normale sup\u00e9rieure, vol. 17, pp. 21\u201386 (1900)","DOI":"10.24033\/asens.476"},{"issue":"2","key":"7_CR4","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1016\/j.jfineco.2004.04.003","volume":"75","author":"N Barberis","year":"2005","unstructured":"Barberis, N., Shleifer, A., Wurgler, J.: Comovemen. J. Financ. Econ. 75(2), 283\u2013317 (2005)","journal-title":"J. Financ. Econ."},{"issue":"4","key":"7_CR5","doi-asserted-by":"publisher","first-page":"703","DOI":"10.1086\/261703","volume":"98","author":"JB De Long","year":"1990","unstructured":"De Long, J.B., Shleifer, A., Summers, L.H., Waldmann, R.J.: Noise trader risk in financial markets. J. Polit. Econ. 98(4), 703\u2013738 (1990)","journal-title":"J. Polit. Econ."},{"key":"7_CR6","unstructured":"Devlin, J., et al.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)"},{"key":"7_CR7","doi-asserted-by":"crossref","unstructured":"Ding, X., et al.: Using structured events to predict stock price movement: an empirical investigation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1415\u20131425 (2014)","DOI":"10.3115\/v1\/D14-1148"},{"key":"7_CR8","unstructured":"Dolphin, R., Smyth, B., Dong, R.: Stock embeddings: Learning distributed representations for financial assets. arXiv preprint arXiv:2202.08968 (2022)"},{"issue":"1","key":"7_CR9","doi-asserted-by":"publisher","first-page":"34","DOI":"10.1086\/294743","volume":"38","author":"EF Fama","year":"1965","unstructured":"Fama, E.F.: The behavior of stock-market prices. J. Bus. 38(1), 34\u2013105 (1965)","journal-title":"J. Bus."},{"key":"7_CR10","unstructured":"Firth, J.R.: A synopsis of linguistic theory, 1930\u20131955. Studies in linguistic analysis (1957)"},{"key":"7_CR11","unstructured":"Gopikrishnan, P., Rosenow, B., Plerou, V., Stanley, H.E.: Identifying business sectors from stock price fluctuations. arXiv preprint cond-mat\/0011145 (2000)"},{"issue":"1","key":"7_CR12","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1016\/0165-4101(94)90021-3","volume":"18","author":"DA Guenther","year":"1994","unstructured":"Guenther, D.A., Rosman, A.J.: Differences between COMPUSTAT and CRSP SIC codes and related effects on research. J. Account. Econ. 18(1), 115\u2013128 (1994)","journal-title":"J. Account. Econ."},{"key":"7_CR13","unstructured":"Ito, T., et al.: Learning company embeddings from annual reports for fine-grained industry characterization. In: Proceedings of the Second Workshop on Financial Technology and Natural Language Processing, Kyoto, Japan (2020)"},{"issue":"3","key":"7_CR14","doi-asserted-by":"publisher","first-page":"309","DOI":"10.2307\/2331394","volume":"31","author":"KM Kahle","year":"1996","unstructured":"Kahle, K.M., Walkling, R.A.: The impact of industry classifications on financial research. J. Financ. Quant. Anal. 31(3), 309\u2013335 (1996)","journal-title":"J. Financ. Quant. Anal."},{"issue":"2","key":"7_CR15","doi-asserted-by":"publisher","first-page":"406","DOI":"10.1108\/INTR-05-2020-0299","volume":"32","author":"D Kim","year":"2021","unstructured":"Kim, D., Kang, H.G., Bae, K., Jeon, S.: An artificial intelligence-enabled industry classification and its interpretation. Internet Res. 32(2), 406\u2013424 (2021)","journal-title":"Internet Res."},{"key":"7_CR16","doi-asserted-by":"crossref","unstructured":"Li, W., et al.: Modeling the stock relation with graph network for overnight stock movement prediction. In: Proceedings of the Twenty-Ninth IJCAI (2021)","DOI":"10.24963\/ijcai.2020\/626"},{"issue":"2","key":"7_CR17","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1111\/j.1540-6261.1970.tb00518.x","volume":"25","author":"BG Malkiel","year":"1970","unstructured":"Malkiel, B.G., Fama, E.F.: Efficient capital markets: a review of theory and empirical work. J. Finance 25(2), 383\u2013417 (1970)","journal-title":"J. Finance"},{"key":"7_CR18","unstructured":"Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)"},{"key":"7_CR19","unstructured":"Parker, R.: Restoring the enterprise statistics program (esp) for the 2012 economic census. Reports of the Census Bureau (2012)"},{"issue":"1","key":"7_CR20","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/08963568.2015.1110229","volume":"21","author":"RL Phillips","year":"2016","unstructured":"Phillips, R.L., Ormsby, R.: Industry classification schemes: an analysis and review. J. Bus. Finance Librarianship 21(1), 1\u201325 (2016)","journal-title":"J. Bus. Finance Librarianship"},{"issue":"3","key":"7_CR21","first-page":"61","volume":"10","author":"J Platt","year":"1999","unstructured":"Platt, J., et al.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Adv. Large Margin Classifiers 10(3), 61\u201374 (1999)","journal-title":"Adv. Large Margin Classifiers"},{"key":"7_CR22","unstructured":"Rong, X.: word2vec parameter learning explained. arXiv:1411.2738 (2014)"},{"key":"7_CR23","unstructured":"Sarmah, B., Nair, N., Mehta, D., Pasquali, S.: Learning embedded representation of the stock correlation matrix using graph machine learning. arXiv preprint arXiv:2207.07183 (2022)"},{"key":"7_CR24","doi-asserted-by":"crossref","unstructured":"Satone, V., Desai, D., Mehta, D.: Fund2vec: mutual funds similarity using graph learning. In: Proceedings of the Second ACM ICAIF, pp. 1\u20138 (2021)","DOI":"10.1145\/3490354.3494381"},{"key":"7_CR25","series-title":"Lecture Notes on Data Engineering and Communications Technologies","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1007\/978-3-030-38040-3_2","volume-title":"Innovative Data Communication Technologies and Application","author":"H Vachhani","year":"2020","unstructured":"Vachhani, H., et al.: Machine learning based stock market analysis: a short survey. In: Raj, J.S., Bashar, A., Ramson, S.R.J. (eds.) ICIDCA 2019. LNDECT, vol. 46, pp. 12\u201326. Springer, Cham (2020). https:\/\/doi.org\/10.1007\/978-3-030-38040-3_2"},{"issue":"1","key":"7_CR26","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-021-82338-6","volume":"11","author":"X Wan","year":"2021","unstructured":"Wan, X., et al.: Sentiment correlation in financial news networks and associated market movements. Sci. Rep. 11(1), 1\u201312 (2021)","journal-title":"Sci. Rep."},{"key":"7_CR27","doi-asserted-by":"crossref","unstructured":"Weiner, C.: The impact of industry classification schemes on financial research. Available at SSRN 871173 (2005)","DOI":"10.2139\/ssrn.871173"},{"key":"7_CR28","doi-asserted-by":"crossref","unstructured":"Wu, Qiong, e.a.: Equity2vec: end-to-end deep learning framework for cross-sectional asset pricing. In: Proceedings of the Second ACM ICAIF (2021)","DOI":"10.1145\/3490354.3494409"},{"issue":"1","key":"7_CR29","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1007\/s10462-017-9588-9","volume":"50","author":"FZ Xing","year":"2018","unstructured":"Xing, F.Z., Cambria, E., Welsch, R.E.: Natural language based financial forecasting: a survey. Artif. Intell. Rev. 50(1), 49\u201373 (2018)","journal-title":"Artif. Intell. Rev."}],"container-title":["Communications in Computer and Information Science","Artificial Intelligence and Cognitive Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-26438-2_7","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,22]],"date-time":"2023-02-22T06:35:12Z","timestamp":1677047712000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-26438-2_7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023]]},"ISBN":["9783031264375","9783031264382"],"references-count":29,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-26438-2_7","relation":{},"ISSN":["1865-0929","1865-0937"],"issn-type":[{"type":"print","value":"1865-0929"},{"type":"electronic","value":"1865-0937"}],"subject":[],"published":{"date-parts":[[2023]]},"assertion":[{"value":"23 February 2023","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"AICS","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Irish Conference on Artificial Intelligence and Cognitive Science","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Munster","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Ireland","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2022","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"8 December 2022","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"9 December 2022","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"30","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"aics2022","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/aics2022.mtu.ie\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Single-blind","order":1,"name":"type","label":"Type","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"EasyChair","order":2,"name":"conference_management_system","label":"Conference Management System","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"102","order":3,"name":"number_of_submissions_sent_for_review","label":"Number of Submissions Sent for Review","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"41","order":4,"name":"number_of_full_papers_accepted","label":"Number of Full Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"0","order":5,"name":"number_of_short_papers_accepted","label":"Number of Short Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"40% - The value is computed by the equation \"Number of Full Papers Accepted \/ Number of Submissions Sent for Review * 100\" and then rounded to a whole number.","order":6,"name":"acceptance_rate_of_full_papers","label":"Acceptance Rate of Full Papers","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"3","order":7,"name":"average_number_of_reviews_per_paper","label":"Average Number of Reviews per Paper","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"3","order":8,"name":"average_number_of_papers_per_reviewer","label":"Average Number of Papers per Reviewer","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"No","order":9,"name":"external_reviewers_involved","label":"External Reviewers Involved","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}}]}}