{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:44:56Z","timestamp":1760147096865,"version":"build-2065373602"},"reference-count":35,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2023,1,6]],"date-time":"2023-01-06T00:00:00Z","timestamp":1672963200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Informatics"],"abstract":"<jats:p>In the last decade, the techniques of news aggregation and summarization have been increasingly gaining relevance for providing users on the web with condensed and unbiased information. Indeed, the recent development of successful machine learning algorithms, such as those based on the transformers architecture, have made it possible to create effective tools for capturing and elaborating news from the Internet. In this regard, this work proposes, for the first time in the literature to the best of the authors\u2019 knowledge, a methodology for the application of such techniques in news related to cryptocurrencies and the blockchain, whose quick reading can be deemed as extremely useful to operators in the financial sector. Specifically, cutting-edge solutions in the field of natural language processing were employed to cluster news by topic and summarize the corresponding articles published by different newspapers. The results achieved on 22,282 news articles show the effectiveness of the proposed methodology in most of the cases, with 86.8% of the examined summaries being considered as coherent and 95.7% of the corresponding articles correctly aggregated. This methodology was implemented in a freely accessible web application.<\/jats:p>","DOI":"10.3390\/informatics10010005","type":"journal-article","created":{"date-parts":[[2023,1,6]],"date-time":"2023-01-06T03:31:28Z","timestamp":1672975888000},"page":"5","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Cryptoblend: An AI-Powered Tool for Aggregation and Summarization of Cryptocurrency News"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9808-5123","authenticated-orcid":false,"given":"Andrea","family":"Pozzi","sequence":"first","affiliation":[{"name":"Department of Mathematics and Physics, Catholic University of the Sacred Heart, Via della Garzetta 48, 25133 Brescia, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1466-0248","authenticated-orcid":false,"given":"Enrico","family":"Barbierato","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Physics, Catholic University of the Sacred Heart, Via della Garzetta 48, 25133 Brescia, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9668-6961","authenticated-orcid":false,"given":"Daniele","family":"Toti","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Physics, Catholic University of the Sacred Heart, Via della Garzetta 48, 25133 Brescia, Italy"}]}],"member":"1968","published-online":{"date-parts":[[2023,1,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Sethi, P., Sonawane, S., Khanwalker, S., and Keskar, R. (2017, January 20\u201322). Automatic text summarization of news articles. Proceedings of the 2017 International Conference on Big Data, IoT and Data Science (BID), Pune, India.","DOI":"10.1109\/BID.2017.8336568"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Saggion, H., and Poibeau, T. (2013). Automatic text summarization: Past, present and future. Multi-Source, Multilingual Information Extraction and Summarization, Springer.","DOI":"10.1007\/978-3-642-28569-1_1"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Hamborg, F., Meuschke, N., and Gipp, B. (2017, January 19\u201323). Matrix-based news aggregation: Exploring different news perspectives. Proceedings of the 2017 ACM\/IEEE Joint Conference on Digital Libraries (JCDL), Toronto, ON, Canada.","DOI":"10.1109\/JCDL.2017.7991561"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1007\/s00799-018-0261-y","article-title":"Automated identification of media bias in news articles: An interdisciplinary literature review","volume":"20","author":"Hamborg","year":"2019","journal-title":"Int. J. Digit. Libr."},{"key":"ref_5","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems 30 (NIPS 2017), Curran Associates, Inc."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"60","DOI":"10.14505\/\/jasf.v9.2(18).03","article-title":"Crypto Currency and its Susceptibility to Speculative Bubbles, Manipulation, Scams and Fraud","volume":"9","author":"Barnes","year":"2018","journal-title":"J. Adv. Stud. Financ."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Bratulescu, R.A., Vatasoiu, R.I., Mitroi, S.A., Suciu, G., Sachian, M.A., Dutu, D.M., and Calescu, S.E. (2022, January 23\u201326). Fraudulent Activities in the Cyber Realm: DEFRAUDify Project: Fraudulent Activities in the Cyber Realm: DEFRAUDify Project. Proceedings of the 17th International Conference on Availability, Reliability and Security, ARES \u201922, Vienna, Austria.","DOI":"10.1145\/3538969.3544434"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Sureshbhai, P.N., Bhattacharya, P., and Tanwar, S. (2020, January 7\u201311). KaRuNa: A Blockchain-Based Sentiment Analysis Framework for Fraud Cryptocurrency Schemes. Proceedings of the 2020 IEEE International Conference on Communications Workshops (ICC Workshops), Dublin, Ireland.","DOI":"10.1109\/ICCWorkshops49005.2020.9145151"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Sawhney, R., Agarwal, S., Mittal, V., Rosso, P., Nanda, V., and Chava, S. (2022). Cryptocurrency Bubble Detection: A New Stock Market Dataset, Financial Task and Hyperbolic Models. arXiv.","DOI":"10.18653\/v1\/2022.naacl-main.405"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"860","DOI":"10.1177\/19401612211009160","article-title":"Avenues to news and diverse news exposure online: Comparing direct navigation, social media, news aggregators, search queries, and article hyperlinks","volume":"27","author":"Wojcieszak","year":"2022","journal-title":"Int. J. Press."},{"key":"ref_11","unstructured":"Liotsiou, D., Kollanyi, B., and Howard, P.N. (2019). The Junk News Aggregator: Examining junk news posted on Facebook, starting with the 2018 US Midterm Elections. arXiv."},{"key":"ref_12","unstructured":"Hong, K., Conroy, J., Favre, B., Kulesza, A., Lin, H., and Nenkova, A. (2014, January 26\u201331). A repository of state of the art and competitive baseline summaries for generic news summarization. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC\u201914), Reykjavik, Iceland."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Fabbri, A.R., Li, I., She, T., Li, S., and Radev, D.R. (2019). Multi-News: A Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model. arXiv.","DOI":"10.18653\/v1\/P19-1102"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Varab, D., and Schluter, N. (2021, January 7\u201311). MassiveSumm: A very large-scale, very multilingual, news summarisation dataset. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Online, Punta Cana, Dominican Republic.","DOI":"10.18653\/v1\/2021.emnlp-main.797"},{"key":"ref_15","unstructured":"He, J., Kry\u015bci\u0144ski, W., McCann, B., Rajani, N., and Xiong, C. (2020). CTRLsum: Towards Generic Controllable Text Summarization. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Lewis, M., Liu, Y., Goyal, N., Ghazvininejad, M., Mohamed, A., Levy, O., Stoyanov, V., and Zettlemoyer, L. (2019). BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. arXiv.","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Gupta, A., Chugh, D., and Katarya, R. (2022). Automated news summarization using transformers. Sustainable Advanced Computing, Springer.","DOI":"10.1007\/978-981-16-9012-9_21"},{"key":"ref_18","first-page":"636","article-title":"News Aggregator and Efficient Summarization System","volume":"11","author":"Mohamed","year":"2020","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Balcerzak, B., Jaworski, W., and Wierzbicki, A. (2014, January 11\u201314). Application of TextRank Algorithm for Credibility Assessment. Proceedings of the 2014 IEEE\/WIC\/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Warsaw, Poland.","DOI":"10.1109\/WI-IAT.2014.70"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Gadi, M.F.A., and Sicilia, M.\u00c1. Cryptocurrency Curated News Event Database From GDELT 2022.","DOI":"10.21203\/rs.3.rs-2145757\/v1"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"101462","DOI":"10.1016\/j.irfa.2020.101462","article-title":"News sentiment in the cryptocurrency market: An empirical comparison with Forex","volume":"69","author":"Rognone","year":"2020","journal-title":"Int. Rev. Financ. Anal."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1162\/tacl_a_00373","article-title":"SummEval: Re-evaluating Summarization Evaluation","volume":"9","author":"Fabbri","year":"2021","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_23","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv."},{"key":"ref_24","unstructured":"M\u00fcllner, D. (2011). Modern hierarchical, agglomerative clustering algorithms. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Carbonell, J., and Goldstein, J. (1998, January 24\u201328). The use of MMR, diversity-based reranking for reordering documents and producing summaries. Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia.","DOI":"10.1145\/290941.291025"},{"key":"ref_26","unstructured":"Richardson, L. (2018, July 07). Beautiful Soup Documentation. Available online: Https:\/\/www. crummy. com\/software\/BeautifulSoup\/bs4\/doc\/."},{"key":"ref_27","unstructured":"Banker, K., Garrett, D., Bakkum, P., and Verch, S. (2016). MongoDB in Action: Covers MongoDB Version 3.0, Simon and Schuster."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Reimers, N., and Gurevych, I. (2019). Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv.","DOI":"10.18653\/v1\/D19-1410"},{"key":"ref_29","first-page":"2825","article-title":"Scikit-learn: Machine Learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J. Mach. Learn. Res."},{"key":"ref_30","unstructured":"Lie, H.W., and Bos, B. (1997). Cascading Style Sheets: Designing for the Web, Addison-Wesley Longman Publishing Co., Inc."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Guha, A., Saftoiu, C., and Krishnamurthi, S. (2010). The essence of JavaScript. Proceedings of the European Conference on Object-Oriented Programming, Springer.","DOI":"10.1007\/978-3-642-14107-2_7"},{"key":"ref_32","unstructured":"Spurlock, J. (2013). Bootstrap: Responsive Web Development, O\u2019Reilly Media, Inc."},{"key":"ref_33","unstructured":"Vasiliev, Y. (2020). Natural Language Processing with Python and SpaCy: A Practical Introduction, No Starch Press."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Heimerl, F., Lohmann, S., Lange, S., and Ertl, T. (2014, January 6\u20139). Word cloud explorer: Text analytics based on word clouds. Proceedings of the 2014 47th Hawaii International Conference on System Sciences, Waikoloa, HI, USA.","DOI":"10.1109\/HICSS.2014.231"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Allahyari, M., Pouriyeh, S., Assefi, M., Safaei, S., Trippe, E.D., Gutierrez, J.B., and Kochut, K. (2017). Text summarization techniques: A brief survey. arXiv.","DOI":"10.14569\/IJACSA.2017.081052"}],"container-title":["Informatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2227-9709\/10\/1\/5\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:01:09Z","timestamp":1760119269000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2227-9709\/10\/1\/5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,6]]},"references-count":35,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,3]]}},"alternative-id":["informatics10010005"],"URL":"https:\/\/doi.org\/10.3390\/informatics10010005","relation":{},"ISSN":["2227-9709"],"issn-type":[{"type":"electronic","value":"2227-9709"}],"subject":[],"published":{"date-parts":[[2023,1,6]]}}}