{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,29]],"date-time":"2025-11-29T08:03:49Z","timestamp":1764403429785,"version":"3.41.0"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"17","license":[{"start":{"date-parts":[[2024,12,7]],"date-time":"2024-12-07T00:00:00Z","timestamp":1733529600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2024,12,7]],"date-time":"2024-12-07T00:00:00Z","timestamp":1733529600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Neural Comput &amp; Applic"],"published-print":{"date-parts":[[2025,6]]},"DOI":"10.1007\/s00521-024-10617-0","type":"journal-article","created":{"date-parts":[[2024,12,7]],"date-time":"2024-12-07T05:23:36Z","timestamp":1733549016000},"page":"10675-10688","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Residual connections improve click-through rate and conversion rate prediction performance"],"prefix":"10.1007","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2293-2031","authenticated-orcid":false,"given":"Ergun","family":"Bi\u00e7ici","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,12,7]]},"reference":[{"key":"10617_CR1","doi-asserted-by":"publisher","unstructured":"Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR). IEEE Computer Society, Los Alamitos, CA, USA, pp 1\u20139. https:\/\/doi.org\/10.1109\/CVPR.2015.7298594","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"10617_CR2","doi-asserted-by":"publisher","unstructured":"Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Bengio Y, LeCun Y. (eds.) 3rd international conference on learning representations, ICLR 2015, San Diego, CA, USA, 2015, conference track proceedings. https:\/\/doi.org\/10.48550\/arXiv.1409.1556","DOI":"10.48550\/arXiv.1409.1556"},{"key":"10617_CR3","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Proceedings of the 31st international conference on neural information processing systems. NIPS\u201917. Curran Associates Inc., Red Hook, NY, USA, pp 6000\u20136010."},{"key":"10617_CR4","doi-asserted-by":"publisher","unstructured":"Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, Neelakantan A, Shyam P, Sastry G, Askell A, Agarwal S, Herbert-Voss A, Krueger G, Henighan T, Child R, Ramesh A, Ziegler DM, Wu J, Winter C, Hesse C, Chen M, Sigler E, Litwin M, Gray S, Chess B, Clark J, Berner C, McCandlish S, Radford A, Sutskever I, Amodei D (2020). Language models are few-shot learners. https:\/\/doi.org\/10.48550\/arXiv.2005.14165","DOI":"10.48550\/arXiv.2005.14165"},{"key":"10617_CR5","unstructured":"Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, Neelakantan A, Shyam P, Sastry G, Askell A, Agarwal S, Herbert-Voss A, Krueger G, Henighan T, Child R, Ramesh A, Ziegler DM, Wu J, Winter C, Hesse C, Chen M, Sigler E, Litwin M, Gray S, Chess B, Clark J, Berner C, McCandlish S, Radford A, Sutskever I, Amodei D (2020) Language models are few-shot learners. In: Proceedings of the 34th international conference on neural information processing systems. NIPS\u201920. Curran Associates Inc., Red Hook, NY, USA."},{"key":"10617_CR6","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 770\u2013778. https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"key":"10617_CR7","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Identity mappings in deep residual networks. In: Leibe B, Matas J, Sebe N, Welling M (eds.) computer vision\u2013ECCV 2016. Springer, Cham, pp 630\u2013645. https:\/\/doi.org\/10.1007\/978-3-319-46493-0_38","DOI":"10.1007\/978-3-319-46493-0_38"},{"key":"10617_CR8","unstructured":"Ramesh A, Pavlov M, Goh G, Gray S, Voss C, Radford A, Chen M, Sutskever I (2021) Zero-shot text-to-image generation. In: Meila M, Zhang T (eds.) Proceedings of the 38th international conference on machine learning. Proceedings of machine learning research, PMLR, Virtual Only, vol. 139, pp 8821\u20138831."},{"key":"10617_CR9","unstructured":"Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I (2019) Language models are unsupervised multitask learners. https:\/\/www.semanticscholar.org\/paper\/Language-Models-are-Unsupervised-Multitask-Learners-Radford-Wu\/9405cc0d6169988371b2755e573cc28650d14dfe"},{"key":"10617_CR10","unstructured":"Li H, Xu Z, Taylor G, Studer C, Goldstein T (2018) Visualizing the loss landscape of neural nets. In: Bengio S, Wallach H, Larochelle H, Grauman K, Cesa-Bianchi N, Garnett R (eds.) Proceedings of the 32st international conference on neural information processing systems. NIPS\u201918, vol. 31. Curran Associates, Inc., Montreal, Canada."},{"key":"10617_CR11","unstructured":"Veit A, Wilber M, Belongie S (2016) Residual networks behave like ensembles of relatively shallow networks. In: Proceedings of the 30th international conference on neural information processing systems. NIPS\u201916. Curran Associates Inc., Red Hook, NY, USA, pp 550\u2013558."},{"key":"10617_CR12","doi-asserted-by":"publisher","unstructured":"Zagoruyko S, Komodakis N (2016) Wide residual networks. In: Wilson RC, Hancock ER, Smith WAP (eds.) Proceedings of the british machine vision conference 2016, BMVC 2016. BMVA Press, York, UK. https:\/\/doi.org\/10.48550\/arXiv.1605.07146","DOI":"10.48550\/arXiv.1605.07146"},{"key":"10617_CR13","doi-asserted-by":"publisher","unstructured":"Huang G, Liu Z, Van Der\u00a0Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 2261\u20132269. https:\/\/doi.org\/10.1109\/CVPR.2017.243","DOI":"10.1109\/CVPR.2017.243"},{"key":"10617_CR14","doi-asserted-by":"publisher","unstructured":"Huang G, Liu S, Maaten Lvd, Weinberger KQ (2018) Condensenet: an efficient densenet using learned group convolutions. In: 2018 IEEE\/CVF conference on computer vision and pattern recognition, pp 2752\u20132761. https:\/\/doi.org\/10.1109\/CVPR.2018.00291","DOI":"10.1109\/CVPR.2018.00291"},{"key":"10617_CR15","doi-asserted-by":"publisher","unstructured":"Bachmann G, Anagnostidis S, Hofmann T (2023). Scaling MLPs: a tale of inductive bias. https:\/\/doi.org\/10.48550\/arXiv.2306.13575","DOI":"10.48550\/arXiv.2306.13575"},{"key":"10617_CR16","unstructured":"Srivastava RK, Greff K, Schmidhuber J (2015) Training very deep networks. In: Proceedings of the 28th international conference on neural information processing systems\u2013vol. 2. NIPS\u201915. MIT Press, Cambridge, MA, USA, pp 2377\u20132385."},{"key":"10617_CR17","doi-asserted-by":"publisher","unstructured":"Clevert D, Unterthiner T, Hochreiter S (2016) Fast and accurate deep network learning by exponential linear units (elus). In: Bengio, Y, LeCun Y. (eds.) 4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, May 2-4, 2016, conference track proceedings. https:\/\/doi.org\/10.48550\/arXiv.1511.07289","DOI":"10.48550\/arXiv.1511.07289"},{"key":"10617_CR18","doi-asserted-by":"publisher","unstructured":"Shah A, Kadam E, Shah H, Shinde S, Shingade S (2016) Deep residual networks with exponential linear unit. In: Proceedings of the 3rd international symposium on computer vision and the internet. VisionNet\u201916. Association for Computing Machinery, New York, NY, USA, pp 59\u201365. https:\/\/doi.org\/10.1145\/2983402.2983406","DOI":"10.1145\/2983402.2983406"},{"key":"10617_CR19","unstructured":"Nair V, Hinton GE (2010) Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th international conference on international conference on machine learning. ICML\u201910. Omnipress, Madison, WI, USA, pp 807\u2013814."},{"key":"10617_CR20","unstructured":"Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd international conference on international conference on machine learning\u2013vol. 37. ICML\u201915. JMLR.org, Lille, France, pp 448\u2013456"},{"key":"10617_CR21","doi-asserted-by":"crossref","unstructured":"Huang G, Sun Y, Liu Z, Sedra D, Weinberger KQ (2016) Deep networks with stochastic depth. In: Leibe B, Matas J, Sebe N, Welling M (eds.) computer vision\u2013ECCV 2016. Springer, Cham, pp 646\u2013661.","DOI":"10.1007\/978-3-319-46493-0_39"},{"key":"10617_CR22","doi-asserted-by":"publisher","unstructured":"Wang F, Jiang M, Qian C, Yang S, Li C, Zhang H, Wang X, Tang X (2017) Residual attention network for image classification. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 6450\u20136458. https:\/\/doi.org\/10.1109\/CVPR.2017.683","DOI":"10.1109\/CVPR.2017.683"},{"key":"10617_CR23","doi-asserted-by":"crossref","unstructured":"Newell A, Yang K, Deng J (2016) Stacked hourglass networks for human pose estimation. In: Leibe B, Matas J, Sebe N, Welling M (eds.) computer vision\u2013ECCV 2016. Springer, Cham, pp 483\u2013499.","DOI":"10.1007\/978-3-319-46484-8_29"},{"issue":"6","key":"10617_CR24","doi-asserted-by":"publisher","first-page":"1303","DOI":"10.1109\/TCSVT.2017.2654543","volume":"28","author":"K Zhang","year":"2018","unstructured":"Zhang K, Sun M, Han TX, Yuan X, Guo L, Liu T (2018) Residual networks of residual networks: multilevel residual networks. IEEE Trans Circ Syst Video Technol 28(6):1303\u20131314. https:\/\/doi.org\/10.1109\/TCSVT.2017.2654543","journal-title":"IEEE Trans Circ Syst Video Technol"},{"key":"10617_CR25","doi-asserted-by":"publisher","unstructured":"Wang Z, She Q, Zhang J (2021) Masknet: Introducing feature-wise multiplication to CTR ranking models by instance-guided mask. In: DLP-KDD \u201921: Proceedings of the 3rd international workshop on deep learning practice for high-dimensional sparse data. Association for Computing Machinery, Singapore. https:\/\/doi.org\/10.48550\/arXiv.2102.07619","DOI":"10.48550\/arXiv.2102.07619"},{"key":"10617_CR26","doi-asserted-by":"publisher","unstructured":"Guo H, Tang R, Ye Y, Li Z, He X (2017) Deepfm: a factorization-machine based neural network for CTR prediction. In: Proceedings of the Twenty-Sixth international joint conference on artificial intelligence, IJCAI-17, pp 1725\u20131731. https:\/\/doi.org\/10.24963\/ijcai.2017\/239","DOI":"10.24963\/ijcai.2017\/239"},{"key":"10617_CR27","doi-asserted-by":"publisher","unstructured":"Song W, Shi C, Xiao Z, Duan Z, Xu Y, Zhang M, Tang J (2019) Autoint: automatic feature interaction learning via self-attentive neural networks. In: Proceedings of the 28th ACM international conference on information and knowledge management. CIKM \u201919. Association for Computing Machinery, New York, NY, USA, pp 1161\u20131170. https:\/\/doi.org\/10.1145\/3357384.3357925","DOI":"10.1145\/3357384.3357925"},{"key":"10617_CR28","doi-asserted-by":"publisher","unstructured":"Mao K, Zhu J, Su L, Cai G, Li Y, Dong Z (2023) FinalMLP: an enhanced two-stream MLP model for CTR prediction. In: Proceedings of the AAAI conference on artificial intelligence 37(4):4552\u20134560. https:\/\/doi.org\/10.1609\/aaai.v37i4.25577","DOI":"10.1609\/aaai.v37i4.25577"},{"key":"10617_CR29","doi-asserted-by":"publisher","unstructured":"Zhao Z, Yang S, Liu G, Feng D, Xu K (2022) FINT: field-aware interaction neural network for click-through rate prediction. In: IEEE international conference on acoustics, speech and signal processing, ICASSP 2022, 23-27 May 2022. IEEE, Virtual and Singapore, pp 3913\u20133917. https:\/\/doi.org\/10.1109\/ICASSP43922.2022.9747247","DOI":"10.1109\/ICASSP43922.2022.9747247"},{"key":"10617_CR30","doi-asserted-by":"publisher","unstructured":"Li Z, Cui Z, Wu S, Zhang X, Wang L (2019) Fi-gnn: modeling feature interactions via graph neural networks for CTR prediction. In: Zhu W, Tao D, Cheng X, Cui P, Rundensteiner EA, Carmel D, He Q, Yu JX (eds.) Proceedings of the 28th ACM international conference on information and knowledge management, CIKM 2019, November 3-7, 2019. ACM, Beijing, China, pp 539\u2013548. https:\/\/doi.org\/10.1145\/3357384.3357951","DOI":"10.1145\/3357384.3357951"},{"key":"10617_CR31","unstructured":"Kaggle (2022) Avazu dataset. https:\/\/www.kaggle.com\/competitions\/avazu-ctr-prediction\/data"},{"key":"10617_CR32","unstructured":"BARS (2022) Avazu dataset BARS X4 Split. https:\/\/github.com\/reczoo\/Datasets\/tree\/main\/Avazu\/Avazu_x4"},{"key":"10617_CR33","doi-asserted-by":"publisher","unstructured":"Zhu J, Dai Q, Su L, Ma R, Liu J, Cai G, Xiao X, Zhang R (2022) BARS: towards open benchmarking for recommender systems. In: Amig\u00f3 E, Castells P, Gonzalo J, Carterette B, Culpepper JS, Kazai G (eds.) SIGIR \u201922: The 45th international ACM SIGIR conference on research and development in information retrieval, July 11 - 15, 2022, pp 2912\u20132923. ACM, Madrid, Spain. https:\/\/doi.org\/10.1145\/3477495.3531723","DOI":"10.1145\/3477495.3531723"},{"key":"10617_CR34","unstructured":"BARS (2022) Criteo dataset BARS X1 split. https:\/\/github.com\/reczoo\/Datasets\/tree\/main\/Criteo\/Criteo_x1"},{"key":"10617_CR35","doi-asserted-by":"publisher","unstructured":"Yu F, Liu Z, Liu Q, Zhang H, Wu S, Wang L (2020) Deep interaction machine: A simple but effective model for high-order feature interactions. In: Proceedings of the 29th ACM international conference on information and knowledge management. CIKM \u201920, pp 2285\u20132288. Association for Computing Machinery, New York, NY, USA. https:\/\/doi.org\/10.1145\/3340531.3412077","DOI":"10.1145\/3340531.3412077"},{"key":"10617_CR36","unstructured":"Developers G (2023) AUC interpretation. https:\/\/developers.google.com\/machine-learning\/crash-course\/classification\/roc-and-auc"},{"key":"10617_CR37","doi-asserted-by":"publisher","unstructured":"Wang R, Fu B, Fu G, Wang M (2017) Deep and cross network for ad click predictions. In: Proceedings of the ADKDD\u201917. ADKDD\u201917. Association for Computing Machinery, New York, NY, USA. https:\/\/doi.org\/10.1145\/3124749.3124754","DOI":"10.1145\/3124749.3124754"},{"key":"10617_CR38","doi-asserted-by":"publisher","unstructured":"Zhu J, Liu J, Yang S, Zhang Q, He X (2022) BarsCTR: open benchmarking for click-through rate prediction. https:\/\/doi.org\/10.48550\/arXiv.2009.05794","DOI":"10.48550\/arXiv.2009.05794"}],"container-title":["Neural Computing and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-024-10617-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00521-024-10617-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-024-10617-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,31]],"date-time":"2025-05-31T09:15:21Z","timestamp":1748682921000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00521-024-10617-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,7]]},"references-count":38,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2025,6]]}},"alternative-id":["10617"],"URL":"https:\/\/doi.org\/10.1007\/s00521-024-10617-0","relation":{},"ISSN":["0941-0643","1433-3058"],"issn-type":[{"type":"print","value":"0941-0643"},{"type":"electronic","value":"1433-3058"}],"subject":[],"published":{"date-parts":[[2024,12,7]]},"assertion":[{"value":"4 February 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 October 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 December 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no relevant financial or non-financial interests to disclose.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}