{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,10]],"date-time":"2026-05-10T00:23:31Z","timestamp":1778372611627,"version":"3.51.4"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2025,12,24]],"date-time":"2025-12-24T00:00:00Z","timestamp":1766534400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2025,12,24]],"date-time":"2025-12-24T00:00:00Z","timestamp":1766534400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"funder":[{"DOI":"10.13039\/100020409","name":"Analytical Center for the Government of the Russian Federation","doi-asserted-by":"publisher","award":["000000C313925P4B0002"],"award-info":[{"award-number":["000000C313925P4B0002"]}],"id":[{"id":"10.13039\/100020409","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Optim Theory Appl"],"published-print":{"date-parts":[[2026,2]]},"DOI":"10.1007\/s10957-025-02893-0","type":"journal-article","created":{"date-parts":[[2025,12,24]],"date-time":"2025-12-24T05:27:36Z","timestamp":1766554056000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Accelerated Methods with Compression for Horizontal and Vertical Federated Learning"],"prefix":"10.1007","volume":"208","author":[{"given":"Sergey","family":"Stanko","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Timur","family":"Karimullin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-0635-6960","authenticated-orcid":false,"given":"Aleksandr","family":"Beznosikov","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"Gasnikov","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,12,24]]},"reference":[{"key":"2893_CR1","unstructured":"Alacaoglu, A., Malitsky, Y.: Stochastic variance reduction for variational inequality methods. In: P.L. Loh, M.\u00a0Raginsky (eds.) Proceedings of Thirty Fifth Conference on Learning Theory, vol. 178, pp. 778\u2013816. PMLR (2022)"},{"key":"2893_CR2","unstructured":"Alistarh, D., Grubic, D., Li, J., Tomioka, R., Vojnovic, M.: QSGD: Communication-efficient SGD via gradient quantization and encoding. In: I.\u00a0Guyon, U.V. Luxburg, S.\u00a0Bengio, H.\u00a0Wallach, R.\u00a0Fergus, S.\u00a0Vishwanathan, R.\u00a0Garnett (eds.) Advances in Neural Information Processing Systems, vol.\u00a030. Curran Associates, Inc. (2017)"},{"issue":"221","key":"2893_CR3","first-page":"1","volume":"18","author":"Z Allen-Zhu","year":"2018","unstructured":"Allen-Zhu, Z.: Katyusha: The first direct acceleration of stochastic gradient methods. J. Mach. Learn. Res. 18(221), 1\u201351 (2018)","journal-title":"J. Mach. Learn. Res."},{"issue":"276","key":"2893_CR4","first-page":"1","volume":"24","author":"A Beznosikov","year":"2023","unstructured":"Beznosikov, A., Horv\u00e1th, S., Richt\u00e1rik, P., Safaryan, M.: On biased compression for distributed learning. J. Mach. Learn. Res. 24(276), 1\u201350 (2023)","journal-title":"J. Mach. Learn. Res."},{"key":"2893_CR5","doi-asserted-by":"crossref","unstructured":"Beznosikov, A., Richtarik, P., Diskin, M., Ryabinin, M., Gasnikov, A.: Distributed methods with compressed communication for solving variational inequalities, with theoretical guarantees. In: S.\u00a0Koyejo, S.\u00a0Mohamed, A.\u00a0Agarwal, D.\u00a0Belgrave, K.\u00a0Cho, A.\u00a0Oh (eds.) Advances in Neural Information Processing Systems, vol.\u00a035, pp. 14,013\u201314,029. Curran Associates, Inc. (2022)","DOI":"10.52202\/068431-1019"},{"key":"2893_CR6","doi-asserted-by":"crossref","unstructured":"Beznosikov, A., Tak\u00e1\u010d, M., Gasnikov, A.: Similarity, compression and local steps: Three pillars of efficient communications for distributed variational inequalities. In: A.\u00a0Oh, T.\u00a0Naumann, A.\u00a0Globerson, K.\u00a0Saenko, M.\u00a0Hardt, S.\u00a0Levine (eds.) Advances in Neural Information Processing Systems, vol.\u00a036, pp. 28,663\u201328,677. Curran Associates, Inc. (2023)","DOI":"10.52202\/075280-1246"},{"issue":"6","key":"2893_CR7","doi-asserted-by":"publisher","first-page":"752","DOI":"10.1109\/TBDATA.2022.3192898","volume":"10","author":"D Cai","year":"2024","unstructured":"Cai, D., Fan, T., Kang, Y., Fan, L., Xu, M., Wang, S., Yang, Q.: Accelerating vertical federated learning. IEEE Transactions on Big Data 10(6), 752\u2013760 (2024). https:\/\/doi.org\/10.1109\/TBDATA.2022.3192898","journal-title":"IEEE Transactions on Big Data"},{"key":"2893_CR8","unstructured":"Castiglia, T.J., Das, A., Wang, S., Patterson, S.: Compressed-VFL: Communication-efficient learning with vertically partitioned data. In: K.\u00a0Chaudhuri, S.\u00a0Jegelka, L.\u00a0Song, C.\u00a0Szepesvari, G.\u00a0Niu, S.\u00a0Sabato (eds.) Proceedings of the 39th International Conference on Machine Learning, vol. 162, pp. 2738\u20132766. PMLR (2022)"},{"issue":"13","key":"2893_CR9","doi-asserted-by":"publisher","first-page":"1749","DOI":"10.1002\/cpe.1206","volume":"19","author":"E Chan","year":"2007","unstructured":"Chan, E., Heimlich, M., Purkayastha, A., Van De Geijn, R.: Collective communication: theory, practice, and experience. Concurrency and Computation: Practice and Experience 19(13), 1749\u20131783 (2007)","journal-title":"Concurrency and Computation: Practice and Experience"},{"key":"2893_CR10","doi-asserted-by":"crossref","unstructured":"Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2 (2007)","DOI":"10.1145\/1961189.1961199"},{"key":"2893_CR11","unstructured":"Chilimbi, T., Suzue, Y., Apacible, J., Kalyanaraman, K.: Project adam: Building an efficient and scalable deep learning training system. In: 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14), pp. 571\u2013582. USENIX Association (2014)"},{"key":"2893_CR12","doi-asserted-by":"crossref","unstructured":"Fan, T., Chen, W., Ma, G., Kang, Y., Fan, L., Yang, Q.: Secureboost+: Large scale and high-performance vertical federated gradient boosting decision tree. In: D.N. Yang, X.\u00a0Xie, V.S. Tseng, J.\u00a0Pei, J.W. Huang, J.C.W. Lin (eds.) Advances in Knowledge Discovery and Data Mining, pp. 237\u2013249. Springer Nature Singapore (2024)","DOI":"10.1007\/978-981-97-2259-4_18"},{"key":"2893_CR13","unstructured":"Gorbunov, E.: Distributed and stochastic optimization methods with gradient compression and local steps. CoRR arXiv:2112.10645 (2021)"},{"key":"2893_CR14","unstructured":"Gorbunov, E., Burlachenko, K., Li, Z., Richt\u00e1rik, P.: MARINA: faster non-convex distributed learning with compression. In: M.\u00a0Meila, T.\u00a0Zhang (eds.) Proceedings of the 38th International Conference on Machine Learning, vol. 139, pp. 3788\u20133798. PMLR (2021)"},{"key":"2893_CR15","unstructured":"Goyal, P., Doll\u00e1r, P., Girshick, R.B., Noordhuis, P., Wesolowski, L., Kyrola, A., Tulloch, A., Jia, Y., He, K.: Accurate, large minibatch SGD: training imagenet in 1 hour. CoRR arXiv:1706.02677 (2017)"},{"issue":"11","key":"2893_CR16","doi-asserted-by":"publisher","first-page":"6103","DOI":"10.1109\/TNNLS.2021.3072238","volume":"33","author":"B Gu","year":"2022","unstructured":"Gu, B., Xu, A., Huo, Z., Deng, C., Huang, H.: Privacy-preserving asynchronous vertical federated learning algorithms for multiparty collaborative learning. IEEE Transactions on Neural Networks and Learning Systems 33(11), 6103\u20136115 (2022). https:\/\/doi.org\/10.1109\/TNNLS.2021.3072238","journal-title":"IEEE Transactions on Neural Networks and Learning Systems"},{"key":"2893_CR17","doi-asserted-by":"crossref","unstructured":"He, Y., Huang, X., Yuan, K.: Unbiased compression saves communication in distributed optimization: When and how much? In: A.\u00a0Oh, T.\u00a0Naumann, A.\u00a0Globerson, K.\u00a0Saenko, M.\u00a0Hardt, S.\u00a0Levine (eds.) Advances in Neural Information Processing Systems, vol.\u00a036, pp. 47,991\u201348,020. Curran Associates, Inc. (2023)","DOI":"10.52202\/075280-2081"},{"issue":"1","key":"2893_CR18","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1080\/10556788.2022.2117355","volume":"38","author":"S Horv\u00e1th","year":"2023","unstructured":"Horv\u00e1th, S., Kovalev, D., Mishchenko, K., Richt\u00e1rik, P., Stich, S.: Stochastic distributed learning with gradient quantization and double-variance reduction. Optimization Methods and Software 38(1), 91\u2013106 (2023). https:\/\/doi.org\/10.1080\/10556788.2022.2117355","journal-title":"Optimization Methods and Software"},{"key":"2893_CR19","doi-asserted-by":"crossref","unstructured":"Huang, L., Li, Z., Sun, J., Zhao, H.: Coresets for vertical federated learning: Regularized linear regression and $$k$$-means clustering. In: S.\u00a0Koyejo, S.\u00a0Mohamed, A.\u00a0Agarwal, D.\u00a0Belgrave, K.\u00a0Cho, A.\u00a0Oh (eds.) Advances in Neural Information Processing Systems, vol.\u00a035 (2022)","DOI":"10.52202\/068431-2144"},{"key":"2893_CR20","unstructured":"Johnson, R., Zhang, T.: Accelerating stochastic gradient descent using predictive variance reduction. In: C.\u00a0Burges, L.\u00a0Bottou, M.\u00a0Welling, Z.\u00a0Ghahramani, K.\u00a0Weinberger (eds.) Advances in Neural Information Processing Systems, vol.\u00a026. Curran Associates, Inc. (2013)"},{"key":"2893_CR21","unstructured":"Kone\u010dn\u00fd, J., McMahan, H.B., Yu, F.X., Richt\u00e1rik, P., Suresh, A.T., Bacon, D.: Federated learning: Strategies for improving communication efficiency. CoRR arXiv:1610.05492 (2016)"},{"key":"2893_CR22","unstructured":"Kovalev, D., Horv\u00e1th, S., Richt\u00e1rik, P.: Don\u2019t jump through hoops and remove those loops: SVRG and Katyusha are better without the outer loop. In: A.\u00a0Kontorovich, G.\u00a0Neu (eds.) Proceedings of the 31st International Conference on Algorithmic Learning Theory, vol. 117, pp. 451\u2013467. PMLR (2020)"},{"key":"2893_CR23","unstructured":"Li, Z., Bao, H., Zhang, X., Richtarik, P.: PAGE: A simple and optimal probabilistic gradient estimator for nonconvex optimization. In: M.\u00a0Meila, T.\u00a0Zhang (eds.) Proceedings of the 38th International Conference on Machine Learning, vol. 139, pp. 6286\u20136295. PMLR (2021)"},{"key":"2893_CR24","unstructured":"Li, Z., Kovalev, D., Qian, X., Richtarik, P.: Acceleration for compressed gradient descent in distributed and federated optimization. In: H.D. III, A.\u00a0Singh (eds.) Proceedings of the 37th International Conference on Machine Learning, vol. 119, pp. 5895\u20135904. PMLR (2020)"},{"key":"2893_CR25","unstructured":"Mishchenko, K., Gorbunov, E., Tak\u00e1c, M., Richt\u00e1rik, P.: Distributed learning with compressed gradient differences. CoRR arXiv:1901.09269 (2019)"},{"key":"2893_CR26","unstructured":"Nesterov, Y.: A method for solving the convex programming problem with convergence rate $$O(\\frac{1}{k^2})$$. In: Dokl Akad Nauk SSSR, vol. 269, p. 543 (1983)"},{"issue":"2","key":"2893_CR27","doi-asserted-by":"publisher","first-page":"341","DOI":"10.1137\/100802001","volume":"22","author":"Y Nesterov","year":"2012","unstructured":"Nesterov, Y.: Efficiency of coordinate descent methods on huge-scale optimization problems. SIAM Journal on Optimization 22(2), 341\u2013362 (2012). https:\/\/doi.org\/10.1137\/100802001","journal-title":"SIAM Journal on Optimization"},{"key":"2893_CR28","unstructured":"Nguyen, L.M., Liu, J., Scheinberg, K., Tak\u00e1\u010d, M.: Sarah: A novel method for machine learning problems using stochastic recursive gradient. In: International Conference on Machine Learning, pp. 2613\u20132621. PMLR (2017)"},{"issue":"5","key":"2893_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/0041-5553(64)90137-5","volume":"4","author":"BT Polyak","year":"1964","unstructured":"Polyak, B.T.: Some methods of speeding up the convergence of iteration methods. USSR Comput. Math. Math. Phys. 4(5), 1\u201317 (1964)","journal-title":"USSR Comput. Math. Math. Phys."},{"key":"2893_CR30","unstructured":"Richtarik, P., Sokolov, I., Fatkhullin, I.: EF21: A new, simpler, theoretically better, and practically faster error feedback. In: M.\u00a0Ranzato, A.\u00a0Beygelzimer, Y.\u00a0Dauphin, P.\u00a0Liang, J.W. Vaughan (eds.) Advances in Neural Information Processing Systems, vol.\u00a034, pp. 4384\u20134396. Curran Associates, Inc. (2021)"},{"issue":"75","key":"2893_CR31","first-page":"1","volume":"17","author":"P Richt\u00e1rik","year":"2016","unstructured":"Richt\u00e1rik, P., Tak\u00e1\u010d, M.: Distributed coordinate descent method for learning with big data. J. Mach. Learn. Res. 17(75), 1\u201325 (2016)","journal-title":"J. Mach. Learn. Res."},{"issue":"230","key":"2893_CR32","first-page":"1","volume":"18","author":"V Smith","year":"2018","unstructured":"Smith, V., Forte, S., Ma, C., Tak\u00e1\u010d, M., Jordan, M.I., Jaggi, M.: CoCoA: A general framework for communication-efficient distributed optimization. J. Mach. Learn. Res. 18(230), 1\u201349 (2018)","journal-title":"J. Mach. Learn. Res."},{"key":"2893_CR33","unstructured":"Stich, S.U.: Unified optimal analysis of the (stochastic) gradient method. CoRR arXiv:1907.04232 (2019)"},{"key":"2893_CR34","doi-asserted-by":"publisher","unstructured":"Sun, J., Xu, Z., Yang, D., Nath, V., Li, W., Zhao, C., Xu, D., Chen, Y., Roth, H.R.: Communication-efficient vertical federated learning with limited overlapping samples. In: 2023 IEEE\/CVF International Conference on Computer Vision (ICCV), pp. 5180\u20135189 (2023). https:\/\/doi.org\/10.1109\/ICCV51070.2023.00480","DOI":"10.1109\/ICCV51070.2023.00480"},{"key":"2893_CR35","unstructured":"Szlendak, R., Tyurin, A., Richt\u00e1rik, P.: Permutation compressors for provably faster distributed nonconvex optimization. In: International Conference on Learning Representations (2022)"},{"issue":"2","key":"2893_CR36","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3377454","volume":"53","author":"J Verbraeken","year":"2020","unstructured":"Verbraeken, J., Wolting, M., Katzy, J., Kloppenburg, J., Verbelen, T., Rellermeyer, J.S.: A survey on distributed machine learning. ACM Comput. Surv. 53(2), 1\u201333 (2020)","journal-title":"ACM Comput. Surv."},{"key":"2893_CR37","unstructured":"Xu, W., Fan, H., Li, K., Yang, K.: Efficient batch homomorphic encryption for vertically federated XGBoost. CoRR arXiv:2112.04261 (2021)"},{"key":"2893_CR38","doi-asserted-by":"publisher","unstructured":"Zhang, Q., Gu, B., Deng, C., Gu, S., Bo, L., Pei, J., Huang, H.: AsySQN: Faster vertical federated learning algorithms with better computation resource utilization. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, p. 3917\u20133927. Association for Computing Machinery (2021). https:\/\/doi.org\/10.1145\/3447548.3467169","DOI":"10.1145\/3447548.3467169"}],"container-title":["Journal of Optimization Theory and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10957-025-02893-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10957-025-02893-0","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10957-025-02893-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T04:53:05Z","timestamp":1776747185000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10957-025-02893-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,24]]},"references-count":38,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,2]]}},"alternative-id":["2893"],"URL":"https:\/\/doi.org\/10.1007\/s10957-025-02893-0","relation":{},"ISSN":["0022-3239","1573-2878"],"issn-type":[{"value":"0022-3239","type":"print"},{"value":"1573-2878","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,12,24]]},"assertion":[{"value":"6 January 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 November 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 December 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"68"}}