{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T20:07:13Z","timestamp":1773950833944,"version":"3.50.1"},"reference-count":40,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2017,9,21]],"date-time":"2017-09-21T00:00:00Z","timestamp":1505952000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>In order to utilize the distributed characteristic of sensors, distributed machine learning has become the mainstream approach, but the different computing capability of sensors and network delays greatly influence the accuracy and the convergence rate of the machine learning model. Our paper describes a reasonable parameter communication optimization strategy to balance the training overhead and the communication overhead. We extend the fault tolerance of iterative-convergent machine learning algorithms and propose the Dynamic Finite Fault Tolerance (DFFT). Based on the DFFT, we implement a parameter communication optimization strategy for distributed machine learning, named Dynamic Synchronous Parallel Strategy (DSP), which uses the performance monitoring model to dynamically adjust the parameter synchronization strategy between worker nodes and the Parameter Server (PS). This strategy makes full use of the computing power of each sensor, ensures the accuracy of the machine learning model, and avoids the situation that the model training is disturbed by any tasks unrelated to the sensors.<\/jats:p>","DOI":"10.3390\/s17102172","type":"journal-article","created":{"date-parts":[[2017,9,21]],"date-time":"2017-09-21T12:17:40Z","timestamp":1505996260000},"page":"2172","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["A Parameter Communication Optimization Strategy for Distributed Machine Learning in Sensors"],"prefix":"10.3390","volume":"17","author":[{"given":"Jilin","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Key Laboratory of Complex Systems Modeling and Simulation, Ministry of Education, Hangzhou 310018, China"},{"name":"College of Electrical Engineering, Zhejiang University, Hangzhou 310058, China"},{"name":"School of Information and Electronic engineering, Zhejiang University of Science & Technology, Hangzhou 310023, China"},{"name":"Zhejiang Provincial Engineering Center on Media Data Cloud Processing and Analysis, Hangzhou 310018, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hangdi","family":"Tu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Key Laboratory of Complex Systems Modeling and Simulation, Ministry of Education, Hangzhou 310018, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yongjian","family":"Ren","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Key Laboratory of Complex Systems Modeling and Simulation, Ministry of Education, Hangzhou 310018, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jian","family":"Wan","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Key Laboratory of Complex Systems Modeling and Simulation, Ministry of Education, Hangzhou 310018, China"},{"name":"School of Information and Electronic engineering, Zhejiang University of Science & Technology, Hangzhou 310023, China"},{"name":"Zhejiang Provincial Engineering Center on Media Data Cloud Processing and Analysis, Hangzhou 310018, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Li","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Key Laboratory of Complex Systems Modeling and Simulation, Ministry of Education, Hangzhou 310018, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mingwei","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Key Laboratory of Complex Systems Modeling and Simulation, Ministry of Education, Hangzhou 310018, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jue","family":"Wang","sequence":"additional","affiliation":[{"name":"Supercomputing Center of Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lifeng","family":"Yu","sequence":"additional","affiliation":[{"name":"Hithink RoyalFlush Information Network Co., Ltd., Hangzhou 310023, Zhejiang, China"},{"name":"Financial Information Engineering Technology Research Center of Zhejiang Province, Hangzhou 310023, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chang","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Key Laboratory of Complex Systems Modeling and Simulation, Ministry of Education, Hangzhou 310018, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lei","family":"Zhang","sequence":"additional","affiliation":[{"name":"Computer Science Department, Beijing University of Civil Engineering and Architecture, Beijing 100044, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2017,9,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1016\/j.asoc.2015.04.037","article-title":"Towards the fast and robust optimal design of wireless body area networks","volume":"37","author":"Nardin","year":"2015","journal-title":"Appl. Soft Comput."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Tsouri, G.R., Prieto, A., and Argade, N. (2012). On Increasing Network Lifetime in Body Area Networks Using Global Routing with Energy Consumption Balancing. Sensors, 12.","DOI":"10.3390\/s121013088"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Natalizio, E., Loscri, V., and Viterbo, E. (2008). Optimal placement of wireless nodes for maximizing path lifetime. IEEE Commun. Lett., 12.","DOI":"10.1109\/LCOMM.2008.080168"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2292","DOI":"10.1016\/j.comnet.2008.04.002","article-title":"Wireless sensor network survey","volume":"52","author":"Yick","year":"2008","journal-title":"Comput. Netw."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"788","DOI":"10.1109\/JSAC.2005.843547","article-title":"Near-optimal reinforcement learning framework for energy-aware sensor communications","volume":"23","author":"Pandana","year":"2005","journal-title":"IEEE J. Sel. Areas Commun."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Junejo, K.N., and Goh, J. (2016, January 30\u201330). Behaviour-Based Attack Detection and Classification in Cyber Physical Systems Using Machine Learning. Proceedings of the ACM International Workshop on Cyber-Physical System Security, Xi\u2019an, China.","DOI":"10.1145\/2899015.2899016"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Van Norden, W., de Jong, J., Bolderheij, F., and Rothkrantz, L. (2005, January 25\u201328). Intelligent Task Scheduling in Sensor Networks. Proceedings of the 2005 8th International Conference on Information Fusion, Philadelphia, PA, USA.","DOI":"10.1109\/ICIF.2005.1592013"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"668","DOI":"10.1109\/TPDS.2014.2315204","article-title":"BURSE: A bursty and self-similar workload generator for cloud computing","volume":"26","author":"Yin","year":"2015","journal-title":"IEEE Trans. Parall Distrib. Syst."},{"key":"ref_9","unstructured":"Di, M., and Joo, E.M. (2007, January 10\u201313). A Survey of Machine Learning in Wireless Sensor Netoworks from Networking and Application Perspectives. Proceedings of the 2007 6th International Conference on Information, Communications & Signal Processing, Singapore."},{"key":"ref_10","unstructured":"McColl, W.F. (1995). Bulk synchronous parallel computing. Abstract Machine Models for Highly Parallel Computers, Oxford University Press."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1006\/jpdc.1994.1085","article-title":"Direct bulk-synchronous parallel algorithms","volume":"22","author":"Gerbessiotis","year":"1994","journal-title":"J. Parallel Distrib. Commun."},{"key":"ref_12","unstructured":"Dean, J., Corrado, G., Monga, R., Chen, K., Devin, M., Mao, M., Senior, A., Tucker, P., Yang, K., and Le, Q.V. (2012). Large Scale Distributed Deep Networks. Advances in Neural Information Processing Systems, Neural Information Processing Systems Foundation, Inc."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Ahmed, A., Aly, M., Gonzalez, J., Narayanamurthy, S., and Smola, A. (2012, January 8\u201312). Scalable Inference in Latent Variable Models. Proceedings of the International conference on Web search and data mining (WSDM), Seattle, DC, USA.","DOI":"10.1145\/2124295.2124312"},{"key":"ref_14","unstructured":"Chilimbi, T., Suzue, Y., Apacible, J., and Kalyanaraman, K. (2014, January 6\u20138). Project Adam: Building an Efficient and Scalable Deep Learning Training System. Proceedings of the Usenix Conference on Operating Systems Design and Implementation, Broomfield, CO, USA."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Cui, H., Tumanov, A., Wei, J., Xu, L., Dai, W., Haber-Kucharsky, J., Ho, Q., Ganger, G.R., Gibbons, P.B., and Gibson, G.A. (2014, January 3\u20135). Exploiting Iterative-Ness for Parallel ML Computations. Proceedings of the ACM Symposium on Cloud Computing, Seattle, WA, USA.","DOI":"10.1145\/2670979.2670984"},{"key":"ref_16","unstructured":"Ho, Q., Cipar, J., Cui, H., Lee, S., Kim, J.K., Gibbons, P.B., Gibson, G.A., Ganger, G., and Xing, E.P. (2013). More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server, Neural Information Processing Systems Foundation, Inc."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1016\/J.ENG.2016.02.008","article-title":"Strategies and principles of distributed machine learning on big data","volume":"2","author":"Xing","year":"2016","journal-title":"Engineering"},{"key":"ref_18","unstructured":"Chen, T., Li, M., Li, Y., Lin, M., Wang, N., Wang, M., Xiao, T., Xu, B., Zhang, C., and Zhang, Z. (arXiv, 2015). Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems, arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., and Darrell, T. (2014, January 3\u20137). Caffe: Convolutional Architecture for Fast Feature Embedding. Proceedings of the 22nd ACM international conference on Multimedia, Orlando, FL, USA.","DOI":"10.1145\/2647868.2654889"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going Deeper with Convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_21","unstructured":"Coates, A., Huval, B., Wang, T., Wu, D.J., Ng, A.Y., and Catanzaro, B. (2013, January 23\u201328). Deep Learning with COTS HPC Systems. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Sparks, E.R., Talwalkar, A., Smith, V., Kottalam, J., Pan, X., Gonzalez, J., Franklin, M.J., Jordan, M.I., and Kraska, T. (2013, January 7\u201310). MLI: An API for Distributed Machine Learning. Proceedings of the 2013 IEEE 13th International Conference on Data Mining (ICDM), Dallas, TX, USA.","DOI":"10.1109\/ICDM.2013.158"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1016\/j.ins.2013.12.007","article-title":"Colbar: A collaborative location-based regularization framework for QoS prediction","volume":"265","author":"Yin","year":"2014","journal-title":"Inf. Sci."},{"key":"ref_24","unstructured":"Yu, K. (October, January 27). Large-Scale Deep Learning at Baidu. Proceedings of the ACM International Conference on Information & Knowledge Management, San Francisco, CA, USA."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1772","DOI":"10.14778\/2733004.2733082","article-title":"Mariana: Tencent deep learning platform and its applications","volume":"7","author":"Zou","year":"2014","journal-title":"Proc. VLDB Endow."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Le, Q.V. (2013, January 26\u201331). Building High-Level Features Using Large Scale Unsupervised Learning. Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, BC, Canada.","DOI":"10.1109\/ICASSP.2013.6639343"},{"key":"ref_27","unstructured":"Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., and Devin, M. (arXiv, 2016). Tensorflow: Large-scale machine learning on heterogeneous distributed systems, arXiv."},{"key":"ref_28","unstructured":"Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., and Isard, M. (2016, January 2\u20134). TensorFlow: A System for Large-Scale Machine Learning. Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Savannah, GA, USA."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"703","DOI":"10.14778\/1920841.1920931","article-title":"An architecture for parallel topic models","volume":"3","author":"Smola","year":"2010","journal-title":"Proc. VLDB Endow."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Li, M., Andersen, D.G., Park, J.W., Smola, A.J., Ahmed, A., Josifovski, V., Long, J., Shekita, E.J., and Su, B.Y. (2014, January 6\u20138). Scaling Distributed Machine Learning with the Parameter Server. Proceedings of the Usenix Conference on Operating Systems Design and Implementation, Broomfield, CO, USA.","DOI":"10.1145\/2640087.2644155"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Li, M., Andersen, D.G., Smola, A., and Yu, K. (2014, January 8\u201313). Communication Efficient Distributed Machine Learning with the Parameter Server. Proceedings of the International Conference on Neural Information Processing Systems, Montreal, QC, Canada.","DOI":"10.1145\/2640087.2644155"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Wei, J., Dai, W., Qiao, A., Ho, Q., Cui, H., Ganger, G.R., Gibbons, P.B., Gibson, G.A., and Xing, E.P. (2015, January 27\u201329). Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. Proceedings of the Proceedings of the Sixth ACM Symposium on Cloud Computing, Kohala Coast, HI, USA.","DOI":"10.1145\/2806777.2806778"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Sra, S., Nowozin, S., and Wright, S.J. (2012). Optimization for Machine Learning, MIT Press.","DOI":"10.7551\/mitpress\/8996.001.0001"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1287\/opre.1030.0065","article-title":"The price of robustness","volume":"52","author":"Bertsimas","year":"2004","journal-title":"Oper. Res."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1007\/978-3-642-30850-5_7","article-title":"New Results about Multi-Band Uncertainty in Robust Optimization","volume":"Volume 7276","year":"2012","journal-title":"Experimental Algorithms, Proceedings of the 11th International Symposium, SEA 2012, Bordeaux, France, 7\u20139 June 2012"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1109\/MCOM.2014.6736760","article-title":"Network planning under demand uncertainty with robust optimization","volume":"52","author":"Bauschert","year":"2014","journal-title":"IEEE Commun. Mag."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Gemulla, R., Nijkamp, E., Haas, P.J., and Sismanis, Y. (2011, January 21\u201324). Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent. Proceedings of the 17th ACM SIGKDD international conference on Knowledge Discovery and Data Mining, San Diego, CA, USA.","DOI":"10.1145\/2020408.2020426"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Dai, W., Kumar, A., Wei, J., Ho, Q., Gibson, G., and Xing, E.P. (2015, January 25\u201330). High-Performance Distributed ML at Scale through Parameter Server Consistency Models. Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, TX, USA.","DOI":"10.1609\/aaai.v29i1.9195"},{"key":"ref_39","unstructured":"Li, M., Zhou, L., Yang, Z., Li, A., Xia, F., Andersen, D.G., and Smola, A. (2013, January 9). Parameter Server for Distributed Machine Learning. Proceedings of the Big Learning NIPS Workshop, Lake Tahoe, NV, USA."},{"key":"ref_40","unstructured":"Cun, Y.L., Boser, B., Denker, J.S., Howard, R.E., Habbard, W., Jackel, L.D., and Henderson, D. (1990). Handwritten Digit Recognition with a Back-Propagation Network. Advances in Neural Information Processing Systems, Neural Information Processing Systems Foundation, Inc."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/17\/10\/2172\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T18:45:34Z","timestamp":1760208334000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/17\/10\/2172"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,9,21]]},"references-count":40,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2017,10]]}},"alternative-id":["s17102172"],"URL":"https:\/\/doi.org\/10.3390\/s17102172","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,9,21]]}}}