{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T05:12:02Z","timestamp":1755839522999,"version":"3.40.3"},"publisher-location":"Cham","reference-count":14,"publisher":"Springer International Publishing","isbn-type":[{"type":"print","value":"9783031304415"},{"type":"electronic","value":"9783031304422"}],"license":[{"start":{"date-parts":[[2023,1,1]],"date-time":"2023-01-01T00:00:00Z","timestamp":1672531200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,4,28]],"date-time":"2023-04-28T00:00:00Z","timestamp":1682640000000},"content-version":"vor","delay-in-days":117,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper studies the utility of using data analytics and machine learning techniques for identifying, classifying, and characterizing the dynamics of large-scale parallel (MPI) programs. To this end, we run microbenchmarks and realistic proxy applications with the regular compute-communicate structure on two different supercomputing platforms and choose the per-process performance and MPI time per time step as relevant observables. Using principal component analysis, clustering techniques, correlation functions, and a new \u201cphase space plot,\u201d we show how desynchronization patterns (or lack thereof) can be readily identified from a data set that is much smaller than a full MPI trace. Our methods also lead the way towards a more general classification of parallel program dynamics.<\/jats:p>","DOI":"10.1007\/978-3-031-30442-2_12","type":"book-chapter","created":{"date-parts":[[2023,4,27]],"date-time":"2023-04-27T10:02:09Z","timestamp":1682589729000},"page":"155-170","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Exploring Techniques for\u00a0the\u00a0Analysis of\u00a0Spontaneous Asynchronicity in\u00a0MPI-Parallel Applications"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5061-0438","authenticated-orcid":false,"given":"Ayesha","family":"Afzal","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8723-2781","authenticated-orcid":false,"given":"Georg","family":"Hager","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7371-3026","authenticated-orcid":false,"given":"Gerhard","family":"Wellein","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0639-0639","authenticated-orcid":false,"given":"Stefano","family":"Markidis","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,4,28]]},"reference":[{"key":"12_CR1","doi-asserted-by":"publisher","unstructured":"Afzal, A., Hager, G., Wellein, G.: Propagation and decay of injected one-off delays on clusters: a case study. In: 2019 IEEE International Conference on Cluster Computing, CLUSTER 2019, Albuquerque, NM, USA, 23\u201326 September 2019, pp. 1\u201310 (2019). https:\/\/doi.org\/10.1109\/CLUSTER.2019.8890995","DOI":"10.1109\/CLUSTER.2019.8890995"},{"key":"12_CR2","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1007\/978-3-030-50743-5_20","volume-title":"High Performance Computing","author":"A Afzal","year":"2020","unstructured":"Afzal, A., Hager, G., Wellein, G.: Desynchronization and wave pattern formation in MPI-parallel and hybrid memory-bound programs. In: Sadayappan, P., Chamberlain, B.L., Juckeland, G., Ltaief, H. (eds.) ISC High Performance 2020. LNCS, vol. 12151, pp. 391\u2013411. Springer, Cham (2020). https:\/\/doi.org\/10.1007\/978-3-030-50743-5_20"},{"key":"12_CR3","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"351","DOI":"10.1007\/978-3-030-78713-4_19","volume-title":"High Performance Computing","author":"A Afzal","year":"2021","unstructured":"Afzal, A., Hager, G., Wellein, G.: Analytic modeling of idle waves in parallel programs: communication, cluster topology, and noise impact. In: Chamberlain, B.L., Varbanescu, A.-L., Ltaief, H., Luszczek, P. (eds.) ISC High Performance 2021. LNCS, vol. 12728, pp. 351\u2013371. Springer, Cham (2021). https:\/\/doi.org\/10.1007\/978-3-030-78713-4_19"},{"key":"12_CR4","doi-asserted-by":"publisher","unstructured":"Afzal, A., Hager, G., Wellein, G.: The role of idle waves, desynchronization, and bottleneck evasion in the performance of parallel programs. IEEE Trans. Parallel Distrib. Syst. TPDS (2022). https:\/\/doi.org\/10.1109\/TPDS.2022.3221085","DOI":"10.1109\/TPDS.2022.3221085"},{"key":"12_CR5","doi-asserted-by":"publisher","unstructured":"Afzal, A., Wellein, G., Hager, G.: Addressing white-box modeling and simulation challenges in parallel computing. In: Proceedings of the 2022 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, SIGSIM-PADS 2022, pp. 25\u201326. Association for Computing Machinery, New York (2022). https:\/\/doi.org\/10.1145\/3518997.3534986","DOI":"10.1145\/3518997.3534986"},{"key":"12_CR6","doi-asserted-by":"publisher","unstructured":"Bhatnagar, P.L., Gross, E.P., Krook, M.: A model for collision processes in gases. I. Small amplitude processes in charged and neutral one-component systems. Phys. Rev. 94(3), 511\u2013525 (1954). https:\/\/doi.org\/10.1103\/PhysRev.94.511","DOI":"10.1103\/PhysRev.94.511"},{"issue":"16","key":"12_CR7","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.69.165115","volume":"69","author":"H Fehske","year":"2004","unstructured":"Fehske, H., Wellein, G., Hager, G., Wei\u00dfe, A., Bishop, A.: Quantum lattice dynamical effects on single-particle excitations in one-dimensional Mott and Peierls insulators. Phys. Rev. B 69(16), 165115 (2004). https:\/\/doi.org\/10.1103\/PhysRevB.69.165115","journal-title":"Phys. Rev. B"},{"issue":"2065","key":"12_CR8","doi-asserted-by":"publisher","first-page":"20150202","DOI":"10.1098\/rsta.2015.0202","volume":"374","author":"IT Jolliffe","year":"2016","unstructured":"Jolliffe, I.T., Cadima, J.: Principal component analysis: a review and recent developments. Philos. Trans. Roy. Soc. A Math. Phys. Eng. Sci. 374(2065), 20150202 (2016). https:\/\/doi.org\/10.1098\/rsta.2015.0202","journal-title":"Philos. Trans. Roy. Soc. A Math. Phys. Eng. Sci."},{"key":"12_CR9","doi-asserted-by":"publisher","unstructured":"Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, Hoboken (2009). https:\/\/doi.org\/10.1002\/9780470316801","DOI":"10.1002\/9780470316801"},{"issue":"1","key":"12_CR10","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.91.013306","volume":"91","author":"S Markidis","year":"2015","unstructured":"Markidis, S., Vencels, J., Peng, I.B., Akhmetova, D., Laure, E., Henri, P.: Idle waves in high-performance computing. Phys. Rev. E 91(1), 013306 (2015). https:\/\/doi.org\/10.1103\/PhysRevE.91.013306","journal-title":"Phys. Rev. E"},{"key":"12_CR11","unstructured":"McCalpin, J.D., et al.: Memory bandwidth and machine balance in current high performance computers. IEEE Comput. Soc. Tech. Committee Comput. Archit. (TCCA) Newsl. 2(19\u201325) (1995)"},{"key":"12_CR12","doi-asserted-by":"crossref","unstructured":"Qian, Y.H., d\u2019Humi\u00e8res, D., Lallemand, P.: Lattice BGK models for Navier-Stokes equation. Europhys. Lett. (EPL) 17(6), 479\u2013484 (1992)","DOI":"10.1209\/0295-5075\/17\/6\/001"},{"key":"12_CR13","unstructured":"Vassilvitskii, S., Arthur, D.: k-means++: the advantages of careful seeding. In: Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1027\u20131035 (2006). https:\/\/dl.acm.org\/doi\/10.5555\/1283383.1283494"},{"key":"12_CR14","unstructured":"Vetterling, W.T., et al.: Numerical Recipes: Example book C. Cambridge University Press, Cambridge (1992)"}],"container-title":["Lecture Notes in Computer Science","Parallel Processing and Applied Mathematics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-30442-2_12","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,4,27]],"date-time":"2023-04-27T10:05:28Z","timestamp":1682589928000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-30442-2_12"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023]]},"ISBN":["9783031304415","9783031304422"],"references-count":14,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-30442-2_12","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[2023]]},"assertion":[{"value":"28 April 2023","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"PPAM","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"International Conference on Parallel Processing and Applied Mathematics","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Gdansk","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Poland","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2022","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"11 September 2022","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"14 September 2022","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"14","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"ppam2022","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/ppam.edu.pl\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Single-blind","order":1,"name":"type","label":"Type","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"Easychair","order":2,"name":"conference_management_system","label":"Conference Management System","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"132","order":3,"name":"number_of_submissions_sent_for_review","label":"Number of Submissions Sent for Review","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"77","order":4,"name":"number_of_full_papers_accepted","label":"Number of Full Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"0","order":5,"name":"number_of_short_papers_accepted","label":"Number of Short Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"58% - The value is computed by the equation \"Number of Full Papers Accepted \/ Number of Submissions Sent for Review * 100\" and then rounded to a whole number.","order":6,"name":"acceptance_rate_of_full_papers","label":"Acceptance Rate of Full Papers","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"3","order":7,"name":"average_number_of_reviews_per_paper","label":"Average Number of Reviews per Paper","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"2","order":8,"name":"average_number_of_papers_per_reviewer","label":"Average Number of Papers per Reviewer","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"No","order":9,"name":"external_reviewers_involved","label":"External Reviewers Involved","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}}]}}