{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,17]],"date-time":"2025-12-17T08:31:21Z","timestamp":1765960281874,"version":"3.41.0"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2024,1,19]],"date-time":"2024-01-19T00:00:00Z","timestamp":1705622400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62272167"],"award-info":[{"award-number":["62272167"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2024,3,31]]},"abstract":"<jats:p>Collecting sufficient microarchitecture performance data is essential for performance evaluation and workload characterization. There are many events to be monitored in a modern processor while only a few hardware performance monitoring counters (PMCs) can be used, so multiplexing is commonly adopted. However, inefficiency commonly exists in state-of-the-art profiling tools when grouping events for multiplexing PMCs. It has the risk of inaccurate measurement and misleading analysis. Commercial tools can leverage PMCs, but they are closed source and only support their specified platforms. To this end, we propose an approach for efficient cross-platform microarchitecture performance measurement via adaptive grouping, aiming to improve the metrics\u2019 sampling ratios. The approach generates event groups based on the number of available PMCs detected on arbitrary machines while avoiding the scheduling pitfall of Linux perf_event subsystem. We evaluate our approach with SPEC CPU 2017 on four mainstream x86-64 and AArch64 processors and conduct comparative analyses of efficiency with two other state-of-the-art tools, LIKWID and ARM Top-down Tool. The experimental results indicate that our approach gains around 50% improvement in the average sampling ratio of metrics without compromising the correctness and reliability.<\/jats:p>","DOI":"10.1145\/3629525","type":"journal-article","created":{"date-parts":[[2023,10,21]],"date-time":"2023-10-21T19:20:37Z","timestamp":1697916037000},"page":"1-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Efficient Cross-platform Multiplexing of Hardware Performance Counters via Adaptive Grouping"],"prefix":"10.1145","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7795-4553","authenticated-orcid":false,"given":"Tong-Yu","family":"Liu","sequence":"first","affiliation":[{"name":"East China Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5787-6781","authenticated-orcid":false,"given":"Jianmei","family":"Guo","sequence":"additional","affiliation":[{"name":"East China Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5126-7192","authenticated-orcid":false,"given":"Bo","family":"Huang","sequence":"additional","affiliation":[{"name":"East China Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,1,19]]},"reference":[{"key":"e_1_3_2_2_2","unstructured":"Advanced Micro Devices Inc.2023. AMD64 Architecture Programmer\u2019s Manual Volumes 1-5. Retrieved June 18 2023 from https:\/\/www.amd.com\/en\/support\/tech-docs\/amd64-architecture-programmers-manual-volumes-1-5"},{"key":"e_1_3_2_3_2","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1145\/1088149.1088163","volume-title":"Proceedings of the 19th Annual International Conference on Supercomputing","author":"Azimi Reza","year":"2005","unstructured":"Reza Azimi, Michael Stumm, and Robert W. Wisniewski. 2005. Online performance analysis by statistical sampling of microprocessor performance counters. In Proceedings of the 19th Annual International Conference on Supercomputing. Association for Computing Machinery, New York, NY, 101\u2013110. DOI:10.1145\/1088149.1088163"},{"key":"e_1_3_2_4_2","volume-title":"Performance Analysis and Tuning on Modern CPUs","author":"Bakhvalov Denis","year":"2020","unstructured":"Denis Bakhvalov. 2020. Performance Analysis and Tuning on Modern CPUs. Retrieved fromhttps:\/\/faculty.cs.niu.edu\/winans\/notes\/patmc.pdf"},{"key":"e_1_3_2_5_2","doi-asserted-by":"crossref","first-page":"832","DOI":"10.1145\/3445814.3446739","volume-title":"Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems","author":"Banerjee Subho S.","year":"2021","unstructured":"Subho S. Banerjee, Saurabh Jha, Zbigniew Kalbarczyk, and Ravishankar K. Iyer. 2021. BayesPerf: Minimizing performance monitoring errors using Bayesian statistics. In Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. Association for Computing Machinery, New York, NY, 832\u2013844. DOI:10.1145\/3445814.3446739"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1177\/109434200001400303"},{"key":"e_1_3_2_7_2","first-page":"1","volume-title":"Proceedings of the IEEE 35th International Performance Computing and Communications Conference (IPCCC\u201916)","author":"Chadha Mohak","year":"2016","unstructured":"Mohak Chadha, Abhishek Srivastava, and Santonu Sarkar. 2016. Unified power and energy measurement API for HPC co-processors. In Proceedings of the IEEE 35th International Performance Computing and Communications Conference (IPCCC\u201916). IEEE Computer Society, Los Alamitos, CA, 1\u20138. DOI:10.1109\/PCCC.2016.7820633"},{"key":"e_1_3_2_8_2","first-page":"42","volume-title":"Proceedings of the 8th Annual IEEE\/ACM International Symposium on Code Generation and Optimization (CGO\u201910)","author":"Chen Dehao","year":"2010","unstructured":"Dehao Chen, Neil Vachharajani, Robert Hundt, Shih-wei Liao, Vinodha Ramasamy, Paul Yuan, Wenguang Chen, and Weimin Zheng. 2010. Taming hardware event samples for FDO compilation. In Proceedings of the 8th Annual IEEE\/ACM International Symposium on Code Generation and Optimization (CGO\u201910). Association for Computing Machinery, New York, NY, 42\u201352. DOI:10.1145\/1772954.1772963"},{"key":"e_1_3_2_9_2","unstructured":"Intel Corporation. 2019. EMON User Guide. Retrieved May 31 2023 from https:\/\/www.intel.com\/content\/dam\/develop\/external\/us\/en\/documents\/emon-users-guide.pdf"},{"key":"e_1_3_2_10_2","unstructured":"Intel Corporation. 2023. Intel 64 and IA-32 Architectures Software Developer Manuals. Retrieved May 31 2023 from https:\/\/www.intel.cn\/content\/www\/cn\/zh\/developer\/articles\/technical\/intel-sdm.html"},{"key":"e_1_3_2_11_2","unstructured":"Intel Corporation. 2023. Intel VTune Profiler Performance Analysis Cookbook. Retrieved August 5 2023 from https:\/\/www.intel.com\/content\/www\/us\/en\/docs\/vtune-profiler\/cookbook\/2023-1\/overview.html"},{"key":"e_1_3_2_12_2","unstructured":"Intel Corporation. 2023. Intel VTune Profiler User Guide. Retrieved May 31 2023 from https:\/\/www.intel.com\/content\/www\/us\/en\/docs\/vtune-profiler\/user-guide\/2023-1\/overview.html"},{"key":"e_1_3_2_13_2","unstructured":"Intel Corporation. 2023. PerfMon Events. Retrieved May 31 2023 from https:\/\/perfmon-events.intel.com"},{"key":"e_1_3_2_14_2","unstructured":"Standard Performance Evaluation Corporation. 2022. SPEC CPU 2017. Retrieved May 31 2023 from https:\/\/www.spec.org\/cpu2017\/"},{"key":"e_1_3_2_15_2","first-page":"20","volume-title":"Proceedings of the IEEE Symposium on Security and Privacy (SP\u201919)","author":"Das Sanjeev","year":"2019","unstructured":"Sanjeev Das, Jan Werner, Manos Antonakakis, Michalis Polychronakis, and Fabian Monrose. 2019. SoK: The challenges, pitfalls, and perils of using hardware performance counters for security. In Proceedings of the IEEE Symposium on Security and Privacy (SP\u201919). IEEE Computer Society, Los Alamitos, CA, 20\u201338. DOI:10.1109\/SP.2019.00021"},{"key":"e_1_3_2_16_2","first-page":"396","volume-title":"Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC\u201916)","author":"Dimakopoulou Maria","year":"2016","unstructured":"Maria Dimakopoulou, St\u00e9phane Eranian, Nectarios Koziris, and Nicholas Bambos. 2016. Reliable and efficient performance monitoring in linux. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC\u201916). IEEE Computer Society, Los Alamitos, CA, 396\u2013408. DOI:10.1109\/SC.2016.33"},{"key":"e_1_3_2_17_2","first-page":"269","volume-title":"Proceedings of the Ottawa Linux Symposium","author":"Eranian St\u00e9phane","year":"2006","unstructured":"St\u00e9phane Eranian. 2006. Perfmon2: A flexible performance monitoring interface for linux. In Proceedings of the Ottawa Linux Symposium. Citeseer, USA, 269\u2013288."},{"key":"e_1_3_2_18_2","first-page":"37","volume-title":"Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XVII)","author":"Ferdman Michael","year":"2012","unstructured":"Michael Ferdman, Almutaz Adileh, Onur Kocberber, Stavros Volos, Mohammad Alisafaee, Djordje Jevdjic, Cansu Kaynak, Adrian Daniel Popescu, Anastasia Ailamaki, and Babak Falsafi. 2012. Clearing the clouds: A study of emerging scale-out workloads on modern hardware. In Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XVII). Association for Computing Machinery, New York, NY, 37\u201348. DOI:10.1145\/2150976.2150982"},{"key":"e_1_3_2_19_2","unstructured":"Eric Gouriou Tipp Moseley and Willem de Bruijn. 2022. Tutorial\u2014Perf Wiki. Retrieved May 31 2023 from https:\/\/perf.wiki.kernel.org\/index.php\/Tutorial"},{"key":"e_1_3_2_20_2","volume-title":"Systems Performance: Enterprise and the Cloud (2nd ed.)","author":"Gregg Brendan","year":"2020","unstructured":"Brendan Gregg. 2020. Systems Performance: Enterprise and the Cloud (2nd ed.). Addison-Wesley, Boston, MA."},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1145\/1094811.1094834","volume-title":"Proceedings of the 20th Annual ACM SIGPLAN Conference on Object-oriented Programming, Systems, Languages, and Applications","author":"Hauswirth Matthias","year":"2005","unstructured":"Matthias Hauswirth, Amer Diwan, Peter F. Sweeney, and Michael C. Mozer. 2005. Automating vertical profiling. In Proceedings of the 20th Annual ACM SIGPLAN Conference on Object-oriented Programming, Systems, Languages, and Applications. Association for Computing Machinery, New York, NY, 281\u2013296. DOI:10.1145\/1094811.1094834"},{"key":"e_1_3_2_22_2","first-page":"1341","volume-title":"Proceedings of the IEEE 26th International Parallel and Distributed Processing Symposium","author":"He Lulu","year":"2012","unstructured":"Lulu He, Zhibin Yu, and Hai Jin. 2012. FractalMRC: Online cache miss rate curve prediction on commodity systems. In Proceedings of the IEEE 26th International Parallel and Distributed Processing Symposium. IEEE Computer Society, Los Alamitos, CA, 1341\u20131351. DOI:10.1109\/IPDPS.2012.121"},{"key":"e_1_3_2_23_2","volume-title":"Computer Architecture: A Quantitative Approach","author":"Hennessy John L.","year":"2017","unstructured":"John L. Hennessy and David A. Patterson. 2017. Computer Architecture: A Quantitative Approach. (6th ed.) Morgan Kaufmann, San Francisco, CA."},{"key":"e_1_3_2_24_2","volume-title":"USENIX Workshop on Cool Topics on Sustainable Data Centers (CoolDC\u201916)","author":"Hirki Mikael","year":"2016","unstructured":"Mikael Hirki, Zhonghong Ou, Kashif Nizam Khan, Jukka K. Nurminen, and Tapio Niemi. 2016. Empirical study of the power consumption of the x86-64 instruction decoder. In USENIX Workshop on Cool Topics on Sustainable Data Centers (CoolDC\u201916). USENIX Association, Berkeley, CA. https:\/\/www.usenix.org\/conference\/cooldc16\/workshop-program\/presentation\/hirki"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cose.2018.08.010"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/2007477.1952692"},{"key":"e_1_3_2_27_2","unstructured":"John Levon. 2020. Oprofile. Retrieved May 31 2023 from https:\/\/oprofile.sourceforge.io"},{"key":"e_1_3_2_28_2","first-page":"101","volume-title":"Proceedings of Ottawa Linux Symposium","author":"Lim Robert V.","year":"2014","unstructured":"Robert V. Lim, David Carrillo-Cisneros, Wail Y. Alkowaileet, and Isaac D. Scherson. 2014. Computationally efficient multiplexing of events on hardware counters. In Proceedings of Ottawa Linux Symposium. Citeseer, USA, 101\u2013110."},{"key":"e_1_3_2_29_2","first-page":"149","volume-title":"Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS\u201918)","author":"Limaye Ankur","year":"2018","unstructured":"Ankur Limaye and Tosiron Adegbija. 2018. A workload characterization of the SPEC CPU2017 benchmark suite. In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS\u201918). IEEE Computer Society, Los Alamitos, CA, 149\u2013158. DOI:10.1109\/ISPASS.2018.00028"},{"key":"e_1_3_2_30_2","unstructured":"Arm Limited. 2022. Arm Architecture Reference Manual for A-profile Architecture. Retrieved May 31 2023 from https:\/\/developer.arm.com\/documentation\/ddi0487\/ia"},{"key":"e_1_3_2_31_2","unstructured":"Arm Limited. 2022. Arm Neoverse N1 PMU Guide. Retrieved May 21 2023 from https:\/\/developer.arm.com\/documentation\/PJDOC-466751330-547673\/r4p1"},{"key":"e_1_3_2_32_2","unstructured":"Arm Limited. 2023. Arm Neoverse N1 Software Optimization Guide. Retrieved May 31 2023 from https:\/\/developer.arm.com\/documentation\/PJDOC-466751330-9707\/r4p1"},{"key":"e_1_3_2_33_2","first-page":"613","volume-title":"Proceedings of the 51st Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201918)","author":"Lv Yirong","year":"2018","unstructured":"Yirong Lv, Bin Sun, Qinyi Luo, Jing Wang, Zhibin Yu, and Xuehai Qian. 2018. CounterMiner: Mining big performance data from hardware counters. In Proceedings of the 51st Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201918). IEEE Computer Society, Los Alamitos, CA, 613\u2013626. DOI:10.1109\/MICRO.2018.00056"},{"key":"e_1_3_2_34_2","first-page":"23","volume-title":"Proceedings of the 13th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems","author":"Mathur Wiplove","year":"2005","unstructured":"Wiplove Mathur and Jeanine Cook. 2005. Improved estimation for software multiplexing of performance counters. In Proceedings of the 13th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems. IEEE Computer Society, Los Alamitos, CA, 23\u201332. DOI:10.1109\/MASCOTS.2005.34"},{"key":"e_1_3_2_35_2","volume-title":"Proceedings of the 15th International Parallel and Distributed Processing Symposium (IPDPS\u201901)","author":"May John M.","year":"2001","unstructured":"John M. May. 2001. MPX: Software for multiplexing hardware performance counters in multithreaded programs. In Proceedings of the 15th International Parallel and Distributed Processing Symposium (IPDPS\u201901). IEEE Computer Society, Los Alamitos, CA. DOI:10.1109\/IPDPS.2001.924955"},{"key":"e_1_3_2_36_2","first-page":"254","volume-title":"Proceedings of the 7th Annual IEEE\/ACM International Symposium on Code Generation and Optimization (CGO\u201909)","author":"Moseley Tipp","year":"2009","unstructured":"Tipp Moseley, Dirk Grunwald, and Ramesh Peri. 2009. OptiScope: Performance accountability for optimizing compilers. In Proceedings of the 7th Annual IEEE\/ACM International Symposium on Code Generation and Optimization (CGO\u201909). IEEE Computer Society, CA, 254\u2013264. DOI:10.1109\/CGO.2009.26"},{"key":"e_1_3_2_37_2","unstructured":"Jumana Mundichipparakkal. 2023. Telemetry Solution\u2014Topdown Tool. Retrieved May 31 2023 from https:\/\/gitlab.arm.com\/telemetry-solution\/telemetry-solution\/-\/tree\/main\/tools\/topdown_tool"},{"key":"e_1_3_2_38_2","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1109\/MICRO.2007.27","volume-title":"Proceedings of the 40th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201907)","author":"Mytkowicz Todd","year":"2007","unstructured":"Todd Mytkowicz, Peter F. Sweeney, Matthias Hauswirth, and Amer Diwan. 2007. Time interpolation: So many metrics, so few registers. In Proceedings of the 40th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201907). IEEE Computer Society, Los Alamitos, CA, 286\u2013300. DOI:10.1109\/MICRO.2007.27"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1145\/3148054"},{"key":"e_1_3_2_40_2","first-page":"271","volume-title":"Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA\u201918)","author":"Panda Reena","year":"2018","unstructured":"Reena Panda, Shuang Song, Joseph Dean, and Lizy K. John. 2018. Wait of a decade: Did spec CPU 2017 broaden the performance horizon? In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA\u201918). IEEE Computer Society, Los Alamitos, CA, 271\u2013282. DOI:10.1109\/HPCA.2018.00032"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2010.68"},{"key":"e_1_3_2_42_2","first-page":"121","volume-title":"Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XIV)","author":"Tam David K.","year":"2009","unstructured":"David K. Tam, Reza Azimi, Livio B. Soares, and Michael Stumm. 2009. RapidMRC: Approximating L2 miss rate curves on commodity systems for online optimizations. In Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XIV). Association for Computing Machinery, New York, NY, 121\u2013132. DOI:10.1145\/1508244.1508259"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-11261-4_11"},{"key":"e_1_3_2_44_2","first-page":"207","volume-title":"Proceedings of the 39th International Conference on Parallel Processing Workshops","author":"Treibig Jan","year":"2010","unstructured":"Jan Treibig, Georg Hager, and Gerhard Wellein. 2010. LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments. In Proceedings of the 39th International Conference on Parallel Processing Workshops. IEEE Computer Society, Los Alamitos, CA, 207\u2013216. DOI:10.1109\/ICPPW.2010.38"},{"key":"e_1_3_2_45_2","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1145\/3431379.3460646","volume-title":"Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing (HPDC\u201921)","author":"Wang Bin","year":"2021","unstructured":"Bin Wang, Ahmed Ali-Eldin, and Prashant Shenoy. 2021. LaSS: Running latency sensitive serverless computations at the edge. In Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing (HPDC\u201921). Association for Computing Machinery, New York, NY, 239\u2013251. DOI:10.1145\/3431379.3460646"},{"key":"e_1_3_2_46_2","first-page":"488","volume-title":"Proceedings of the IEEE 20th International Symposium on High Performance Computer Architecture (HPCA\u201914)","author":"Wang Lei","year":"2014","unstructured":"Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, Wanling Gao, Zhen Jia, Yingjie Shi, Shujie Zhang, Chen Zheng, Gang Lu, Kent Zhan, Xiaona Li, and Bizhu Qiu. 2014. BigDataBench: A big data benchmark suite from internet services. In Proceedings of the IEEE 20th International Symposium on High Performance Computer Architecture (HPCA\u201914). IEEE Computer Society, Los Alamitos, CA, 488\u2013499. DOI:10.1109\/HPCA.2014.6835958"},{"key":"e_1_3_2_47_2","first-page":"47","volume-title":"Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER\u201920)","author":"Wang Yi-Chao","year":"2020","unstructured":"Yi-Chao Wang, Jie Wang, Jin-Kun Chen, Si-Cheng Zuo, Xiao-Ming Su, and James Lin. 2020. NeoMPX: Characterizing and improving estimation of multiplexing hardware counters for PAPI. In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER\u201920). IEEE Computer Society, Los Alamitos, CA, 47\u201356. DOI:10.1109\/CLUSTER49012.2020.00015"},{"key":"e_1_3_2_48_2","volume-title":"Proceedings of the 2nd International Workshop on Performance Analysis of Workload Optimized Systems, FastPath","volume":"13","author":"Weaver Vincent M.","year":"2013","unstructured":"Vincent M. Weaver. 2013. Linux perf_event features and overhead. In Proceedings of the 2nd International Workshop on Performance Analysis of Workload Optimized Systems, FastPath, Vol. 13. IEEE Computer Society, Los Alamitos, CA."},{"key":"e_1_3_2_49_2","first-page":"102","volume-title":"Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS\u201915)","author":"Weaver Vincent M.","year":"2015","unstructured":"Vincent M. Weaver. 2015. Self-monitoring overhead of the linux perf_event performance counter interface. In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS\u201915). IEEE Computer Society, Los Alamitos, CA, 102\u2013111. DOI:10.1109\/ISPASS.2015.7095789"},{"key":"e_1_3_2_50_2","first-page":"35","volume-title":"Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS\u201914)","author":"Yasin Ahmad","year":"2014","unstructured":"Ahmad Yasin. 2014. A top-down method for performance analysis and counters architecture. In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS\u201914). IEEE Computer Society, Los Alamitos, CA, 35\u201344. DOI:10.1109\/ISPASS.2014.6844459"},{"key":"e_1_3_2_51_2","first-page":"202","volume-title":"Proceedings of the IEEE International Symposium on Workload Characterization (IISWC\u201914)","author":"Yasin Ahmad","year":"2014","unstructured":"Ahmad Yasin, Yosi Ben-Asher, and Avi Mendelson. 2014. Deep-dive analysis of the data analytics workload in CloudSuite. In Proceedings of the IEEE International Symposium on Workload Characterization (IISWC\u201914). IEEE Computer Society, Los Alamitos, CA, 202\u2013211. DOI:10.1109\/IISWC.2014.6983059"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3369383"},{"key":"e_1_3_2_53_2","first-page":"106","volume-title":"Proceedings of the IEEE International Symposium on Workload Characterization (IISWC\u201920)","author":"Yi Li","year":"2020","unstructured":"Li Yi, Cong Li, and Jianmei Guo. 2020. CPI for runtime performance measurement: The good, the bad, and the ugly. In Proceedings of the IEEE International Symposium on Workload Characterization (IISWC\u201920). IEEE Computer Society, Los Alamitos, CA, 106\u2013113. DOI:10.1109\/IISWC50251.2020.00019"},{"key":"e_1_3_2_54_2","volume-title":"Proceedings of the 7th ACM SIGOPS Asia-Pacific Workshop on Systems (APSys\u201916)","author":"Zellweger Gerd","year":"2016","unstructured":"Gerd Zellweger, Denny Lin, and Timothy Roscoe. 2016. So many performance events, so little time. In Proceedings of the 7th ACM SIGOPS Asia-Pacific Workshop on Systems (APSys\u201916). Association for Computing Machinery, New York, NY, Article 14, 9 pages. DOI:10.1145\/2967360.2967375"},{"key":"e_1_3_2_55_2","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1145\/2465351.2465388","volume-title":"Proceedings of the 8th ACM European Conference on Computer Systems (EuroSys \u201913)","author":"Zhang Xiao","year":"2013","unstructured":"Xiao Zhang, Eric Tune, Robert Hagmann, Rohit Jnagal, Vrigo Gokhale, and John Wilkes. 2013. CPI2: CPU performance isolation for shared compute clusters. In Proceedings of the 8th ACM European Conference on Computer Systems (EuroSys \u201913). Association for Computing Machinery, New York, NY, 379\u2013391. DOI:10.1145\/2465351.2465388"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3629525","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3629525","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:01Z","timestamp":1750178161000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3629525"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,19]]},"references-count":54,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,3,31]]}},"alternative-id":["10.1145\/3629525"],"URL":"https:\/\/doi.org\/10.1145\/3629525","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"type":"print","value":"1544-3566"},{"type":"electronic","value":"1544-3973"}],"subject":[],"published":{"date-parts":[[2024,1,19]]},"assertion":[{"value":"2023-06-02","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-09-27","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-01-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}