{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:28:52Z","timestamp":1750220932588,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":74,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,6,8]],"date-time":"2019-06-08T00:00:00Z","timestamp":1559952000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1453086"],"award-info":[{"award-number":["1453086"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,6,8]]},"DOI":"10.1145\/3314221.3314621","type":"proceedings-article","created":{"date-parts":[[2019,6,7]],"date-time":"2019-06-07T21:02:18Z","timestamp":1559941338000},"page":"485-501","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Parallelism-centric what-if and differential analyses"],"prefix":"10.1145","author":[{"given":"Adarsh","family":"Yoga","sequence":"first","affiliation":[{"name":"Rutgers University, USA"}]},{"given":"Santosh","family":"Nagarakatte","sequence":"additional","affiliation":[{"name":"Rutgers University, USA"}]}],"member":"320","published-online":{"date-parts":[[2019,6,8]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"[n. d.]. Coral benchmarks. https:\/\/asc.llnl.gov\/CORAL-benchmarks\/.  [n. d.]. Coral benchmarks. https:\/\/asc.llnl.gov\/CORAL-benchmarks\/."},{"volume-title":"Proceedings of the Twelfth Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA). 1-12","author":"Acar Umut A.","key":"e_1_3_2_2_2_1","unstructured":"Umut A. Acar , Guy E. Blelloch , and Robert D. Blumofe . 2000. The Data Locality of Work Stealing . In Proceedings of the Twelfth Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA). 1-12 . Umut A. Acar, Guy E. Blelloch, and Robert D. Blumofe. 2000. The Data Locality of Work Stealing. In Proceedings of the Twelfth Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA). 1-12."},{"key":"e_1_3_2_2_3_1","unstructured":"U. A. Acar A. Chargu\u00e9raud and M. Rainey. 2017. Parallel Work Inflation Memory Effects and their Empirical Analysis. ArXiv e-prints (2017).  U. A. Acar A. Chargu\u00e9raud and M. Rainey. 2017. Parallel Work Inflation Memory Effects and their Empirical Analysis. ArXiv e-prints (2017)."},{"volume-title":"Proceedings of the Second International Workshop on Modeling, Analysis, and Simulation On Computer and Telecommunication Systems (MASCOTS). 308-317","author":"Alexander Cedell","key":"e_1_3_2_2_4_1","unstructured":"Cedell Alexander , Donna Reese , and James C. Harden . 1994. Near-Critical Path Analysis of Program Activity Graphs . In Proceedings of the Second International Workshop on Modeling, Analysis, and Simulation On Computer and Telecommunication Systems (MASCOTS). 308-317 . Cedell Alexander, Donna Reese, and James C. Harden. 1994. Near-Critical Path Analysis of Program Activity Graphs. In Proceedings of the Second International Workshop on Modeling, Analysis, and Simulation On Computer and Telecommunication Systems (MASCOTS). 308-317."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1454115.1454128"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/209936.209958"},{"key":"e_1_3_2_2_7_1","first-page":"1","article-title":"A Parallelism Profiler with What-if Analyses for OpenMP Programs. In Proceedings of the International Conference for High Performance Computing","volume":"16","author":"Boushehrinejadmoradi Nader","year":"2018","unstructured":"Nader Boushehrinejadmoradi , Adarsh Yoga , and Santosh Nagarakatte . 2018 . A Parallelism Profiler with What-if Analyses for OpenMP Programs. In Proceedings of the International Conference for High Performance Computing , Networking, Storage, and Analysis (SC). 16 : 1 - 16 :14. Nader Boushehrinejadmoradi, Adarsh Yoga, and Santosh Nagarakatte. 2018. A Parallelism Profiler with What-if Analyses for OpenMP Programs. In Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC). 16:1-16:14.","journal-title":"Networking, Storage, and Analysis (SC)."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2013.49"},{"key":"e_1_3_2_2_9_1","first-page":"1","article-title":"Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes. In Proceedings of the International Conference on High Performance Computing","volume":"45","author":"Calotoiu Alexandru","year":"2013","unstructured":"Alexandru Calotoiu , Torsten Hoefler , Marius Poke , and Felix Wolf . 2013 . Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes. In Proceedings of the International Conference on High Performance Computing , Networking, Storage and Analysis (SC). 45 : 1 - 45 :12. Alexandru Calotoiu, Torsten Hoefler, Marius Poke, and Felix Wolf. 2013. Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC). 45:1-45:12.","journal-title":"Networking, Storage and Analysis (SC)."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178487.3178499"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1094811.1094852"},{"volume-title":"Proceedings of the 10th ACM Symposium on Parallel Algorithms and Architectures (SPAA). 298-309","author":"Cheng Guang-Ien","key":"e_1_3_2_2_12_1","unstructured":"Guang-Ien Cheng , Mingdong Feng , Charles E. Leiserson , Keith H. Randall , and Andrew F. Stark . 1998. Detecting Data Races in Cilk Programs That Use Locks . In Proceedings of the 10th ACM Symposium on Parallel Algorithms and Architectures (SPAA). 298-309 . Guang-Ien Cheng, Mingdong Feng, Charles E. Leiserson, Keith H. Randall, and Andrew F. Stark. 1998. Detecting Data Races in Cilk Programs That Use Locks. In Proceedings of the 10th ACM Symposium on Parallel Algorithms and Architectures (SPAA). 298-309."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1274971.1274976"},{"key":"e_1_3_2_2_14_1","volume-title":"Retrieved","author":"Intel Corporation","year":"2019","unstructured":"Intel Corporation . 2019 . Intel Advisor . Retrieved March 20, 2019 from https:\/\/software.intel.com\/en-us\/advisor. Intel Corporation. 2019. Intel Advisor. Retrieved March 20, 2019 from https:\/\/software.intel.com\/en-us\/advisor."},{"key":"e_1_3_2_2_15_1","volume-title":"Retrieved","author":"Intel Corporation","year":"2019","unstructured":"Intel Corporation . 2019 . Intel VTune Amplifier . Retrieved March 20, 2019 from https:\/\/software.intel.com\/en-us\/intel-vtune-amplifier-xe. Intel Corporation. 2019. Intel VTune Amplifier. Retrieved March 20, 2019 from https:\/\/software.intel.com\/en-us\/intel-vtune-amplifier-xe."},{"key":"e_1_3_2_2_16_1","unstructured":"Intel Corporation. 2019. Official Intel(R) Threading Building Blocks (Intel TBB) GitHub repository. Retrieved Apr 5 2019 from https:\/\/github.com\/01org\/tbb.  Intel Corporation. 2019. Official Intel(R) Threading Building Blocks (Intel TBB) GitHub repository. Retrieved Apr 5 2019 from https:\/\/github.com\/01org\/tbb."},{"volume-title":"Proceedings of the 25th Symposium on Operating Systems Principles (SOSP). 184-197","author":"Curtsinger Charlie","key":"e_1_3_2_2_17_1","unstructured":"Charlie Curtsinger and Emery D. Berger . 2015. Coz: Finding Code That Counts with Causal Profiling . In Proceedings of the 25th Symposium on Operating Systems Principles (SOSP). 184-197 . Charlie Curtsinger and Emery D. Berger. 2015. Coz: Finding Code That Counts with Causal Profiling. In Proceedings of the 25th Symposium on Operating Systems Principles (SOSP). 184-197."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/2391541.2391560"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2485922.2485966"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2509136.2509529"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2908080.2908090"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2012.6189221"},{"volume-title":"Proceedings of the ACM SIGPLAN 1998 Conference on Programming Language Design and Implementation (PLDI). 212-223","author":"Frigo Matteo","key":"e_1_3_2_2_23_1","unstructured":"Matteo Frigo , Charles E. Leiserson , and Keith H. Randall . 1998. The Implementation of the Cilk-5 Multithreaded Language . In Proceedings of the ACM SIGPLAN 1998 Conference on Programming Language Design and Implementation (PLDI). 212-223 . Matteo Frigo, Charles E. Leiserson, and Keith H. Randall. 1998. The Implementation of the Cilk-5 Multithreaded Language. In Proceedings of the ACM SIGPLAN 1998 Conference on Programming Language Design and Implementation (PLDI). 212-223."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1993498.1993553"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079079.3079102"},{"volume-title":"Proceedings of the Twenty-second Annual ACM Symposium on Parallelism in Algorithms and Architectures (SPAA). 145-156","author":"He Yuxiong","key":"e_1_3_2_2_26_1","unstructured":"Yuxiong He , Charles E. Leiserson , and William M. Leiserson . 2010. The Cilkview Scalability Analyzer . In Proceedings of the Twenty-second Annual ACM Symposium on Parallelism in Algorithms and Architectures (SPAA). 145-156 . Yuxiong He, Charles E. Leiserson, and William M. Leiserson. 2010. The Cilkview Scalability Analyzer. In Proceedings of the Twenty-second Annual ACM Symposium on Parallelism in Algorithms and Architectures (SPAA). 145-156."},{"key":"e_1_3_2_2_27_1","volume-title":"Miller","author":"Hollingsworth Jeffrey K.","year":"1994","unstructured":"Jeffrey K. Hollingsworth and Barton P . Miller . 1994 . Slack : A New Performance Metric for Parallel Programs. Technical Report. University of Wisconsin-Madison . Jeffrey K. Hollingsworth and Barton P. Miller. 1994. Slack: A New Performance Metric for Parallel Programs. Technical Report. University of Wisconsin-Madison."},{"volume-title":"2010 IEEE International Symposium on Parallel Distributed Processing (IPDPS). 1-12","author":"Hood R.","key":"e_1_3_2_2_28_1","unstructured":"R. Hood , H. Jin , P. Mehrotra , J. Chang , J. Djomehri , S. Gavali , D. Jespersen , K. Taylor , and R. Biswas . 2010. Performance impact of resource contention in multicore systems . In 2010 IEEE International Symposium on Parallel Distributed Processing (IPDPS). 1-12 . R. Hood, H. Jin, P. Mehrotra, J. Chang, J. Djomehri, S. Gavali, D. Jespersen, K. Taylor, and R. Biswas. 2010. Performance impact of resource contention in multicore systems. In 2010 IEEE International Symposium on Parallel Distributed Processing (IPDPS). 1-12."},{"key":"e_1_3_2_2_29_1","volume-title":"Cache-aware Roofline Model: Upgrading the Loft","author":"Ilic Aleksandar","year":"2014","unstructured":"Aleksandar Ilic , Frederico Pratas , and Leonel Sousa . 2014. Cache-aware Roofline Model: Upgrading the Loft . IEEE Computer Architecture Letters ( 2014 ), 21-24. Aleksandar Ilic, Frederico Pratas, and Leonel Sousa. 2014. Cache-aware Roofline Model: Upgrading the Loft. IEEE Computer Architecture Letters (2014), 21-24."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2967938.2967968"},{"key":"e_1_3_2_2_31_1","first-page":"1","article-title":"Detection of False Sharing Using Machine Learning. In Proceedings of the International Conference on High Performance Computing","volume":"30","author":"Jayasena Sanath","year":"2013","unstructured":"Sanath Jayasena , Saman Amarasinghe , Asanka Abeyweera , Gayashan Amarasinghe , Himeshi De Silva , Sunimal Rathnayake , Xiaoqiao Meng , and Yanbin Liu . 2013 . Detection of False Sharing Using Machine Learning. In Proceedings of the International Conference on High Performance Computing , Networking, Storage and Analysis (SC). 30 : 1 - 30 :9. Sanath Jayasena, Saman Amarasinghe, Asanka Abeyweera, Gayashan Amarasinghe, Himeshi De Silva, Sunimal Rathnayake, Xiaoqiao Meng, and Yanbin Liu. 2013. Detection of False Sharing Using Machine Learning. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC). 30:1-30:9.","journal-title":"Networking, Storage and Analysis (SC)."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2048066.2048108"},{"key":"e_1_3_2_2_33_1","volume-title":"Retrieved","author":"Lawrence Livermore National Labs.","year":"2018","unstructured":"Lawrence Livermore National Labs. 2018 . Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH) . Retrieved November 17, 2018 from https:\/\/computation.llnl.gov\/projects\/co-design\/lulesh. Lawrence Livermore National Labs. 2018. Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH). Retrieved November 17, 2018 from https:\/\/computation.llnl.gov\/projects\/co-design\/lulesh."},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/337449.337465"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1640089.1640106"},{"volume-title":"Proceedings of the 2011 ACM International Conference on Object Oriented Programming Systems Languages and Applications (OOPSLA). 3-18","author":"Liu Tongping","key":"e_1_3_2_2_36_1","unstructured":"Tongping Liu and Emery D. Berger . 2011. SHERIFF: Precise Detection and Automatic Mitigation of False Sharing . In Proceedings of the 2011 ACM International Conference on Object Oriented Programming Systems Languages and Applications (OOPSLA). 3-18 . Tongping Liu and Emery D. Berger. 2011. SHERIFF: Precise Detection and Automatic Mitigation of False Sharing. In Proceedings of the 2011 ACM International Conference on Object Oriented Programming Systems Languages and Applications (OOPSLA). 3-18."},{"volume-title":"Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). 3-14","author":"Liu Tongping","key":"e_1_3_2_2_37_1","unstructured":"Tongping Liu , Chen Tian , Ziang Hu , and Emery D. Berger . 2014. PREDATOR: Predictive False Sharing Detection . In Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). 3-14 . Tongping Liu, Chen Tian, Ziang Hu, and Emery D. Berger. 2014. PREDATOR: Predictive False Sharing Detection. In Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). 3-14."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.5555\/2190025.2190064"},{"key":"e_1_3_2_2_39_1","first-page":"1","article-title":"A Data-centric Profiler for Parallel Programs. In Proceedings of the International Conference on High Performance Computing","volume":"28","author":"Liu Xu","year":"2013","unstructured":"Xu Liu and John Mellor-Crummey . 2013 . A Data-centric Profiler for Parallel Programs. In Proceedings of the International Conference on High Performance Computing , Networking, Storage and Analysis (SC). 28 : 1 - 28 :12. Xu Liu and John Mellor-Crummey. 2013. A Data-centric Profiler for Parallel Programs. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC). 28:1-28:12.","journal-title":"Networking, Storage and Analysis (SC)."},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2013.6557169"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2555243.2555271"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2628071.2628102"},{"key":"e_1_3_2_2_43_1","first-page":"1","article-title":"ScaAnalyzer: A Tool to Identify Memory Scalability Bottlenecks in Parallel Programs. In Proceedings of the International Conference for High Performance Computing","volume":"47","author":"Liu Xu","year":"2015","unstructured":"Xu Liu and Bo Wu . 2015 . ScaAnalyzer: A Tool to Identify Memory Scalability Bottlenecks in Parallel Programs. In Proceedings of the International Conference for High Performance Computing , Networking, Storage and Analysis (SC). 47 : 1 - 47 :12. Xu Liu and Bo Wu. 2015. ScaAnalyzer: A Tool to Identify Memory Scalability Bottlenecks in Parallel Programs. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC). 47:1-47:12.","journal-title":"Networking, Storage and Analysis (SC)."},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2016.7446070"},{"volume-title":"Performance Analysis with Cache-Aware Roofline Model in Intel Advisor. In 2017 International Conference on High Performance Computing Simulation (HPCS). 898-907","author":"Marques D.","key":"e_1_3_2_2_45_1","unstructured":"D. Marques , H. Duarte , A. Ilic , L. Sousa , R. Belenov , P. Thierry , and Z. A. Matveev . 2017 . Performance Analysis with Cache-Aware Roofline Model in Intel Advisor. In 2017 International Conference on High Performance Computing Simulation (HPCS). 898-907 . D. Marques, H. Duarte, A. Ilic, L. Sousa, R. Belenov, P. Thierry, and Z. A. Matveev. 2017. Performance Analysis with Cache-Aware Roofline Model in Intel Advisor. In 2017 International Conference on High Performance Computing Simulation (HPCS). 898-907."},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2010.5452060"},{"key":"e_1_3_2_2_47_1","volume-title":"219-234","author":"McKenney Paul E.","year":"1999","unstructured":"Paul E. McKenney . 1999. Differential Profiling . Software - Practice & Experience ( 1999 ), 219-234 . Paul E. McKenney. 1999. Differential Profiling. Software - Practice & Experience (1999), 219-234."},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"crossref","unstructured":"B. P. Miller M. Clark J. Hollingsworth S. Kierstead S. S. Lim and T. Torzewski. 1990. IPS-2: The Second Generation of a Parallel Program Measurement System. IEEE Transactions on Parallel and Distributed Systems (1990) 206-217.   B. P. Miller M. Clark J. Hollingsworth S. Kierstead S. S. Lim and T. Torzewski. 1990. IPS-2: The Second Generation of a Parallel Program Measurement System. IEEE Transactions on Parallel and Distributed Systems (1990) 206-217.","DOI":"10.1109\/71.80132"},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/2465351.2465366"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1985793.1985822"},{"volume-title":"Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC). 65:1-65:12","author":"Olivier Stephen L.","key":"e_1_3_2_2_51_1","unstructured":"Stephen L. Olivier , Bronis R. de Supinski , Martin Schulz , and Jan F. Prins . 2012. Characterizing and Mitigating Work Time Inflation in Task Parallel Programs . In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC). 65:1-65:12 . Stephen L. Olivier, Bronis R. de Supinski, Martin Schulz, and Jan F. Prins. 2012. Characterizing and Mitigating Work Time Inflation in Task Parallel Programs. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC). 65:1-65:12."},{"key":"e_1_3_2_2_52_1","unstructured":"OpenMP Architecture Review Board. 2015. OpenMP 4.5 Complete Specification. http:\/\/www.openmp.org\/wp-content\/uploads\/openmp-4.5.pdf.  OpenMP Architecture Review Board. 2015. OpenMP 4.5 Complete Specification. http:\/\/www.openmp.org\/wp-content\/uploads\/openmp-4.5.pdf."},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.5555\/645612.662674"},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2254064.2254127"},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"crossref","unstructured":"Patrick Reisert Alexandru Calotoiu Sergei Shudler and Felix Wolf. 2017. Following the Blind Seer - Creating Better Performance Models Using Less Information. In Euro-Par 2017: Parallel Processing.  Patrick Reisert Alexandru Calotoiu Sergei Shudler and Felix Wolf. 2017. Following the Blind Seer - Creating Better Performance Models Using Less Information. In Euro-Par 2017: Parallel Processing .","DOI":"10.1007\/978-3-319-64203-1_8"},{"key":"e_1_3_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3168828"},{"volume-title":"2017 24th Asia-Pacific Software Engineering Conference (APSEC). 570-575","author":"Rosales E.","key":"e_1_3_2_2_57_1","unstructured":"E. Rosales , A. Ros\u00e0 , and W. Binder . 2017. tgp: A Task-Granularity Profiler for the Java Virtual Machine . In 2017 24th Asia-Pacific Software Engineering Conference (APSEC). 570-575 . E. Rosales, A. Ros\u00e0, and W. Binder. 2017. tgp: A Task-Granularity Profiler for the Java Virtual Machine. In 2017 24th Asia-Pacific Software Engineering Conference (APSEC). 570-575."},{"key":"e_1_3_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2012.6402901"},{"volume-title":"Proceedings of the 27th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA). 89-100","author":"Schardl Tao B.","key":"e_1_3_2_2_59_1","unstructured":"Tao B. Schardl , Bradley C. Kuszmaul , I- Ting Angelina Lee , William M. Leiserson , and Charles E. Leiserson . 2015. The Cilkprof Scalability Profiler . In Proceedings of the 27th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA). 89-100 . Tao B. Schardl, Bradley C. Kuszmaul, I-Ting Angelina Lee, William M. Leiserson, and Charles E. Leiserson. 2015. The Cilkprof Scalability Profiler. In Proceedings of the 27th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA). 89-100."},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTR.2005.347035"},{"volume-title":"Practical Differential Profiling. In Euro-Par 2007 Parallel Processing: 13th International Euro-Par Conference, Rennes, France, August 28-31, 2007. Proceedings. 97-106","author":"Schulz Martin","key":"e_1_3_2_2_61_1","unstructured":"Martin Schulz and Bronis R . de Supinski. 2007 . Practical Differential Profiling. In Euro-Par 2007 Parallel Processing: 13th International Euro-Par Conference, Rennes, France, August 28-31, 2007. Proceedings. 97-106 . Martin Schulz and Bronis R. de Supinski. 2007. Practical Differential Profiling. In Euro-Par 2007 Parallel Processing: 13th International Euro-Par Conference, Rennes, France, August 28-31, 2007. Proceedings. 97-106."},{"key":"e_1_3_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/2312005.2312018"},{"volume-title":"Proceedings of the 2010 ACM\/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC). 1-11","author":"Tallent Nathan R.","key":"e_1_3_2_2_63_1","unstructured":"Nathan R. Tallent , Laksono Adhianto , and John M . Mellor-Crummey. 2010. Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles . In Proceedings of the 2010 ACM\/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC). 1-11 . Nathan R. Tallent, Laksono Adhianto, and John M. Mellor-Crummey. 2010. Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles. In Proceedings of the 2010 ACM\/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC). 1-11."},{"volume-title":"Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). 229-240","author":"Nathan","key":"e_1_3_2_2_64_1","unstructured":"Nathan R. Tallent and John M. Mellor-Crummey. 2009. Effective Performance Measurement and Analysis of Multithreaded Applications . In Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). 229-240 . Nathan R. Tallent and John M. Mellor-Crummey. 2009. Effective Performance Measurement and Analysis of Multithreaded Applications. In Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). 229-240."},{"key":"e_1_3_2_2_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173162.3177159"},{"key":"e_1_3_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1145\/1498765.1498785"},{"key":"e_1_3_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/2854038.2854063"},{"key":"e_1_3_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/3106237.3106254"},{"key":"e_1_3_2_2_69_1","volume-title":"Retrieved","author":"Yoga Adarsh","year":"2019","unstructured":"Adarsh Yoga and Santosh Nagarakatte . 2019 . Issue 103: Where to place progress points . Retrieved Mar 28, 2019 from https:\/\/github.com\/plasma-umass\/coz\/issues\/103. Adarsh Yoga and Santosh Nagarakatte. 2019. Issue 103: Where to place progress points. Retrieved Mar 28, 2019 from https:\/\/github.com\/plasma-umass\/coz\/issues\/103."},{"key":"e_1_3_2_2_70_1","volume-title":"Retrieved","author":"Yoga Adarsh","year":"2019","unstructured":"Adarsh Yoga and Santosh Nagarakatte . 2019 . Issue 104: Inconsistent results for throughput and latency profiling . Retrieved Mar 28, 2019 from https:\/\/github.com\/plasma-umass\/coz\/issues\/104. Adarsh Yoga and Santosh Nagarakatte. 2019. Issue 104: Inconsistent results for throughput and latency profiling. Retrieved Mar 28, 2019 from https:\/\/github.com\/plasma-umass\/coz\/issues\/104."},{"key":"e_1_3_2_2_71_1","doi-asserted-by":"crossref","unstructured":"Adarsh Yoga and Santosh Nagarakatte. 2019. TaskProf2. Retrieved Apr 5 2019 from https:\/\/github.com\/rutgers-apl\/TaskProf2.git.  Adarsh Yoga and Santosh Nagarakatte. 2019. TaskProf2. Retrieved Apr 5 2019 from https:\/\/github.com\/rutgers-apl\/TaskProf2.git.","DOI":"10.1145\/3325965"},{"key":"e_1_3_2_2_72_1","volume-title":"TaskProf2-Evaluation data. Retrieved","author":"Yoga Adarsh","year":"2019","unstructured":"Adarsh Yoga and Santosh Nagarakatte . 2019. TaskProf2-Evaluation data. Retrieved Apr 5, 2019 from https:\/\/github.com\/rutgers-apl\/TaskProf2\/tree\/master\/pldi_comparison_results. Adarsh Yoga and Santosh Nagarakatte. 2019. TaskProf2-Evaluation data. Retrieved Apr 5, 2019 from https:\/\/github.com\/rutgers-apl\/TaskProf2\/tree\/master\/pldi_comparison_results."},{"key":"e_1_3_2_2_73_1","doi-asserted-by":"publisher","DOI":"10.1145\/2950290.2950329"},{"key":"e_1_3_2_2_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/1952682.1952688"}],"event":{"name":"PLDI '19: 40th ACM SIGPLAN Conference on Programming Language Design and Implementation","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages"],"location":"Phoenix AZ USA","acronym":"PLDI '19"},"container-title":["Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3314221.3314621","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3314221.3314621","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3314221.3314621","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:53:22Z","timestamp":1750204402000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3314221.3314621"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,6,8]]},"references-count":74,"alternative-id":["10.1145\/3314221.3314621","10.1145\/3314221"],"URL":"https:\/\/doi.org\/10.1145\/3314221.3314621","relation":{},"subject":[],"published":{"date-parts":[[2019,6,8]]},"assertion":[{"value":"2019-06-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}