{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T15:31:20Z","timestamp":1759332680712,"version":"3.37.3"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2020,11,4]],"date-time":"2020-11-04T00:00:00Z","timestamp":1604448000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springer.com\/tdm"},{"start":{"date-parts":[[2020,11,4]],"date-time":"2020-11-04T00:00:00Z","timestamp":1604448000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Des Autom Embed Syst"],"published-print":{"date-parts":[[2021,6]]},"DOI":"10.1007\/s10617-020-09243-5","type":"journal-article","created":{"date-parts":[[2020,11,4]],"date-time":"2020-11-04T08:02:53Z","timestamp":1604476973000},"page":"135-160","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Dynamic concurrency throttling on NUMA systems and data migration impacts"],"prefix":"10.1007","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7070-1297","authenticated-orcid":false,"given":"Janaina","family":"Schwarzrock","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael Guilherme","family":"Jordan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guilherme","family":"Korol","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Charles C. de","family":"Oliveira","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2412-3027","authenticated-orcid":false,"given":"Arthur F.","family":"Lorenzon","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mateus","family":"Beck\u00a0Rutzig","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4492-1747","authenticated-orcid":false,"given":"Antonio Carlos","family":"S.\u00a0Beck","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,11,4]]},"reference":[{"key":"9243_CR1","first-page":"219","volume-title":"Application-level energy awareness for OpenMP","author":"F Alessi","year":"2015","unstructured":"Alessi F, Thoman P, Georgakoudis G, Fahringer T, Nikolopoulos DS (2015) Application-level energy awareness for OpenMP. Springer, Cham, pp 219\u2013232"},{"key":"9243_CR2","doi-asserted-by":"publisher","unstructured":"Bailey DH, Barszcz E, Barton JT, Browning DS, Carter RL, Dagum L, Fatoohi RA, Frederickson PO, Lasinski TA, Schreiber RS, Simon HD, Venkatakrishnan V, Weeratunga SK (1991) The NAS parallel benchmarks\u2014summary and preliminary results. In: ACM\/IEEE CS. ACM, NY, USA , pp 158\u2013165. https:\/\/doi.org\/10.1145\/125826.125925","DOI":"10.1145\/125826.125925"},{"key":"9243_CR3","doi-asserted-by":"crossref","unstructured":"Bari MAS, Chaimov N, Malik AM, Huck KA, Chapman B, Malony AD, Sarood O (2016) Arcs: adaptive runtime configuration selection for power-constrained openmp applications. In: 2016 IEEE international conference on cluster computing (CLUSTER), pp 461\u2013470","DOI":"10.1109\/CLUSTER.2016.39"},{"key":"9243_CR4","volume-title":"Adaptable embedded systems","author":"ACS Beck","year":"2012","unstructured":"Beck ACS, Lisb\u00f4a CAL, Carro L (2012) Adaptable embedded systems. Springer, Berlin"},{"key":"9243_CR5","doi-asserted-by":"crossref","unstructured":"Broquedis F, Aumage O, Goglin B, Thibault S, Wacrenier PA, Namyst R (2010) Structuring the execution of openmp applications for multicore architectures. In: 2010 IEEE international symposium on parallel & distributed processing (IPDPS). IEEE, pp 1\u201310 (2010)","DOI":"10.1109\/IPDPS.2010.5470442"},{"issue":"5\u20136","key":"9243_CR6","doi-asserted-by":"publisher","first-page":"418","DOI":"10.1007\/s10766-010-0136-3","volume":"38","author":"F Broquedis","year":"2010","unstructured":"Broquedis F, Furmento N, Goglin B, Wacrenier PA, Namyst R (2010) Forestgomp: an efficient openmp environment for numa architectures. Int J Parallel Prog 38(5\u20136):418\u2013439","journal-title":"Int J Parallel Prog"},{"key":"9243_CR7","doi-asserted-by":"crossref","unstructured":"Chadha G, Mahlke S, Narayanasamy S (2012) When less is more (LIMO): controlled parallelism for improved efficiency. In: CASES. USA, pp 141\u2013150 (2012)","DOI":"10.1145\/2380403.2380431"},{"key":"9243_CR8","unstructured":"Corbet J. Toward better numa scheduling. https:\/\/lwn.net\/Articles\/486858\/"},{"issue":"10","key":"9243_CR9","doi-asserted-by":"publisher","first-page":"1396","DOI":"10.1109\/TPDS.2007.70804","volume":"19","author":"M Curtis-Maury","year":"2008","unstructured":"Curtis-Maury M, Blagojevic F, Antonopoulos CD, Nikolopoulos DS (2008) Prediction-based power-performance adaptation of multithreaded scientific codes. IEEE Trans Parallel Distrib Syst 19(10):1396\u20131410","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"9243_CR10","doi-asserted-by":"crossref","unstructured":"Curtis-Maury M, Dzierwa J, Antonopoulos CD, Nikolopoulos DS (2006) Online power-performance adaptation of multithreaded programs using hardware event-based prediction. In: Int CS, pp 157\u2013166","DOI":"10.1145\/1183401.1183426"},{"issue":"1","key":"9243_CR11","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1145\/2490301.2451157","volume":"41","author":"M Dashti","year":"2013","unstructured":"Dashti M, Fedorova A, Funston J, Gaud F, Lachaize R, Lepers B, Quema V, Roth M (2013) Traffic management: a holistic approach to memory placement on numa systems. ACM SIGARCH Comput Archit News 41(1):381\u2013394","journal-title":"ACM SIGARCH Comput Archit News"},{"key":"9243_CR12","doi-asserted-by":"publisher","unstructured":"De\u00a0Sensi D (2016) Predicting performance and power consumption of parallel applications. In: PDP, pp 200\u2013207. https:\/\/doi.org\/10.1109\/PDP.2016.41","DOI":"10.1109\/PDP.2016.41"},{"issue":"4","key":"9243_CR13","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3006385","volume":"49","author":"M Diener","year":"2016","unstructured":"Diener M, Cruz EH, Alves MA, Navaux PO, Koren I (2016) Affinity-based thread and data mapping in shared memory systems. ACM Comput Surv (CSUR) 49(4):1\u201338","journal-title":"ACM Comput Surv (CSUR)"},{"key":"9243_CR14","doi-asserted-by":"crossref","unstructured":"Diener M, Cruz EH, Navaux PO (2015) Locality vs. balance: Exploring data mapping policies on numa systems. In: 2015 23rd Euromicro international conference on parallel, distributed, and network-based processing. IEEE, pp 9\u201316 (2015)","DOI":"10.1109\/PDP.2015.11"},{"key":"9243_CR15","doi-asserted-by":"crossref","unstructured":"Diener M, Cruz EH, Navaux PO, Busse A, Hei\u00df HU (2014) kmaf: automatic kernel-level management of thread and data affinity. In: Proceedings of the 23rd international conference on parallel architectures and compilation. ACM, pp 277\u2013288 (2014)","DOI":"10.1145\/2628071.2628085"},{"key":"9243_CR16","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1016\/j.peva.2015.03.001","volume":"88","author":"M Diener","year":"2015","unstructured":"Diener M, Cruz EH, Pilla LL, Dupros F, Navaux PO (2015) Characterizing communication and page usage of parallel applications for thread and data mapping. Perform Eval 88:18\u201336","journal-title":"Perform Eval"},{"issue":"3","key":"9243_CR17","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1145\/2425248.2425252","volume":"40","author":"M H\u00e4hnel","year":"2012","unstructured":"H\u00e4hnel M, D\u00f6bel B, V\u00f6lp M, H\u00e4rtig H (2012) Measuring energy consumption for short code paths using RAPL. SIGMETRICS Perform Eval Rev 40(3):13\u201317. https:\/\/doi.org\/10.1145\/2425248.2425252","journal-title":"SIGMETRICS Perform Eval Rev"},{"key":"9243_CR18","doi-asserted-by":"publisher","unstructured":"Joao JA, Suleman MA, Mutlu O, Patt YN (202) Bottleneck identification and scheduling in multithreaded applications. In: ASPLOS. ACM, NY, USA, pp 223\u2013234. https:\/\/doi.org\/10.1145\/2150976.2151001","DOI":"10.1145\/2150976.2151001"},{"key":"9243_CR19","doi-asserted-by":"crossref","unstructured":"Jung C, Lim D, Lee J, Han S (2005) Adaptive execution techniques for SMT multiprocessor architectures. In: ACM symposium on principles and practice of parallel programming. USA , pp 236\u2013246","DOI":"10.1145\/1065944.1065976"},{"issue":"3","key":"9243_CR20","doi-asserted-by":"publisher","first-page":"270","DOI":"10.1145\/1816038.1815996","volume":"38","author":"J Lee","year":"2010","unstructured":"Lee J, Wu H, Ravichandran M, Clark N (2010) Thread tailor: dynamically weaving threads together for efficient, adaptive parallel applications. SIGARCH Comput Archit News 38(3):270\u2013279","journal-title":"SIGARCH Comput Archit News"},{"key":"9243_CR21","unstructured":"Lepers B, Qu\u00e9ma V, Fedorova A (2015) Thread and memory placement on numa systems: asymmetry matters. In: 2015 USENIX annual technical conference (USENIX ATC 15), pp 277\u2013289"},{"key":"9243_CR22","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-28719-1","volume-title":"Parallel computing hits the power wall: principles, challenges, and a survey of solutions","author":"AF Lorenzon","year":"2019","unstructured":"Lorenzon AF, Beck ACS (2019) Parallel computing hits the power wall: principles, challenges, and a survey of solutions. Springer, Berlin"},{"key":"9243_CR23","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1016\/j.jpdc.2016.04.003","volume":"95","author":"AF Lorenzon","year":"2016","unstructured":"Lorenzon AF, Cera MC, Beck ACS (2016) Investigating different general-purpose and embedded multicores to achieve optimal trade-offs between performance and energy. J Parallel Distrib Comput 95:107\u2013123","journal-title":"J Parallel Distrib Comput"},{"key":"9243_CR24","doi-asserted-by":"publisher","unstructured":"Lorenzon AF, Oliveira CCD, Souza JD, Filho ACSB (2018) Aurora: seamless optimization of OpenMP applications. In: IEEE transactions on parallel and distributed systems, pp 1\u20131. https:\/\/doi.org\/10.1109\/TPDS.2018.2872992","DOI":"10.1109\/TPDS.2018.2872992"},{"key":"9243_CR25","doi-asserted-by":"crossref","unstructured":"Lorenzon AF, Sartor AL, Cera MC, Beck ACS (2015) Optimized use of parallel programming interfaces in multithreaded embedded architectures. In: 2015 IEEE computer society annual symposium on VLSI. IEEE, pp 410\u2013415","DOI":"10.1109\/ISVLSI.2015.85"},{"key":"9243_CR26","doi-asserted-by":"publisher","unstructured":"Lorenzon AF, Souza JD, Beck ACS (2017) LAANT: a library to automatically optimize EDP for OpenMP applications. In: DATE, pp 1229\u20131232. https:\/\/doi.org\/10.23919\/DATE.2017.7927176","DOI":"10.23919\/DATE.2017.7927176"},{"key":"9243_CR27","unstructured":"McCalpin JD (1995) Memory bandwidth and machine balance in current high performance computers. In: IEEE computer society technical committee on computer architecture newsletter, pp 19\u201325 (1995)"},{"key":"9243_CR28","unstructured":"Mucci PJ, Browne S, Deane C, Ho G (1999) Papi: a portable interface to hardware performance counters. In: Proceedings of the department of defense HPCMP users group conference, vol 710 (1999)"},{"key":"9243_CR29","doi-asserted-by":"publisher","DOI":"10.1093\/oso\/9780198515760.001.0001","volume-title":"Introduction to parallel computing: a practical guide with examples in C. Oxford texts in applied and engineering mathematics","author":"W Petersen","year":"2004","unstructured":"Petersen W, Arbenz P (2004) Introduction to parallel computing: a practical guide with examples in C. Oxford texts in applied and engineering mathematics. OUP, Oxford"},{"key":"9243_CR30","doi-asserted-by":"crossref","unstructured":"Porterfield AK, Olivier SL, Bhalachandra S, Prins JF (2013) Power measurement and concurrency throttling for energy reduction in OpenMP programs. In: IEEE IPDPS, pp 884\u2013891","DOI":"10.1109\/IPDPSW.2013.15"},{"key":"9243_CR31","doi-asserted-by":"crossref","unstructured":"Pusukuri KK, Gupta R, Bhuyan LN (2011) Thread reinforcer: dynamically determining number of threads via OS level monitoring. In: IEEE ISWC. USA, pp 116\u2013125","DOI":"10.1109\/IISWC.2011.6114208"},{"key":"9243_CR32","volume-title":"Parallel programming in C with MPI and OpenMP","author":"M Quinn","year":"2004","unstructured":"Quinn M (2004) Parallel programming in C with MPI and OpenMP. McGraw-Hill Higher Education, New York City"},{"key":"9243_CR33","doi-asserted-by":"publisher","unstructured":"Raasch SE, Reinhardt SK (2003) The impact of resource partitioning on SMT processors. In: PACT, pp 15\u201325. https:\/\/doi.org\/10.1109\/PACT.2003.1237998","DOI":"10.1109\/PACT.2003.1237998"},{"key":"9243_CR34","doi-asserted-by":"crossref","unstructured":"Schwarzrock J, Lorenzon AF, Navaux PO, Beck ACS, de\u00a0Freitas EP (2017) Potential gains in EDP by dynamically adapting the number of threads for openmp applications in embedded systems. In: 2017 VII Brazilian symposium on computing systems engineering (SBESC). IEEE, pp 79\u201385","DOI":"10.1109\/SBESC.2017.17"},{"issue":"4","key":"9243_CR35","doi-asserted-by":"publisher","first-page":"43-1","DOI":"10.1145\/3004054","volume":"13","author":"DD Sensi","year":"2016","unstructured":"Sensi DD, Torquati M, Danelutto M (2016) A reconfiguration algorithm for power-aware parallel applications. TACO 13(4):43-1\u201343-25. https:\/\/doi.org\/10.1145\/3004054","journal-title":"TACO"},{"key":"9243_CR36","doi-asserted-by":"publisher","unstructured":"Seo S, Jo G, Lee J (2011) Performance characterization of the NAS parallel benchmarks in OpenCL. In: IEEE ISWC, pp 137\u2013148. https:\/\/doi.org\/10.1109\/IISWC.2011.6114174","DOI":"10.1109\/IISWC.2011.6114174"},{"key":"9243_CR37","doi-asserted-by":"crossref","unstructured":"Sridharan S, Gupta G, Sohi GS (2014) Adaptive, efficient, parallel execution of parallel programs. In: ACM SIGPLAN PLDI. ACM, NY, USA, pp 169\u2013180","DOI":"10.1145\/2666356.2594292"},{"key":"9243_CR38","doi-asserted-by":"crossref","unstructured":"Subramanian L, Seshadri V, Kim Y, Jaiyen B, Mutlu O (2013) MISE: providing performance predictability and improving fairness in shared main memory systems. In: IEEE HPCA, pp 639\u2013650","DOI":"10.1109\/HPCA.2013.6522356"},{"issue":"1","key":"9243_CR39","doi-asserted-by":"publisher","first-page":"277","DOI":"10.1145\/1353534.1346317","volume":"36","author":"MA Suleman","year":"2008","unstructured":"Suleman MA, Qureshi MK, Patt YN (2008) Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on CMPs. SIGARCH Comput Archit News 36(1):277\u2013286","journal-title":"SIGARCH Comput Archit News"},{"key":"9243_CR40","doi-asserted-by":"crossref","unstructured":"Wang W, Davidson JW, Soffa ML (2016) Predicting the memory bandwidth and optimal core allocations for multi-threaded applications on large-scale numa machines. In: 2016 IEEE international symposium on high performance computer architecture (HPCA). IEEE, pp 419\u2013431 (2016)","DOI":"10.1109\/HPCA.2016.7446083"}],"container-title":["Design Automation for Embedded Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10617-020-09243-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10617-020-09243-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10617-020-09243-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,6,3]],"date-time":"2021-06-03T16:07:02Z","timestamp":1622736422000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10617-020-09243-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,4]]},"references-count":40,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2021,6]]}},"alternative-id":["9243"],"URL":"https:\/\/doi.org\/10.1007\/s10617-020-09243-5","relation":{},"ISSN":["0929-5585","1572-8080"],"issn-type":[{"type":"print","value":"0929-5585"},{"type":"electronic","value":"1572-8080"}],"subject":[],"published":{"date-parts":[[2020,11,4]]},"assertion":[{"value":"28 April 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 October 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 November 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}