{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,12]],"date-time":"2025-11-12T03:19:25Z","timestamp":1762917565011,"version":"3.41.0"},"reference-count":33,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2015,1,21]],"date-time":"2015-01-21T00:00:00Z","timestamp":1421798400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Embed. Comput. Syst."],"published-print":{"date-parts":[[2015,1,21]]},"abstract":"<jats:p>The current trend to move from homogeneous to heterogeneous multicore systems provides compelling opportunities for achieving performance and energy efficiency goals. Running multiple threads in multicore systems poses challenges on meeting limited shared resources, such as memory bandwidth. We propose an optimization approach that includes an Integer Linear Programming (ILP) optimization model and a scheme to dynamically determine thread-to-core assignment. We present simulation analysis that shows energy savings and performance gains for a variety of workloads compared to state-of-the-art schemes. We implemented and evaluated a prototype of our thread assignment approach at user level, leveraging Linux scheduling and performance-monitoring capabilities.<\/jats:p>","DOI":"10.1145\/2566618","type":"journal-article","created":{"date-parts":[[2015,1,28]],"date-time":"2015-01-28T14:05:51Z","timestamp":1422453951000},"page":"1-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":31,"title":["Energy-Efficient Thread Assignment Optimization for Heterogeneous Multicore Systems"],"prefix":"10.1145","volume":"14","author":[{"given":"Vinicius","family":"Petrucci","sequence":"first","affiliation":[{"name":"University of Michigan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Orlando","family":"Loques","sequence":"additional","affiliation":[{"name":"Universidade Federal Fluminense, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Moss\u00e9","sequence":"additional","affiliation":[{"name":"University of Pittsburgh, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rami","family":"Melhem","sequence":"additional","affiliation":[{"name":"University of Pittsburgh, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Neven Abou","family":"Gazala","sequence":"additional","affiliation":[{"name":"Intel Corporation, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sameh","family":"Gobriel","sequence":"additional","affiliation":[{"name":"Intel Corporation"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2015,1,21]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/998685.1007004"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1128022.1128029"},{"key":"e_1_2_1_3_1","volume-title":"Proceedings of the Linux Symposium.","author":"Blagodurov Sergey","year":"2011","unstructured":"Sergey Blagodurov and Alexandra Fedorova . 2011 . User-level scheduling on NUMA multicore systems under Linux . In Proceedings of the Linux Symposium. Sergey Blagodurov and Alexandra Fedorova. 2011. User-level scheduling on NUMA multicore systems under Linux. In Proceedings of the Linux Symposium."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1880018.1880019"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/RTSS.2009.23"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/40.888701"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2012.6169046"},{"key":"e_1_2_1_8_1","volume-title":"Retrieved","author":"Devices Electronic Educational","year":"2010","unstructured":"Electronic Educational Devices . 2010 . Watts Up PRO . Retrieved October 28, 2014 from http:\/\/www.wattsupmeters.com\/. Electronic Educational Devices. 2010. Watts Up PRO. Retrieved October 28, 2014 from http:\/\/www.wattsupmeters.com\/."},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the Linux Symposium. 269--287","author":"Eranian Stephane","year":"2006","unstructured":"Stephane Eranian . 2006 . Perfmon2: A flexible performance monitoring interface for Linux . In Proceedings of the Linux Symposium. 269--287 . Stephane Eranian. 2006. Perfmon2: A flexible performance monitoring interface for Linux. In Proceedings of the Linux Symposium. 269--287."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1610252.1610270"},{"key":"e_1_2_1_11_1","unstructured":"P. Greenhalgh. 2011. Big.LITTLE Processing with ARM CortexTM-A15 and Cortex-A7. White Paper.  P. Greenhalgh. 2011. Big.LITTLE Processing with ARM CortexTM-A15 and Cortex-A7. White Paper."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1945023.1945026"},{"key":"e_1_2_1_13_1","unstructured":"Gurobi Optimization Inc. 2011. Gurobi Optimizer Version 4.5. Retrieved from http:\/\/www.gurobi.com\/.  Gurobi Optimization Inc. 2011. Gurobi Optimizer Version 4.5. Retrieved from http:\/\/www.gurobi.com\/."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"e_1_2_1_15_1","volume-title":"Retrieved","author":"Intel Corp.","year":"2011","unstructured":"Intel Corp. 2011 . Intel Processor Specifications . Retrieved October 28, 2014 from http:\/\/ark.intel.com\/. (2011). Intel Corp. 2011. Intel Processor Specifications. Retrieved October 28, 2014 from http:\/\/ark.intel.com\/. (2011)."},{"key":"e_1_2_1_16_1","unstructured":"Aamer Jaleel. 2011. Memory characterization of workloads using instrumentation-driven simulation. http:\/\/www.jaleels.org\/ajaleel\/workload\/.  Aamer Jaleel. 2011. Memory characterization of workloads using instrumentation-driven simulation. http:\/\/www.jaleels.org\/ajaleel\/workload\/."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/800057.808695"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2008.48"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1755913.1755928"},{"volume-title":"Proceedings of MICRO 36","author":"Kumar Rakesh","key":"e_1_2_1_20_1","unstructured":"Rakesh Kumar , Keith I. Farkas , Norman P. Jouppi , Parthasarathy Ranganathan , and Dean M. Tullsen . 2003. Single-ISA heterogeneous multi-core architectures: The potential for processor power reduction . In Proceedings of MICRO 36 . Rakesh Kumar, Keith I. Farkas, Norman P. Jouppi, Parthasarathy Ranganathan, and Dean M. Tullsen. 2003. Single-ISA heterogeneous multi-core architectures: The potential for processor power reduction. In Proceedings of MICRO 36."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1152154.1152162"},{"volume-title":"Proceedings of HPCA.","author":"Li Tong","key":"e_1_2_1_22_1","unstructured":"Tong Li , P. Brett , R. Knauerhase , D. Koufaty , D. Reddy , and S. Hahn . 2010. Operating system support for overlapping-ISA heterogeneous multi-core architectures . In Proceedings of HPCA. Tong Li, P. Brett, R. Knauerhase, D. Koufaty, D. Reddy, and S. Hahn. 2010. Operating system support for overlapping-ISA heterogeneous multi-core architectures. In Proceedings of HPCA."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1944862.1944887"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/98124"},{"volume-title":"Proceedings of the USENIX Annual Technical Conference (2002-01-03)","author":"Larry","key":"e_1_2_1_25_1","unstructured":"Larry W. McVoy and Carl Staelin. 1996. lmbench: Portable tools for performance analysis . In Proceedings of the USENIX Annual Technical Conference (2002-01-03) . 279--294. Larry W. McVoy and Carl Staelin. 1996. lmbench: Portable tools for performance analysis. In Proceedings of the USENIX Annual Technical Conference (2002-01-03). 279--294."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/RTAS.2012.13"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.5555\/2485288.2485400"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1555754.1555793"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1755913.1755929"},{"key":"e_1_2_1_30_1","volume-title":"Retrieved","author":"Electronics Samsung","year":"2013","unstructured":"Samsung Electronics . 2013 . SAMSUNG highlightsinnovations in mobile experiences driven by components, in CES keynote . Retrieved October 28, 2014 from http:\/\/www.samsung.com\/us\/news\/20353. Samsung Electronics. 2013. SAMSUNG highlightsinnovations in mobile experiences driven by components, in CES keynote. Retrieved October 28, 2014 from http:\/\/www.samsung.com\/us\/news\/20353."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531793.1531804"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the IEEE International Symposium on Industrial Embedded Systems (SIES\u201911)","author":"Sousa P. B.","year":"2011","unstructured":"P. B. Sousa , B. Andersson , and E. Tovar . 2011. Implementing slot-based task-splitting multiprocessor scheduling . In Proceedings of the IEEE International Symposium on Industrial Embedded Systems (SIES\u201911) . 256--265. DOI: http:\/\/dx.doi.org\/10.1109\/SIES. 2011 .5953669 10.1109\/SIES.2011.5953669 P. B. Sousa, B. Andersson, and E. Tovar. 2011. Implementing slot-based task-splitting multiprocessor scheduling. In Proceedings of the IEEE International Symposium on Industrial Embedded Systems (SIES\u201911). 256--265. DOI: http:\/\/dx.doi.org\/10.1109\/SIES.2011.5953669"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1945023.1945032"}],"container-title":["ACM Transactions on Embedded Computing Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2566618","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2566618","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:01:00Z","timestamp":1750230060000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2566618"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,1,21]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,1,21]]}},"alternative-id":["10.1145\/2566618"],"URL":"https:\/\/doi.org\/10.1145\/2566618","relation":{},"ISSN":["1539-9087","1558-3465"],"issn-type":[{"type":"print","value":"1539-9087"},{"type":"electronic","value":"1558-3465"}],"subject":[],"published":{"date-parts":[[2015,1,21]]},"assertion":[{"value":"2012-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-01-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}