{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T18:34:37Z","timestamp":1771698877338,"version":"3.50.1"},"reference-count":28,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2015,1,21]],"date-time":"2015-01-21T00:00:00Z","timestamp":1421798400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Spanish Ministry of Education"},{"name":"Swedish Research Council through the Linnaeus centre of excellence UPMARC"},{"name":"Uppsala Programming for Multicore Architectures; COST Action IC0805 \u201cOpen European Network for High Performance Computing on Complex Environments\u201d"},{"DOI":"10.13039\/501100002809","name":"Generalitat de Catalunya","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100002809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Embed. Comput. Syst."],"published-print":{"date-parts":[[2015,1,21]]},"abstract":"<jats:p>Dependency-aware task-based parallel programming models have proven to be successful for developing efficient application software for multicore-based computer architectures. The programming model is amenable to programmers, thereby supporting productivity, whereas hardware performance is achieved through a runtime system that dynamically schedules tasks onto cores in such a way that all dependencies are respected. However, even if the scheduling is completely successful with respect to load balancing, the scaling with the number of cores may be suboptimal due to resource contention. Here we consider the problem of scheduling tasks not only with respect to their interdependencies but also with respect to their usage of resources, such as memory and bandwidth. At the software level, this is achieved by user annotations of the task resource consumption. In the runtime system, the annotations are translated into scheduling constraints. Experimental results for different hardware, demonstrating performance gains both for model examples and real applications, are presented. Furthermore, we provide a set of tools to detect resource sensitivity and predict the performance improvements that can be achieved by resource-aware scheduling. These tools are solely based on parallel execution traces and require no instrumentation or modification of the application code.<\/jats:p>","DOI":"10.1145\/2638554","type":"journal-article","created":{"date-parts":[[2015,1,28]],"date-time":"2015-01-28T14:05:51Z","timestamp":1422453951000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":24,"title":["Resource-Aware Task Scheduling"],"prefix":"10.1145","volume":"14","author":[{"given":"Martin","family":"Tillenius","sequence":"first","affiliation":[{"name":"Uppsala University, Uppsala, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elisabeth","family":"Larsson","sequence":"additional","affiliation":[{"name":"Uppsala University, Uppsala, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rosa M.","family":"Badia","sequence":"additional","affiliation":[{"name":"Barcelona Supercomputing Center, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xavier","family":"Martorell","sequence":"additional","affiliation":[{"name":"Barcelona Supercomputing Center, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2015,1,21]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/180\/1\/012037"},{"key":"e_1_2_1_2_1","unstructured":"R. Al-Omary Guillermo Miranda Xavier Martorell Jesus Labarta Rosa M. Badia D. Keyes and Hatem Ltaief. 2013. Dense Cholesky factorization on NUMA architectures with socket-aware work stealing. Submitted.  R. Al-Omary Guillermo Miranda Xavier Martorell Jesus Labarta Rosa M. Badia D. Keyes and Hatem Ltaief. 2013. Dense Cholesky factorization on NUMA architectures with socket-aware work stealing. Submitted."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.1631"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-010-0135-4"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1810085.1810113"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2008.4771801"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1880018.1880019"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the 6th Swedish Workshop on Multicore Computing. 61--64","author":"Ceballos G\u00e9rman","year":"2013","unstructured":"G\u00e9rman Ceballos and David Black-Schaffer . 2013 . Shared resource sensitivity in task-based runtime systems . In Proceedings of the 6th Swedish Workshop on Multicore Computing. 61--64 . G\u00e9rman Ceballos and David Black-Schaffer. 2013. Shared resource sensitivity in task-based runtime systems. In Proceedings of the 6th Swedish Workshop on Multicore Computing. 61--64."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/1413370.1413375"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1142\/S0129626411000151"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ECRTS.2012.24"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2011.15"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/1898953.1899072"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.5555\/1247360.1247386"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1629335.1629369"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-010-0405-3"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPA.2012.109"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTR.2008.4663765"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342009106195"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/378993.379244"},{"key":"e_1_2_1_21_1","volume-title":"Leveraging Multicore Processors for Scientific Computing. Licentiate thesis. Department of Information Technology","author":"Tillenius Martin","unstructured":"Martin Tillenius . 2012a. Leveraging Multicore Processors for Scientific Computing. Licentiate thesis. Department of Information Technology , Uppsala University . Martin Tillenius. 2012a. Leveraging Multicore Processors for Scientific Computing. Licentiate thesis. Department of Information Technology, Uppsala University."},{"key":"e_1_2_1_22_1","volume-title":"Retrieved","author":"Tillenius Martin","year":"2012","unstructured":"Martin Tillenius . 2012 b. SuperGlue Project . Retrieved October 28, 2014, from http:\/\/www.it.uu.se\/research\/scicomp\/software\/superglue Martin Tillenius. 2012b. SuperGlue Project. Retrieved October 28, 2014, from http:\/\/www.it.uu.se\/research\/scicomp\/software\/superglue"},{"key":"e_1_2_1_23_1","volume-title":"PARA 2010: State of the Art in Scientific and Parallel Computing.","author":"Tillenius Martin","year":"2010","unstructured":"Martin Tillenius and Elisabeth Larsson . 2010 . An efficient task-based approach for solving the n-body problem on multicore architectures . In PARA 2010: State of the Art in Scientific and Parallel Computing. Martin Tillenius and Elisabeth Larsson. 2010. An efficient task-based approach for solving the n-body problem on multicore architectures. In PARA 2010: State of the Art in Scientific and Parallel Computing."},{"key":"e_1_2_1_24_1","unstructured":"Martin Tillenius Elisabeth Larsson Rosa M. Badia and Xavier Martorell. 2013. Resource aware task scheduling. In Proceedings of the 8th International Conference on High-Performance and Embedded Architectures and Compilers (Workshop on Parallel Programming and Run-Time Management Techniques for Many-Core Architectures). ACM New York NY.  Martin Tillenius Elisabeth Larsson Rosa M. Badia and Xavier Martorell. 2013. Resource aware task scheduling. In Proceedings of the 8th International Conference on High-Performance and Embedded Architectures and Compilers (Workshop on Parallel Programming and Run-Time Management Techniques for Many-Core Architectures). ACM New York NY."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/PACT.2011.7"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2008.12.006"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCSE.2012.45"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2379776.2379780"}],"container-title":["ACM Transactions on Embedded Computing Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2638554","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2638554","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:10:33Z","timestamp":1750234233000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2638554"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,1,21]]},"references-count":28,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,1,21]]}},"alternative-id":["10.1145\/2638554"],"URL":"https:\/\/doi.org\/10.1145\/2638554","relation":{},"ISSN":["1539-9087","1558-3465"],"issn-type":[{"value":"1539-9087","type":"print"},{"value":"1558-3465","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,1,21]]},"assertion":[{"value":"2013-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-01-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}