{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:15:31Z","timestamp":1750306531808,"version":"3.41.0"},"reference-count":30,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2015,1,9]],"date-time":"2015-01-09T00:00:00Z","timestamp":1420761600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"EU TERAFLUX project"},{"name":"University of Cyprus through a scholarship for George Matheou"},{"name":"IKYK foundation"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2015,1,9]]},"abstract":"<jats:p>The exponential growth of sequential processors has come to an end, and thus, parallel processing is probably the only way to achieve performance growth. We propose the development of parallel architectures based on data-driven scheduling. Data-driven scheduling enforces only a partial ordering as dictated by the true data dependencies, which is the minimum synchronization possible. This is very beneficial for parallel processing because it enables it to exploit the maximum possible parallelism. We provide architectural support for data-driven execution for the Data-Driven Multithreading (DDM) model. In the past, DDM has been evaluated mostly in the form of virtual machines. The main contribution of this work is the development of a highly efficient hardware support for data-driven execution and its integration into a multicore system with eight cores on a Virtex-6 FPGA. The DDM semantics make barriers and cache coherence unnecessary, which reduces the synchronization latencies significantly and makes the cache simpler. The performance evaluation has shown that the support for data-driven execution is very efficient with negligible overheads. Our prototype can support very small problem sizes (matrix 16\u00d716) and ultra-lightweight threads (block of 4x4) that achieve speedups close to linear. Such results cannot be achieved by software-based systems.<\/jats:p>","DOI":"10.1145\/2686874","type":"journal-article","created":{"date-parts":[[2015,1,12]],"date-time":"2015-01-12T20:02:10Z","timestamp":1421092930000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Architectural Support for Data-Driven Execution"],"prefix":"10.1145","volume":"11","author":[{"given":"George","family":"Matheou","sequence":"first","affiliation":[{"name":"University of Cyprus"}]},{"given":"Paraskevas","family":"Evripidou","sequence":"additional","affiliation":[{"name":"University of Cyprus"}]}],"member":"320","published-online":{"date-parts":[[2015,1,9]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Samer Arandi. 2012. The Data-Driven Multithreading Virtual Machine. Ph.D. Dissertation.  Samer Arandi. 2012. The Data-Driven Multithreading Virtual Machine. Ph.D. Dissertation."},{"key":"e_1_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Samer Arandi and Paraskevas Evripidou. 2010. Programming multi-core architectures using Data-Flow techniques. IEEE 152--161. DOI:http:\/\/dx.doi.org\/10.1109\/ICSAMOS.2010.5642072  Samer Arandi and Paraskevas Evripidou. 2010. Programming multi-core architectures using Data-Flow techniques. IEEE 152--161. DOI:http:\/\/dx.doi.org\/10.1109\/ICSAMOS.2010.5642072","DOI":"10.1109\/ICSAMOS.2010.5642072"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1944862.1944869"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/DFM.2011.16"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.1982.1653940"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1188455.1188546"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1454115.1454128"},{"volume-title":"Retrieved","year":"2014","author":"BSC.","key":"e_1_2_1_8_1"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2004.65"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2012.135"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2010.13"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/511425.511428"},{"volume-title":"Proceedings of the 1990 International Conference on Parallel Processing.","year":"1990","author":"Evripidou Paraskevas","key":"e_1_2_1_13_1"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2011.15"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2155620.2155628"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2465.2468"},{"volume-title":"Euro-Par 2004 Parallel Processing, Marco Danelutto, Marco Vanneschi, and Domenico Laforenza (Eds.)","series-title":"Lecture Notes in Computer Science","author":"Kyriacou Costas","key":"e_1_2_1_17_1"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2006.136"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/SAMOS.2013.6621136"},{"volume-title":"Proceedings of the 2013 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'13)","year":"2013","author":"Michael George","key":"e_1_2_1_20_1"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.515.0593"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342009106195"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/859618.859667"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2008.74"},{"volume-title":"WaveScalar. In Proceedings of the 36th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO'36)","year":"2003","author":"Swanson Steven","key":"e_1_2_1_25_1"},{"key":"e_1_2_1_26_1","unstructured":"Kevin Bryan Theobald. 1999. Earth: An Efficient Architecture for Running Threads. Ph.D. Dissertation. Montreal Quebec Canada Canada. Advisor(s) Gao Guang R. AAINQ50269.  Kevin Bryan Theobald. 1999. Earth: An Efficient Architecture for Running Threads. Ph.D. Dissertation. Montreal Quebec Canada Canada. Advisor(s) Gao Guang R. AAINQ50269."},{"volume-title":"Proceedings of the 11th Workshop on Interaction between Compilers and Computer Architectures. Citeseer, 32","year":"2007","author":"Trancoso Pedro","key":"e_1_2_1_27_1"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2541228.2555316"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/225830.223990"},{"volume-title":"Retrieved","year":"2014","key":"e_1_2_1_30_1"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2686874","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2686874","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:12:13Z","timestamp":1750227133000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2686874"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,1,9]]},"references-count":30,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2015,1,9]]}},"alternative-id":["10.1145\/2686874"],"URL":"https:\/\/doi.org\/10.1145\/2686874","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"type":"print","value":"1544-3566"},{"type":"electronic","value":"1544-3973"}],"subject":[],"published":{"date-parts":[[2015,1,9]]},"assertion":[{"value":"2014-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-01-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}