{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,13]],"date-time":"2026-03-13T09:27:31Z","timestamp":1773394051997,"version":"3.50.1"},"reference-count":37,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2027,1,24]],"date-time":"2027-01-24T00:00:00Z","timestamp":1800748800000},"content-version":"vor","delay-in-days":327,"URL":"http:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","award":["HR001123C0130"],"award-info":[{"award-number":["HR001123C0130"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Embed. Comput. Syst."],"published-print":{"date-parts":[[2026,3,31]]},"abstract":"<jats:p>Modern data-intensive applications demand accelerators that can adapt to dynamic and high-throughput workloads. Coarse-Grained Reconfigurable Arrays (CGRAs) have emerged as promising candidates for such workloads due to their spatial architecture and run-time reconfigurability. However, ad-hoc hardware configurations and traditional static compilation techniques struggle to cope with the run-time irregularity and control-flow dynamism. This article first presents a systematic design space exploration (DSE) to identify the optimized hardware configurations tailored to application-specific constraints, such as area budget, throughput requirement, and throughput efficiency. Then, it proposes a communication-aware dynamic scheduling approach built on a hardware\/software co-design that combines preloading and scoreboard mechanisms to minimize reconfiguration overhead while maximizing interconnect bandwidth utilization. Evaluated on the optimized configurations and the respective spectrum sensing benchmarks, the proposed scheduling method achieves up to 1.6\u00d7 performance improvement over a baseline and 1.3\u00d7 over an adapted state-of-the-art (SOTA) dynamic scheduling strategy.<\/jats:p>","DOI":"10.1145\/3793672","type":"journal-article","created":{"date-parts":[[2026,1,24]],"date-time":"2026-01-24T21:07:18Z","timestamp":1769288838000},"page":"1-27","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["CADAS: Communication-Aware Dynamic Scheduler on CGRAs for Large-Volume and Real-Time Processing"],"prefix":"10.1145","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-3618-2385","authenticated-orcid":false,"given":"Jiahao","family":"Lin","sequence":"first","affiliation":[{"name":"University of Wisconsin-Madison","place":["Madison, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-5398-5708","authenticated-orcid":false,"given":"Hasan Umut","family":"Suluhan","sequence":"additional","affiliation":[{"name":"University of Arizona","place":["Tucson, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9859-7778","authenticated-orcid":false,"given":"Chaitali","family":"Chakrabarti","sequence":"additional","affiliation":[{"name":"Electrical Engineering, Arizona State University","place":["Tempe, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7982-8991","authenticated-orcid":false,"given":"Ali","family":"Akoglu","sequence":"additional","affiliation":[{"name":"University of Arizona","place":["Tucson, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5045-5535","authenticated-orcid":false,"given":"Umit","family":"Ogras","sequence":"additional","affiliation":[{"name":"University of Wisconsin-Madison","place":["Madison, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2026,3,3]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"Manel Ammar Mouna Baklouti and Mohamed Abid. 2016. The performance-energy tradeoff in embedded systems design: A survey of existing design space exploration tools and trends. International Journal of Computer Science and Information Security 14 5 (2016) 381\u2013391."},{"key":"e_1_3_1_3_2","doi-asserted-by":"crossref","unstructured":"Samet E. Arda Anish Krishnakumar A. Alper Goksoy Nirmal Kumbhare Joshua Mack Anderson L. Sartor Ali Akoglu Radu Marculescu and Umit Y. Ogras. 2020. DS3: A system-level domain-specific system-on-chip simulation framework. IEEE Transactions on Computers 69 8 (2020) 1248\u20131262.","DOI":"10.1109\/TC.2020.2986963"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3503222.3507772"},{"key":"e_1_3_1_5_2","doi-asserted-by":"crossref","unstructured":"Joao P. Cerqueira Thomas J. Repetti Yu Pu Shivam Priyadarshi Martha A. Kim and Mingoo Seok. 2020. Catena: A near-threshold sub-0.4-mW 16-core programmable spatial array accelerator for the ultralow-power mobile and embedded Internet of Things. IEEE Journal of Solid-State Circuits 55 8 (2020) 2270\u20132284.","DOI":"10.1109\/JSSC.2020.2978137"},{"key":"e_1_3_1_6_2","doi-asserted-by":"crossref","unstructured":"Kuan-Yu Chen Thomas Mason Nelson Alireza Khadem Morteza Fayazi Sanjay Sri Vallabh Singapuram Ronald Dreslinski Nishil Talati Hun-Seok Kim and David Blaauw. 2025. Canalis: A throughput-optimized framework for real-time stream processing of wireless communication. ACM Transactions on Reconfigurable Technology and Systems 17 4 (2025) 1\u201332.","DOI":"10.1145\/3695880"},{"key":"e_1_3_1_7_2","doi-asserted-by":"crossref","unstructured":"Kuan-Yu Chen Chi-Sheng Yang Yu-Hsiu Sun Chien-Wei Tseng Morteza Fayazi Xin He Siying Feng Yufan Yue Trevor Mudge Ronald Dreslinski Hun-Seok Kim and David Blaauw. 2025. DAP: A 507-GMACs\/J 256-core domain adaptive processor for wireless communication and linear algebra kernels in 12-nm FINFET. IEEE Journal of Solid-State Circuits 60 2 (2025) 672\u2013684.","DOI":"10.1109\/JSSC.2024.3438758"},{"key":"e_1_3_1_8_2","doi-asserted-by":"crossref","unstructured":"Longlong Chen Jianfeng Zhu Yangdong Deng Zhaoshi Li Jian Chen Xiaowei Jiang Shouyi Yin Shaojun Wei and Leibo Liu. 2021. An elastic task scheduling scheme on coarse-grained reconfigurable architectures. IEEE Transactions on Parallel and Distributed Systems 32 12 (2021) 3066\u20133080.","DOI":"10.1109\/TPDS.2021.3084804"},{"key":"e_1_3_1_9_2","doi-asserted-by":"crossref","unstructured":"Sichao Chen Chang Cai Su Zheng Jiangnan Li Guowei Zhu Jingyuan Li Yazhou Yan Yuan Dai Wenbo Yin and Lingli Wang. 2024. Hiercgra: A novel framework for large-scale CGRA with hierarchical modeling and automated design space exploration. ACM Transactions on Reconfigurable Technology and Systems 17 2 (2024) 1\u201331.","DOI":"10.1145\/3656176"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2018.00014"},{"key":"e_1_3_1_11_2","doi-asserted-by":"crossref","unstructured":"Yu-Hsin Chen Tushar Krishna Joel S. Emer and Vivienne Sze. 2016. Eyeriss: An energy-efficient reconfigurable accelerator for deep convolutional neural networks. IEEE Journal of Solid-State Circuits 52 1 (2016) 127\u2013138.","DOI":"10.1109\/JSSC.2016.2616357"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/ASAP.2017.7995277"},{"key":"e_1_3_1_13_2","unstructured":"Tri Dao Dan Fu Stefano Ermon Atri Rudra and Christopher R\u00e9. 2022. Flashattention: Fast and memory-efficient exact attention with IO-awareness. In Proceedings of the 36th International Conference on Neural Information Processing Systems (NIPS\u201922). Curran Associates Inc. Red Hook NY USA Article 1189 (2022) 16."},{"key":"e_1_3_1_14_2","doi-asserted-by":"crossref","unstructured":"Barry De Bruin Kanishkan Vadivel Mark Wijtvliet Pekka J\u00e4\u00e4skel\u00e4inen and Henk Corporaal. 2024. R-blocks: An energy-efficient flexible and programmable CGRA. ACM Transactions on Reconfigurable Technology and Systems 17 2 (2024) 1\u201334.","DOI":"10.1145\/3656642"},{"key":"e_1_3_1_15_2","doi-asserted-by":"crossref","unstructured":"Loris Duch Soumya Basu Rub\u00e9n Braojos Giovanni Ansaloni Laura Pozzi and David Atienza. 2017. HEAL-WEAR: An ultra-low power heterogeneous system for bio-signal analysis. IEEE Transactions on Circuits and Systems I: Regular Papers 64 9 (2017) 2448\u20132461.","DOI":"10.1109\/TCSI.2017.2701499"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358316"},{"key":"e_1_3_1_17_2","unstructured":"Andrei Ivanov Nikoli Dryden Tal Ben-Nun Shigang Li and Torsten Hoefler. 2021. Data movement is all you need: A case study on optimizing transformers. In Proceedings of Machine Learning and Systems A. Smola A. Dimakis and I. Stoica (Eds.). 3 (2021) 711\u2013732."},{"key":"e_1_3_1_18_2","doi-asserted-by":"crossref","unstructured":"Antoni Ivanov Albena Mihovska Krasimir Tonchev and Vladimir Poulkov. 2018. Real-time adaptive spectrum sensing for cyclostationary and energy detectors. IEEE Aerospace and Electronic Systems Magazine 33 5-6 (2018) 20\u201333.","DOI":"10.1109\/MAES.2018.170098"},{"key":"e_1_3_1_19_2","doi-asserted-by":"crossref","unstructured":"Dimpal Janu Kuldeep Singh and Sandeep Kumar. 2022. Machine learning for cooperative spectrum sensing and sharing: A survey. Transactions on Emerging Telecommunications Technologies 33 1 (2022) e4352.","DOI":"10.1002\/ett.4352"},{"key":"e_1_3_1_20_2","doi-asserted-by":"crossref","unstructured":"Yoonjin Kim Rabi N. Mahapatra and Kiyoung Choi. 2009. Design space exploration for efficient resource utilization in coarse-grained reconfigurable architecture. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 18 10 (2009) 1471\u20131482.","DOI":"10.1109\/TVLSI.2009.2025280"},{"key":"e_1_3_1_21_2","unstructured":"Taeyoung Kong Kalhan Koul Priyanka Raina Mark Horowitz and Christopher Torng. 2023. Hardware abstractions and hardware mechanisms to support multi-task execution on coarse-grained reconfigurable arrays. arXiv:2301.00861. Retrieved from https:\/\/arxiv.org\/abs\/2301.00861"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.23919\/EuCAP51087.2021.9411042"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/3600006.3613165"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/DAC18074.2021.9586255"},{"key":"e_1_3_1_25_2","doi-asserted-by":"crossref","unstructured":"Jiahao Lin H. Umut Suluhan Hyunwon Chung Arindam Dutta Anish Vipperla Gerard Gubash Jacob Holtom Bernd-Peter Paris Chaitali Chakrabarti Daniel W. Bliss David Blaauw Hun-Seok Kim Ali Akoglu and Umit Y. Ogras. 2025. An overview of challenges and requirements for real-time spectrum sensing in modern RF autonomy systems. IEEE Design & Test 42 6 (2025) 111\u2013126.","DOI":"10.1109\/MDAT.2025.3594311"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-73699-0_5"},{"key":"e_1_3_1_27_2","doi-asserted-by":"crossref","unstructured":"Leibo Liu Jianfeng Zhu Zhaoshi Li Yanan Lu Yangdong Deng Jie Han Shouyi Yin and Shaojun Wei. 2019. A survey of coarse-grained reconfigurable architecture and design: Taxonomy challenges and applications. ACM Computing Surveys (CSUR) 52 6 (2019) 1\u201339.","DOI":"10.1145\/3357375"},{"key":"e_1_3_1_28_2","doi-asserted-by":"crossref","unstructured":"Sumeet Singh Nagi Uneeb Rathore Krutikesh Sahoo Tim Ling Subramanian S. Iyer and Dejan Markovi\u0107. 2022. A 16-nm 784-core digital signal processor array assembled as a 2\u00d7 2 dielet with 10- \\(\\mu\\) m pitch interdielet I\/O for runtime multiprogram reconfiguration. IEEE Journal of Solid-State Circuits 58 1 (2022) 111\u2013123.","DOI":"10.1109\/JSSC.2022.3212685"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/3466752.3480134"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1145\/1454115.1454140"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/1669112.1669160"},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","unstructured":"Zhijin Qin Yue Gao Mark D. Plumbley and Clive G. Parini. 2015. Wideband spectrum sensing on real-time signals at sub-Nyquist sampling rates in single and cooperative multiple nodes. IEEE Transactions on Signal Processing 64 12 (2015) 3106\u20133117.","DOI":"10.1109\/TSP.2015.2512562"},{"key":"e_1_3_1_33_2","doi-asserted-by":"crossref","unstructured":"Shahid Shayaa Noor Ismawati Jaafar Shamshul Bahri Ainin Sulaiman Phoong Seuk Wai Yeong Wai Chung Arsalan Zahid Piprani and Mohammed Ali Al-Garadi. 2018. Sentiment analysis of big data: Methods applications and open challenges. IEEE Access 6 1 (2018) 37807\u201337827.","DOI":"10.1109\/ACCESS.2018.2851311"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1145\/3658617.3697564"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCAD66269.2025.11240855"},{"key":"e_1_3_1_36_2","unstructured":"Daniel Vazquez Jose Miranda Alfonso Rodriguez Andres Otero Pascuale Davide Schiavone and David Atienza. 2024. STRELA: Streaming elastic CGRA accelerator for embedded systems. arXiv:2404.12503. Retrieved from https:\/\/arxiv.org\/abs\/2404.12503"},{"key":"e_1_3_1_37_2","doi-asserted-by":"crossref","unstructured":"Anish Vipperla Narayanan Murugan Ali Akoglu and Chaitali Chakrabarti. 2025. Compilation framework for dynamically reconfigurable array architectures. IEEE Access 13 1 (2025) 196415\u2013196432.","DOI":"10.1109\/ACCESS.2025.3633942"},{"key":"e_1_3_1_38_2","doi-asserted-by":"crossref","unstructured":"Tevfik Yucek and Huseyin Arslan. 2009. A survey of spectrum sensing algorithms for cognitive radio applications. IEEE Communications Surveys & Tutorials 11 1 (2009) 116\u2013130.","DOI":"10.1109\/SURV.2009.090109"}],"container-title":["ACM Transactions on Embedded Computing Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3793672","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3793672","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,13]],"date-time":"2026-03-13T08:15:41Z","timestamp":1773389741000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3793672"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,3]]},"references-count":37,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,3,31]]}},"alternative-id":["10.1145\/3793672"],"URL":"https:\/\/doi.org\/10.1145\/3793672","relation":{},"ISSN":["1539-9087","1558-3465"],"issn-type":[{"value":"1539-9087","type":"print"},{"value":"1558-3465","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,3]]},"assertion":[{"value":"2025-06-03","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-01-20","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-03-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}