{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T18:02:30Z","timestamp":1759341750630,"version":"3.41.0"},"reference-count":59,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2024,12,10]],"date-time":"2024-12-10T00:00:00Z","timestamp":1733788800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Air Force Research Laboratory (AFRL) and Defense Advanced Research Projects Agency","award":["FA8650-18-2-7860"],"award-info":[{"award-number":["FA8650-18-2-7860"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Embed. Comput. Syst."],"published-print":{"date-parts":[[2025,1,31]]},"abstract":"<jats:p>As the landscape of computing advances, system designers are increasingly exploring methodologies that leverage higher levels of heterogeneity to enhance performance within constrained size, weight, power, and cost parameters. CEDR (Compiler-integrated Extensible DSSoC Runtime) stands as an ecosystem facilitating productive and efficient application development and deployment across heterogeneous computing systems. It fosters the co-design of applications, scheduling heuristics, and accelerators within a unified framework. Our goal is to present CEDR as a promising environment for lifting the barriers to research on heterogeneous systems and addressing the broader challenges within domain-specific architectures. We introduce CEDR and discuss the evolutionary design decisions underlying its programming model. Subsequently, we explore its utility for a broad range of users through design sweeps on off-the-shelf heterogeneous platforms across scheduling heuristics, hardware compositions, and workload scenarios.<\/jats:p>","DOI":"10.1145\/3687463","type":"journal-article","created":{"date-parts":[[2024,8,8]],"date-time":"2024-08-08T11:14:22Z","timestamp":1723115662000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Tutorial: A Novel Runtime Environment for Accelerator-Rich Heterogeneous Architectures"],"prefix":"10.1145","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1066-5578","authenticated-orcid":false,"given":"Joshua","family":"Mack","sequence":"first","affiliation":[{"name":"Electrical and Computer Engineering, The University of Arizona, Tucson, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2419-1860","authenticated-orcid":false,"given":"Anish","family":"Krishnakumar","sequence":"additional","affiliation":[{"name":"Electrical and Computer Engineering, University of Wisconsin-Madison, Madison, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5045-5535","authenticated-orcid":false,"given":"Umit","family":"Ogras","sequence":"additional","affiliation":[{"name":"Electrical and Computer Engineering, University of Wisconsin-Madison, Madison, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7982-8991","authenticated-orcid":false,"given":"Ali","family":"Akoglu","sequence":"additional","affiliation":[{"name":"Electrical and Computer Engineering, The University of Arizona, Tucson, United States"}]}],"member":"320","published-online":{"date-parts":[[2024,12,10]]},"reference":[{"key":"e_1_3_3_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3282307"},{"key":"e_1_3_3_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/3563946"},{"key":"e_1_3_3_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3358203"},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/3199610.3199614"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/ReCoSoC.2018.8449373"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2019.2907493"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2017.2773424"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/2228360.2228411"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/VLSI-SoC.2019.8920374"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW50202.2020.00016"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/3529257"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW59300.2023.00016"},{"key":"e_1_3_3_14_2","unstructured":"Umit Ogras Joshua Mack and Ali Akoglu. (n. d.). CEDR: A Novel Runtime Environment for Accelerator-Rich Heterogeneous Architectures. https:\/\/esweek.org\/education-class\/ec62023 Embedded Systems Week (ESWEEK) https:\/\/esweek.org\/education-class\/ec6"},{"key":"e_1_3_3_15_2","volume-title":"Proceedings of the GNU Radio Conference","volume":"7","author":"Mack Joshua","year":"2022","unstructured":"Joshua Mack, Serhan Gener, Ali Akoglu, Jacob Holtom, Alex Chiriyath, Chaitali Chakrabarti, Daniel Bliss, Anish Krishnakumar, Alper Goksoy, and Umit Ogras. 2022. GNU radio and CEDR: Runtime scheduling to heterogeneous accelerators. In Proceedings of the GNU Radio Conference, Vol. 7."},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00121"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2021.3135876"},{"key":"e_1_3_3_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/VLSI-SoC54400.2022.9939623"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/FCCM.2014.12"},{"key":"e_1_3_3_20_2","volume-title":"BOOM v2: an open-source out-of-order RISC-V core","author":"Celio Christopher","year":"2017","unstructured":"Christopher Celio, Pi-Feng Chiu, Borivoje Nikolic, David A. Patterson, and Krste Asanovi\u0107. 2017. BOOM v2: an open-source out-of-order RISC-V core. Technical Report UCB\/EECS-2017-157. EECS Department, University of California, Berkeley. http:\/\/www2.eecs.berkeley.edu\/Pubs\/TechRpts\/2017\/EECS-2017-157.html"},{"key":"e_1_3_3_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/MDAT.2023.3291331"},{"key":"e_1_3_3_22_2","unstructured":"Timothy Roscoe. 2021. It\u2019s time for operating systems to rediscover hardware. USENIX Association. Retrieved August 13 2024 from https:\/\/www.usenix.org\/conference\/osdi21\/presentation\/fri-keynote"},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0022-0000(75)80008-0"},{"key":"e_1_3_3_24_2","doi-asserted-by":"publisher","DOI":"10.3390\/jlpea13040056"},{"key":"e_1_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/LES.2021.3110426"},{"key":"e_1_3_3_26_2","doi-asserted-by":"publisher","unstructured":"H. Topcuoglu S. Hariri and Min-You Wu. 2002. Performance-effective and low-complexity task scheduling for heterogeneous computing. In IEEE Transactions on Parallel and Distributed Systems 13 3 (2002) 260\u2013274. DOI:10.1109\/71.993206","DOI":"10.1109\/71.993206"},{"key":"e_1_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/PDP.2010.56"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2013.57"},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.3944"},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2017.2730876"},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/iThings-GreenCom-CPSCom-SmartData.2016.143"},{"key":"e_1_3_3_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2020.3012861"},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/3005745.3005750"},{"key":"e_1_3_3_34_2","unstructured":"TraceAtlas. 2020. Retrieved January 20 2020 from https:\/\/github.com\/ruhrie\/TraceAtlas\/"},{"key":"e_1_3_3_35_2","unstructured":"Vivek Yadav. Small U-Net for vehicle detection. (n. d.). Retrieved June 5 2023 from github.com\/vxy10\/p5_VehicleDetection_Unet"},{"key":"e_1_3_3_36_2","unstructured":"Imagenette Dataset github.com\/fastai\/imagenette. (n. d.). Retrieved June 5 2023 from github.com\/fastai\/imagenette"},{"key":"e_1_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_3_38_2","article-title":"Speech commands: A dataset for limited-vocabulary speech recognition","author":"Warden P.","year":"2018","unstructured":"P. Warden. 2018. Speech commands: A dataset for limited-vocabulary speech recognition. ArXiv e-prints (April2018). arxiv:cs.CL\/1804.03209https:\/\/arxiv.org\/abs\/1804.03209","journal-title":"ArXiv e-prints"},{"key":"e_1_3_3_39_2","volume-title":"The Rocket Chip Generator","author":"Asanovi\u0107 Krste","year":"2016","unstructured":"Krste Asanovi\u0107, Rimas Avizienis, Jonathan Bachrach, Scott Beamer, David Biancolin, Christopher Celio, Henry Cook, Daniel Dabbelt, John Hauser, Adam Izraelevitz, Sagar Karandikar, Ben Keller, Donggyu Kim, John Koenig, Yunsup Lee, Eric Love, Martin Maas, Albert Magyar, Howard Mao, Miquel Moreto, Albert Ou, David A. Patterson, Brian Richards, Colin Schmidt, Stephen Twigg, Huy Vo, and Andrew Waterman. 2016. The Rocket Chip Generator. Technical Report UCB\/EECS-2016-17. EECS Department, University of California, Berkeley. http:\/\/www2.eecs.berkeley.edu\/Pubs\/TechRpts\/2016\/EECS-2016-17.html"},{"key":"e_1_3_3_40_2","unstructured":"CEDR Tutorial Set. (n. d.). Retrieved January 31 2024 from https:\/\/github.com\/UA-RCL\/CEDR\/blob\/tutorial\/CEDR_tutorial.md"},{"key":"e_1_3_3_41_2","unstructured":"Joshua Mack Sahil Hassan and Ali Akoglu. CEDR: A Holistic Software and Hardware Design Environment for FPGA-Integrated Heterogeneous Systems. (n. d.). Tutorial: 2024 International Symposium on Field Programmable Gate Arrays March 3-5 2024 Monterey CA.https:\/\/www.isfpga.org\/workshops-tutorials\/#t8"},{"key":"e_1_3_3_42_2","first-page":"1","volume-title":"2016 International Conference on Hardware\/Software Codesign and System Synthesis (CODES+ISSS)","author":"Donyanavard Bryan","year":"2016","unstructured":"Bryan Donyanavard, Tiago M\u00fcck, Santanu Sarma, and Nikil Dutt. 2016. SPARTA: Runtime task allocation for energy efficient heterogeneous manycores. In 2016 International Conference on Hardware\/Software Codesign and System Synthesis (CODES+ISSS). 1\u201310."},{"key":"e_1_3_3_43_2","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358312"},{"issue":"5","key":"e_1_3_3_44_2","first-page":"48:1\u201348:26","article-title":"SEAMS: Self-optimizing runtime manager for approximate memory hierarchies","volume":"20","author":"Maity Biswadip","year":"2021","unstructured":"Biswadip Maity, Bryan Donyanavard, Anmol Surhonne, Amir Rahmani, Andreas Herkersdorf, and Nikil Dutt. 2021. SEAMS: Self-optimizing runtime manager for approximate memory hierarchies. ACM Transactions on Embedded Computing Systems 20, 5 (July2021), 48:1\u201348:26. DOI:10\/gm3hnz","journal-title":"ACM Transactions on Embedded Computing Systems"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.sysarc.2019.01.006"},{"key":"e_1_3_3_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/VLSI-SoC.2019.8920374"},{"key":"e_1_3_3_47_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11265-021-01640-8"},{"key":"e_1_3_3_48_2","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.1631"},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3199610.3199614"},{"key":"e_1_3_3_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPEC49654.2021.9622873"},{"key":"e_1_3_3_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW55747.2022.00017"},{"key":"e_1_3_3_52_2","volume-title":"Advances in Neural Information Processing Systems","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alch\u00e9-Buc, E. Fox, and R. Garnett (Eds.), Vol. 32. Curran Associates, Inc.https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2019\/file\/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf"},{"key":"e_1_3_3_53_2","unstructured":"GNU Radio Website. (n. d.). Retrieved January 31 2024 from https:\/\/www.gnuradio.org"},{"key":"e_1_3_3_54_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2021.3104255"},{"key":"e_1_3_3_55_2","unstructured":"Richard Uhrie Chaitali Chakrabarti and John Brunhaver. 2020. Automated parallel kernel extraction from dynamic application traces. (2020). arXiv:2001.09995"},{"issue":"7","key":"e_1_3_3_56_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3544102","article-title":"Microarchitectural attacks in heterogeneous systems: A survey","volume":"55","author":"Naghibijouybari Hoda","year":"2022","unstructured":"Hoda Naghibijouybari, Esmaeil Mohammadian Koruyeh, and Nael Abu-Ghazaleh. 2022. Microarchitectural attacks in heterogeneous systems: A survey. Comput. Surveys 55, 7 (2022), 1\u201340.","journal-title":"Comput. Surveys"},{"key":"e_1_3_3_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/CCGRID.2019.00038"},{"key":"e_1_3_3_58_2","unstructured":"ARM AMBA. (n. d.). AXI4-stream protocol specification. Volume IHI 51A."},{"key":"e_1_3_3_59_2","unstructured":"udmabuf. (n. d.). Udmabuf A Userspace mappable DMA Buffer. Retrieved October 13 2021 from https:\/\/github.com\/ikwzm\/udmabuf"},{"key":"e_1_3_3_60_2","first-page":"1","volume-title":"2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","author":"Suluhan H. Umut","year":"2024","unstructured":"H. Umut Suluhan, Serhan Gener, Fusco Alexander, Joshua Mack, Ismet Dagli, Mehmet Belviranli, Cagatay Edemen, and Ali Akoglu. 2024. A runtime manager integrated emulation environment for heterogeneous SoC design with RISC-V cores. In 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 1\u20138."}],"container-title":["ACM Transactions on Embedded Computing Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3687463","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3687463","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:05:59Z","timestamp":1750291559000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3687463"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,10]]},"references-count":59,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1,31]]}},"alternative-id":["10.1145\/3687463"],"URL":"https:\/\/doi.org\/10.1145\/3687463","relation":{},"ISSN":["1539-9087","1558-3465"],"issn-type":[{"type":"print","value":"1539-9087"},{"type":"electronic","value":"1558-3465"}],"subject":[],"published":{"date-parts":[[2024,12,10]]},"assertion":[{"value":"2024-04-02","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-07-24","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-12-10","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}