{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T16:21:18Z","timestamp":1781108478366,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,6,22]],"date-time":"2021-06-22T00:00:00Z","timestamp":1624320000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"NSF (National Science Foundation)","doi-asserted-by":"publisher","award":["1836752"],"award-info":[{"award-number":["1836752"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,6,22]]},"DOI":"10.1145\/3447555.3465326","type":"proceedings-article","created":{"date-parts":[[2021,6,23]],"date-time":"2021-06-23T04:49:35Z","timestamp":1624423775000},"page":"302-308","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["Design Considerations for Energy-efficient Inference on Edge Devices"],"prefix":"10.1145","author":[{"given":"Walid A.","family":"Hanafy","sequence":"first","affiliation":[{"name":"University of Massachusetts Amherst"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tergel","family":"Molom-Ochir","sequence":"additional","affiliation":[{"name":"University of Massachusetts Amherst"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Rohan","family":"Shenoy","sequence":"additional","affiliation":[{"name":"University of Massachusetts Amherst"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2021,6,22]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Process. Symp. 113--122","author":"Abe Yuki","year":"2014","unstructured":"Yuki Abe , Hiroshi Sasaki , Shinpei Kato , Koji Inoue , Masato Edahiro , and Martin Peres . 2014 . Power and Performance Characterization and Modeling of GPU-Accelerated Systems. In 2014 IEEE 28th Int. Parallel Distrib . Process. Symp. 113--122 . Yuki Abe, Hiroshi Sasaki, Shinpei Kato, Koji Inoue, Masato Edahiro, and Martin Peres. 2014. Power and Performance Characterization and Modeling of GPU-Accelerated Systems. In 2014 IEEE 28th Int. Parallel Distrib. Process. Symp. 113--122."},{"key":"e_1_3_2_1_2_1","volume-title":"Retrieved","year":"2021","unstructured":"Apple. 2021 . Apple Neural Engine . Retrieved January 15, 2021 from https:\/\/www.apple.com\/newsroom\/2020\/11\/apple-unleashes-m1\/ Apple. 2021. Apple Neural Engine. Retrieved January 15, 2021 from https:\/\/www.apple.com\/newsroom\/2020\/11\/apple-unleashes-m1\/"},{"key":"e_1_3_2_1_3_1","volume-title":"PredJoule: A Timing-Predictable Energy Optimization Framework for Deep Neural Networks. In 2018 IEEE Real-Time Systems Symposium (RTSS). 107--118","author":"Bateni S.","unstructured":"S. Bateni , H. Zhou , Y. Zhu , and C. Liu . 2018 . PredJoule: A Timing-Predictable Energy Optimization Framework for Deep Neural Networks. In 2018 IEEE Real-Time Systems Symposium (RTSS). 107--118 . S. Bateni, H. Zhou, Y. Zhu, and C. Liu. 2018. PredJoule: A Timing-Predictable Energy Optimization Framework for Deep Neural Networks. In 2018 IEEE Real-Time Systems Symposium (RTSS). 107--118."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2019.2921977"},{"key":"e_1_3_2_1_5_1","volume-title":"Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems","author":"Chen Kaifei","unstructured":"Kaifei Chen , Tong Li , Hyung-Sin Kim , David E. Culler , and Randy H. Katz . 2018. MARVEL: Enabling Mobile Augmented Reality with Low Energy and Low Latency . In Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems ( Shenzhen, China) (SenSys '18). Association for Computing Machinery, New York, NY, USA, 292--304. Kaifei Chen, Tong Li, Hyung-Sin Kim, David E. Culler, and Randy H. Katz. 2018. MARVEL: Enabling Mobile Augmented Reality with Low Energy and Low Latency. In Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems (Shenzhen, China) (SenSys '18). Association for Computing Machinery, New York, NY, USA, 292--304."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.351"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2954546"},{"key":"e_1_3_2_1_10_1","volume-title":"Irwin","author":"Liang Qianlin","year":"2020","unstructured":"Qianlin Liang , Prashant J. Shenoy , and David E . Irwin . 2020 . AI on the Edge : Characterizing AI-based IoT Applications Using Specialized Edge Architectures. In IISWC. 145--156. Qianlin Liang, Prashant J. Shenoy, and David E. Irwin. 2020. AI on the Edge: Characterizing AI-based IoT Applications Using Specialized Edge Architectures. In IISWC. 145--156."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-12087-4"},{"key":"e_1_3_2_1_12_1","volume-title":"Automating Deep Neural Network Model Selection for Edge Inference. In 2019 IEEE First International Conference on Cognitive Machine Intelligence","author":"Lu B.","unstructured":"B. Lu , J. Yang , L. Y. Chen , and S. Ren . 2019 . Automating Deep Neural Network Model Selection for Edge Inference. In 2019 IEEE First International Conference on Cognitive Machine Intelligence ( CogMI). 184--193. B. Lu, J. Yang, L. Y. Chen, and S. Ren. 2019. Automating Deep Neural Network Model Selection for Edge Inference. In 2019 IEEE First International Conference on Cognitive Machine Intelligence (CogMI). 184--193."},{"key":"e_1_3_2_1_13_1","volume-title":"Retrieved","year":"2020","unstructured":"Nvidia. 2020 . NVIDIA Jetson Modules . Retrieved October 19, 2020 from https:\/\/developer.nvidia.com\/embedded\/jetson-modules Nvidia. 2020. NVIDIA Jetson Modules. Retrieved October 19, 2020 from https:\/\/developer.nvidia.com\/embedded\/jetson-modules"},{"key":"e_1_3_2_1_14_1","first-page":"8024","article-title":"PyTorch: An Imperative Style, High-Performance Deep Learning Library","volume":"32","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke , Sam Gross , Francisco Massa , Adam Lerer , James Bradbury , Gregory Chanan , Trevor Killeen , Zeming Lin , Natalia Gimelshein , Luca Antiga , Alban Desmaison , Andreas Kopf , Edward Yang , Zachary DeVito , Martin Raison , Alykhan Tejani , Sasank Chilamkurthy , Benoit Steiner , Lu Fang , Junjie Bai , and Soumith Chintala . 2019 . PyTorch: An Imperative Style, High-Performance Deep Learning Library . In Advances in Neural Information Processing Systems 32. 8024 -- 8035 . Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32. 8024--8035.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_3_2_1_16_1","volume-title":"Proceedings of the 2nd International Workshop on Challenges in Artificial Intelligence and Machine Learning for Internet of Things","author":"Samplawski Colin","unstructured":"Colin Samplawski , Jin Huang , Deepak Ganesan , and Benjamin M. Marlin . 2020. Towards Objection Detection Under IoT Resource Constraints: Combining Partitioning, Slicing and Compression . In Proceedings of the 2nd International Workshop on Challenges in Artificial Intelligence and Machine Learning for Internet of Things ( Virtual Event, Japan) (AIChallengeIoT '20). Association for Computing Machinery, New York, NY, USA, 14--20. Colin Samplawski, Jin Huang, Deepak Ganesan, and Benjamin M. Marlin. 2020. Towards Objection Detection Under IoT Resource Constraints: Combining Partitioning, Slicing and Compression. In Proceedings of the 2nd International Workshop on Challenges in Artificial Intelligence and Machine Learning for Internet of Things (Virtual Event, Japan) (AIChallengeIoT '20). Association for Computing Machinery, New York, NY, USA, 14--20."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3381831"},{"key":"e_1_3_2_1_18_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv:1409.1556 [cs.CV]  Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv:1409.1556 [cs.CV]"},{"key":"e_1_3_2_1_19_1","volume-title":"Rethinking the Inception Architecture for Computer Vision. In 2016 IEEE Conf. Comput. Vis. Pattern Recognit.","volume":"2826","author":"Szegedy Christian","year":"2016","unstructured":"Christian Szegedy , Vanhoucke Vincent , Sergey Ioffe , Jonathon Shlens , and Zbigniew Wojna . 2016 . Rethinking the Inception Architecture for Computer Vision. In 2016 IEEE Conf. Comput. Vis. Pattern Recognit. , Vol. 2016-Decem. 2818-- 2826 . Christian Szegedy, Vanhoucke Vincent, Sergey Ioffe, Jonathon Shlens, and Zbigniew Wojna. 2016. Rethinking the Inception Architecture for Computer Vision. In 2016 IEEE Conf. Comput. Vis. Pattern Recognit., Vol. 2016-Decem. 2818--2826."},{"key":"e_1_3_2_1_20_1","volume-title":"Le","author":"Tan M.","year":"2019","unstructured":"M. Tan and Quoc V . Le . 2019 . EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. ArXiv abs\/1905.11946 (2019). M. Tan and Quoc V. Le. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. ArXiv abs\/1905.11946 (2019)."},{"key":"e_1_3_2_1_21_1","first-page":"504","article-title":"Neural network accelerator with parameters resident on chip","volume":"10","author":"Olivier Temam","year":"2019","unstructured":"Olivier Temam et al. 2019 . Neural network accelerator with parameters resident on chip . US Patent 10 , 504 ,022. Olivier Temam et al. 2019. Neural network accelerator with parameters resident on chip. US Patent 10,504,022.","journal-title":"US Patent"},{"key":"e_1_3_2_1_22_1","volume-title":"ALERT: Accurate Learning for Energy and Timeliness. In (USENIX ATC 20). 353--369.","author":"Wan Chengcheng","year":"2020","unstructured":"Chengcheng Wan , Muhammad Santriaji , Eri Rogers , Henry Hoffmann , Michael Maire , and Shan Lu . 2020 . ALERT: Accurate Learning for Energy and Timeliness. In (USENIX ATC 20). 353--369. Chengcheng Wan, Muhammad Santriaji, Eri Rogers, Henry Hoffmann, Michael Maire, and Shan Lu. 2020. ALERT: Accurate Learning for Energy and Timeliness. In (USENIX ATC 20). 353--369."},{"key":"e_1_3_2_1_23_1","volume-title":"w. wang, and F. Yan","author":"Zhang C.","year":"2020","unstructured":"C. Zhang , M. Yu , w. wang, and F. Yan . 2020 . Enabling Cost-Effective, SLO- Aware Machine Learning Inference Serving on Public Cloud. IEEE Transactions on Cloud Computing ( 2020), 1--1. C. Zhang, M. Yu, w. wang, and F. Yan. 2020. Enabling Cost-Effective, SLO-Aware Machine Learning Inference Serving on Public Cloud. IEEE Transactions on Cloud Computing (2020), 1--1."}],"event":{"name":"e-Energy '21: The Twelfth ACM International Conference on Future Energy Systems","location":"Virtual Event Italy","acronym":"e-Energy '21"},"container-title":["Proceedings of the Twelfth ACM International Conference on Future Energy Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447555.3465326","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3447555.3465326","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3447555.3465326","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:18:32Z","timestamp":1750191512000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447555.3465326"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,22]]},"references-count":23,"alternative-id":["10.1145\/3447555.3465326","10.1145\/3447555"],"URL":"https:\/\/doi.org\/10.1145\/3447555.3465326","relation":{},"subject":[],"published":{"date-parts":[[2021,6,22]]},"assertion":[{"value":"2021-06-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}