{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,9]],"date-time":"2026-04-09T14:45:28Z","timestamp":1775745928951,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":51,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,11,7]],"date-time":"2019-11-07T00:00:00Z","timestamp":1573084800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["60053525"],"award-info":[{"award-number":["60053525"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,11,7]]},"DOI":"10.1145\/3318216.3363312","type":"proceedings-article","created":{"date-parts":[[2019,11,4]],"date-time":"2019-11-04T14:11:35Z","timestamp":1572876695000},"page":"195-208","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":94,"title":["Adaptive parallel execution of deep neural networks on heterogeneous edge devices"],"prefix":"10.1145","author":[{"given":"Li","family":"Zhou","sequence":"first","affiliation":[{"name":"The Ohio State University"}]},{"given":"Mohammad Hossein","family":"Samavatian","sequence":"additional","affiliation":[{"name":"The Ohio State University"}]},{"given":"Anys","family":"Bacha","sequence":"additional","affiliation":[{"name":"University of Michigan-Dearborn"}]},{"given":"Saikat","family":"Majumdar","sequence":"additional","affiliation":[{"name":"The Ohio State University"}]},{"given":"Radu","family":"Teodorescu","sequence":"additional","affiliation":[{"name":"The Ohio State University"}]}],"member":"320","published-online":{"date-parts":[[2019,11,7]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001138"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.5555\/3195638.3195664"},{"key":"e_1_3_2_1_3_1","unstructured":"Amazon. [n.d.]. Machine Learning on AWS. https:\/\/aws.amazon.com\/machine-learning\/.  Amazon. [n.d.]. Machine Learning on AWS. https:\/\/aws.amazon.com\/machine-learning\/."},{"key":"e_1_3_2_1_4_1","unstructured":"Apple. [n.d.]. Core ML. https:\/\/developer.apple.com\/documentation\/coreml.  Apple. [n.d.]. Core ML. https:\/\/developer.apple.com\/documentation\/coreml."},{"key":"e_1_3_2_1_5_1","volume-title":"3rd International Conference on Learning Representations (ICLR).","author":"Bahdanau Dzmitry","year":"2015","unstructured":"Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2015 . Neural Machine Translation by Jointly Learning to Align and Translate . In 3rd International Conference on Learning Representations (ICLR). Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In 3rd International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_6_1","volume-title":"Davide Del Testa","author":"Bojarski Mariusz","year":"2016","unstructured":"Mariusz Bojarski , Davide Del Testa , Daniel Dworakowski, Bernhard Firner , Beat Flepp, Prasoon Goyal, Lawrence D Jackel, Mathew Monfort, Urs Muller, Jiakai Zhang, et al. 2016 . End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316 (2016). Mariusz Bojarski, Davide Del Testa, Daniel Dworakowski, Bernhard Firner, Beat Flepp, Prasoon Goyal, Lawrence D Jackel, Mathew Monfort, Urs Muller, Jiakai Zhang, et al. 2016. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316 (2016)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Tianshi Chen Zidong Du Ninghui Sun Jia Wang Chengyong Wu Yunji Chen and Olivier Temam. 2014. DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning. In Architectural Support for Programming Languages and Operating Systems (ASPLOS). 269--284.  Tianshi Chen Zidong Du Ninghui Sun Jia Wang Chengyong Wu Yunji Chen and Olivier Temam. 2014. DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning. In Architectural Support for Programming Languages and Operating Systems (ASPLOS). 269--284.","DOI":"10.1145\/2644865.2541967"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001177"},{"key":"e_1_3_2_1_9_1","volume-title":"DaDianNao: A Machine-Learning Supercomputer. In 47th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO). 609--622","author":"Chen Yunji","year":"2014","unstructured":"Yunji Chen , Tao Luo , Shaoli Liu , Shijin Zhang , Liqiang He , Jia Wang , Ling Li , Tianshi Chen , Zhiwei Xu , Ninghui Sun , and Olivier Temam . 2014 . DaDianNao: A Machine-Learning Supercomputer. In 47th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO). 609--622 . Yunji Chen, Tao Luo, Shaoli Liu, Shijin Zhang, Liqiang He, Jia Wang, Ling Li, Tianshi Chen, Zhiwei Xu, Ninghui Sun, and Olivier Temam. 2014. DaDianNao: A Machine-Learning Supercomputer. In 47th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO). 609--622."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390177"},{"key":"e_1_3_2_1_11_1","volume-title":"Training deep neural networks with low precision multiplications. arXiv preprint arXiv:1412.7024","author":"Courbariaux Matthieu","year":"2014","unstructured":"Matthieu Courbariaux , Yoshua Bengio , and Jean-Pierre David . 2014. Training deep neural networks with low precision multiplications. arXiv preprint arXiv:1412.7024 ( 2014 ). Matthieu Courbariaux, Yoshua Bengio, and Jean-Pierre David. 2014. Training deep neural networks with low precision multiplications. arXiv preprint arXiv:1412.7024 (2014)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750389"},{"key":"e_1_3_2_1_13_1","unstructured":"Marat Dukhan. 2018. NNPACK. https:\/\/github.com\/Maratyszcza\/NNPACK.  Marat Dukhan. 2018. NNPACK. https:\/\/github.com\/Maratyszcza\/NNPACK."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3241539.3241559"},{"key":"e_1_3_2_1_15_1","unstructured":"Raspberry Pi Foundation. [n.d.]. Raspberry Pi. https:\/\/www.raspberrypi.org\/.  Raspberry Pi Foundation. [n.d.]. Raspberry Pi. https:\/\/www.raspberrypi.org\/."},{"key":"e_1_3_2_1_16_1","unstructured":"Google. [n.d.]. Cloud Machine Learning Engine. https:\/\/cloud.google.com\/ml-engine\/.  Google. [n.d.]. Cloud Machine Learning Engine. https:\/\/cloud.google.com\/ml-engine\/."},{"key":"e_1_3_2_1_17_1","volume-title":"Collaborative Execution of Deep Neural Networks on Internet of Things Devices. arXiv preprint arXiv:1901.02537","author":"Hadidi Ramyad","year":"2019","unstructured":"Ramyad Hadidi , Jiashen Cao , Micheal S Ryoo , and Hyesoon Kim . 2019. Collaborative Execution of Deep Neural Networks on Internet of Things Devices. arXiv preprint arXiv:1901.02537 ( 2019 ). Ramyad Hadidi, Jiashen Cao, Micheal S Ryoo, and Hyesoon Kim. 2019. Collaborative Execution of Deep Neural Networks on Internet of Things Devices. arXiv preprint arXiv:1901.02537 (2019)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2018.2856261"},{"key":"e_1_3_2_1_19_1","volume-title":"Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149","author":"Han Song","year":"2015","unstructured":"Song Han , Huizi Mao , and William J Dally . 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 ( 2015 ). Song Han, Huizi Mao, and William J Dally. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_21_1","volume-title":"Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861","author":"Howard Andrew G","year":"2017","unstructured":"Andrew G Howard , Menglong Zhu , Bo Chen , Dmitry Kalenichenko , Weijun Wang , Tobias Weyand , Marco Andreetto , and Hartwig Adam . 2017 . Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017). Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.243"},{"key":"e_1_3_2_1_23_1","unstructured":"Intel. [n.d.]. Movidius Neural Compute Stick. https:\/\/software.intel.com\/en-us\/movidius-ncs.  Intel. [n.d.]. Movidius Neural Compute Stick. https:\/\/software.intel.com\/en-us\/movidius-ncs."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3267809.3267828"},{"key":"e_1_3_2_1_25_1","volume-title":"Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks. arXiv preprint arXiv:1802.04924","author":"Jia Zhihao","year":"2018","unstructured":"Zhihao Jia , Sina Lin , Charles R Qi , and Alex Aiken . 2018. Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks. arXiv preprint arXiv:1802.04924 ( 2018 ). Zhihao Jia, Sina Lin, Charles R Qi, and Alex Aiken. 2018. Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks. arXiv preprint arXiv:1802.04924 (2018)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037698"},{"key":"e_1_3_2_1_27_1","volume-title":"One weird trick for parallelizing convolutional neural networks. arXiv preprint arXiv:1404.5997","author":"Krizhevsky Alex","year":"2014","unstructured":"Alex Krizhevsky . 2014. One weird trick for parallelizing convolutional neural networks. arXiv preprint arXiv:1404.5997 ( 2014 ). Alex Krizhevsky. 2014. One weird trick for parallelizing convolutional neural networks. arXiv preprint arXiv:1404.5997 (2014)."},{"key":"e_1_3_2_1_28_1","unstructured":"Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105.  Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/MNET.2018.1700202"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001164"},{"key":"e_1_3_2_1_31_1","volume-title":"Arnaud Arindra Adiyoso Setio, Francesco Ciompi, Mohsen Ghafoorian, Jeroen Awm Van Der Laak, Bram Van Ginneken, and Clara I S\u00e1nchez.","author":"Litjens Geert","year":"2017","unstructured":"Geert Litjens , Thijs Kooi , Babak Ehteshami Bejnordi , Arnaud Arindra Adiyoso Setio, Francesco Ciompi, Mohsen Ghafoorian, Jeroen Awm Van Der Laak, Bram Van Ginneken, and Clara I S\u00e1nchez. 2017 . A survey on deep learning in medical image analysis. Medical image analysis 42 (2017), 60--88. Geert Litjens, Thijs Kooi, Babak Ehteshami Bejnordi, Arnaud Arindra Adiyoso Setio, Francesco Ciompi, Mohsen Ghafoorian, Jeroen Awm Van Der Laak, Bram Van Ginneken, and Clara I S\u00e1nchez. 2017. A survey on deep learning in medical image analysis. Medical image analysis 42 (2017), 60--88."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2694344.2694358"},{"key":"e_1_3_2_1_33_1","volume-title":"Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1396--1401","author":"Mao Jiachen","year":"2017","unstructured":"Jiachen Mao , Xiang Chen , Kent W Nixon , Christopher Krieger , and Yiran Chen . 2017 . Modnn: Local distributed mobile computing system for deep neural network. In 2017 Design , Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1396--1401 . Jiachen Mao, Xiang Chen, Kent W Nixon, Christopher Krieger, and Yiran Chen. 2017. Modnn: Local distributed mobile computing system for deep neural network. In 2017 Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1396--1401."},{"key":"e_1_3_2_1_34_1","unstructured":"Microsoft. [n.d.]. Azure Machine Learning service. https:\/\/azure.microsoft.com\/en-us\/services\/machine-learning-service\/.  Microsoft. [n.d.]. Azure Machine Learning service. https:\/\/azure.microsoft.com\/en-us\/services\/machine-learning-service\/."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMST.2018.2844341"},{"key":"e_1_3_2_1_36_1","unstructured":"Nvidia. [n.d.]. Jetson Nano. https:\/\/www.nvidia.com\/en-us\/autonomous-machines\/embedded-systems\/jetson-nano\/.  Nvidia. [n.d.]. Jetson Nano. https:\/\/www.nvidia.com\/en-us\/autonomous-machines\/embedded-systems\/jetson-nano\/."},{"key":"e_1_3_2_1_37_1","unstructured":"Joseph Redmon. 2013-2016. Darknet: Open Source Neural Networks in C. http:\/\/pjreddie.com\/darknet\/.  Joseph Redmon. 2013-2016. Darknet: Open Source Neural Networks in C. http:\/\/pjreddie.com\/darknet\/."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_3_2_1_39_1","volume-title":"YOLO9000: better, faster, stronger. arXiv preprint","author":"Redmon Joseph","year":"2017","unstructured":"Joseph Redmon and Ali Farhadi . 2017. YOLO9000: better, faster, stronger. arXiv preprint ( 2017 ). Joseph Redmon and Ali Farhadi. 2017. YOLO9000: better, faster, stronger. arXiv preprint (2017)."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0816-y"},{"key":"e_1_3_2_1_41_1","volume-title":"ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars. In 43rd ACM\/IEEE Annual International Symposium on Computer Architecture (ISCA). 14--26","author":"Shafiee Ali","year":"2016","unstructured":"Ali Shafiee , Anirban Nag , Naveen Muralimanohar , Rajeev Balasubramonian , John Paul Strachan , Miao Hu , R. Stanley Williams , and Vivek Srikumar . 2016 . ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars. In 43rd ACM\/IEEE Annual International Symposium on Computer Architecture (ISCA). 14--26 . Ali Shafiee, Anirban Nag, Naveen Muralimanohar, Rajeev Balasubramonian, John Paul Strachan, Miao Hu, R. Stanley Williams, and Vivek Srikumar. 2016. ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars. In 43rd ACM\/IEEE Annual International Symposium on Computer Architecture (ISCA). 14--26."},{"key":"e_1_3_2_1_42_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDCS.2017.226"},{"key":"e_1_3_2_1_45_1","unstructured":"TensorFlowTM. 2019. TensorFlow for Mobile and IoT. https:\/\/www.tensorflow.org\/lite.  TensorFlow TM . 2019. TensorFlow for Mobile and IoT. https:\/\/www.tensorflow.org\/lite."},{"key":"e_1_3_2_1_46_1","volume-title":"in Deep Learning and Unsupervised Feature Learning Workshop, NIPS. Citeseer.","author":"Vanhoucke Vincent","year":"2011","unstructured":"Vincent Vanhoucke , Andrew Senior , and Mark Z Mao . 2011 . Improving the speed of neural networks on CPUs . In in Deep Learning and Unsupervised Feature Learning Workshop, NIPS. Citeseer. Vincent Vanhoucke, Andrew Senior, and Mark Z Mao. 2011. Improving the speed of neural networks on CPUs. In in Deep Learning and Unsupervised Feature Learning Workshop, NIPS. Citeseer."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3218603.3218652"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3140659.3080215"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s13174-010-0007-6"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00716"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2018.2858384"}],"event":{"name":"SEC '19: The Fourth ACM\/IEEE Symposium on Edge Computing","location":"Arlington Virginia","acronym":"SEC '19","sponsor":["SIGMOBILE ACM Special Interest Group on Mobility of Systems, Users, Data and Computing","IEEE-CS\\DATC IEEE Computer Society"]},"container-title":["Proceedings of the 4th ACM\/IEEE Symposium on Edge Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3318216.3363312","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3318216.3363312","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3318216.3363312","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:54:40Z","timestamp":1750204480000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3318216.3363312"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,7]]},"references-count":51,"alternative-id":["10.1145\/3318216.3363312","10.1145\/3318216"],"URL":"https:\/\/doi.org\/10.1145\/3318216.3363312","relation":{},"subject":[],"published":{"date-parts":[[2019,11,7]]},"assertion":[{"value":"2019-11-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}