{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,1,10]],"date-time":"2023-01-10T05:29:30Z","timestamp":1673328570171},"publisher-location":"New York, NY, USA","reference-count":36,"publisher":"ACM","funder":[{"name":"Horizon2020","award":["780681"]},{"name":"SNIC","award":["2018-05973"]},{"name":"Horizon2020 FET-HPC","award":["800962"]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,5,11]]},"DOI":"10.1145\/3457388.3458662","type":"proceedings-article","created":{"date-parts":[[2021,4,29]],"date-time":"2021-04-29T22:15:17Z","timestamp":1619734517000},"update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["An online guided tuning approach to run CNN pipelines on edge devices"],"prefix":"10.1145","author":[{"given":"Pirah Noor","family":"Soomro","sequence":"first","affiliation":[{"name":"Chalmers University of Technology, Gothenburg, Sweden"}]},{"given":"Mustafa","family":"Abduljabbar","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology, Gothenburg, Sweden"}]},{"given":"Jeronimo","family":"Castrillon","sequence":"additional","affiliation":[{"name":"Technical University of Dresden, Dresden, Germany"}]},{"given":"Miquel","family":"Peric\u00e0s","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology, Gothenburg, Sweden"}]}],"member":"320","published-online":{"date-parts":[[2021,5,11]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3065386"},{"key":"e_1_3_2_1_2_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 , 2014 . Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014."},{"key":"e_1_3_2_1_3_1","volume-title":"Deep learning applied to nlp. arXiv preprint arXiv:1703.03091","author":"Lopez Marc Moreno","year":"2017","unstructured":"Marc Moreno Lopez and Jugal Kalita . Deep learning applied to nlp. arXiv preprint arXiv:1703.03091 , 2017 . Marc Moreno Lopez and Jugal Kalita. Deep learning applied to nlp. arXiv preprint arXiv:1703.03091, 2017."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2699343.2699349"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMST.2018.2844341"},{"key":"e_1_3_2_1_6_1","volume-title":"ACM SysML","author":"Jiang Ziheng","year":"2018","unstructured":"Ziheng Jiang , Tianqi Chen , and Mu Li . Efficient deep learning inference on edge devices . ACM SysML , 2018 . Ziheng Jiang, Tianqi Chen, and Mu Li. Efficient deep learning inference on edge devices. ACM SysML, 2018."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMST.2020.2970550"},{"key":"e_1_3_2_1_8_1","volume-title":"et al. Tensorflow: Large-scale machine learning on heterogeneous systems","author":"Abadi Mart\u00edn","year":"2015","unstructured":"Mart\u00edn Abadi , Ashish Agarwal , Paul Barham , Eugene Brevdo , Zhifeng Chen , Craig Citro , Greg S Corrado , Andy Davis , Jeffrey Dean , Matthieu Devin , et al. Tensorflow: Large-scale machine learning on heterogeneous systems . 2015 . Mart\u00edn Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, et al. Tensorflow: Large-scale machine learning on heterogeneous systems. 2015."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654889"},{"key":"e_1_3_2_1_10_1","unstructured":"Torch. http:\/\/torch.ch. Accessed: 2021-01-20. Torch. http:\/\/torch.ch. Accessed: 2021-01-20."},{"key":"e_1_3_2_1_11_1","unstructured":"Theano. http:\/\/deeplearning.net\/software\/theano\/. Accessed: 2021-01-20. Theano. http:\/\/deeplearning.net\/software\/theano\/. Accessed: 2021-01-20."},{"key":"e_1_3_2_1_12_1","volume-title":"Deep learning inference in facebook data centers: Characterization, performance optimizations and hardware implications. arXiv preprint arXiv:1811.09886","author":"Park Jongsoo","year":"2018","unstructured":"Jongsoo Park , Maxim Naumov , Protonu Basu , Summer Deng , Aravind Kalaiah , Daya Khudia , James Law , Parth Malani , Andrey Malevich , Satish Nadathur , Deep learning inference in facebook data centers: Characterization, performance optimizations and hardware implications. arXiv preprint arXiv:1811.09886 , 2018 . Jongsoo Park, Maxim Naumov, Protonu Basu, Summer Deng, Aravind Kalaiah, Daya Khudia, James Law, Parth Malani, Andrey Malevich, Satish Nadathur, et al. Deep learning inference in facebook data centers: Characterization, performance optimizations and hardware implications. arXiv preprint arXiv:1811.09886, 2018."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2019.00048"},{"key":"e_1_3_2_1_14_1","first-page":"225","volume-title":"A study of mobile device utilization. In 2015 ieee international symposium on performance analysis of systems and software (ispass)","author":"Gao Cao","year":"2015","unstructured":"Cao Gao , Anthony Gutierrez , Madhav Rajan , Ronald G Dreslinski , Trevor Mudge , and Carole-Jean Wu . A study of mobile device utilization. In 2015 ieee international symposium on performance analysis of systems and software (ispass) , pages 225 -- 234 . IEEE , 2015 . Cao Gao, Anthony Gutierrez, Madhav Rajan, Ronald G Dreslinski, Trevor Mudge, and Carole-Jean Wu. A study of mobile device utilization. In 2015 ieee international symposium on performance analysis of systems and software (ispass), pages 225--234. IEEE, 2015."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2003.1253185"},{"key":"e_1_3_2_1_16_1","unstructured":"Nvidia jetson tx2. https:\/\/www.nvidia.com\/en-us\/autonomous-machines\/embedded-systems\/jetson-tx2\/. Accessed: 2021-01-20. Nvidia jetson tx2. https:\/\/www.nvidia.com\/en-us\/autonomous-machines\/embedded-systems\/jetson-tx2\/. Accessed: 2021-01-20."},{"key":"e_1_3_2_1_17_1","unstructured":"The apple a14 soc: Firestorm & icestorm. https:\/\/www.anandtech.com\/show\/16192\/the-iphone-12-review\/2. Accessed: 2021-03-24. The apple a14 soc: Firestorm & icestorm. https:\/\/www.anandtech.com\/show\/16192\/the-iphone-12-review\/2. Accessed: 2021-03-24."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-60939-9_2"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICISC.2017.8068684"},{"key":"e_1_3_2_1_20_1","volume-title":"NNPACK","author":"Dukhan M","year":"2018","unstructured":"M Dukhan . Acceleration package for neural networks on multi-core cpus: Maratyszcza . NNPACK , Oct , 2018 . M Dukhan. Acceleration package for neural networks on multi-core cpus: Maratyszcza. NNPACK, Oct, 2018."},{"key":"e_1_3_2_1_21_1","volume-title":"Qnnpack: open source library for optimized mobile deep learning","author":"Dukhan Marat","year":"2018","unstructured":"Marat Dukhan , Yiming Wu , and Hao Lu . Qnnpack: open source library for optimized mobile deep learning , 2018 . Marat Dukhan, Yiming Wu, and Hao Lu. Qnnpack: open source library for optimized mobile deep learning, 2018."},{"key":"e_1_3_2_1_22_1","unstructured":"Compute library: A software library for computer vision and machine learning. https:\/\/developer.arm.com\/ip-products\/processors\/machine-learning\/compute-library. Accessed: 2021-01-20. Compute library: A software library for computer vision and machine learning. https:\/\/developer.arm.com\/ip-products\/processors\/machine-learning\/compute-library. Accessed: 2021-01-20."},{"key":"e_1_3_2_1_23_1","volume-title":"Beyond data and model parallelism for deep neural networks. arXiv preprint arXiv:1807.05358","author":"Jia Zhihao","year":"2018","unstructured":"Zhihao Jia , Matei Zaharia , and Alex Aiken . Beyond data and model parallelism for deep neural networks. arXiv preprint arXiv:1807.05358 , 2018 . Zhihao Jia, Matei Zaharia, and Alex Aiken. Beyond data and model parallelism for deep neural networks. arXiv preprint arXiv:1807.05358, 2018."},{"key":"e_1_3_2_1_24_1","article-title":"High-throughput cnn inference on embedded arm big. little multi-core processors","author":"Wang Siqi","year":"2019","unstructured":"Siqi Wang , Gayathri Ananthanarayanan , Yifan Zeng , Neeraj Goel , Anuj Pathania , and Tulika Mitra . High-throughput cnn inference on embedded arm big. little multi-core processors . IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems , 2019 . Siqi Wang, Gayathri Ananthanarayanan, Yifan Zeng, Neeraj Goel, Anuj Pathania, and Tulika Mitra. High-throughput cnn inference on embedded arm big. little multi-core processors. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2019.","journal-title":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems"},{"key":"e_1_3_2_1_25_1","volume-title":"Scheduling computation graphs of deep learning models on manycore cpus. arXiv preprint arXiv:1807.09667","author":"Tang Linpeng","year":"2018","unstructured":"Linpeng Tang , Yida Wang , Theodore L Willke , and Kai Li . Scheduling computation graphs of deep learning models on manycore cpus. arXiv preprint arXiv:1807.09667 , 2018 . Linpeng Tang, Yida Wang, Theodore L Willke, and Kai Li. Scheduling computation graphs of deep learning models on manycore cpus. arXiv preprint arXiv:1807.09667, 2018."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123389"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2005.1526010"},{"key":"e_1_3_2_1_28_1","volume-title":"Elastic places: An adaptive resource manager for scalable and portable performance. ACM Transactions on Architecture and Code Optimization (TACO), 15(2):1--26","author":"Peric\u00e0s Miquel","year":"2018","unstructured":"Miquel Peric\u00e0s . Elastic places: An adaptive resource manager for scalable and portable performance. ACM Transactions on Architecture and Code Optimization (TACO), 15(2):1--26 , 2018 . Miquel Peric\u00e0s. Elastic places: An adaptive resource manager for scalable and portable performance. ACM Transactions on Architecture and Code Optimization (TACO), 15(2):1--26, 2018."},{"key":"e_1_3_2_1_29_1","volume-title":"Dehao Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V Le, Yonghui Wu, et al. Gpipe: Efficient training of giant neural networks using pipeline parallelism. arXiv preprint arXiv:1811.06965","author":"Huang Yanping","year":"2018","unstructured":"Yanping Huang , Youlong Cheng , Ankur Bapna , Orhan Firat , Mia Xu Chen , Dehao Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V Le, Yonghui Wu, et al. Gpipe: Efficient training of giant neural networks using pipeline parallelism. arXiv preprint arXiv:1811.06965 , 2018 . Yanping Huang, Youlong Cheng, Ankur Bapna, Orhan Firat, Mia Xu Chen, Dehao Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V Le, Yonghui Wu, et al. Gpipe: Efficient training of giant neural networks using pipeline parallelism. arXiv preprint arXiv:1811.06965, 2018."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2005.51"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3315454.3329959"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-80328-4_13"},{"key":"e_1_3_2_1_33_1","unstructured":"Xitao. https:\/\/github.com\/CHART-Team\/xitao. Accessed: 2021-02-02. Xitao. https:\/\/github.com\/CHART-Team\/xitao. Accessed: 2021-02-02."},{"key":"e_1_3_2_1_34_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 , 2014 . Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014."},{"key":"e_1_3_2_1_35_1","volume-title":"Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25:1097--1105","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky , Ilya Sutskever , and Geoffrey E Hinton . Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25:1097--1105 , 2012 . Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25:1097--1105, 2012."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"}],"event":{"name":"CF '21: Computing Frontiers Conference","location":"Virtual Event Italy","acronym":"CF '21","sponsor":["SIGMICRO ACM Special Interest Group on Microarchitectural Research and Processing"]},"container-title":["Proceedings of the 18th ACM International Conference on Computing Frontiers"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3457388.3458662","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,9]],"date-time":"2023-01-09T23:52:30Z","timestamp":1673308350000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3457388.3458662"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,11]]},"references-count":36,"alternative-id":["10.1145\/3457388.3458662","10.1145\/3457388"],"URL":"http:\/\/dx.doi.org\/10.1145\/3457388.3458662","relation":{},"published":{"date-parts":[[2021,5,11]]},"assertion":[{"value":"2021-05-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}