{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,14]],"date-time":"2025-05-14T04:47:29Z","timestamp":1747198049569,"version":"3.40.5"},"reference-count":27,"publisher":"Walter de Gruyter GmbH","issue":"9","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,27]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>The enormous potential of artificial intelligence, especially artificial neural networks, when used for edge computing applications in cars, traffic lights or smart watches, has not yet been fully exploited today. The reasons for this are the computing, energy and memory requirements of modern neural networks, which typically cannot be met by embedded devices. This article provides a detailed summary of today\u2019s challenges and gives a deeper insight into existing solutions that enable neural network performance with modern HW\/SW co-design techniques.<\/jats:p>","DOI":"10.1515\/auto-2022-0023","type":"journal-article","created":{"date-parts":[[2022,9,2]],"date-time":"2022-09-02T07:34:11Z","timestamp":1662104051000},"page":"767-776","source":"Crossref","is-referenced-by-count":0,"title":["Tools and methods for Edge-AI-systems"],"prefix":"10.1515","volume":"70","author":[{"given":"Nils","family":"Schwabe","sequence":"first","affiliation":[{"name":"FZI Research Center for Information Technology , Karlsruhe , Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yexu","family":"Zhou","sequence":"additional","affiliation":[{"name":"Karlsruher Institute of Technology (KIT) , Institute of Telematics , Karlsruhe , Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Leon","family":"Hielscher","sequence":"additional","affiliation":[{"name":"FZI Research Center for Information Technology , Karlsruhe , Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tobias","family":"R\u00f6ddiger","sequence":"additional","affiliation":[{"name":"Karlsruher Institute of Technology (KIT) , Institute of Telematics , Karlsruhe , Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4547-1984","authenticated-orcid":false,"given":"Till","family":"Riedel","sequence":"additional","affiliation":[{"name":"Karlsruher Institute of Technology (KIT) , Institute of Telematics , Karlsruhe , Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sebastian","family":"Reiter","sequence":"additional","affiliation":[{"name":"FZI Research Center for Information Technology , Karlsruhe , Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"374","published-online":{"date-parts":[[2022,9,3]]},"reference":[{"key":"2023033111053265210_j_auto-2022-0023_ref_001","doi-asserted-by":"crossref","unstructured":"Benmeziane, Hadjer, Kaoutar El Maghraoui, Hamza Ouarnoughi, Smail Niar, Martin Wistuba and Naigang Wang. 2021. A comprehensive survey on hardware-aware neural architecture search. arXiv preprint arXiv:2101.09336.","DOI":"10.24963\/ijcai.2021\/592"},{"key":"2023033111053265210_j_auto-2022-0023_ref_002","doi-asserted-by":"crossref","unstructured":"Cao, Shijie, Chen Zhang, Zhuliang Yao, Wencong Xiao, Lanshun Nie, Dechen Zhan, Yunxin Liu, Ming Wu and Lintao Zhang. 2019. Efficient and effective sparse LSTM on FPGA with bank-balanced sparsity. In: Proceedings of the 2019 ACM\/SIGDA International Symposium on Field-Programmable Gate Arrays. ACM, pp.\u200963\u201372.","DOI":"10.1145\/3289602.3293898"},{"key":"2023033111053265210_j_auto-2022-0023_ref_003","unstructured":"Cheng, Yu, Duo Wang, Pan Zhou and Tao Zhang. 2017. A survey of model compression and acceleration for deep neural networks. arXiv preprint arXiv:1710.09282."},{"key":"2023033111053265210_j_auto-2022-0023_ref_004","doi-asserted-by":"crossref","unstructured":"Farshchi, Farzad, Qijing Huang and Heechul Yun. 2019. Integrating NVIDIA deep learning accelerator (NVDLA) with RISC-V SoC on FireSim. In: 2019 2nd Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications (EMC2). IEEE, pp.\u200921\u201325.","DOI":"10.1109\/EMC249363.2019.00012"},{"key":"2023033111053265210_j_auto-2022-0023_ref_005","unstructured":"Feurer, Matthias, Aaron Klein, Katharina Eggensperger, Jost Springenberg, Manuel Blum and Frank Hutter. 2015. Efficient and robust automated machine learning. In: Advances in neural information processing systems 28. Curran Associates, Inc., pp.\u20092962\u20132970."},{"key":"2023033111053265210_j_auto-2022-0023_ref_006","doi-asserted-by":"crossref","unstructured":"Fleck, Tobias, Sven Ochs, Marc Ren\u00e8 Zofka and J. Marius Zollner. 2020. Robust tracking of reference trajectories for autonomous driving in intelligent roadside infrastructure. In: 2020 IEEE Intelligent Vehicles Symposium (IV). IEEE, pp.\u20091337\u20131342.","DOI":"10.1109\/IV47402.2020.9304620"},{"key":"2023033111053265210_j_auto-2022-0023_ref_007","doi-asserted-by":"crossref","unstructured":"Genc, Hasan, Seah Kim, Alon Amid, Ameer Haj-Ali, Vighnesh Iyer, Pranav Prakash, Jerry Zhao, Daniel Grubb, Harrison Liew, Howard Mao and et al. 2021. Gemmini: Enabling systematic deep-learning architecture evaluation via full-stack integration. In: 2021 58th ACM\/IEEE Design Automation Conference (DAC). IEEE, pp.\u2009769\u2013774.","DOI":"10.1109\/DAC18074.2021.9586216"},{"key":"2023033111053265210_j_auto-2022-0023_ref_008","unstructured":"Gonzalez, Abraham and Charles Hong. 2020. A Chipyard Comparison of NVDLA and Gemmini. Tech. Rep. EE, Berkeley, CA, USA, pp.\u2009290\u2013292."},{"key":"2023033111053265210_j_auto-2022-0023_ref_009","unstructured":"Guo, Yunhui. 2018. A survey on methods and theories of quantized neural networks. arXiv preprint arXiv:1808.04752."},{"key":"2023033111053265210_j_auto-2022-0023_ref_010","unstructured":"Gysel, Philipp, Mohammad Motamedi and Soheil Ghiasi. 2016. Hardware-oriented approximation of convolutional neural networks. arXiv preprint arXiv:1604.03168."},{"key":"2023033111053265210_j_auto-2022-0023_ref_011","doi-asserted-by":"crossref","unstructured":"HajiRassouliha, Amir, Andrew J. Taberner, Martyn P. Nash and Poul M.F. Nielsen. 2018. Suitability of recent hardware accelerators (DSPs, FPGAs, and GPUs) for computer vision and image processing algorithms. Signal Processing: Image Communication 68: 101\u2013119.","DOI":"10.1016\/j.image.2018.07.007"},{"key":"2023033111053265210_j_auto-2022-0023_ref_012","unstructured":"Hinton, Geoffrey, Oriol Vinyals and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531."},{"key":"2023033111053265210_j_auto-2022-0023_ref_013","unstructured":"Iandola, Forrest N., Song Han, Matthew W. Moskewicz, Khalid Ashraf, William J. Dally and Kurt Keutzer. 2016. SqueezeNet: AlexNet-level accuracy with 50\u00d7 fewer parameters and <\u20090.5\u2009MB model size. arXiv preprint arXiv:1602.07360."},{"key":"2023033111053265210_j_auto-2022-0023_ref_014","doi-asserted-by":"crossref","unstructured":"Khurana, Udayan, Deepak Turaga, Horst Samulowitz and Srinivasan Parthasrathy. 2016. Cognito: Automated feature engineering for supervised learning. In: 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW). IEEE, pp.\u20091304\u20131307.","DOI":"10.1109\/ICDMW.2016.0190"},{"key":"2023033111053265210_j_auto-2022-0023_ref_015","doi-asserted-by":"crossref","unstructured":"Lu, Liqiang and Yun Liang. 2018. SpWA: An efficient sparse winograd convolutional neural networks accelerator on FPGAs. In: Proceedings of the 55th Annual Design Automation Conference. IEEE, pp.\u20091\u20136.","DOI":"10.1145\/3195970.3196120"},{"key":"2023033111053265210_j_auto-2022-0023_ref_016","doi-asserted-by":"crossref","unstructured":"Ma, Ningning, Xiangyu Zhang, Hai-Tao Zheng and Jian Sun. 2018. Shufflenet v2: Practical guidelines for efficient cnn architecture design. In: Proceedings of the European conference on computer vision (ECCV), pp.\u2009116\u2013131.","DOI":"10.1007\/978-3-030-01264-9_8"},{"key":"2023033111053265210_j_auto-2022-0023_ref_017","doi-asserted-by":"crossref","unstructured":"Mollah, Muhammad Baqer, Md Abul Kalam Azad and Athanasios Vasilakos. 2017. Security and privacy challenges in mobile cloud computing: Survey and way ahead. Journal of Network and Computer Applications 84: 38\u201354.","DOI":"10.1016\/j.jnca.2017.02.001"},{"key":"2023033111053265210_j_auto-2022-0023_ref_018","doi-asserted-by":"crossref","unstructured":"Nakahara, Hiroki, Haruyoshi Yonekawa, Tomoya Fujii and Shimpei Sato. 2018. A lightweight YOLOv2: A binarized CNN with a parallel support vector regression for an FPGA. In: Proceedings of the 2018 ACM\/SIGDA International Symposium on field-programmable gate arrays. ACM, pp.\u200931\u201340.","DOI":"10.1145\/3174243.3174266"},{"key":"2023033111053265210_j_auto-2022-0023_ref_019","doi-asserted-by":"crossref","unstructured":"Ouyang, Zhenchao, Jianwei Niu, Yu Liu and Mohsen Guizani. 2019. Deep CNN-based real-time traffic light detector for self-driving vehicles. IEEE transactions on Mobile Computing 19(2): 300\u2013313.","DOI":"10.1109\/TMC.2019.2892451"},{"key":"2023033111053265210_j_auto-2022-0023_ref_020","doi-asserted-by":"crossref","unstructured":"Palffy, Andras, Jiaao Dong, Julian F.P. Kooij and Dariu M. Gavrila. 2020. CNN based road user detection using the 3D radar cube. IEEE Robotics and Automation Letters 5(2): 1263\u20131270.","DOI":"10.1109\/LRA.2020.2967272"},{"key":"2023033111053265210_j_auto-2022-0023_ref_021","unstructured":"Redmon, Joseph and Ali Farhadi. 2018. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767."},{"key":"2023033111053265210_j_auto-2022-0023_ref_022","doi-asserted-by":"crossref","unstructured":"Sandler, Mark, Andrew Howard, Menglong Zhu, Andrey Zhmoginov and Liang-Chieh Chen. 2018. Mobilenetv2: Inverted residuals and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp.\u20094510\u20134520.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"2023033111053265210_j_auto-2022-0023_ref_023","doi-asserted-by":"crossref","unstructured":"Shrivastava, Ashish, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang and Russell Webb. 2017. Learning from simulated and unsupervised images through adversarial training. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp.\u20092107\u20132116.","DOI":"10.1109\/CVPR.2017.241"},{"key":"2023033111053265210_j_auto-2022-0023_ref_024","doi-asserted-by":"crossref","unstructured":"Venkataramani, Swagath, Vijayalakshmi Srinivasan, Wei Wang, Sanchari Sen, Jintao Zhang, Ankur Agrawal, Monodeep Kar, Shubham Jain, Alberto Mannari, Hoang Tran and et al. 2021. RaPiD: AI accelerator for ultra-low precision training and inference. In: 2021 ACM\/IEEE 48th Annual International Symposium on Computer Architecture (ISCA). IEEE, pp.\u2009153\u2013166.","DOI":"10.1109\/ISCA52012.2021.00021"},{"key":"2023033111053265210_j_auto-2022-0023_ref_025","doi-asserted-by":"crossref","unstructured":"Vogelsang, Thomas. 2010. Understanding the energy consumption of dynamic random access memories. In: 2010 43rd Annual IEEE\/ACM International Symposium on Microarchitecture. IEEE, pp.\u2009363\u2013374.","DOI":"10.1109\/MICRO.2010.42"},{"key":"2023033111053265210_j_auto-2022-0023_ref_026","doi-asserted-by":"crossref","unstructured":"Wu, Bichen, Xiaoliang Dai, Peizhao Zhang, Yanghan Wang, Fei Sun, Yiming Wu, Yuandong Tian, Peter Vajda, Yangqing Jia and Kurt Keutzer. 2019. Fbnet: Hardware-aware efficient convnet design via differentiable neural architecture search. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. IEEE, pp.\u200910734\u201310742.","DOI":"10.1109\/CVPR.2019.01099"},{"key":"2023033111053265210_j_auto-2022-0023_ref_027","doi-asserted-by":"crossref","unstructured":"Yang, Zhifang, Lei Yan and Jiakai Yuan. 2020. Design and Implementation of Driverless Perceptual System Based on CPU+ FPGA. In: 2020 5th International Conference on Control, Robotics and Cybernetics (CRC). IEEE, pp.\u2009261\u2013265.","DOI":"10.1109\/CRC51253.2020.9253490"}],"container-title":["at - Automatisierungstechnik"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.1515\/auto-2022-0023\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.1515\/auto-2022-0023\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,3,31]],"date-time":"2023-03-31T13:56:12Z","timestamp":1680270972000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.1515\/auto-2022-0023\/html"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,1]]},"references-count":27,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2022,9,3]]},"published-print":{"date-parts":[[2022,9,27]]}},"alternative-id":["10.1515\/auto-2022-0023"],"URL":"https:\/\/doi.org\/10.1515\/auto-2022-0023","relation":{},"ISSN":["2196-677X","0178-2312"],"issn-type":[{"type":"electronic","value":"2196-677X"},{"type":"print","value":"0178-2312"}],"subject":[],"published":{"date-parts":[[2022,9,1]]}}}