{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T05:46:41Z","timestamp":1780465601989,"version":"3.54.1"},"reference-count":144,"publisher":"Springer Science and Business Media LLC","issue":"13","license":[{"start":{"date-parts":[[2021,12,27]],"date-time":"2021-12-27T00:00:00Z","timestamp":1640563200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,12,27]],"date-time":"2021-12-27T00:00:00Z","timestamp":1640563200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100010801","name":"Xunta de Galicia","doi-asserted-by":"publisher","award":["ED431C2018\/55-GRC"],"award-info":[{"award-number":["ED431C2018\/55-GRC"]}],"id":[{"id":"10.13039\/501100010801","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"publisher","award":["ED431G2019\/06"],"award-info":[{"award-number":["ED431G2019\/06"]}],"id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100006761","name":"Universidade de Vigo","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100006761","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Neural Comput &amp; Applic"],"published-print":{"date-parts":[[2022,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Convolutional neural networks have pushed forward image analysis research and computer vision over the last decade, constituting a state-of-the-art approach in object detection today. The design of increasingly deeper and wider architectures has made it possible to achieve unprecedented levels of detection accuracy, albeit at the cost of both a dramatic computational burden and a large memory footprint. In such a context, cloud systems have become a mainstream technological solution due to their tremendous scalability, providing researchers and practitioners with virtually unlimited resources. However, these resources are typically made available as remote services, requiring communication over the network to be accessed, thus compromising the speed of response, availability, and security of the implemented solution. In view of these limitations, the on-device paradigm has emerged as a recent yet widely explored alternative, pursuing more compact and efficient networks to ultimately enable the execution of the derived models directly on resource-constrained client devices. This study provides an up-to-date review of the more relevant scientific research carried out in this vein, circumscribed to the object detection problem. In particular, the paper contributes to the field with a comprehensive architectural overview of both the existing lightweight object detection frameworks targeted to mobile and embedded devices, and the underlying convolutional neural networks that make up their internal structure. More specifically, it addresses the main structural-level strategies used for conceiving the various components of a detection pipeline (i.e., backbone, neck, and head), as well as the most salient techniques proposed for adapting such structures and the resulting architectures to more austere deployment environments. Finally, the study concludes with a discussion of the specific challenges and next steps to be taken to move toward a more convenient accuracy\u2013speed trade-off.<\/jats:p>","DOI":"10.1007\/s00521-021-06830-w","type":"journal-article","created":{"date-parts":[[2021,12,27]],"date-time":"2021-12-27T09:03:54Z","timestamp":1640595834000},"page":"10469-10501","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":38,"title":["Optimized convolutional neural network architectures for efficient on-device vision-based object detection"],"prefix":"10.1007","volume":"34","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9619-4852","authenticated-orcid":false,"given":"Ivan","family":"Rodriguez-Conde","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8849-4989","authenticated-orcid":false,"given":"Celso","family":"Campos","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3943-8013","authenticated-orcid":false,"given":"Florentino","family":"Fdez-Riverola","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2021,12,27]]},"reference":[{"key":"6830_CR1","doi-asserted-by":"publisher","first-page":"172","DOI":"10.1109\/TPAMI.2019.2929257","volume":"43","author":"Z Cao","year":"2021","unstructured":"Cao Z, Hidalgo G, Simon T, Wei SE, Sheikh Y (2021) OpenPose: realtime multi-person 2D pose estimation using part affinity fields. IEEE Trans Pattern Anal Mach Intell 43:172\u2013186. https:\/\/doi.org\/10.1109\/TPAMI.2019.2929257","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR2","doi-asserted-by":"publisher","first-page":"743","DOI":"10.1109\/TPAMI.2011.155","volume":"34","author":"P Doll\u00e1r","year":"2012","unstructured":"Doll\u00e1r P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34:743\u2013761. https:\/\/doi.org\/10.1109\/TPAMI.2011.155","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR3","doi-asserted-by":"publisher","unstructured":"Yang S, Luo P, Loy CC, Tang X (2016) WIDER FACE: a face detection benchmark. In: 2016 IEEE Conference on computer vision and pattern recognition (CVPR), 27\u201330 June 2016, pp 5525\u20135533. https:\/\/doi.org\/10.1109\/CVPR.2016.596","DOI":"10.1109\/CVPR.2016.596"},{"key":"6830_CR4","doi-asserted-by":"publisher","first-page":"1005","DOI":"10.3390\/s19051005","volume":"19","author":"HB Zhang","year":"2019","unstructured":"Zhang HB, Zhang YX, Zhong B, Lei Q, Yang L, Du JX, Chen DS (2019) A comprehensive survey of vision-based human action recognition methods. Sensors (Switzerland) 19:1005. https:\/\/doi.org\/10.3390\/s19051005","journal-title":"Sensors (Switzerland)"},{"key":"6830_CR5","doi-asserted-by":"publisher","first-page":"1572","DOI":"10.1109\/TITS.2019.2910643","volume":"21","author":"J Wei","year":"2020","unstructured":"Wei J, He J, Zhou Y, Chen K, Tang Z, Xiong Z (2020) Enhanced object detection with deep convolutional neural networks for advanced driving assistance. IEEE Trans Intell Transp Syst 21:1572\u20131583. https:\/\/doi.org\/10.1109\/TITS.2019.2910643","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"6830_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.comcom.2020.03.012","volume":"156","author":"B Mishra","year":"2020","unstructured":"Mishra B, Garg D, Narang P, Mishra V (2020) Drone-surveillance for search and rescue in natural disaster. Comput Commun 156:1\u201310. https:\/\/doi.org\/10.1016\/j.comcom.2020.03.012","journal-title":"Comput Commun"},{"key":"6830_CR7","doi-asserted-by":"publisher","first-page":"541","DOI":"10.1162\/neco.1989.1.4.541","volume":"1","author":"Y LeCun","year":"1989","unstructured":"LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1:541\u2013551. https:\/\/doi.org\/10.1162\/neco.1989.1.4.541","journal-title":"Neural Comput"},{"key":"6830_CR8","doi-asserted-by":"publisher","unstructured":"Kazemi FM, Samadi S, Poorreza HR, Akbarzadeh-T MR (2007) Vehicle recognition using curvelet transform and SVM. In: Fourth international conference on information technology (ITNG'07), 2\u20134 April 2007, pp 516\u2013521. https:\/\/doi.org\/10.1109\/ITNG.2007.205","DOI":"10.1109\/ITNG.2007.205"},{"key":"6830_CR9","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L (2001) Random forests. Mach Learn 45:5\u201332. https:\/\/doi.org\/10.1023\/A:1010933404324","journal-title":"Mach Learn"},{"key":"6830_CR10","doi-asserted-by":"publisher","first-page":"687","DOI":"10.1109\/LSP.2014.2313570","volume":"21","author":"S Wu","year":"2014","unstructured":"Wu S, Nagahashi H (2014) Parameterized adaboost: Introducing a parameter to speed up the training of real adaboost. IEEE Signal Process Lett 21:687\u2013691. https:\/\/doi.org\/10.1109\/LSP.2014.2313570","journal-title":"IEEE Signal Process Lett"},{"key":"6830_CR11","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","volume":"60","author":"DG Lowe","year":"2004","unstructured":"Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91\u2013110. https:\/\/doi.org\/10.1023\/B:VISI.0000029664.99615.94","journal-title":"Int J Comput Vis"},{"key":"6830_CR12","doi-asserted-by":"publisher","unstructured":"Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE Computer society conference on computer vision and pattern recognition (CVPR'05), 20\u201325 June 2005, vol 881, pp 886\u2013893. https:\/\/doi.org\/10.1109\/CVPR.2005.177","DOI":"10.1109\/CVPR.2005.177"},{"key":"6830_CR13","doi-asserted-by":"publisher","first-page":"971","DOI":"10.1109\/TPAMI.2002.1017623","volume":"24","author":"T Ojala","year":"2002","unstructured":"Ojala T, Pietik\u00e4inen M, M\u00e4enp\u00e4\u00e4 T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24:971\u2013987. https:\/\/doi.org\/10.1109\/TPAMI.2002.1017623","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR14","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1145\/3065386","volume":"60","author":"A Krizhevsky","year":"2017","unstructured":"Krizhevsky A, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Commun ACM 60:84\u201390. https:\/\/doi.org\/10.1145\/3065386","journal-title":"Commun ACM"},{"key":"6830_CR15","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","volume":"115","author":"O Russakovsky","year":"2015","unstructured":"Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L (2015) ImageNet large scale visual recognition challenge. Int J Comput Vis 115:211\u2013252. https:\/\/doi.org\/10.1007\/s11263-015-0816-y","journal-title":"Int J Comput Vis"},{"key":"6830_CR16","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","volume":"88","author":"M Everingham","year":"2010","unstructured":"Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88:303\u2013338. https:\/\/doi.org\/10.1007\/s11263-009-0275-4","journal-title":"Int J Comput Vis"},{"key":"6830_CR17","unstructured":"Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd International conference on learning representations (ICLR), San Diego, CA, USA, 7\u20139 May 2015"},{"key":"6830_CR18","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on computer vision and pattern recognition (CVPR), 27\u201330 June 2016, pp 770\u2013778. https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"key":"6830_CR19","doi-asserted-by":"publisher","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","volume":"42","author":"TY Lin","year":"2020","unstructured":"Lin TY, Goyal P, Girshick R, He K, Dollar P (2020) Focal loss for dense object detection. IEEE Trans Pattern Anal Mach Intell 42:318\u2013327. https:\/\/doi.org\/10.1109\/TPAMI.2018.2858826","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR20","doi-asserted-by":"publisher","unstructured":"Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE Conference on computer vision and pattern recognition (CVPR), 7\u201312 June 2015, pp 1\u20139. https:\/\/doi.org\/10.1109\/CVPR.2015.7298594","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"6830_CR21","doi-asserted-by":"publisher","first-page":"615","DOI":"10.1145\/3037697.3037698","volume":"52","author":"Y Kang","year":"2017","unstructured":"Kang Y, Hauswald J, Gao C, Rovinski A, Mudge T, Mars J, Tang L (2017) Neurosurgeon: collaborative intelligence between the cloud and mobile edge. ACM SIGPLAN Not 52:615\u2013629. https:\/\/doi.org\/10.1145\/3037697.3037698","journal-title":"ACM SIGPLAN Not"},{"key":"6830_CR22","doi-asserted-by":"publisher","unstructured":"Teerapittayanon S, McDanel B, Kung HT (2017) Distributed deep neural networks over the cloud, the edge and end devices. In: 2017 IEEE 37th International conference on distributed computing systems (ICDCS), 5\u20138 June 2017, pp 328\u2013339. https:\/\/doi.org\/10.1109\/ICDCS.2017.226","DOI":"10.1109\/ICDCS.2017.226"},{"key":"6830_CR23","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-021-09987-4","author":"S Chinchali","year":"2021","unstructured":"Chinchali S, Sharma A, Harrison J, Elhafsi A, Kang D, Pergament E, Cidon E, Katti S, Pavone M (2021) Network offloading policies for cloud robotics: a learning-based approach. Auton Robot. https:\/\/doi.org\/10.1007\/s10514-021-09987-4","journal-title":"Auton Robot"},{"key":"6830_CR24","doi-asserted-by":"publisher","first-page":"106582","DOI":"10.1016\/j.asoc.2020.106582","volume":"96","author":"F Jauro","year":"2020","unstructured":"Jauro F, Chiroma H, Gital AY, Almutairi M, SiM A, Abawajy JH (2020) Deep learning architectures in emerging cloud computing architectures: recent development, challenges and next research trend. Appl Soft Comput 96:106582. https:\/\/doi.org\/10.1016\/j.asoc.2020.106582","journal-title":"Appl Soft Comput"},{"issue":"1","key":"6830_CR25","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1186\/s13677-020-00168-9","volume":"9","author":"H Wu","year":"2020","unstructured":"Wu H, Li X, Deng Y (2020) Deep learning-driven wireless communication for edge-cloud computing: opportunities and challenges. J Cloud Comput 9(1):21. https:\/\/doi.org\/10.1186\/s13677-020-00168-9","journal-title":"J Cloud Comput"},{"issue":"43","key":"6830_CR26","doi-asserted-by":"publisher","first-page":"587139","DOI":"10.3389\/fdata.2020.587139","volume":"3","author":"A Qayyum","year":"2020","unstructured":"Qayyum A, Ijaz A, Usama M, Iqbal W, Qadir J, Elkhatib Y, Al-Fuqaha A (2020) Securing machine learning in the cloud: a systematic review of cloud machine learning security. Front Big Data 3(43):587139. https:\/\/doi.org\/10.3389\/fdata.2020.587139","journal-title":"Front Big Data"},{"issue":"9","key":"6830_CR27","doi-asserted-by":"publisher","first-page":"8099","DOI":"10.1109\/JIOT.2020.2996784","volume":"7","author":"H Wu","year":"2020","unstructured":"Wu H, Zhang Z, Guan C, Wolter K, Xu M (2020) Collaborate edge and cloud computing with distributed deep learning for smart city internet of things. IEEE Internet Things J 7(9):8099\u20138110. https:\/\/doi.org\/10.1109\/JIOT.2020.2996784","journal-title":"IEEE Internet Things J"},{"key":"6830_CR28","doi-asserted-by":"publisher","unstructured":"Choi H, Baji\u0107 IV (2018) Deep feature compression for collaborative object detection. In: 25th IEEE International conference on image processing (ICIP), 7\u201310 Oct 2018, pp 3743\u20133747. https:\/\/doi.org\/10.1109\/ICIP.2018.8451100","DOI":"10.1109\/ICIP.2018.8451100"},{"key":"6830_CR29","doi-asserted-by":"publisher","unstructured":"Ishakian V, Muthusamy V, Slominski A (2018) Serving deep learning models in a serverless platform. In: 2018 IEEE International conference on cloud engineering (IC2E), 17\u201320 April 2018, pp 257\u2013262. https:\/\/doi.org\/10.1109\/IC2E.2018.00052","DOI":"10.1109\/IC2E.2018.00052"},{"key":"6830_CR30","doi-asserted-by":"publisher","first-page":"849","DOI":"10.1016\/j.future.2017.09.020","volume":"79","author":"B Varghese","year":"2018","unstructured":"Varghese B, Buyya R (2018) Next generation cloud computing: new trends and research directions. Futur Gener Comput Syst 79:849\u2013861. https:\/\/doi.org\/10.1016\/j.future.2017.09.020","journal-title":"Futur Gener Comput Syst"},{"key":"6830_CR31","doi-asserted-by":"publisher","unstructured":"Wang J, Zhang J, Bao W, Zhu X, Cao B, Yu PS (2018) Not just privacy: improving performance of private deep learning in mobile cloud. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, London, United Kingdom, 2018. Association for Computing Machinery, pp 2407\u20132416. https:\/\/doi.org\/10.1145\/3219819.3220106","DOI":"10.1145\/3219819.3220106"},{"key":"6830_CR32","unstructured":"Dhar S, Guo J, Liu J, Tripathi S, Kurup U, Shah M (2019) On-device machine learning: an algorithms and learning theory perspective. arXiv preprint arXIv:1911.00623"},{"key":"6830_CR33","doi-asserted-by":"publisher","unstructured":"Chen T, Du Z, Sun N, Wang J, Wu C, Chen Y, Temam O (2014) DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning. In: Proceedings of the 19th international conference on architectural support for programming languages and operating systems, Salt Lake City, Utah, USA, 2014, pp 269\u2013284. https:\/\/doi.org\/10.1145\/2541940.2541967","DOI":"10.1145\/2541940.2541967"},{"key":"6830_CR34","doi-asserted-by":"publisher","DOI":"10.1109\/JETCAS.2019.2910232","author":"YH Chen","year":"2019","unstructured":"Chen YH, Yang TJ, Emer JS, Sze V (2019) Eyeriss v2: a flexible accelerator for emerging deep neural networks on mobile devices. IEEE J Emerg Sel Top Circuits Syst. https:\/\/doi.org\/10.1109\/JETCAS.2019.2910232","journal-title":"IEEE J Emerg Sel Top Circuits Syst"},{"key":"6830_CR35","doi-asserted-by":"publisher","unstructured":"Yin X, Chen L, Zhang X, Gao Z (2018) Object detection implementation and optimization on embedded GPU system. In: 2018 IEEE International symposium on broadband multimedia systems and broadcasting (BMSB), 6\u20138 June 2018, pp 1\u20135. https:\/\/doi.org\/10.1109\/BMSB.2018.8436848","DOI":"10.1109\/BMSB.2018.8436848"},{"key":"6830_CR36","doi-asserted-by":"publisher","unstructured":"Andargie FA, Rose J, Austin T, Bertacco V (2017) Energy efficient object detection on the mobile GP-GPU. In: 2017 IEEE AFRICON, 18\u201320 Sept 2017, pp 945\u2013950. https:\/\/doi.org\/10.1109\/AFRCON.2017.8095609","DOI":"10.1109\/AFRCON.2017.8095609"},{"key":"6830_CR37","doi-asserted-by":"publisher","first-page":"206","DOI":"10.11591\/ijres.v8.i3.pp206-214","volume":"8","author":"YJ Wai","year":"2019","unstructured":"Wai YJ, Yussof ZM, Irwan S, Salim M (2019) A scalable FPGA based accelerator for Tiny-YOLO-v2 using openCL. Int J Reconfigurable Embed Syst (IJRES) 8:206\u2013214. https:\/\/doi.org\/10.11591\/ijres.v8.i3.pp206-214","journal-title":"Int J Reconfigurable Embed Syst (IJRES)"},{"issue":"1","key":"6830_CR38","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1145\/3289185","volume":"12","author":"K Guo","year":"2019","unstructured":"Guo K, Zeng S, Yu J, Wang Y, Yang H (2019) [DL] A survey of FPGA-based neural network inference accelerators. ACM Trans Reconfigurable Technol Syst 12(1):2. https:\/\/doi.org\/10.1145\/3289185","journal-title":"ACM Trans Reconfigurable Technol Syst"},{"key":"6830_CR39","doi-asserted-by":"publisher","unstructured":"Zhang C, Li P, Sun G, Guan Y, Xiao B, Cong J (2015) Optimizing FPGA-based accelerator design for deep convolutional neural networks. In: Proceedings of the 2015 ACM\/SIGDA international symposium on field-programmable gate arrays, Monterey, California, USA, 2015. Association for Computing Machinery, pp 161\u2013170. https:\/\/doi.org\/10.1145\/2684746.2689060","DOI":"10.1145\/2684746.2689060"},{"key":"6830_CR40","doi-asserted-by":"publisher","unstructured":"Kaarmukilan SP, Poddar S (2020) FPGA based deep learning models for object detection and recognition comparison of object detection comparison of object detection models using FPGA. In: 2020 Fourth international conference on computing methodologies and communication (ICCMC), 11\u201313 March 2020, pp 471\u2013474. https:\/\/doi.org\/10.1109\/ICCMC48092.2020.ICCMC-00088","DOI":"10.1109\/ICCMC48092.2020.ICCMC-00088"},{"key":"6830_CR41","doi-asserted-by":"publisher","unstructured":"Wu J, Leng C, Wang Y, Hu Q, Cheng J (2016) Quantized convolutional neural networks for mobile devices. In: 2016 IEEE Conference on computer vision and pattern recognition (CVPR), 27\u201330 June 2016, pp 4820\u20134828. https:\/\/doi.org\/10.1109\/CVPR.2016.521","DOI":"10.1109\/CVPR.2016.521"},{"issue":"6","key":"6830_CR42","doi-asserted-by":"publisher","first-page":"661","DOI":"10.3390\/electronics8060661","volume":"8","author":"T Simons","year":"2019","unstructured":"Simons T, Lee D-J (2019) A review of binarized neural networks. Electronics 8(6):661. https:\/\/doi.org\/10.3390\/electronics8060661","journal-title":"Electronics"},{"key":"6830_CR43","doi-asserted-by":"publisher","unstructured":"Bhattacharya S, Lane ND (2016) Sparsification and separation of deep learning layers for constrained resource inference on wearables. Paper presented at the Proceedings of the 14th ACM conference on embedded networked sensor systems (SenSys), Stanford, CA, USA. https:\/\/doi.org\/10.1145\/2994551.2994564","DOI":"10.1145\/2994551.2994564"},{"key":"6830_CR44","unstructured":"Fedorov I, Adams RP, Mattina M, Whatmough PN (2019) SpArSe: sparse architecture search for CNNs on resource-constrained microcontrollers. arXiv preprint https:\/\/arxiv.org\/abs\/1905.12107"},{"key":"6830_CR45","doi-asserted-by":"publisher","unstructured":"Yang TJ, Chen YH, Sze V (2017) Designing energy-efficient convolutional neural networks using energy-aware pruning. In: 2017 IEEE Conference on computer vision and pattern recognition (CVPR), 21\u201326 July 2017, pp 6071\u20136079. https:\/\/doi.org\/10.1109\/CVPR.2017.643","DOI":"10.1109\/CVPR.2017.643"},{"key":"6830_CR46","doi-asserted-by":"publisher","unstructured":"Zhang L, Song J, Gao A, Chen J, Bao C, Ma K (2019) Be your own teacher: improve the performance of convolutional neural networks via self distillation. In: 2019 IEEE\/CVF International conference on computer vision (ICCV), 27 Oct\u20132 Nov 2019, pp 3712\u20133721. https:\/\/doi.org\/10.1109\/ICCV.2019.00381","DOI":"10.1109\/ICCV.2019.00381"},{"key":"6830_CR47","unstructured":"Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv preprint https:\/\/arxiv.org\/abs\/1704.04861"},{"key":"6830_CR48","doi-asserted-by":"publisher","unstructured":"Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L-CC (2018) MobileNetV2: inverted residuals and linear bottlenecks. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition, 18\u201323 June 2018, pp 4510\u20134520. https:\/\/doi.org\/10.1109\/CVPR.2018.00474","DOI":"10.1109\/CVPR.2018.00474"},{"key":"6830_CR49","unstructured":"Iandola FN, Han S, Moskewicz MW, Ashraf K, Dally WJ, Keutzer K (2016) SqueezeNet: AlexNet-level accuracy with 50\u00d7 fewer parameters and <\u00a00.5\u00a0MB model size. arXiv preprint https:\/\/arxiv.org\/abs\/1602.07360"},{"key":"6830_CR50","doi-asserted-by":"publisher","unstructured":"He Y, Liu X, Zhong H, Ma Y (2019) AddressNet: shift-based primitives for efficient convolutional neural networks. In: 2019 IEEE Winter conference on applications of computer vision (WACV), 7\u201311 Jan 2019, pp 1213\u20131222. https:\/\/doi.org\/10.1109\/WACV.2019.00134","DOI":"10.1109\/WACV.2019.00134"},{"key":"6830_CR51","doi-asserted-by":"publisher","unstructured":"Mehta S, Rastegari M, Shapiro L, Hajishirzi H (2019) ESPNetv2: a light-weight, power efficient, and general purpose convolutional neural network. In: 2019 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR), 15\u201320 June 2019, pp 9182\u20139192. https:\/\/doi.org\/10.1109\/CVPR.2019.00941","DOI":"10.1109\/CVPR.2019.00941"},{"key":"6830_CR52","doi-asserted-by":"publisher","unstructured":"Ma N, Zhang X, Zheng H-T, Sun J (2018) ShuffleNet V2: practical guidelines for efficient CNN architecture design. In: Ferrari V, Hebert M, Sminchisescu C, Weiss Y (eds) Computer vision\u2014ECCV 2018. Springer International Publishing, Cham, pp 122\u2013138. https:\/\/doi.org\/10.1007\/978-3-030-01264-9_8","DOI":"10.1007\/978-3-030-01264-9_8"},{"key":"6830_CR53","doi-asserted-by":"publisher","unstructured":"Xie X, Zhou Y, Kung SY (2020) Exploring highly efficient compact neural networks for image classification. In: 2020 IEEE International conference on image processing (ICIP), 25\u201328 Oct 2020, pp 2930\u20132934. https:\/\/doi.org\/10.1109\/ICIP40778.2020.9191334","DOI":"10.1109\/ICIP40778.2020.9191334"},{"key":"6830_CR54","doi-asserted-by":"publisher","unstructured":"Zhang X, Zhou X, Lin M, Sun J (2018) ShuffleNet: an extremely efficient convolutional neural network for mobile devices. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition, 18\u201323 June 2018, pp 6848\u20136856. https:\/\/doi.org\/10.1109\/CVPR.2018.00716","DOI":"10.1109\/CVPR.2018.00716"},{"key":"6830_CR55","doi-asserted-by":"publisher","unstructured":"Huang G, Liu S, Maaten Lvd, Weinberger KQ (2018) CondenseNet: an efficient DenseNet using learned group convolutions. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition, 18\u201323 June 2018, pp 2752\u20132761. https:\/\/doi.org\/10.1109\/CVPR.2018.00291","DOI":"10.1109\/CVPR.2018.00291"},{"issue":"4","key":"6830_CR56","doi-asserted-by":"publisher","first-page":"485","DOI":"10.1109\/JPROC.2020.2976475","volume":"108","author":"L Deng","year":"2020","unstructured":"Deng L, Li G, Han S, Shi L, Xie Y (2020) Model compression and hardware acceleration for neural networks: a comprehensive survey. Proc IEEE 108(4):485\u2013532. https:\/\/doi.org\/10.1109\/JPROC.2020.2976475","journal-title":"Proc IEEE"},{"key":"6830_CR57","doi-asserted-by":"publisher","first-page":"107281","DOI":"10.1016\/j.patcog.2020.107281","volume":"105","author":"H Qin","year":"2020","unstructured":"Qin H, Gong R, Liu X, Bai X, Song J, Sebe N (2020) Binary neural networks: a survey. Pattern Recogn 105:107281. https:\/\/doi.org\/10.1016\/j.patcog.2020.107281","journal-title":"Pattern Recogn"},{"issue":"1","key":"6830_CR58","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1631\/FITEE.1700789","volume":"19","author":"J Cheng","year":"2018","unstructured":"Cheng J, Wang P-s, Li G, Hu Q-h, Lu H-q (2018) Recent advances in efficient computation of deep convolutional neural networks. Front Inf Technol Electron Eng 19(1):64\u201377. https:\/\/doi.org\/10.1631\/FITEE.1700789","journal-title":"Front Inf Technol Electron Eng"},{"key":"6830_CR59","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1016\/j.neucom.2020.01.085","volume":"396","author":"X Wu","year":"2020","unstructured":"Wu X, Sahoo D, Hoi SCH (2020) Recent advances in deep learning for object detection. Neurocomputing 396:39\u201364. https:\/\/doi.org\/10.1016\/j.neucom.2020.01.085","journal-title":"Neurocomputing"},{"key":"6830_CR60","unstructured":"Chahal K, Dey K (2018) A survey of modern object detection literature using deep learning. arXiv preprint https:\/\/arxiv.org\/abs\/1808.07256"},{"issue":"11","key":"6830_CR61","doi-asserted-by":"publisher","first-page":"3212","DOI":"10.1109\/TNNLS.2018.2876865","volume":"30","author":"Z Zhao","year":"2019","unstructured":"Zhao Z, Zheng P, Xu S, Wu X (2019) Object detection with deep learning: a review. IEEE Trans Neural Netw Learn Syst 30(11):3212\u20133232. https:\/\/doi.org\/10.1109\/TNNLS.2018.2876865","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"6830_CR62","doi-asserted-by":"publisher","first-page":"128837","DOI":"10.1109\/ACCESS.2019.2939201","volume":"7","author":"L Jiao","year":"2019","unstructured":"Jiao L, Zhang F, Liu F, Yang S, Li L, Feng Z, Qu R (2019) A survey of deep learning-based object detection. IEEE Access 7:128837\u2013128868. https:\/\/doi.org\/10.1109\/ACCESS.2019.2939201","journal-title":"IEEE Access"},{"key":"6830_CR63","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1007\/s11263-019-01247-4","volume":"128","author":"L Liu","year":"2020","unstructured":"Liu L, Ouyang W, Wang X, Fieguth P, Chen J, Liu X, Pietik\u00e4inen M, Wang X, Fieguth P, Chen J, Liu X, Pietik\u00e4inen M (2020) Deep learning for generic object detection: a survey. Int J Comput Vision 128:261\u2013318. https:\/\/doi.org\/10.1007\/s11263-019-01247-4","journal-title":"Int J Comput Vision"},{"issue":"8","key":"6830_CR64","doi-asserted-by":"publisher","first-page":"5455","DOI":"10.1007\/s10462-020-09825-6","volume":"53","author":"A Khan","year":"2020","unstructured":"Khan A, Sohail A, Zahoora U, Qureshi AS (2020) A survey of the recent architectures of deep convolutional neural networks. Artif Intell Rev 53(8):5455\u20135516. https:\/\/doi.org\/10.1007\/s10462-020-09825-6","journal-title":"Artif Intell Rev"},{"key":"6830_CR65","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/978-981-15-4288-6_1","volume-title":"Intelligent computing: image processing based applications","author":"F Sultana","year":"2020","unstructured":"Sultana F, Sufian A, Dutta P (2020) A review of object detection models based on convolutional neural network. In: Mandal JK, Banerjee S (eds) Intelligent computing: image processing based applications. Springer Singapore, Singapore, pp 1\u201316. https:\/\/doi.org\/10.1007\/978-981-15-4288-6_1"},{"key":"6830_CR66","doi-asserted-by":"publisher","unstructured":"Carion N, Massa F, Synnaeve G, Usunier N, Kirillov A, Zagoruyko S (2020) End-to-end object detection with transformers. In: Vedaldi A, Bischof H, Brox T, Frahm J-M (eds) Computer vision\u2014ECCV 2020. Springer International Publishing, Cham, pp 213\u2013229. https:\/\/doi.org\/10.1007\/978-3-030-58452-8_13","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"6830_CR67","unstructured":"Tolstikhin I, Houlsby N, Kolesnikov A, Beyer L, Zhai X, Unterthiner T, Yung J, Steiner A, Keysers D, Uszkoreit J, Lucic M, Dosovitskiy A (2021) MLP-Mixer: an all-MLP architecture for vision. arXiv preprint https:\/\/arxiv.org\/abs\/2105.01601"},{"key":"6830_CR68","doi-asserted-by":"publisher","unstructured":"Ullah S, Kim D (2020) Benchmarking Jetson platform for 3D point-cloud and hyper-spectral image classification. In: 2020 IEEE International conference on big data and smart computing (BigComp), 19\u201322 Feb 2020, pp 477\u2013482. https:\/\/doi.org\/10.1109\/BigComp48618.2020.00-21","DOI":"10.1109\/BigComp48618.2020.00-21"},{"key":"6830_CR69","doi-asserted-by":"publisher","unstructured":"Qi CR, Litany O, He K, Guibas L (2019) Deep hough voting for 3D object detection in point clouds. In: 2019 IEEE\/CVF International conference on computer vision (ICCV), 27 Oct\u20132 Nov 2019, pp 9276\u20139285. https:\/\/doi.org\/10.1109\/ICCV.2019.00937","DOI":"10.1109\/ICCV.2019.00937"},{"key":"6830_CR70","doi-asserted-by":"publisher","unstructured":"Wang Y, Zell A (2021) Yolo+FPN: 2D and 3D fused object detection with an RGB-D camera. In: 2020 25th International conference on pattern recognition (ICPR), 10\u201315 Jan 2021, pp 4657\u20134664. https:\/\/doi.org\/10.1109\/ICPR48806.2021.9413066","DOI":"10.1109\/ICPR48806.2021.9413066"},{"issue":"1","key":"6830_CR71","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1007\/s41095-020-0199-z","volume":"7","author":"T Zhou","year":"2021","unstructured":"Zhou T, Fan D-P, Cheng M-M, Shen J, Shao L (2021) RGB-D salient object detection: a survey. Comput Vis Media 7(1):37\u201369. https:\/\/doi.org\/10.1007\/s41095-020-0199-z","journal-title":"Comput Vis Media"},{"key":"6830_CR72","doi-asserted-by":"publisher","unstructured":"Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: single shot MultiBox detector. In: Leibe B, Matas J, Sebe N, Welling M (eds) Computer vision\u2014ECCV 2016. Springer International Publishing, Cham, pp 21\u201337. https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"6830_CR73","doi-asserted-by":"publisher","unstructured":"Huang R, Pedoeem J, Chen C (2018) YOLO-LITE: a real-time object detection algorithm optimized for non-GPU computers. In: 2018 IEEE International conference on big data (Big Data), 10\u201313 Dec 2018, pp 2503\u20132510. https:\/\/doi.org\/10.1109\/BigData.2018.8621865","DOI":"10.1109\/BigData.2018.8621865"},{"issue":"16","key":"6830_CR74","doi-asserted-by":"publisher","first-page":"3225","DOI":"10.3390\/app9163225","volume":"9","author":"W He","year":"2019","unstructured":"He W, Huang Z, Wei Z, Li C, Guo B (2019) TF-YOLO: an improved incremental network for real-time object detection. Appl Sci 9(16):3225. https:\/\/doi.org\/10.3390\/app9163225","journal-title":"Appl Sci"},{"key":"6830_CR75","doi-asserted-by":"publisher","unstructured":"Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: 2017 IEEE Conference on computer vision and pattern recognition (CVPR), 21\u201326 July 2017, pp 6517\u20136525. https:\/\/doi.org\/10.1109\/CVPR.2017.690","DOI":"10.1109\/CVPR.2017.690"},{"key":"6830_CR76","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1049\/iet-cvi.2019.0897","volume":"14","author":"C Kyrkou","year":"2020","unstructured":"Kyrkou C (2020) YOLOpeds: efficient real-time single-shot pedestrian detection for smart camera applications. IET Comput Vis 14:417\u2013425. https:\/\/doi.org\/10.1049\/iet-cvi.2019.0897","journal-title":"IET Comput Vis"},{"key":"6830_CR77","unstructured":"Redmon J, Farhadi A (2018) YOLOv3: an incremental improvement. arXiv preprint https:\/\/arxiv.org\/pdf\/1804.02767.pdf"},{"issue":"6","key":"6830_CR78","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2017","unstructured":"Ren S, He K, Girshick R, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137\u20131149. https:\/\/doi.org\/10.1109\/TPAMI.2016.2577031","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"2","key":"6830_CR79","doi-asserted-by":"publisher","first-page":"398","DOI":"10.1109\/TPAMI.2019.2922181","volume":"42","author":"Z Shen","year":"2020","unstructured":"Shen Z, Liu Z, Li J, Jiang YG, Chen Y, Xue X (2020) Object detection from scratch with deep supervision. IEEE Trans Pattern Anal Mach Intell 42(2):398\u2013412. https:\/\/doi.org\/10.1109\/TPAMI.2019.2922181","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR80","doi-asserted-by":"publisher","unstructured":"Zhang S, Wen L, Bian X, Lei Z, Li SZ (2018) Single-shot refinement neural network for object detection. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition, 18\u201323 June 2018, pp 4203\u20134212. https:\/\/doi.org\/10.1109\/CVPR.2018.00442","DOI":"10.1109\/CVPR.2018.00442"},{"issue":"3","key":"6830_CR81","doi-asserted-by":"publisher","first-page":"642","DOI":"10.1007\/s11263-019-01204-1","volume":"128","author":"H Law","year":"2020","unstructured":"Law H, Deng J (2020) CornerNet: detecting objects as paired keypoints. Int J Comput Vis 128(3):642\u2013656. https:\/\/doi.org\/10.1007\/s11263-019-01204-1","journal-title":"Int J Comput Vis"},{"key":"6830_CR82","doi-asserted-by":"publisher","unstructured":"Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE Conference on computer vision and pattern recognition, 23\u201328 June 2014, pp 580\u2013587. https:\/\/doi.org\/10.1109\/CVPR.2014.81","DOI":"10.1109\/CVPR.2014.81"},{"key":"6830_CR83","unstructured":"Li Z, Peng C, Yu G, Zhang X, Deng Y, Sun J (2017) Light-head R-CNN: in defense of two-stage object detector. arXiv preprint https:\/\/arxiv.org\/abs\/1711.07264"},{"key":"6830_CR84","doi-asserted-by":"publisher","unstructured":"Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: 2016 IEEE Conference on computer vision and pattern recognition (CVPR), 27\u201330 June 2016, pp 779\u2013788. https:\/\/doi.org\/10.1109\/CVPR.2016.91","DOI":"10.1109\/CVPR.2016.91"},{"issue":"11","key":"6830_CR85","doi-asserted-by":"publisher","first-page":"2278","DOI":"10.1109\/5.726791","volume":"86","author":"Y Lecun","year":"1998","unstructured":"Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278\u20132324. https:\/\/doi.org\/10.1109\/5.726791","journal-title":"Proc IEEE"},{"key":"6830_CR86","doi-asserted-by":"publisher","unstructured":"Lin TY, Doll\u00e1r P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: 2017 IEEE Conference on computer vision and pattern recognition (CVPR), 21\u201326 July 2017, pp 936\u2013944. https:\/\/doi.org\/10.1109\/CVPR.2017.106","DOI":"10.1109\/CVPR.2017.106"},{"key":"6830_CR87","doi-asserted-by":"publisher","unstructured":"Kong T, Yao A, Chen Y, Sun F (2016) HyperNet: towards accurate region proposal generation and joint object detection. In: 2016 IEEE Conference on computer vision and pattern recognition (CVPR), 27\u201330 June 2016, pp 845\u2013853. https:\/\/doi.org\/10.1109\/CVPR.2016.98","DOI":"10.1109\/CVPR.2016.98"},{"key":"6830_CR88","doi-asserted-by":"publisher","unstructured":"Newell A, Yang K, Deng J (2016) Stacked Hourglass networks for human pose estimation. In: 2016 European conference on computer vision (ECCV). Springer International Publishing, Cham, pp 483\u2013499. https:\/\/doi.org\/10.1007\/978-3-319-46484-8_29","DOI":"10.1007\/978-3-319-46484-8_29"},{"key":"6830_CR89","unstructured":"Li Z, Zhou F (2017) FSSD: feature fusion single shot Multibox detector. arXiv preprint https:\/\/arxiv.org\/abs\/1712.00960"},{"key":"6830_CR90","doi-asserted-by":"crossref","unstructured":"Li Z, Peng C, Yu G, Zhang X, Deng Y, Sun J (2018) DetNet: design backbone for object detection. In: Ferrari V, Hebert M, Sminchisescu C, Weiss Y (eds) Computer vision\u2014ECCV 2018. Springer International Publishing, Cham, pp 339\u2013354","DOI":"10.1007\/978-3-030-01240-3_21"},{"key":"6830_CR91","doi-asserted-by":"publisher","unstructured":"Qin Z, Li Z, Zhang Z, Bao Y, Yu G, Peng Y, Sun J (2019) ThunderNet: towards real-time generic object detection on mobile devices. In: 2019 IEEE\/CVF International conference on computer vision (ICCV), 27 Oct\u20132 Nov 2019, pp 6717\u20136726. https:\/\/doi.org\/10.1109\/ICCV.2019.00682","DOI":"10.1109\/ICCV.2019.00682"},{"key":"6830_CR92","doi-asserted-by":"publisher","first-page":"86564","DOI":"10.1109\/ACCESS.2020.2992516","volume":"8","author":"D Chen","year":"2020","unstructured":"Chen D, Shen H (2020) MAOD: an efficient anchor-free object detector based on MobileDet. IEEE Access 8:86564\u201386572. https:\/\/doi.org\/10.1109\/ACCESS.2020.2992516","journal-title":"IEEE Access"},{"key":"6830_CR93","unstructured":"Law H, Teng Y, Russakovsky O, Deng J (2020) CornerNet-Lite: efficient keypoint based object detection. In: 31st British machine vision conference 2020 (BMVC), Virtual Event, UK, 7\u201310 Sept 2020"},{"key":"6830_CR94","doi-asserted-by":"publisher","unstructured":"Tang Q, Li J, Shi Z, Hu Y (2020) Lightdet: a lightweight and accurate object detection network. In: ICASSP 2020\u20132020 IEEE International conference on acoustics, speech and signal processing (ICASSP), 4\u20138 May 2020, pp 2243\u20132247. https:\/\/doi.org\/10.1109\/ICASSP40776.2020.9054101","DOI":"10.1109\/ICASSP40776.2020.9054101"},{"key":"6830_CR95","unstructured":"Li Y, Li JJ, Lin W, Li JJ (2018) Tiny-DSOD: lightweight object detection for resource-restricted usages. In: 29th British machine vision conference (BMVC), 2018"},{"key":"6830_CR96","doi-asserted-by":"publisher","unstructured":"Wong A, Shafiee MJ, Li F, Chwyl B (2018) Tiny SSD: a tiny single-shot detection deep convolutional neural network for real-time embedded object detection. In: 2018 15th Conference on computer and robot vision (CRV), 8\u201310 May 2018, pp 95\u2013101. https:\/\/doi.org\/10.1109\/CRV.2018.00023","DOI":"10.1109\/CRV.2018.00023"},{"key":"6830_CR97","doi-asserted-by":"publisher","unstructured":"Azimi SM (2019) ShuffleDet: real-time vehicle detection network in on-board embedded UAV imagery. In: Leal-Taix\u00e9 L, Roth S (eds) Computer vision\u2014ECCV 2018 workshops, 2019. Springer International Publishing, Cham, pp 88\u201399. https:\/\/doi.org\/10.1007\/978-3-030-11012-3_7","DOI":"10.1007\/978-3-030-11012-3_7"},{"key":"6830_CR98","doi-asserted-by":"publisher","first-page":"133529","DOI":"10.1109\/ACCESS.2019.2941547","volume":"7","author":"QC Mao","year":"2019","unstructured":"Mao QC, Sun HM, Liu YB, Jia RS (2019) Mini-YOLOv3: real-time object detector for embedded applications. IEEE Access 7:133529\u2013133538. https:\/\/doi.org\/10.1109\/ACCESS.2019.2941547","journal-title":"IEEE Access"},{"key":"6830_CR99","doi-asserted-by":"publisher","unstructured":"Chiu YC, Tsai CY, Ruan MD, Shen GY, Lee TT (2020) Mobilenet-SSDv2: an improved object detection model for embedded systems. In: 2020 International conference on system science and engineering (ICSSE), 31 Aug\u20133 Sept 2020, pp 1\u20135. https:\/\/doi.org\/10.1109\/ICSSE50014.2020.9219319","DOI":"10.1109\/ICSSE50014.2020.9219319"},{"key":"6830_CR100","unstructured":"Oh S, You J-H, Kim Y-K (2020) FRDet: balanced and lightweight object detector based on fire-residual modules for embedded processor of autonomous driving. arXiv preprint https:\/\/arxiv.org\/abs\/2011.08061"},{"key":"6830_CR101","doi-asserted-by":"publisher","unstructured":"Chen C, Liu M, Meng X, Xiao W, Ju Q (2020) RefineDetLite: a lightweight one-stage object detection framework for CPU-only devices. In: 2020 IEEE\/CVF Conference on computer vision and pattern recognition workshops (CVPRW), 14\u201319 June 2020, pp 2997\u20133007. https:\/\/doi.org\/10.1109\/CVPRW50498.2020.00358","DOI":"10.1109\/CVPRW50498.2020.00358"},{"key":"6830_CR102","unstructured":"Ling H, Zhang L, Qin Y, Shi Y, Wu L, Chen J, Zhang B (2020) BMNet: a reconstructed network for lightweight object detection via branch merging. In: 2019 30th British machine vision conference (BMVC), 2019, pp 1\u201312"},{"key":"6830_CR103","doi-asserted-by":"publisher","first-page":"1935","DOI":"10.1109\/ACCESS.2019.2961959","volume":"8","author":"W Fang","year":"2020","unstructured":"Fang W, Wang L, Ren P (2020) Tinier-YOLO: a real-time object detection method for constrained environments. IEEE Access 8:1935\u20131944. https:\/\/doi.org\/10.1109\/ACCESS.2019.2961959","journal-title":"IEEE Access"},{"key":"6830_CR104","doi-asserted-by":"publisher","DOI":"10.1007\/s11554-021-01145-4","author":"J Han","year":"2021","unstructured":"Han J, Yang Y (2021) L-Net: lightweight and fast object detector-based ShuffleNetV2. J Real-Time Image Proc. https:\/\/doi.org\/10.1007\/s11554-021-01145-4","journal-title":"J Real-Time Image Proc"},{"issue":"1","key":"6830_CR105","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1007\/s11036-020-01723-z","volume":"26","author":"Q Zhou","year":"2021","unstructured":"Zhou Q, Wang J, Liu J, Li S, Ou W, Jin X (2021) RSANet: towards real-time object detection with residual semantic-guided attention feature pyramid network. Mobile Netw Appl 26(1):77\u201387. https:\/\/doi.org\/10.1007\/s11036-020-01723-z","journal-title":"Mobile Netw Appl"},{"key":"6830_CR106","doi-asserted-by":"crossref","unstructured":"Wang C-Y, Bochkovskiy A, Liao H-YM (2021) Scaled-yolov4: scaling cross stage partial network. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, 2021, pp 13029\u201313038","DOI":"10.1109\/CVPR46437.2021.01283"},{"key":"6830_CR107","unstructured":"Wang RJ, Li X, Ao S, Ling CX (2018) Pelee: a real-time object detection system on mobile devices. In: 6th International conference on learning representations, ICLR 2018\u2014workshop track proceedings, Montr\u00e9al, Canada, 2018. Curran Associates Inc., pp 1963\u20131972"},{"key":"6830_CR108","unstructured":"Liau HF, Yamini N, Wong YL (2018) Fire SSD: wide fire modules based single shot detector on edge device. arXiv preprint https:\/\/arxiv.org\/abs\/1806.05363"},{"key":"6830_CR109","doi-asserted-by":"publisher","unstructured":"Gong H, Li H, Xu K, Zhang Y (2019) Object detection based on improved YOLOv3-tiny. In: 2019 Chinese automation congress (CAC), 22\u201324 Nov 2019, pp 3240\u20133245. https:\/\/doi.org\/10.1109\/CAC48633.2019.8996750","DOI":"10.1109\/CAC48633.2019.8996750"},{"key":"6830_CR110","doi-asserted-by":"publisher","unstructured":"Jiun-In G, Chi-Chi T, Ching-Kan T (2019) Pvalite CLN: lightweight object detection with classfication and localization network. In: 2019 32nd IEEE International system-on-chip conference (SOCC), 3\u20136 Sept 2019, pp 118\u2013121. https:\/\/doi.org\/10.1109\/SOCC46988.2019.1570561207","DOI":"10.1109\/SOCC46988.2019.1570561207"},{"key":"6830_CR111","doi-asserted-by":"publisher","unstructured":"Ghiasi G, Lin TY, Le QV (2019) NAS-FPN: learning scalable feature pyramid architecture for object detection. In: 2019 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR), 15\u201320 June 2019, pp 7029\u20137038. https:\/\/doi.org\/10.1109\/CVPR.2019.00720","DOI":"10.1109\/CVPR.2019.00720"},{"key":"6830_CR112","doi-asserted-by":"publisher","unstructured":"Howard A, Sandler M, Chen B, Wang W, Chen L, Tan M, Chu G, Vasudevan V, Zhu Y, Pang R, Adam H, Le Q (2019) Searching for MobileNetV3. In: 2019 IEEE\/CVF International conference on computer vision (ICCV), 27 Oct\u20132 Nov 2019, pp 1314\u20131324. https:\/\/doi.org\/10.1109\/ICCV.2019.00140","DOI":"10.1109\/ICCV.2019.00140"},{"key":"6830_CR113","doi-asserted-by":"publisher","unstructured":"Sun Y, Wang C, Qu L (2019) An object detection network for embedded system. In: 2019 IEEE International conferences on ubiquitous computing & communications (IUCC) and data science and computational intelligence (DSCI) and smart computing, networking and services (SmartCNS), 21\u201323 Oct 2019, pp 506\u2013512. https:\/\/doi.org\/10.1109\/IUCC\/DSCI\/SmartCNS.2019.00110","DOI":"10.1109\/IUCC\/DSCI\/SmartCNS.2019.00110"},{"key":"6830_CR114","doi-asserted-by":"publisher","first-page":"1861","DOI":"10.3390\/s20071861","volume":"20","author":"H Zhao","year":"2020","unstructured":"Zhao H, Zhou Y, Zhang L, Peng Y, Hu X, Peng H, Cai X (2020) Mixed YOLOv3-LITE: a lightweight real-time object detection method. Sensors (Switzerland) 20:1861. https:\/\/doi.org\/10.3390\/s20071861","journal-title":"Sensors (Switzerland)"},{"key":"6830_CR115","doi-asserted-by":"publisher","unstructured":"Fan B, Chen Y, Qu J, Chai Y, Xiao C, Huang P (2019) FFBNet: lightweight backbone for object detection based feature fusion block. In: 2019 IEEE International conference on image processing (ICIP), 22\u201325 Sept 2019, pp 3920\u20133924. https:\/\/doi.org\/10.1109\/ICIP.2019.8803683","DOI":"10.1109\/ICIP.2019.8803683"},{"key":"6830_CR116","doi-asserted-by":"publisher","unstructured":"Hu L, Li Y (2021) Micro-YOLO: exploring efficient methods to compress CNN based object detection model. In: Proceedings of the 13th International conference on agents and artificial intelligence (ICAART), 2021. SciTePress, pp 151\u2013158. https:\/\/doi.org\/10.5220\/0010234401510158","DOI":"10.5220\/0010234401510158"},{"key":"6830_CR117","doi-asserted-by":"publisher","unstructured":"Guo S, Liu Y, Ni Y, Ni W (2021) Lightweight SSD: real-time lightweight single shot detector for mobile devices. In: Proceedings of the 16th international joint conference on computer vision, imaging and computer graphics theory and applications (VISIGRAPP), 2021, pp 25\u201335. https:\/\/doi.org\/10.5220\/0010188000250035","DOI":"10.5220\/0010188000250035"},{"key":"6830_CR118","doi-asserted-by":"publisher","unstructured":"Wu B, Wan A, Iandola F, Jin PH, Keutzer K (2017) SqueezeDet: unified, small, low power fully convolutional neural networks for real-time object detection for autonomous driving. In: 2017 IEEE Conference on computer vision and pattern recognition workshops (CVPRW), 21\u201326 July 2017, pp 446\u2013454. https:\/\/doi.org\/10.1109\/CVPRW.2017.60","DOI":"10.1109\/CVPRW.2017.60"},{"key":"6830_CR119","doi-asserted-by":"crossref","unstructured":"Szegedy C, Ioffe S, Vanhoucke V, Alemi AA (2017) Inception-v4, inception-ResNet and the impact of residual connections on learning. Paper presented at the Proceedings of the thirty-first AAAI conference on artificial intelligence, San Francisco, California, USA","DOI":"10.1609\/aaai.v31i1.11231"},{"key":"6830_CR120","unstructured":"YOLO: Real-time object detection. https:\/\/pjreddie.com\/darknet\/yolo\/. Accessed 2021-03-09"},{"key":"6830_CR121","doi-asserted-by":"publisher","unstructured":"Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition, 18\u201323 June 2018, pp 7132\u20137141. https:\/\/doi.org\/10.1109\/CVPR.2018.00745","DOI":"10.1109\/CVPR.2018.00745"},{"key":"6830_CR122","unstructured":"Hong S, Roh B, Kim K-H, Cheon Y, Park M (2016) PVANet: lightweight deep neural networks for real-time object detection. arXiv preprint https:\/\/arxiv.org\/abs\/1611.08588v2"},{"key":"6830_CR123","doi-asserted-by":"publisher","unstructured":"Woo S, Park J, Lee J-Y, Kweon IS (2018) CBAM: convolutional block attention module. In: Ferrari V, Hebert M, Sminchisescu C, Weiss Y (eds) Computer vision\u2014ECCV 2018. Springer International Publishing, Cham, pp 3\u201319. https:\/\/doi.org\/10.1007\/978-3-030-01234-2_1","DOI":"10.1007\/978-3-030-01234-2_1"},{"issue":"2","key":"6830_CR124","doi-asserted-by":"publisher","first-page":"652","DOI":"10.1109\/tpami.2019.2938758","volume":"43","author":"S-H Gao","year":"2021","unstructured":"Gao S-H, Cheng M-M, Zhao K, Zhang X-Y, Yang M-H, Torr P (2021) Res2Net: a new multi-scale backbone architecture. IEEE Trans Pattern Anal Mach Intell 43(2):652\u2013662. https:\/\/doi.org\/10.1109\/tpami.2019.2938758","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR125","doi-asserted-by":"publisher","unstructured":"Wang C, Liao HM, Wu Y, Chen P, Hsieh J, Yeh I (2020) CSPNet: a new backbone that can enhance learning capability of CNN. In: 2020 IEEE\/CVF Conference on computer vision and pattern recognition workshops (CVPRW), 14\u201319 June 2020, pp 1571\u20131580. https:\/\/doi.org\/10.1109\/CVPRW50498.2020.00203","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"6830_CR126","doi-asserted-by":"publisher","first-page":"3349","DOI":"10.1109\/TPAMI.2020.2983686","volume":"43","author":"J Wang","year":"2020","unstructured":"Wang J, Sun K, Cheng T, Jiang B, Deng C, Zhao Y, Liu D, Mu Y, Tan M, Wang X, Liu W, Xiao B (2020) Deep high-resolution representation learning for visual recognition. IEEE Trans Pattern Anal Mach Intell 43:3349\u20133364. https:\/\/doi.org\/10.1109\/TPAMI.2020.2983686","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR127","doi-asserted-by":"publisher","unstructured":"Tian Z, Shen C, Chen H, He T (2019) FCOS: fully convolutional one-stage object detection. In: 2019 IEEE\/CVF International conference on computer vision (ICCV), 27 Oct\u20132 Nov 2019, pp 9626\u20139635. https:\/\/doi.org\/10.1109\/ICCV.2019.00972","DOI":"10.1109\/ICCV.2019.00972"},{"key":"6830_CR128","doi-asserted-by":"publisher","unstructured":"Wu B, Wan A, Yue X, Jin P, Zhao S, Golmant N, Gholaminejad A, Gonzalez J, Keutzer K (2018) Shift: a zero FLOP, zero parameter alternative to spatial convolutions. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition, 18\u201323 June 2018. IEEE Computer Society, pp 9127\u20139135. https:\/\/doi.org\/10.1109\/CVPR.2018.00951","DOI":"10.1109\/CVPR.2018.00951"},{"key":"6830_CR129","doi-asserted-by":"publisher","unstructured":"Lee Y, Hwang J, Lee S, Bae Y, Park J (2019) An energy and GPU-computation efficient backbone network for real-time object detection. In: 2019 IEEE\/CVF Conference on computer vision and pattern recognition workshops (CVPRW), 16\u201317 June 2019, pp 752\u2013760. https:\/\/doi.org\/10.1109\/CVPRW.2019.00103","DOI":"10.1109\/CVPRW.2019.00103"},{"key":"6830_CR130","doi-asserted-by":"publisher","unstructured":"Zhang D (2018) clcNet: improving the efficiency of convolutional neural network using channel local convolutions. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition, 18\u201323 June 2018, pp 7912\u20137919. https:\/\/doi.org\/10.1109\/CVPR.2018.00825","DOI":"10.1109\/CVPR.2018.00825"},{"issue":"8","key":"6830_CR131","doi-asserted-by":"publisher","first-page":"2570","DOI":"10.1109\/TPAMI.2020.2975796","volume":"43","author":"H Gao","year":"2021","unstructured":"Gao H, Wang Z, Cai L, Ji S (2021) ChannelNets: compact and efficient convolutional neural networks via channel-wise convolutions. IEEE Trans Pattern Anal Mach Intell 43(8):2570\u20132581. https:\/\/doi.org\/10.1109\/TPAMI.2020.2975796","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR132","unstructured":"Xiong Y, Kim HJ, Hedau V (2019) ANTNets: mobile convolutional neural networks for resource efficient image classification. arXiv preprint https:\/\/arxiv.org\/abs\/1904.03775"},{"key":"6830_CR133","doi-asserted-by":"publisher","unstructured":"Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) GhostNet: more features from cheap operations. In: 2020 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR), 13\u201319 June 2020, pp 1577\u20131586. https:\/\/doi.org\/10.1109\/CVPR42600.2020.00165","DOI":"10.1109\/CVPR42600.2020.00165"},{"key":"6830_CR134","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/8817849","author":"W Wang","year":"2020","unstructured":"Wang W, Hu Y, Zou T, Liu H, Wang J, Wang X (2020) A new image classification approach via improved MobileNet models with local receptive field expansion in shallow layers. Comput Intell Neurosci. https:\/\/doi.org\/10.1155\/2020\/8817849","journal-title":"Comput Intell Neurosci"},{"key":"6830_CR135","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2020.3041871","author":"S Mehta","year":"2020","unstructured":"Mehta S, Hajishirzi H, Rastegari M (2020) DiCENet: dimension-wise convolutions for efficient networks. IEEE Trans Pattern Anal Mach Intell. https:\/\/doi.org\/10.1109\/TPAMI.2020.3041871","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6830_CR136","doi-asserted-by":"publisher","unstructured":"Gholami A, Kwon K, Wu B, Tai Z, Yue X, Jin P, Zhao S, Keutzer K (2018) SqueezeNext: hardware-aware neural network design. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition workshops (CVPRW), 18\u201322 June 2018, pp 1719\u20131728. https:\/\/doi.org\/10.1109\/CVPRW.2018.00215","DOI":"10.1109\/CVPRW.2018.00215"},{"key":"6830_CR137","doi-asserted-by":"publisher","unstructured":"Huang G, Liu Z, Maaten LVD, Weinberger KQ (2017) Densely connected convolutional networks. In: 2017 IEEE Conference on computer vision and pattern recognition (CVPR), 21\u201326 July 2017, pp 2261\u20132269. https:\/\/doi.org\/10.1109\/CVPR.2017.243","DOI":"10.1109\/CVPR.2017.243"},{"key":"6830_CR138","doi-asserted-by":"publisher","unstructured":"Zoph B, Vasudevan V, Shlens J, Le QV (2018) Learning transferable architectures for scalable image recognition. In: 2018 IEEE\/CVF Conference on computer vision and pattern recognition, 18\u201323 June 2018, pp 8697\u20138710. https:\/\/doi.org\/10.1109\/CVPR.2018.00907","DOI":"10.1109\/CVPR.2018.00907"},{"key":"6830_CR139","doi-asserted-by":"publisher","unstructured":"Tan M, Chen B, Pang R, Vasudevan V, Sandler M, Howard A, Le QV (2019) MnasNet: platform-aware neural architecture search for mobile. In: 2019 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR), 15\u201320 June 2019, pp 2815\u20132823. https:\/\/doi.org\/10.1109\/CVPR.2019.00293","DOI":"10.1109\/CVPR.2019.00293"},{"key":"6830_CR140","doi-asserted-by":"publisher","unstructured":"Stamoulis D, Ding R, Wang D, Lymberopoulos D, Priyantha B, Liu J, Marculescu D (2020) Single-path NAS: designing hardware-efficient ConvNets in less than 4 h. In: Brefeld U, Fromont E, Hotho A, Knobbe A, Maathuis M, Robardet C (eds) Machine learning and knowledge discovery in databases. Springer International Publishing, Cham, pp 481\u2013497. https:\/\/doi.org\/10.1007\/978-3-030-46147-8_29","DOI":"10.1007\/978-3-030-46147-8_29"},{"key":"6830_CR141","doi-asserted-by":"publisher","unstructured":"Wu B, Dai X, Zhang P, Wang Y, Sun F, Wu Y, Tian Y, Vajda P, Jia Y, Keutzer K (2019) FBNet: hardware-aware efficient ConvNet design via differentiable neural architecture search. In: 2019 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR), 15\u201320 June 2019, pp 10726\u201310734. https:\/\/doi.org\/10.1109\/CVPR.2019.01099","DOI":"10.1109\/CVPR.2019.01099"},{"key":"6830_CR142","doi-asserted-by":"publisher","unstructured":"Guo Z, Zhang X, Mu H, Heng W, Liu Z, Wei Y, Sun J (2020) Single path one-shot neural architecture search with uniform sampling. In: Vedaldi A, Bischof H, Brox T, Frahm J-M (eds) Computer vision\u2014ECCV 2020. Springer International Publishing, Cham, pp 544\u2013560. https:\/\/doi.org\/10.1007\/978-3-030-58517-4_32","DOI":"10.1007\/978-3-030-58517-4_32"},{"key":"6830_CR143","doi-asserted-by":"publisher","unstructured":"Cai H, Wang T, Wu Z, Wang K, Lin J, Han S (2019) On-device image classification with proxyless neural architecture search and quantization-aware fine-tuning. In: 2019 IEEE\/CVF International conference on computer vision workshop (ICCVW), 27\u201328 Oct 2019, pp 2509\u20132513. https:\/\/doi.org\/10.1109\/ICCVW.2019.00307","DOI":"10.1109\/ICCVW.2019.00307"},{"key":"6830_CR144","doi-asserted-by":"publisher","unstructured":"Wan A, Dai X, Zhang P, He Z, Tian Y, Xie S, Wu B, Yu M, Xu T, Chen K, Vajda P, Gonzalez JE (2020) FBNetV2: differentiable neural architecture search for spatial and channel dimensions. In: 2020 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR), 13\u201319 June 2020, pp 12962\u201312971. https:\/\/doi.org\/10.1109\/CVPR42600.2020.01298","DOI":"10.1109\/CVPR42600.2020.01298"}],"container-title":["Neural Computing and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-021-06830-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00521-021-06830-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-021-06830-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,25]],"date-time":"2022-06-25T09:09:42Z","timestamp":1656148182000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00521-021-06830-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,27]]},"references-count":144,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2022,7]]}},"alternative-id":["6830"],"URL":"https:\/\/doi.org\/10.1007\/s00521-021-06830-w","relation":{},"ISSN":["0941-0643","1433-3058"],"issn-type":[{"value":"0941-0643","type":"print"},{"value":"1433-3058","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,12,27]]},"assertion":[{"value":"31 March 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 December 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 December 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}