{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T12:11:18Z","timestamp":1781611878586,"version":"3.54.5"},"reference-count":154,"publisher":"MDPI AG","issue":"16","license":[{"start":{"date-parts":[[2024,8,22]],"date-time":"2024-08-22T00:00:00Z","timestamp":1724284800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"CEA"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>In modern cyber-physical systems, the integration of AI into vision pipelines is now a standard practice for applications ranging from autonomous vehicles to mobile devices. Traditional AI integration often relies on cloud-based processing, which faces challenges such as data access bottlenecks, increased latency, and high power consumption. This article reviews embedded AI vision systems, examining the diverse landscape of near-sensor and in-sensor processing architectures that incorporate convolutional neural networks. We begin with a comprehensive analysis of the critical characteristics and metrics that define the performance of AI-integrated vision systems. These include sensor resolution, frame rate, data bandwidth, computational throughput, latency, power efficiency, and overall system scalability. Understanding these metrics provides a foundation for evaluating how different embedded processing architectures impact the entire vision pipeline, from image capture to AI inference. Our analysis delves into near-sensor systems that leverage dedicated hardware accelerators and commercially available components to efficiently process data close to their source, minimizing data transfer overhead and latency. These systems offer a balance between flexibility and performance, allowing for real-time processing in constrained environments. In addition, we explore in-sensor processing solutions that integrate computational capabilities directly into the sensor. This approach addresses the rigorous demand constraints of embedded applications by significantly reducing data movement and power consumption while also enabling in-sensor feature extraction, pre-processing, and CNN inference. By comparing these approaches, we identify trade-offs related to flexibility, power consumption, and computational performance. Ultimately, this article provides insights into the evolving landscape of embedded AI vision systems and suggests new research directions for the development of next-generation machine vision systems.<\/jats:p>","DOI":"10.3390\/s24165446","type":"journal-article","created":{"date-parts":[[2024,8,23]],"date-time":"2024-08-23T12:58:07Z","timestamp":1724417887000},"page":"5446","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":29,"title":["From Near-Sensor to In-Sensor: A State-of-the-Art Review of Embedded AI Vision Systems"],"prefix":"10.3390","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5365-699X","authenticated-orcid":false,"given":"William","family":"Fabre","sequence":"first","affiliation":[{"name":"Universit\u00e9 Paris-Saclay, CEA, List, F-91120 Palaiseau, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-6972-6019","authenticated-orcid":false,"given":"Karim","family":"Haroun","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris-Saclay, CEA, List, F-91120 Palaiseau, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-9909-2017","authenticated-orcid":false,"given":"Vincent","family":"Lorrain","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris-Saclay, CEA, List, F-91120 Palaiseau, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8426-7789","authenticated-orcid":false,"given":"Maria","family":"Lepecq","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris-Saclay, CEA, List, F-91120 Palaiseau, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8940-1000","authenticated-orcid":false,"given":"Gilles","family":"Sicard","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris-Saclay, CEA, List, F-91120 Palaiseau, France"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2024,8,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Beckman, P., Sankaran, R., Catlett, C., Ferrier, N., Jacob, R., and Papka, M. (November, January 30). Waggle: An open sensor platform for edge computing. Proceedings of the 2016 IEEE Sensors, Orlando, FL, USA.","DOI":"10.1109\/ICSENS.2016.7808975"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Theuwissen, A. (2021, January 13\u201322). There\u2019s More to the Picture Than Meets the Eye and in the Future It Will Only Become More so. Proceedings of the 2021 IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA.","DOI":"10.1109\/ISSCC42613.2021.9366058"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive Image Features from Scale-Invariant Keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.cviu.2007.09.014","article-title":"Speeded-Up Robust Features (SURF)","volume":"110","author":"Bay","year":"2008","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_5","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905), San Diego, CA, USA."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1109\/JPROC.2020.2976475","article-title":"Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey","volume":"108","author":"Deng","year":"2020","journal-title":"Proc. IEEE"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"2295","DOI":"10.1109\/JPROC.2017.2761740","article-title":"Efficient Processing of Deep Neural Networks: A Tutorial and Survey","volume":"105","author":"Sze","year":"2017","journal-title":"Proc. IEEE"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Capra, M., Bussolino, B., Marchisio, A., Shafique, M., Masera, G., and Martina, M. (2020). An Updated Survey of Efficient Hardware Architectures for Accelerating Deep Convolutional Neural Networks. Future Internet, 12.","DOI":"10.3390\/fi12070113"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Li, F.-F. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Bourdev, L., Girshick, R., Hays, J., Perona, P., Ramanan, D., Zitnick, C.L., and Doll\u00e1r, P. (2014, January 6\u201312). Microsoft COCO: Common Objects in Context. Proceedings of the European Conference on Computer Vision (ECCV), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_11","first-page":"1097","article-title":"ImageNet Classification with Deep Convolutional Neural Networks","volume":"Volume 25","author":"Pereira","year":"2012","journal-title":"Advances in Neural Information Processing Systems"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Minaee, S., Boykov, Y., Porikli, F., Plaza, A., Kehtarnavaz, N., and Terzopoulos, D. (2021). Image Segmentation Using Deep Learning: A Survey. IEEE Trans. Pattern Anal. Mach. Intell.","DOI":"10.1109\/TPAMI.2021.3059968"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"103514","DOI":"10.1016\/j.dsp.2022.103514","article-title":"A Survey of Modern Deep Learning Based Object Detection Models","volume":"126","author":"Zaidi","year":"2022","journal-title":"Digital Signal Processing"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_15","unstructured":"Tolstikhin, I.O., Houlsby, N., Kolesnikov, A., Beyer, L., Zhai, X., Unterthiner, T., Yung, J., Steiner, A., Keysers, D., and Uszkoreit, J. (2021, January 6\u201314). MLP-Mixer: An all-MLP Architecture for Vision. Proceedings of the Advances in Neural Information Processing Systems, NeurIPS, Online. Available online: https:\/\/proceedings.neurips.cc\/paper\/2021\/hash\/cba0a4ee5ccd02fda0fe3f9a3e7b89fe-Abstract.html."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"5314","DOI":"10.1109\/TPAMI.2022.3206148","article-title":"ResMLP: Feedforward networks for image classification with data-efficient training","volume":"45","author":"Touvron","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_17","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2021, January 4). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. Proceedings of the International Conference on Learning Representations (ICLR), Vienna, Austria."},{"key":"ref_18","unstructured":"Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., and J\u00e9gou, H. (2021, January 18\u201324). Training data-efficient image transformers & distillation through attention. Proceedings of the 38th International Conference on Machine Learning, Virtual."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 10\u201317). Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"14284","DOI":"10.1109\/TPAMI.2023.3303397","article-title":"CycleMLP: A MLP-Like Architecture for Dense Visual Predictions","volume":"45","author":"Chen","year":"2023","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_21","unstructured":"Liu, H., Dai, Z., So, D.R., and Le, Q.V. (2021). Pay Attention to MLPs. Advances in Neural Information Processing Systems, NeurIPS."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Horowitz, M. (2014, January 9\u201313). 1.1 Computing\u2019s Energy Problem (and What We Can Do about It). Proceedings of the 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), San Francisco, CA, USA.","DOI":"10.1109\/ISSCC.2014.6757323"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Lepecq, M., Dalgaty, T., Fabre, W., and Chevobbe, S. (2023). End-to-End Implementation of a Convolutional Neural Network on a 3D-Integrated Image Sensor with Macropixel Array. Sensors, 23.","DOI":"10.3390\/s23041909"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Liu, P., Yang, Z., Kang, L., and Wang, J. (2022). A Heterogeneous Architecture for the Vision Processing Unit with a Hybrid Deep Neural Network Accelerator. Micromachines, 13.","DOI":"10.3390\/mi13020268"},{"key":"ref_25","first-page":"2957","article-title":"ViP: A Hierarchical Parallel Vision Processor for Hybrid Vision Chip","volume":"69","author":"Zheng","year":"2022","journal-title":"IEEE Trans. Circuits Syst. II Express Briefs"},{"key":"ref_26","unstructured":"Khronos Group (2023, April 03). OpenCL: The Open Standard for Parallel Programming of Heterogeneous Systems. Available online: https:\/\/www.khronos.org\/opencl\/."},{"key":"ref_27","unstructured":"NVIDIA Corporation (2023, April 03). CUDA Documentation. Available online: https:\/\/docs.nvidia.com\/cuda\/."},{"key":"ref_28","unstructured":"NVIDIA Corporation (2023, April 03). TensorRT Documentation. Available online: https:\/\/developer.nvidia.com\/tensorrt."},{"key":"ref_29","unstructured":"ADLINK Technology (2023, April 03). NEON-1000-MDX Starter Kit Series Datasheet. Available online: https:\/\/www.adlinktech.com\/Products\/Machine_Vision\/SmartCamera\/NEON-1000-MDX_Series."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Garofalo, A., Rusci, M., Conti, F., Rossi, D., and Benini, L. (2019, January 27\u201329). PULP-NN: Accelerating Quantized Neural Networks on Parallel Ultra-Low-Power RISC-V Processors. Proceedings of the 26th IEEE International Conference on Electronics, Circuits and Systems (ICECS), Genoa, Italy.","DOI":"10.1098\/rsta.2019.0155"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Eki, R., Yamada, S., Ozawa, H., Kai, H., Okuike, K., Gowtham, H., Nakanishi, H., Almog, E., Livne, Y., and Yuval, G. (2021, January 13\u201322). 9.6 a 1\/2.3inch 12.3Mpixel with on-Chip 4.97TOPS\/W CNN Processor Back-Illuminated Stacked CMOS Image Sensor. Proceedings of the 2021 IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA.","DOI":"10.1109\/ISSCC42613.2021.9365965"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Bong, K., Choi, S., Kim, C., Kang, S., Kim, Y., and Yoo, H.J. (2017, January 5\u20139). 14.6 a 0.62mW Ultra-Low-Power Convolutional-Neural-Network facial recognition Processor and a CIS Integrated with Always-on Haar-like Face Detector. Proceedings of the 2017 IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA.","DOI":"10.1109\/ISSCC.2017.7870354"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"8357","DOI":"10.1109\/JIOT.2019.2917066","article-title":"A 64-mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones","volume":"6","author":"Palossi","year":"2019","journal-title":"IEEE Internet Things J."},{"key":"ref_34","unstructured":"JeVois (2023, April 03). JeVois Pro: Smart Embedded Vision Toolkit. Available online: http:\/\/jevois.org\/doc\/ProUserConnect.html."},{"key":"ref_35","unstructured":"(2023, April 03). Teledyne. FireflyDL Datasheet, Available online: https:\/\/www.flir.com\/support-center\/iis\/machine-vision\/knowledge-base\/technical-documentation-firefly-s\/."},{"key":"ref_36","unstructured":"(2024, August 20). Luxonis. OAK-1 DepthAI Hardware Product Page, Available online: https:\/\/shop.luxonis.com\/products\/oak-1?variant=42664380334303."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Bose, L., Dudek, P., Chen, J., Carey, S., Everitt, P., Wang, H., Davies, P., Liu, J., Wang, Y., and Uusitalo, A. (2019, January 27). A Camera That Cnns: Towards Embedded Neural Networks on Pixel Processor Arrays. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00142"},{"key":"ref_38","unstructured":"ADLINK Technology (2024, August 18). NEON-2000-JNX Series Datasheet. Available online: https:\/\/www.adlinktech.com\/Products\/Machine_Vision\/SmartCamera\/NEON-2000-JNX_Series."},{"key":"ref_39","unstructured":"DFRobot (2024, August 03). Huskylens AI Machine Vision Sensor. Available online: https:\/\/store-usa.arduino.cc\/products\/gravity-huskylens-ai-machine-vision-sensor."},{"key":"ref_40","unstructured":"AIY, G. (2024, August 03). AIY Vision Kit v1.1. Available online: https:\/\/pi3g.com\/products\/machine-learning\/google-coral\/aiy-vision-kit\/."},{"key":"ref_41","unstructured":"ImagoTechnologies (2023, August 03). VisionAI v1.2 Datasheet. Available online: https:\/\/imago-technologies.com\/wp-content\/uploads\/2021\/01\/Specification-VisionAI-V1.2.pdf."},{"key":"ref_42","unstructured":"Seeed Studio (2024, August 18). SenseCAP A1101 LoRaWAN Vision AI Sensor User Guide. Available online: https:\/\/files.seeedstudio.com\/wiki\/SenseCAP-A1101\/SenseCAP_A1101_LoRaWAN_Vision_AI_Sensor_User_Guide_V1.0.2.pdf."},{"key":"ref_43","unstructured":"JeVois (2023, April 03). JeVois Smart Machine. Available online: https:\/\/www.jevoisinc.com\/pages\/hardware."},{"key":"ref_44","unstructured":"Amazon Web Services (2024, August 04). AWS DeepLens AI Vision. Available online: https:\/\/bloggeek.me\/aws-deeplens-ai-vision\/."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1071","DOI":"10.1109\/JSSC.2020.3041858","article-title":"An Ultra-Low-Power Image Signal Processor for Hierarchical Image Recognition with Deep Neural Networks","volume":"56","author":"An","year":"2021","journal-title":"IEEE J. Solid-State Circuits"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1109\/JSSC.2017.2767705","article-title":"A Low-Power Convolutional Neural Network Face Recognition Processor and a CIS Integrated with Always-on Face Detector","volume":"53","author":"Bong","year":"2018","journal-title":"IEEE J. Solid-State Circuits"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Verdant, A., Guicquero, W., Royer, N., Moritz, G., Seifert, T., Hsu, C.-H., and He, Z. (2020, January 16\u201319). A 3.0\u03bcW5fps QQVGA Self-Controlled Wake-up Imager with on-Chip Motion Detection, Auto-Exposure and Object Recognition. Proceedings of the 2020 IEEE Symposium on VLSI Circuits, Honolulu, HI, USA.","DOI":"10.1109\/VLSICircuits18222.2020.9162854"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Lefebvre, M., Moreau, L., Dekimpe, R., and Bol, D. (2021, January 13\u201322). A 0.2-to-3.6TOPS\/W Programmable Convolutional Imager SoC with in-Sensor Current-Domain Ternary-Weighted MAC Operations for Feature Extraction and Region-of-Interest Detection. Proceedings of the 2021 IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA.","DOI":"10.1109\/ISSCC42613.2021.9365839"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Singh, R., Bailey, S., Chang, P., Olyaei, A., Hekmat, M., and Winoto, R. (2021, January 13\u201322). 34.2 A 21pJ\/frame\/pixel Imager and 34pJ\/frame\/pixel Image Processor for a Low-Vision Augmented-Reality Smart Contact Lens. Proceedings of the 2021 IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA.","DOI":"10.1109\/ISSCC42613.2021.9365771"},{"key":"ref_50","unstructured":"Wong, M.Z., Guillard, B., Murai, R., Saeedi, S., and Kelly, P.H.J. (2020). AnalogNet: Convolutional Neural Network Inference on Analog Focal Plane Sensor Processors. arXiv, Available online: http:\/\/arxiv.org\/abs\/2006.01765."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Gontard, L.C., Carmona-Galan, R., and Rodriguez-Vazquez, A. (2020, January 12\u201314). Cellular-Neural-Network Focal-Plane Processor as Pre-Processor for ConvNet Inference. Proceedings of the 2020 IEEE International Symposium on Circuits and Systems (ISCAS), Seville, Spain.","DOI":"10.1109\/ISCAS45731.2020.9181102"},{"key":"ref_52","first-page":"5535","article-title":"Evolving Image Sensor Architecture through Stacking Devices","volume":"68","author":"Oike","year":"2021","journal-title":"IEEE Trans. Electron Devices"},{"key":"ref_53","unstructured":"Carey, S.J., Lopich, A., Barr, D.R.W., Wang, B., and Dudek, P. (2013, January 12\u201314). A 100,000 fps Vision Sensor with Embedded 535GOPS\/W 256\u00d7256 SIMD Processor Array. Proceedings of the 2013 Symposium on VLSI Circuits, Kyoto, Japan."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1109\/MCAS.2018.2821772","article-title":"CMOS Vision Sensors: Embedding Computer Vision at Imaging Front-Ends","volume":"18","author":"Vornicu","year":"2018","journal-title":"IEEE Circuits Syst. Mag."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Millet, L., Chevobbe, S., Andriamisaina, C., Beign\u00e9, E., Benaissa, L., Deschaseaux, E., Benchehida, K., Lepecq, M., Darouich, M., and Guellec, F. (2018, January 18\u201322). A 5500FPS 85GOPS\/W 3D Stacked BSI Vision Chip Based on Parallel in-Focal-Plane Acquisition and Processing. Proceedings of the 2018 IEEE Symposium on VLSI Circuits, Honolulu, HI, USA.","DOI":"10.1109\/VLSIC.2018.8502290"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"1592","DOI":"10.1109\/TVLSI.2023.3299509","article-title":"H3DAtten: Heterogeneous 3-D Integrated Hybrid Analog and Digital Compute-in-Memory Accelerator for Vision Transformer Self-Attention","volume":"31","author":"Li","year":"2023","journal-title":"IEEE Trans. Very Large Scale Integr. (VLSI) Syst."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"4:1","DOI":"10.1147\/JRD.2019.2947373","article-title":"Heterogeneous Integration for Artificial Intelligence: Challenges and Opportunities","volume":"63","author":"Mukhopadhyay","year":"2019","journal-title":"IBM J. Res. Dev."},{"key":"ref_58","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv, Available online: http:\/\/arxiv.org\/abs\/1704.04861."},{"key":"ref_59","unstructured":"Tan, M., and Le, Q.V. (2019, January 9\u201315). EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. Proceedings of the 36th International Conference on Machine Learning, PMLR 97:6105\u20136114, Long Beach, CA, USA."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Xu, C., Mo, Y., Ren, G., Ma, W., Wang, X., Shi, W., Hou, J., Shao, K., Wang, H., and Xiao, P. (2019, January 17\u201321). 5.1 A Stacked Global-Shutter CMOS Imager with SC-Type Hybrid-GS Pixel and Self-Knee Point Calibration Single Frame HDR and On-Chip Binarization Algorithm for Smart Vision Applications. Proceedings of the 2019 IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA.","DOI":"10.1109\/ISSCC.2019.8662441"},{"key":"ref_61","unstructured":"Viola, P., and Jones, M. (2001, January 8\u201314). Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Kauai, HI, USA."},{"key":"ref_62","unstructured":"Chevobbe, S., Lepecq, M., Benchehida, K., Darouich, M., Dombek, T., Guellec, F., and Millet, L. (2019, January 23\u201327). A Versatile 3D Stacked Vision Chip with Massively Parallel Processing Enabling Low Latency Image Analysis. Proceedings of the 2019 International Image Sensor Workshop, Snowbird, UT, USA. Available online: https:\/\/api.semanticscholar.org\/CorpusID:210114478."},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Tsugawa, H., Takahashi, H., Nakamura, R., Umebayashi, T., Ogita, T., Okano, H., and Nomoto, T. (2017, January 2\u20136). Pixel\/DRAM\/Logic 3-Layer Stacked CMOS Image Sensor Technology. Proceedings of the 2017 IEEE International Electron Devices Meeting (IEDM), San Francisco, CA, USA.","DOI":"10.1109\/IEDM.2017.8268317"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Haruta, T., Nakajima, T., Hashizume, J., Umebayashi, T., Gotoh, Y., Maeda, S., Kaneko, M., Morimoto, S., Murakami, K., and Iwahashi, S. (2017, January 5\u20139). 4.6 a 1\/2.3inch 20mpixel 3-Layer Stacked CMOS Image Sensor with DRAM. Proceedings of the 2017 IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA.","DOI":"10.1109\/ISSCC.2017.7870268"},{"key":"ref_65","unstructured":"PCI-SIG (2024, August 20). PCI Express Base Specification Revision 5.0, Version 1.0. PCI-SIG. Available online: https:\/\/pcisig.com\/specifications\/pciexpress\/."},{"key":"ref_66","unstructured":"NVIDIA (2024, August 20). Jetson AGX Xavier Developer Kit User Guide. NVIDIA Corporation, Online Resource. Available online: https:\/\/developer.nvidia.com\/embedded\/downloads."},{"key":"ref_67","unstructured":"MIPI Alliance (2024, August 20). MIPI CSI-2 Specification Version 4.0. MIPI Alliance. Available online: https:\/\/www.mipi.org\/specifications\/csi-2."},{"key":"ref_68","unstructured":"Google (2023, April 03). Coral Edge TPU Datasheet. Google, Online Resource. Available online: https:\/\/coral.ai\/products\/edgetpu\/."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Bonnin, P., Blazevic, P., Pissaloux, E., and Al Nachar, R. (2022, January 15\u201317). Methodology of Evaluation of Low Cost Electronic Devices: Raspberry PI and Nvidia Jetson Nano for Perception System Implementation in Robotic Applications. Proceedings of the 2022 IEEE Information Technologies & Smart Industrial Systems (ITSIS), Paris, France.","DOI":"10.1109\/ITSIS56166.2022.10118365"},{"key":"ref_70","unstructured":"Arm Limited (2024, August 20). AMBA AXI and ACE Protocol Specification Version 4.0. Arm Limited. Available online: https:\/\/developer.arm.com\/documentation\/ihi0022\/g."},{"key":"ref_71","unstructured":"NXP Semiconductors (2024, August 20). I2C Bus Specification and User Manual, Revision 7. NXP Semiconductors. Available online: https:\/\/www.nxp.com\/docs\/en\/user-guide\/UM10204.pdf."},{"key":"ref_72","unstructured":"VTI Technologies (2024, August 20). TN15 SPI Interface Specification. VTI Technologies. Available online: https:\/\/www.mouser.com\/pdfdocs\/tn15_spi_interface_specification.PDF."},{"key":"ref_73","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-Based Learning Applied to Document Recognition","volume":"86","author":"Lecun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_74","unstructured":"Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning, MIT Press."},{"key":"ref_75","doi-asserted-by":"crossref","first-page":"64270","DOI":"10.1109\/ACCESS.2018.2877890","article-title":"Benchmark Analysis of Representative Deep Neural Network Architectures","volume":"6","author":"Bianco","year":"2018","journal-title":"IEEE Access"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201322). Squeeze-and-Excitation Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_77","unstructured":"Ramachandran, P., Zoph, B., and Le, Q.V. (2017, January 24\u201326). Searching for Activation Functions. Proceedings of the International Conference on Learning Representations (ICLR), Toulon, France."},{"key":"ref_78","doi-asserted-by":"crossref","unstructured":"Chollet, F. (2017, January 21\u201326). Xception: Deep Learning with Depthwise Separable Convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.195"},{"key":"ref_79","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 7\u20139). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Proceedings of the 32nd International Conference on Machine Learning (ICML), Lille, France."},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going Deeper with Convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_81","unstructured":"Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., and Keutzer, K. (2024, August 20). SqueezeNet: AlexNet-level Accuracy with 50x Fewer Parameters and <0.5 MB Model Size. Available online: https:\/\/openreview.net\/forum?id=S1xh5sYgx."},{"key":"ref_82","doi-asserted-by":"crossref","unstructured":"Howard, A., Sandler, M., Chen, B., Wang, W., Chen, L.C., Tan, M., Chu, G., Vasudevan, V., Zhu, Y., and Pang, R. (November, January 27). Searching for MobileNetV3. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00140"},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Gholami, A., Kwon, K., Wu, B., Tai, Z., Yue, X., Jin, P., Zhao, S., and Keutzer, K. (2018, January 18\u201322). SqueezeNext: Hardware-Aware Neural Network Design. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00215"},{"key":"ref_84","doi-asserted-by":"crossref","unstructured":"Tan, M., Chen, B., Pang, R., Vasudevan, V., Sandler, M., Howard, A., and Le, Q.V. (2019, January 16\u201320). MnasNet: Platform-Aware Neural Architecture Search for Mobile. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00293"},{"key":"ref_85","unstructured":"(2023, April 03). Google AI Blog. Introducing the Next Generation of On-Device Vision Models: MobileNetV3 and MobileNetEdgeTPU. Available online: https:\/\/ai.googleblog.com\/2019\/11\/introducing-next-generation-on-device.html."},{"key":"ref_86","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.C. (2018, January 18\u201322). MobileNetV2: Inverted Residuals and Linear Bottlenecks. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_87","unstructured":"Simonyan, K., and Zisserman, A. (2015). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv, Available online: http:\/\/arxiv.org\/abs\/1409.1556."},{"key":"ref_88","unstructured":"Mitchell, J.C. (2024, July 27). MobileNetV2 Keras Implementation. Available online: https:\/\/github.com\/JonathanCMitchell\/mobilenet_v2_keras\/blob\/master\/mobilenetv2.py."},{"key":"ref_89","unstructured":"White, C., Safari, M., Sukthanker, R., Ru, B., Lindauer, M., and Hutter, F. (2023). Neural Architecture Search: Insights from 1000 Papers. arXiv, Available online: https:\/\/arxiv.org\/abs\/2301.08727."},{"key":"ref_90","unstructured":"Keras Team (2024, July 27). Keras Applications-MobileNet. Available online: https:\/\/keras.io\/api\/applications\/mobilenet\/."},{"key":"ref_91","unstructured":"TensorFlow Blog (2023, April 03). Higher Accuracy on Vision Models with EfficientNet-Lite. Available online: https:\/\/blog.tensorflow.org\/2020\/03\/higher-accuracy-on-vision-models-with-efficientnet-lite.html."},{"key":"ref_92","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1007\/BF00342633","article-title":"Cognitron: A self-organizing multilayered neural network","volume":"20","author":"Fukushima","year":"1975","journal-title":"Biol. Cybern."},{"key":"ref_93","doi-asserted-by":"crossref","unstructured":"Vasu, P.K.A., Gabriel, J., Zhu, J., Tuzel, O., and Ranjan, A. (2023, January 18\u201322). MobileOne: An Improved One millisecond Mobile Backbone. Proceedings of the 2023 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00764"},{"key":"ref_94","doi-asserted-by":"crossref","first-page":"6:1","DOI":"10.1147\/JRD.2019.2942284","article-title":"Co-design of deep neural nets and neural net accelerators for embedded vision applications","volume":"63","author":"Amid","year":"2019","journal-title":"IBM J. Res. Dev."},{"key":"ref_95","unstructured":"Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., and Isard, M. (2016, January 2\u20134). TensorFlow: A system for large-scale machine learning. Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Savannah, GA, USA."},{"key":"ref_96","unstructured":"Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., and Antiga, L. (2019, January 8\u201314). PyTorch: An Imperative Style, High-Performance Deep Learning Library. Proceedings of the 32nd Advances in Neural Information Processing Systems (NeurIPS), Vancouver, BC, Canada. Available online: https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2019\/file\/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf."},{"key":"ref_97","unstructured":"CEA-LIST (2024, April 03). N2D2. Available online: https:\/\/github.com\/CEA-LIST\/N2D2."},{"key":"ref_98","unstructured":"Khadas (2023, April 03). Khadas VIM3 Online Datasheet. Available online: https:\/\/docs.khadas.com\/vim3\/."},{"key":"ref_99","unstructured":"Google Research (2023, April 03). TensorFlow Lite. Available online: https:\/\/www.tensorflow.org\/lite."},{"key":"ref_100","unstructured":"Teledyne (2023, April 03). Spinnaker SDK Teledyne FLIR. Available online: https:\/\/www.flir.fr\/products\/spinnaker-sdk?vertical=machine+vision&segment=iis."},{"key":"ref_101","doi-asserted-by":"crossref","first-page":"4730","DOI":"10.1109\/TNNLS.2017.2774288","article-title":"Quantized CNN: A Unified Approach to Accelerate and Compress Convolutional Networks","volume":"29","author":"Cheng","year":"2018","journal-title":"IEEE Trans. Neural Netw. Learning Syst."},{"key":"ref_102","unstructured":"Krishnamoorthi, R. (2018). Quantizing Deep Convolutional Networks for Efficient Inference: A Whitepaper. arXiv, Available online: http:\/\/arxiv.org\/abs\/1806.08342."},{"key":"ref_103","unstructured":"Nagel, M., Fournarakis, M., Amjad, R.A., Bondarenko, Y., and Blankevoort, T. (2021). A White Paper on Neural Network Quantization. arXiv, Available online: https:\/\/arxiv.org\/abs\/2106.08295."},{"key":"ref_104","doi-asserted-by":"crossref","first-page":"370","DOI":"10.1016\/j.neucom.2021.07.045","article-title":"Pruning and Quantization for Deep Neural Network Acceleration: A Survey","volume":"461","author":"Liang","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_105","unstructured":"Han, S., Pool, J., Tran, J., and Dally, W.J. (2015, January 7\u201312). Learning Both Weights and Connections for Efficient Neural Networks. Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS\u201915), Montreal, QC, Canada. Available online: https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2015\/file\/ae0eb3eed39d2bcef4622b2499a05fe6-Paper.pdf."},{"key":"ref_106","doi-asserted-by":"crossref","unstructured":"Lin, M., Ji, R., Wang, Y., Zhang, Y., Tian, Y., and Shao, L. (2020, January 13\u201319). HRank: Filter Pruning Using High-Rank Feature Map. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00160"},{"key":"ref_107","unstructured":"Yvinec, E., Dapogny, A., Cord, M., and Bailly, K. (2024, January 9\u201315). RED: Looking for Redundancies for Data-Free Structured Compression of Deep Neural Networks. Proceedings of the 35th International Conference on Neural Information Processing Systems (NIPS\u201921), Red Hook, NY, USA. Available online: https:\/\/dl.acm.org\/doi\/10.5555\/3540261.3541857."},{"key":"ref_108","unstructured":"Courbariaux, M., Bengio, Y., and David, J.P. (2015, January 7\u201312). BinaryConnect: Training Deep Neural Networks with Binary Weights during Propagations. Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS\u201915), Montreal, Canada. Available online: https:\/\/dl.acm.org\/doi\/10.5555\/2969442.2969588."},{"key":"ref_109","first-page":"525","article-title":"Xnor-net: Imagenet classification using binary convolutional neural networks","volume":"Volume 9908","author":"Leibe","year":"2016","journal-title":"Proceedings of the European Conference on Computer Vision"},{"key":"ref_110","doi-asserted-by":"crossref","unstructured":"Gholami, A., Kim, S., Dong, Z., Yao, Z., Mahoney, M.W., and Keutzer, K. (2021). A Survey of Quantization Methods for Efficient Neural Network Inference. CoRR, Available online: https:\/\/arxiv.org\/abs\/2103.13630.","DOI":"10.1201\/9781003162810-13"},{"key":"ref_111","unstructured":"Grootendorst, M. (2024, August 04). A Visual Guide to Quantization. Available online: https:\/\/newsletter.maartengrootendorst.com\/p\/a-visual-guide-to-quantization."},{"key":"ref_112","unstructured":"Nagel, M., Amjad, R.A., Van Baalen, M., Louizos, C., and Blankevoort, T. (2020, January 13\u201318). Up or down? Adaptive rounding for post-training quantization. Proceedings of the 37th International Conference on Machine Learning, Virtual."},{"key":"ref_113","unstructured":"Li, Y., Gong, R., Tan, X., Yang, Y., Hu, P., Zhang, Q., Yu, F., Wang, W., and Gu, S. (2021, January 3\u20137). BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction. Proceedings of the International Conference on Learning Representations (ICLR), Virtual. Available online: https:\/\/iclr.cc\/virtual\/2021\/poster\/2596."},{"key":"ref_114","doi-asserted-by":"crossref","unstructured":"Xu, K., Li, Z., Wang, S., and Zhang, X. (2024, January 18\u201322). PTMQ: Post-training Multi-Bit Quantization of Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, Virtual.","DOI":"10.1609\/aaai.v38i14.29553"},{"key":"ref_115","doi-asserted-by":"crossref","first-page":"3245","DOI":"10.1007\/s10994-021-06053-z","article-title":"Loss Aware Post-training Quantization","volume":"110","author":"Nahshan","year":"2021","journal-title":"Mach. Learn."},{"key":"ref_116","unstructured":"Choi, J., Wang, Z., Venkataramani, S., Chuang, P.I.J., Srinivasan, V., and Gopalakrishnan, K. (2018). PACT: Parameterized Clipping Activation for Quantized Neural Networks. arXiv, Available online: https:\/\/arxiv.org\/abs\/1805.06085."},{"key":"ref_117","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1109\/JSSC.2016.2616357","article-title":"Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks","volume":"52","author":"Chen","year":"2017","journal-title":"IEEE J. Solid-State Circuits"},{"key":"ref_118","unstructured":"NVIDIA (2023, April 03). Jetson TX2 Module Online Datasheet. Available online: https:\/\/developer.nvidia.com\/embedded\/jetson-tx2."},{"key":"ref_119","unstructured":"Khadas (2023, April 03). Khadas VIM4 Online Datasheet. Available online: https:\/\/www.khadas.com\/vim4."},{"key":"ref_120","unstructured":"Reddi, V.J., Cheng, C., Kanter, D., Mattson, P., Schmuelling, G., Wu, C.-J., Anderson, B., Breughe, M., Charlebois, M., and Chou, W. (2019). MLPerf Inference Benchmark. CoRR, Available online: https:\/\/arxiv.org\/abs\/1911.02549."},{"key":"ref_121","unstructured":"Allwinner Technology (2023, April 03). A33 Datasheet. Available online: https:\/\/github.com\/allwinner-zh\/documents\/blob\/master\/A33\/A33%20Datasheet%20release%201.1.pdf."},{"key":"ref_122","unstructured":"Intel (2023, April 03). Intel Movidius Myriad X Vision Processing Unit (VPU). Available online: https:\/\/www.intel.com\/content\/www\/us\/en\/products\/details\/processors\/movidius-vpu\/movidius-myriad-x.html."},{"key":"ref_123","doi-asserted-by":"crossref","unstructured":"Liu, P., and Song, Y. (2021). A Hybrid Vision Processing Unit with a Pipelined Workflow for Convolutional Neural Network Accelerating and Image Signal Processing. Electronics, 10.","DOI":"10.3390\/electronics10232989"},{"key":"ref_124","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. (2016, January 27\u201330). Rethinking the Inception Architecture for Computer Vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.308"},{"key":"ref_125","unstructured":"Teledyne (2023, April 03). FireflyDL Online Datasheet: Neural Networks Supported by the Firefly DL. Available online: https:\/\/www.flir.com\/support-center\/iis\/machine-vision\/application-note\/neural-networks-supported-by-the-firefly-dl\/."},{"key":"ref_126","unstructured":"Nair, D., Pakdaman, A., and Pl\u00f6ger, P.G. (2020). Performance Evaluation of Low-Cost Machine Vision Cameras for Image-Based Grasp Verification. arXiv, Available online: http:\/\/arxiv.org\/abs\/2003.10167."},{"key":"ref_127","unstructured":"GreenWaves Technologies (2023, April 03). GAP8 CNN Benchmarks. Available online: https:\/\/greenwaves-technologies.com\/gap8-cnn-benchmarks\/."},{"key":"ref_128","unstructured":"Himax Technologies, Inc (2023, April 03). Image Sensors by Himax Technologies, Inc. Available online: https:\/\/www.himax.com.tw\/en\/products\/cmos-image-sensor\/image-sensors\/."},{"key":"ref_129","doi-asserted-by":"crossref","first-page":"1088","DOI":"10.1109\/LRA.2018.2795643","article-title":"DroNet: Learning to Fly by Driving","volume":"3","author":"Loquercio","year":"2018","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_130","doi-asserted-by":"crossref","unstructured":"Kagawa, Y., and Iwamoto, H. (, January 8\u201310). 3D Integration Technologies for the Stacked CMOS Image Sensors. Proceedings of the 2019 International 3D Systems Integration Conference (3DIC), Sendai, Japan.","DOI":"10.1109\/3DIC48104.2019.9058895"},{"key":"ref_131","doi-asserted-by":"crossref","unstructured":"Pham, N., Sabuncuoglu Tezcan, D., Jamieson, G., Tutunjyan, N., Peng, L., and Volkaerts, D. (2015, January 2\u20134). 3D Integration Technology Using W2W Direct Bonding and TSV for CMOS Based Image Sensors. Proceedings of the 2015 IEEE 17th Electronics Packaging and Technology Conference (EPTC), Singapore.","DOI":"10.1109\/EPTC.2015.7412378"},{"key":"ref_132","doi-asserted-by":"crossref","unstructured":"Kagawa, Y., Kamibayashi, T., Yamano, Y., Nishio, K., Sakamoto, A., Yamada, T., Shimizu, K., Hirano, T., and Iwamoto, H. (June, January 31). Development of face-to-face and face-to-back ultra-fine pitch Cu-Cu hybrid bonding. Proceedings of the 2022 IEEE 72nd Electronic Components and Technology Conference (ECTC), San Diego, CA, USA.","DOI":"10.1109\/ECTC51906.2022.00057"},{"key":"ref_133","doi-asserted-by":"crossref","unstructured":"Batude, P., Fenouillet-Beranger, C., Pasini, L., Lu, V., Deprat, F., Brunet, L., Sklenard, B., Piegas-Luce, F., Cass\u00e9, M., and Mathieu, B. (2015, January 16\u201318). 3DVLSI with CoolCube process: An alternative path to scaling. Proceedings of the 2015 Symposium on VLSI Technology (VLSI Technology), Kyoto, Japan.","DOI":"10.1109\/VLSIT.2015.7223698"},{"key":"ref_134","doi-asserted-by":"crossref","unstructured":"Kwon, M., Lim, S., Lee, H., Ha, I.S., Kim, M.-Y., Seo, I.-J., Lee, S., Choi, Y., Kim, K., and Lee, H. (2020, January 12\u201314). A Low-Power 65\/14 nm Stacked CMOS Image Sensor. Proceedings of the 2020 IEEE International Symposium on Circuits and Systems (ISCAS), Seville, Spain.","DOI":"10.1109\/ISCAS45731.2020.9180435"},{"key":"ref_135","doi-asserted-by":"crossref","first-page":"723","DOI":"10.1109\/JETCAS.2012.2223552","article-title":"CMOS-3D Smart Imager Architectures for Feature Detection","volume":"2","author":"Suarez","year":"2012","journal-title":"IEEE J. Emerg. Sel. Top. Circuits Syst."},{"key":"ref_136","doi-asserted-by":"crossref","unstructured":"Sukegawa, S., Umebayashi, T., Nakajima, T., Kawanobe, H., Koseki, K., Hirota, I., Haruta, T., Kasai, M., Fukumoto, K., and Wakano, T. (2013, January 17\u201321). A 1\/4-inch 8Mpixel back-illuminated stacked CMOS image sensor. Proceedings of the 2013 IEEE International Solid-State Circuits Conference Digest of Technical Papers, San Francisco, CA, USA.","DOI":"10.1109\/ISSCC.2013.6487825"},{"key":"ref_137","unstructured":"(2023, April 03). TensorFlow. MobileNet_v1, MobileNet_v2, Inception_v1, SSD MobileNet_v1, SSD MobileNet_v2. Available from Tensorflow models. Available online: https:\/\/github.com\/tensorflow\/models\/blob\/master\/research\/object%5Fdetection\/g3doc\/tf1%5Fdetection%5Fzoo.md."},{"key":"ref_138","doi-asserted-by":"crossref","first-page":"663","DOI":"10.1109\/JETCAS.2021.3121259","article-title":"Near-Sensor Distributed DNN Processing for Augmented and Virtual Reality","volume":"11","author":"Pinkham","year":"2021","journal-title":"IEEE J. Emerg. Sel. Top. Circuits Syst."},{"key":"ref_139","unstructured":"Bose, L., Chen, J., Carey, S.J., Dudek, P., and Mayol-Cuevas, W.W. Fully Embedding Fast Convolutional Networks on Pixel Processor Arrays. Proceedings of the European Conference on Computer Vision (ECCV)."},{"key":"ref_140","doi-asserted-by":"crossref","unstructured":"Li, Z., Xu, H., Liu, Z., Luo, L., Wei, Q., and Qiao, F. (2022, January 17\u201320). A 2.17\u03bcW@120fps Ultra-Low-Power Dual-Mode CMOS Image Sensor with Senputing Architecture. Proceedings of the 2022 27th Asia and South Pacific Design Automation Conference (ASP-DAC), Taipei, Taiwan.","DOI":"10.1109\/ASP-DAC52403.2022.9712591"},{"key":"ref_141","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1109\/MSP.2012.2211477","article-title":"The MNIST Database of Handwritten Digit Images for Machine Learning Research","volume":"29","author":"Deng","year":"2012","journal-title":"IEEE Signal Process. Mag."},{"key":"ref_142","doi-asserted-by":"crossref","first-page":"1096","DOI":"10.1109\/JSSC.2018.2886325","article-title":"A 5500-Frames\/s 85-GOPS\/W 3-D Stacked BSI Vision Chip Based on Parallel in-Focal-Plane Acquisition and Processing","volume":"54","author":"Millet","year":"2019","journal-title":"IEEE J. Solid-State Circuits"},{"key":"ref_143","doi-asserted-by":"crossref","unstructured":"Kao, S.C., Pellauer, M., Parashar, A., and Krishna, T. (2022, January 14\u201323). DiGamma: Domain-aware Genetic Algorithm for HW-Mapping Co-optimization for DNN Accelerators. Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition (DATE), Antwerp, Belgium. Available online: https:\/\/dl.acm.org\/doi\/abs\/10.5555\/3539845.3539906.","DOI":"10.23919\/DATE54114.2022.9774568"},{"key":"ref_144","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1109\/TC.2021.3059962","article-title":"ZigZag: Enlarging Joint Architecture-Mapping Design Space Exploration for DNN Accelerators","volume":"70","author":"Mei","year":"2021","journal-title":"IEEE Trans. Comput."},{"key":"ref_145","doi-asserted-by":"crossref","unstructured":"Kao, S.C., Jeong, G., and Krishna, T. (2020, January 17\u201321). ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators Using Reinforcement Learning. Proceedings of the 2020 53rd Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO), Athens, Greece.","DOI":"10.1109\/MICRO50266.2020.00058"},{"key":"ref_146","doi-asserted-by":"crossref","unstructured":"Parashar, A., Raina, P., Shao, Y.S., Chen, Y.H., Ying, V.A., Mukkara, A., Venkatesan, R., Khailany, B., Keckler, S.W., and Emer, J. (2019, January 24\u201326). Timeloop: A Systematic Approach to DNN Accelerator Evaluation. Proceedings of the 2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Madison, WI, USA.","DOI":"10.1109\/ISPASS.2019.00042"},{"key":"ref_147","doi-asserted-by":"crossref","unstructured":"Wu, Y.N., Tsai, P.A., Parashar, A., Sze, V., and Emer, J.S. (2022, January 1\u20135). Sparseloop: An Analytical Approach to Sparse Tensor Accelerator Modeling. Proceedings of the 2022 55th IEEE\/ACM International Symposium on Microarchitecture (MICRO), Chicago, IL, USA.","DOI":"10.1109\/MICRO56248.2022.00096"},{"key":"ref_148","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1007\/s42514-022-00093-0","article-title":"Survey on chiplets: Interface, interconnect and integration methodology","volume":"4","author":"Ma","year":"2022","journal-title":"CCF Trans. High Perform. Comput."},{"key":"ref_149","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1109\/MSSC.2021.3111386","article-title":"2.5D and 3D Heterogeneous Integration: Emerging applications","volume":"13","author":"Sheikh","year":"2021","journal-title":"IEEE Solid-State Circuits Mag."},{"key":"ref_150","doi-asserted-by":"crossref","unstructured":"Lenihan, T.G., Matthew, L., and Vardaman, E.J. (2013, January 11\u201313). Developments in 2.5D: The role of silicon interposers. Proceedings of the 2013 IEEE 15th Electronics Packaging Technology Conference (EPTC 2013), Singapore.","DOI":"10.1109\/EPTC.2013.6745683"},{"key":"ref_151","doi-asserted-by":"crossref","unstructured":"Stow, D., Akgun, I., Barnes, R., Gu, P., and Xie, Y. (2016, January 11\u201313). Cost and Thermal Analysis of High-Performance 2.5D and 3D Integrated Circuit Design Space. Proceedings of the 2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), Pittsburgh, PA, USA.","DOI":"10.1109\/ISVLSI.2016.133"},{"key":"ref_152","doi-asserted-by":"crossref","unstructured":"Wang, Z., Sun, J., Goksoy, A., Mandal, S.K., Liu, Y., Seo, J., Chakrabarti, C., Ogras, U.Y., Chhabria, V., and Zhang, J. (2024, January 22\u201325). Exploiting 2.5D\/3D Heterogeneous Integration for AI Computing. Proceedings of the 29th Asia and South Pacific Design Automation Conference (ASP-DAC), Incheon, Republic of Korea.","DOI":"10.1109\/ASP-DAC58780.2024.10473875"},{"key":"ref_153","doi-asserted-by":"crossref","first-page":"2424","DOI":"10.1109\/TVLSI.2020.3015494","article-title":"Architecture, Chip, and Package Codesign Flow for Interposer-Based 2.5-D Chiplet Integration Enabling Heterogeneous IP Reuse","volume":"28","author":"Kim","year":"2020","journal-title":"IEEE Trans. Very Large Scale Integr. (VLSI) Syst."},{"key":"ref_154","doi-asserted-by":"crossref","first-page":"35007","DOI":"10.1109\/ACCESS.2023.3264828","article-title":"A Pipelining-Based Heterogeneous Scheduling and Energy-Throughput Optimization Scheme for CNNs Leveraging Apache TVM","volume":"11","author":"Goossens","year":"2023","journal-title":"IEEE Access"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/16\/5446\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T15:41:46Z","timestamp":1760110906000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/16\/5446"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8,22]]},"references-count":154,"journal-issue":{"issue":"16","published-online":{"date-parts":[[2024,8]]}},"alternative-id":["s24165446"],"URL":"https:\/\/doi.org\/10.3390\/s24165446","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,8,22]]}}}