{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T00:58:16Z","timestamp":1769043496919,"version":"3.49.0"},"reference-count":28,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,1,1]],"date-time":"2022-01-01T00:00:00Z","timestamp":1640995200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>Object detection is a significant activity in computer vision, and various approaches have been proposed to detect varied objects using deep neural networks (DNNs). However, because DNNs are computation-intensive, it is difficult to apply them to resource-constrained devices. Here, we propose an on-device object detection method using domain-specific models. In the proposed method, we define object of interest (OOI) groups that contain objects with a high frequency of appearance in specific domains. Compared with the existing DNN model, the layers of the domain-specific models are shallower and narrower, reducing the number of trainable parameters; thus, speeding up the object detection. To ensure a lightweight network design, we combine various network structures to obtain the best-performing lightweight detection model. The experimental results reveal that the size of the proposed lightweight model is 21.7 MB, which is 91.35% and 36.98% smaller than those of YOLOv3-SPP and Tiny-YOLO, respectively. The f-measure achieved on the MS COCO 2017 dataset were 18.3%, 11.9% and 20.3% higher than those of YOLOv3-SPP, Tiny-YOLO and YOLO-Nano, respectively. The results demonstrated that the lightweight model achieved higher efficiency and better performance on non-GPU devices, such as mobile devices and embedded boards, than conventional models.<\/jats:p>","DOI":"10.3390\/e24010077","type":"journal-article","created":{"date-parts":[[2022,1,3]],"date-time":"2022-01-03T22:51:50Z","timestamp":1641250310000},"page":"77","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Domain-Specific On-Device Object Detection Method"],"prefix":"10.3390","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8306-2028","authenticated-orcid":false,"given":"Seongju","family":"Kang","sequence":"first","affiliation":[{"name":"Department of Electronics and Communications Engineering, Kwangwoon University, Seoul 01897, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jaegi","family":"Hwang","sequence":"additional","affiliation":[{"name":"Department of Electronics and Communications Engineering, Kwangwoon University, Seoul 01897, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kwangsue","family":"Chung","sequence":"additional","affiliation":[{"name":"Department of Electronics and Communications Engineering, Kwangwoon University, Seoul 01897, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,1,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"102897","DOI":"10.1016\/j.cviu.2019.102897","article-title":"Monocular human pose estimation: A survey of deep learning-based methods","volume":"192","author":"Chen","year":"2020","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"100014","DOI":"10.1016\/j.array.2019.100014","article-title":"An Improved Face Recognition Algorithm and Its Application in Attendance Management System","volume":"5","author":"Bah","year":"2020","journal-title":"Array"},{"key":"ref_3","unstructured":"Wang, T., Wu, D.J., Coates, A., and Ng, A.Y. (2012, January 11\u201315). End-to-end Text Recognition with Convolutional Neural Networks. Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Ibaraki, Japan."},{"key":"ref_4","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale recognition. arXiv."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhouche, V., and Rabinovich, A. (2014). Going deeper with convolutions. arXiv.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Bucilua, C., Caruana, R., and Niculescu-Mizil, A. (2006, January 20\u201323). Model compression. Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, PA, USA.","DOI":"10.1145\/1150402.1150464"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Wu, J., Leng, C., Wang, Y., Hu, Q., and Cheng, J. (2016, January 27\u201330). Quantized convolutional neural networks for mobile devices. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.521"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Rastegari, M., Ordonez, V., Redmon, J., and Farhadi, A. (2016, January 11\u201314). XNOR-Net: ImageNet classification using binary convolutional neural networks. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46493-0_32"},{"key":"ref_10","unstructured":"Han, S., Mao, H., and Dally, W.J. (2015). Deep compression: Compressing deep neural networks with pruning trained quantization and Huffman coding. arXiv."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.future.2019.04.039","article-title":"Deep learning based mobile data offloading in mobile edge computing systems","volume":"99","author":"Zhao","year":"2019","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Mochizuki, D., Abiko, Y., Saito, T., Ikeda, D., and Mineno, H. (2019). Delay-tolerance-based mobile data offloading using deep reinforcement learning. Sensors, 19.","DOI":"10.3390\/s19071674"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1109\/MNET.2019.1800286","article-title":"In-Edge AI: Intelligentizing mobile edge computing, caching, and communication by federated learning","volume":"33","author":"Wang","year":"2019","journal-title":"IEEE Netw."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1109\/TWC.2019.2946140","article-title":"Edge AI: On-demand accelerating deep neural network inference via edge computing","volume":"19","author":"Li","year":"2020","journal-title":"IEEE Trans. Wireless Commun."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1145\/3371154","article-title":"Adaptive deep learning model selection on embedded systems","volume":"19","author":"Marco","year":"2020","journal-title":"ACM Trans. Embedded Comput. Syst."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K.Q. (2017, January 21\u201326). Densely connected convolutional networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_17","first-page":"740","article-title":"Microsoft COOC: Common objects in context","volume":"8693","author":"Lin","year":"2014","journal-title":"Comput. Vis."},{"key":"ref_18","unstructured":"Zhu, P., Wen, L., Bian, X., Ling, H., and Hu, Q. (2018). Vision meets drones: A challenge. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The Pascal visual object classes (VOC) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L. (2018, January 18\u201322). MobileNetV2: Inverted residuals and linear bottlenecks. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_21","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An incremental improvement. arXiv."},{"key":"ref_22","unstructured":"Pedoeem, J., and Huang, R. (2018, January 10\u201313). YOLO-LITE: A real-time object detection algorithm optimized for non-GPU computers. Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Wong, A., Famuori, M., Shafiee, J.M., Li, F., Chwyl, B., and Chung, J. (2019). YOLO Nano: A highly compact you only look once convolutional neural network for object detection. arXiv.","DOI":"10.1109\/EMC2-NIPS53020.2019.00013"},{"key":"ref_24","unstructured":"Redmon, J. (2021, April 21). Darknet: Open Source Neural Networks in C. 2013\u20132016. Available online: http:\/\/pjreddie.com\/darknet\/."},{"key":"ref_25","unstructured":"Krizhevsky, A. (2009). Learning multiple layers of features from tiny images. Technol. Rep., Available online: http:\/\/citeseerx.ist.psu.edu\/viewdoc\/download?doi=10.1.1.222.9220&rep=rep1&type=pdf."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., and Zagoruyko, S. (2020). End-to-end object detection with transformers. arXiv.","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"ref_27","unstructured":"Kuznetsova, A., Rom, H., Alldrin, N., Uijlings, J., Krasin, I., Pont-Tuset, J., Kamali, S., Popov, S., Malloci, M., and Kolesnikov, A. (2018). The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale. arXiv."},{"key":"ref_28","first-page":"37","article-title":"Evaluation: From Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation","volume":"2","author":"David","year":"2011","journal-title":"J. Mach. Learn. Technol."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/24\/1\/77\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T13:35:53Z","timestamp":1760362553000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/24\/1\/77"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,1]]},"references-count":28,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,1]]}},"alternative-id":["e24010077"],"URL":"https:\/\/doi.org\/10.3390\/e24010077","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1,1]]}}}