{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,29]],"date-time":"2026-05-29T11:27:23Z","timestamp":1780054043992,"version":"3.54.0"},"reference-count":25,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2024,4,8]],"date-time":"2024-04-08T00:00:00Z","timestamp":1712534400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,4,8]],"date-time":"2024-04-08T00:00:00Z","timestamp":1712534400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Machine Vision and Applications"],"published-print":{"date-parts":[[2024,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The quality of annotations in the datasets is crucial for supervised machine learning as it significantly affects the performance of models. While many public datasets are widely used, they often suffer from annotations errors, including missing annotations, incorrect bounding box sizes, and positions. It results in low accuracy of machine learning models. However, most researchers have traditionally focused on improving model performance by enhancing algorithms, while overlooking concerns regarding data quality. This so-called model-centric AI approach has been predominant. In contrast, a data-centric AI approach, advocated by Andrew Ng at the DATA and AI Summit 2022, emphasizes enhancing data quality while keeping the model fixed, which proves to be more efficient in improving performance. Building upon this data-centric approach, we propose a method to enhance the quality of public datasets such as MS-COCO and Open Image Dataset. Our approach involves automatically retrieving missing annotations and correcting the size and position of existing bounding boxes in these datasets. Specifically, our study deals with human object detection, which is one of the prominent applications of artificial intelligence. Experimental results demonstrate improved performance with models such as Faster-RCNN, EfficientDet, and RetinaNet. We can achieve up to 32% compared to original datasets in the term of mAP after applying both proposed methods to dataset which is transformed the grouped of instances to individual instance. In summary, our methods significantly enhance the model\u2019s performance by improving the quality of annotations at a lower cost with less time than manual improvement employed in other studies.<\/jats:p>","DOI":"10.1007\/s00138-024-01527-1","type":"journal-article","created":{"date-parts":[[2024,4,8]],"date-time":"2024-04-08T14:02:12Z","timestamp":1712584932000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["The improvement of ground truth annotation in public datasets for human detection"],"prefix":"10.1007","volume":"35","author":[{"given":"Sotheany","family":"Nou","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Joong-Sun","family":"Lee","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nagaaki","family":"Ohyama","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Takashi","family":"Obi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,4,8]]},"reference":[{"key":"1527_CR1","unstructured":"Databricks: Data centric AI development from big data to good data Andrew NG data+AI summit (2022). https:\/\/www.youtube.com\/watch?v=avoijDORAlc"},{"key":"1527_CR2","doi-asserted-by":"crossref","unstructured":"Qiao, S., Chen, L., Yuille, A.: Detectors: detecting objects with recursive feature pyramid and switchable atrous convolution (2021)","DOI":"10.1109\/CVPR46437.2021.01008"},{"key":"1527_CR3","unstructured":"Long, X., et\u00a0al.: Pp\u2013yolo: an effective and efficient implementation of object detector. CoRR (2020). arXiv:abs\/2007.12099"},{"key":"1527_CR4","doi-asserted-by":"crossref","unstructured":"Tan, M., Pang, R., Le, Q.V.: EfficientDet: scalable and efficient object detection (2020)","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"1527_CR5","doi-asserted-by":"crossref","unstructured":"Ma, J., Ushiku, Y., Sagara, M.: The effect of improving annotation quality on object detection datasets: a preliminary study (2022)","DOI":"10.1109\/CVPRW56347.2022.00532"},{"key":"1527_CR6","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., et\u00a0al.: Microsoft coco: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) Computer Vision\u2014ECCV 2014, pp. 740\u2013755. Springer, Cham (2014)","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"1527_CR7","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1007\/s11263-014-0733-5","volume":"111","author":"M Everingham","year":"2015","unstructured":"Everingham, M., et al.: The Pascal visual object classes challenge: a retrospective. Int. J. Comput. Vis. 111, 98\u2013136 (2015)","journal-title":"Int. J. Comput. Vis."},{"key":"1527_CR8","doi-asserted-by":"crossref","unstructured":"Deng, J., et\u00a0al.: ImageNet: a large-scale hierarchical image database (2009)","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"1527_CR9","doi-asserted-by":"publisher","first-page":"1956","DOI":"10.1007\/s11263-020-01316-z","volume":"128","author":"A Kuznetsova","year":"2020","unstructured":"Kuznetsova, A., et al.: The open images dataset v4: unified image classification, object detection, and visual relationship detection at scale. Int. J. Comput. Vis. 128, 1956\u20131981 (2020)","journal-title":"Int. J. Comput. Vis."},{"key":"1527_CR10","unstructured":"Enderes, S.: The impact of annotation errors on neural networks (2021). https:\/\/understand.ai\/blog\/annotation\/machine-learning\/autonomous-driving\/2021\/06\/01\/impact-of-annotation-errors-on-neural-networks.html"},{"key":"1527_CR11","doi-asserted-by":"crossref","unstructured":"Mao, J., Yu, Q., Yamakata, Y., Aizawa, K.: Noisy annotation refinement for object detection. In: Paper Presented at the Conference of the 32nd British Machine Vision Conference (2021)","DOI":"10.1109\/ICIP40778.2020.9190728"},{"key":"1527_CR12","first-page":"691","volume":"39","author":"C-Y Wang","year":"2021","unstructured":"Wang, C.-Y., Yeh, I.-H., Liao, H.: You only learn one representation: unified network for multiple tasks. J. Inf. Sci. Eng. 39, 691\u2013709 (2021)","journal-title":"J. Inf. Sci. Eng."},{"key":"1527_CR13","doi-asserted-by":"crossref","unstructured":"Wang, C., Bochkovskiy, A., Liao, H.: Scaled-yolov4: scaling cross stage partial network (2021)","DOI":"10.1109\/CVPR46437.2021.01283"},{"key":"1527_CR14","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2017","unstructured":"Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39, 1137\u20131149 (2017)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"1527_CR15","doi-asserted-by":"crossref","unstructured":"Lin, T., Goyal, P., Girshick, R., He, K., Dollar, P.: Focal loss for dense object detection (2017)","DOI":"10.1109\/ICCV.2017.324"},{"key":"1527_CR16","doi-asserted-by":"crossref","unstructured":"Liu, W., et\u00a0al. SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds) Computer Vision\u2014ECCV 2016, pp. 21\u201337. Springer, Berlin (2016)","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"1527_CR17","unstructured":"Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. CoRR (2018). arXiv:abs\/1804.02767"},{"key":"1527_CR18","doi-asserted-by":"crossref","unstructured":"Carion, N., et\u00a0al.: End-to-end object detection with transformers. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) Computer Vision\u2014ECCV 2020, pp. 213\u2013229. Springer, Cham (2020)","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"1527_CR19","doi-asserted-by":"publisher","first-page":"1373","DOI":"10.1613\/jair.1.12125","volume":"70","author":"CG Northcutt","year":"2021","unstructured":"Northcutt, C.G., Jiang, L., Chuang, I.L.: Confident learning: estimating uncertainty in dataset labels. J. Artif. Intell. Res. (JAIR) 70, 1373\u20131411 (2021)","journal-title":"J. Artif. Intell. Res. (JAIR)"},{"key":"1527_CR20","unstructured":"Krizhevsky, A.: Learning multiple layers of features from tiny images. Master\u2019s Thesis, Department of Computer Science, University of Toronto (2009)"},{"key":"1527_CR21","unstructured":"Xu, M., Bai, Y., Ghanem, B.: Missing labels in object detection. In: Paper Presented at the Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (2019)"},{"key":"1527_CR22","unstructured":"Rosebrock, A.: Intersection over union (IOU) for object detection (2016). https:\/\/pyimagesearch.com\/2016\/11\/07\/intersection-over-union-iou-for-object-detection\/"},{"key":"1527_CR23","doi-asserted-by":"crossref","unstructured":"Wang, C., Bochkovskiy, A., Liao, H.: Yolov7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors (2023)","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"1527_CR24","unstructured":"Zhang, H., et\u00a0al.: Dino: Detr with improved denoising anchor boxes for end-to-end object detection. Comput. Vis. Pattern Recognit. (2022). arXiv:abs\/2203.03605"},{"key":"1527_CR25","unstructured":"Lv, W., et\u00a0al.: Detrs beat yolos on real-time object detection. Comput. Vis. Pattern Recognit. (2023). arXiv:abs\/2304.08069"}],"container-title":["Machine Vision and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00138-024-01527-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00138-024-01527-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00138-024-01527-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,14]],"date-time":"2024-05-14T04:21:30Z","timestamp":1715660490000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00138-024-01527-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,8]]},"references-count":25,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,5]]}},"alternative-id":["1527"],"URL":"https:\/\/doi.org\/10.1007\/s00138-024-01527-1","relation":{},"ISSN":["0932-8092","1432-1769"],"issn-type":[{"value":"0932-8092","type":"print"},{"value":"1432-1769","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,8]]},"assertion":[{"value":"31 May 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 February 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 March 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 April 2024","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"49"}}