Browse > Article
http://dx.doi.org/10.9708/jksci.2022.27.12.077

Small-Scale Object Detection Label Reassignment Strategy  

An, Jung-In (Dept. of Computer Science and Engineering, Kangwon National University)
Kim, Yoon (Dept. of Computer Science and Engineering, Kangwon National University)
Choi, Hyun-Soo (Dept. of Computer Science and Engineering, Kangwon National University, Dept. of Computer Science and Engineering, Seoul National University of Science and Technology)
Abstract
In this paper, we propose a Label Reassignment Strategy to improve the performance of an object detection algorithm. Our approach involves two stages: an inference stage and an assignment stage. In the inference stage, we perform multi-scale inference with predefined scale sizes on a trained model and re-infer masked images to obtain robust classification results. In the assignment stage, we calculate the IoU between bounding boxes to remove duplicates. We also check box and class occurrence between the detection result and annotation label to re-assign the dominant class type. We trained the YOLOX-L model with the re-annotated dataset to validate our strategy. The model achieved a 3.9% improvement in mAP and 3x better performance on AP_S compared to the model trained with the original dataset. Our results demonstrate that the proposed Label Reassignment Strategy can effectively improve the performance of an object detection model.
Keywords
Artificial Intelligence; Object Detection; Dataset Refinement; Dataset Reassignment;
Citations & Related Records
연도 인용수 순위
  • Reference
1 P. Chen, B. Liao, G. Chen, S. Zhang, "Understanding and utilizing deep neural networks trained with noisy labels", in Proceedings of International Conference on Machine Learning (ICML), pp. 1062-1070, Jun. 2019
2 M. Everingham, L. Van Gool, C.K.I. Williams, et al., "The PASCAL Visual Object Classes (VOC) Challenge", International Journal of Computer Vision, Vol. 88, pp. 303-338, Sep. 2009. DOI: 10.1007/s11263-009-0275-4   DOI
3 T. Y. Lin, et al., "Microsoft COCO: Common Objects in Context", Lecture Notes in Computer Science, Vol. 8693, pp. 740-755, Sep. 2014, DOI: 10.1007/978-3-319-10602-1_48   DOI
4 J. Redmon, S. Divvala, R. Girshick, A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, Jun. 2016
5 J. Sukhwan, L. Joowon, K. Mingyun, H. Sungeun, B. Junil, K. Hwajong, "A Study on Helmet Wear Identification Using YOLOv5 Object Recognition Model", The Korean Institute of Communications and Information Sciences (KICS), pp. 1293-1294, Pyeongchang, Korea, Feb. 2022
6 P. Micikevicius, S. Narang, et al., "Mixed Precision Training", International Conference on Learning Representations (ICLR), Poster Session, Canada, May 2018
7 Y. Shen, S. Sanghavi, "Learning with bad training data via iterative trimmed loss minimization", in Proceedings of International Conference on Machine Learning (ICML), pp. 5739-5748, Jun. 2019
8 J. Jaegyung, G. Taehun, K. Gyeongmin, L. Jaemoon, K. Sungyoung, O. Byoung-woo, "Development of Personal Safety Equipment Wearing Confirmation System", Proceedings of KIIT Conference, pp. 664-667, Jeju, Korea, Jun. 2022
9 A. Bochkovskiy, C. Wang, H. M. Liao, "YOLOv4: Optimal Speed and Accuracy of Object Detection", arXiv preprint, arXiv:2004.10934, Apr. 2020
10 J. Redmon, A. Farhadi, "YOLOv3: An Incremental Improvement", arXiv preprint, Apr. 2018
11 Ministry of Employment and Labor, Analysis of the current status of industrial accidents in 2020, https://www.moel.go.kr/info/publict/publictDataView.do?bbs_seq=20211201900
12 C. Northcutt, A. Athalye, J. Mueller, "Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks", Conference on Neural Information Processing Systems (NeurIPS), Dataset and Benchmark Track 1, Virtual Conference, Dec. 2021
13 Z. Ge, S. Liu, F. Wang, Z. Li, J. Sun, "YOLOX: Exceeding YOLO Series in 2021", arXiv preprint, Aug. 2021, DOI: 10.48550/arXiv.2107.08430   DOI
14 J. Redmon, A. Farhadi, "YOLO9000: Better, Faster, Stronger", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7263-7271, Jul. 2017, DOI: 10.1109/CVPR.2017.690   DOI
15 C. Northcutt, L. Jiang, I. Chuang, "Confident Learning: Estimating Uncertainty in Dataset Labels", ACM Journal of Artificial Intelligence Research, Vol. 70, pp. 1373-1411, May 2021, DOI: 10.1613/jair.1.12125   DOI
16 Construction Site Safety Equipment Detection Image Dataset, National Information Society Agency, https://aihub.or.kr/aihubdata/data/view.do?dataSetSn=163