DOI QR코드

DOI QR Code

A Study of a Method for Maintaining Accuracy Uniformity When Using Long-tailed Dataset

불균형 데이터세트 학습에서 정확도 균일화를 위한 학습 방법에 관한 연구

  • Geun-pyo Park (School of Electrical Engineering, Korea University) ;
  • XinYu Piao (School of Electrical Engineering, Korea University) ;
  • Jong-Kook Kim (School of Electrical Engineering, Korea University)
  • 박근표 (고려대학교 전자전자공학과) ;
  • 박흠우 (고려대학교 전기전자공학과) ;
  • 김종국 (고려대학교 전기전자학부)
  • Published : 2023.05.18

Abstract

Long-tailed datasets have an imbalanced distribution because they consist of a different number of data samples for each class. However, there are problems of the performance degradation in tail-classes and class-accuracy imbalance for all classes. To address these problems, this paper suggests a learning method for training of long-tailed dataset. The proposed method uses and combines two methods; one is a resampling method to generate a uniform mini-batch to prevent the performance degradation in tail-classes, and the other is a reweighting method to address the accuracy imbalance problem. The purpose of our proposed method is to train the learning models to have uniform accuracy for each class in a long-tailed dataset.

Keywords