DOI QR코드

DOI QR Code

Active Selection of Label Data for Semi-Supervised Learning Algorithm

준감독 학습 알고리즘을 위한 능동적 레이블 데이터 선택

  • Received : 2013.07.18
  • Accepted : 2013.08.21
  • Published : 2013.09.30

Abstract

The choice of labeled data in semi-supervised learning algorithm can result in effects on the performance of the resultant classifier. In order to select labeled data required for the training of a semi-supervised learning algorithm, VCNN(Vector Centroid Neural Network) is proposed in this paper. The proposed selection method of label data is evaluated on UCI dataset and caltech dataset. Experiments and results show that the proposed selection method outperforms conventional methods in terms of classification accuracy and minimum error rate.

본 논문에서는 준감독 학습 알고리즘(Semi-Supervised Learning Algorithm)의 학습데이터에 필요한 소수의 레이블 데이터를 능동적으로 선택하기 위한 무감독경쟁학습 알고리즘인 VCNN(Vector Centroid Neural Network)을 제안한다. 준감독 학습 알고리즘에서 레이블 데이터의 선택은 학습 결과 큰 영향을 미치고, 레이블 데이터를 선택하는데 있어 많은 비용과 전문적인 지식이 필요하다. 본 논문에서 능동적이고 효율적인 레이블 데이터 선택을 검증하기 위하여 UCI database 와 caltech dataset 을 이용하여 실험한 결과, 기존의 레이블 데이터 선택 방법과 비교하여 안정된 분류 결과와 최소의 오차율을 나타냈다.

Keywords

References

  1. H. Zeng, Y. Cheung, "Semi-Supervised Maximum Margin Clustering with Pairwise Constraints", IEEE Tr. on Knowledge and Data Engineering, vol.24, no.5, pp.926-939, 2012 https://doi.org/10.1109/TKDE.2011.68
  2. M. Mathia et. al, "Semisupervised Least Squares Support Vector Machine", IEEE Tr. on Neural Network, vol.20, no.12, pp.1858-1870, 2009 https://doi.org/10.1109/TNN.2009.2031143
  3. P. Mallapragada et. al, "SemiBoost: Boosting for Semi-Supervised Learning", IEEE Tr. on PAMI, vol.31, no.11, pp.2000-2014, 2009 https://doi.org/10.1109/TPAMI.2008.235
  4. N. Kumar, K. Kummamuru, "Semi supervised Clustering with Metric Learning Using Relative Comparisons", IEEE Tr. on Knowledge and Data Engineering, vol.20, no.4, pp.496-503, 2008 https://doi.org/10.1109/TKDE.2007.190715
  5. K. Chen, S. Wang, "Semi-Supervised Learning via Regularized Boosting Working on Multiple Semi-Supervised Assumptions", IEEE Tr. on PAMI, vol.33, no.1, pp.129-143, 2011 https://doi.org/10.1109/TPAMI.2010.92
  6. Dong-Chul Park, "Centroid Neural Network for Unsupervised Competitive Learning", IEEE Trans on Neural Network, vol.11, no.2, pp.520-528, 2000 https://doi.org/10.1109/72.839021
  7. T. Ojala et. al, "Performance Evaluation of Texture Measures with Classification Based on Kullback Discrimination of Distributions", Proc. of ICPR, vol.1, pp.582-585, 1994
  8. T. Ahonen et. al, "Face Description with Local Binary Patterns: Application to Face Recognition", IEEE Tr. on PAMI, vol.28, no.12, pp.2037-2041, 2006 https://doi.org/10.1109/TPAMI.2006.244
  9. C. Novak, S. Shafer, "Anatomy of a Color Histogram", IEEE Conf. on Computer Society, pp.599-605, 1992
  10. N. Anmed et. al, "Discrete Cosine Transform", IEEE Tr. on Computer, vol.23, no.1, pp.90-93, 1974
  11. SVM-light, http://svmlight.joachims.org
  12. Caltech-101 dataset, http://www.vision.caltech.edu/
  13. UCI dataset, http://archive.ics.uci.edu/ml/datasets.html

Cited by

  1. Design and Implementation of Damage Information System for Integrated Management of Waterfront Structures vol.18, pp.1, 2014, https://doi.org/10.7471/ikeee.2014.18.1.045