Bio-data Classification using Modified Additive Factor Model

변형된 팩터 분석 모델을 이용한 생체데이타 분류 시스템

  • 조민국 (경북대학교 전자전기컴퓨터학부) ;
  • 박혜영 (경북대학교 전자전기컴퓨터학부)
  • Published : 2007.07.15

Abstract

The bio-data processing is used for a suitable purpose with bio-signals, which are obtained from human individuals. Recently, there is increasing demand that the bio-data has been widely applied to various applications. However, it is often that the number of data within each class is limited and the number of classes is large due to the property of problem domain. Therefore, the conventional pattern recognition systems and classification methods are suffering form low generalization performance because the system using the lack of data is influenced by noises of that. To solve this problem, we propose a modified additive factor model for bio-data generation, with two factors; the class factor which affects properties of each individuals and the environment factor such as noises which affects all classes. We then develop a classification system through defining a new similarity function using the proposed model. The proposed method maximizes to use an information of the class classification. So, we can expect to obtain good generalization performances with robust noises from small number of datas for bio-data. Experimental results show that proposed method outperforms significantly conventional method with real bio-data.

생체데이타 프로세싱이란 인간개체로부터 얻을 수 있는 고유의 생체 신호를 이용하여 다양한 목적으로 사용하는 것으로, 최근 이에 대한 요구가 높아지고 있다. 생체데이타는 도메인의 특성상, 클래스의 수는 많고 해당 클래스 내의 데이타는 상당히 제한적일 수 있어서 그만큼 데이타 내에 포함된 노이즈에 민감하게 된다. 따라서 기존의 패턴 인식과 분류 방법을 그대로 적용하여 개발된 시스템의 경우는 높은 일반화 성능을 기대하기 힘들다. 이를 해결하기 위해 본 논문에서는 생체데이타가 가지는 특성을 고려하여 각 클래스 고유의 특성에 영향을 미치는 클래스 요인과 노이즈와 같이 전체 데이타에 영향을 미치는 환경 요인으로 구성된 변형된 팩터 분석 모델로 생체데이타 생성 모델을 정의한다. 이를 바탕으로 분류에 필요한 데이타간 이격(inter-data discrepancy) 정보를 추출하고 새로운 유사도 함수를 정의하여 분류기에 적용한다. 제안하는 방법은 분류 대상이 되는 클래스의 정보 팔용을 극대화 하여 적은 수의 데이터로부터 노이즈에 강인한 결과를 얻을 수 있다. 실제 생체데이타를 적용한 실험에서 제안하는 방법이 기존의 방법 보다 우수한 분류 성능을 보임을 확인할 수 있었다.

Keywords

References

  1. http://www.biometrics.org
  2. http://www.amia.org
  3. http://bioinformatics.org
  4. R. Rifkin and et al. 'An Analytical Method for Multiclass Molecular Cancer Classification,' SIAM Review, vol.45, issue 4, pp. 706-723, 2003 https://doi.org/10.1137/S0036144502411986
  5. M. Bartlett, and T. Sejnowsky, 'Viewpoint Invariant Face Recognition using Independent Component Analysis and Attractor Networks,' Neural Information Proc. Systems - Natural and Synthetic, vol.9, pp. 817-823, 1997
  6. T. S. Furey et al., 'Support vector machine classification and validation of cancer tissue samples using microarray expression data,' Bioinformatics, vol.16, pp. 906-914, 2000 https://doi.org/10.1093/bioinformatics/16.10.906
  7. R.P. Wildes, 'Iris Recognition: An Emerging Biometric Technology,' Proc. of the IEEE, vol.85, no.9, pp. 1348-1363, 1997 https://doi.org/10.1109/5.628669
  8. John D. Woodward, Jr., Nicholas M. Orlans, Peter T. Higgins, 'BIOMETRICS,' OSBORNE Press. 2003
  9. Thomas Hofmann, Joachim M. Buhmann, 'Pairwise Data Clustering by Deterministic Annealing,' IEEE Trans on PAMI, vol.19, no.1, pp. 1-14, 1997
  10. Johannes Fürnkranz, 'Pairwise Classification as an Ensemble Technique,' LNCS, vol.2430, pp. 97-110, 2002 https://doi.org/10.1007/3-540-36755-1_9
  11. Jacob Goldberger, Sam Roweis, Geoff Hinton, Ruslan Salakhutdinov, 'Neighbourhood Components Analysis,' Advances in Neural Information Processing Systems, vol.17, pp. 513-520, 2004
  12. Kilian Q. Weinberger, John Blitzer, Lawrence K. Saul, 'Distance Metric Learning for Large Margin Nearest Neighbor Classification,' Advances in Neural Information Processing Systems, vol.18, pp. 1473-1480, 2005
  13. Sumit Chopra, Raia Hadsell, Yann LeCun, 'Learning a Similarity Metric Discriminatively, with Application to Face Verification,' Proc. of International Conference on Computer Vision on Pattern Recognition, pp. 539-546, 2005
  14. Mardia, K., Kent, J., & Bibby, J., 'Multivariate analysis. London,' Academic Press. 1979
  15. Bell, A., & Sejnowski, T., 'An informationmaximization approach to blind separation and blind deconvolution', Neural Computation, vol.7(6), pp. 1129-1159, 1995 https://doi.org/10.1162/neco.1995.7.6.1129
  16. Hinton, G.E., & Zemel, R., 'Autoencoders, minimuum description length and Helmholtz free energy,' In J. Cowan, G. Tesauro, and J. Alspector (Eds.), Advances in neural information processing systems, vol.6, pp. 3-10, San Mateo, CA: Morgan Kauffman, 1994
  17. Ghahramani, Z., 'Factorial learning and the EM algorithm,' In G. Tesauro, D. Touretzky, and T. Leen (Eds), Advances in neural information processing systems Vol.7, pp. 617-624. Cambridge, MA: MIT Press, 1995
  18. Hinton, G., Dayan, P., Frey, B., & Neal, R., 'The wake-sleep algorithm for unsupervised neural networks,' Science, vol.268, pp. 1158-1161, 1995 https://doi.org/10.1126/science.7761831
  19. Dayan, P., Hinton, G., Neal, R., & Zemel, R. 'The Helmholtz machine,' Neural Computation, vol.7(5), pp. 889-904, 1995 https://doi.org/10.1162/neco.1995.7.5.889
  20. Hinton, G., & Ghahramani, Z., 'Generative models for discovering sparse distributed representations,' Phil. Trans. Royal Soc. B, vol.352, pp. 1177-1190, 1997 https://doi.org/10.1098/rstb.1997.0101
  21. Joshua B. Tenenbaum, William T. Freeman. 'Separating Style and Content with Bilinear Models,' Neural Computation, vol.12, pp. 1247-1283, 2000 https://doi.org/10.1162/089976600300015349
  22. Tammy Riklin-Raviv and Amnon Shashua, 'The quotient image: class-based rerendering and recognition with varying illuminations,' IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.23, issue 2, pp. 129-139, 2001 https://doi.org/10.1109/34.908964
  23. J.G. Daugman, 'High Confidence Visual Recognition of Persons by a Test of Statistical Independence,' IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.15(11), pp. 1148-1161, 1993 https://doi.org/10.1109/34.244676
  24. Gorsuch, Richard L., 'Factor Analysis,' Erlbaum, 1983
  25. K. Fukunaga, Introduction to Statistical Pattern Recognition, 2ed, Academic Press, 1990
  26. Ethem Alpaydin, 'Introduction to Machine Learning,' MIT Press, 2004
  27. Bernhard Schölkopf, Alexander J. Smola, 'Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond (Adaptive Computation and Machine Learning),' MIT Press, 2001
  28. Gyundo Kee, Kwanyong Lee, Hyeyoung Park, Yillbyung Lee, 'A New Approach to Human Iris Recognition based on Statistical Information Theory,' International Conference on Neural Information Processing, vol.1, pp. 134-139, 2000