Browse > Article
http://dx.doi.org/10.5351/KJAS.2015.28.5.847

A Study of Outlier Detection Using the Mixture of Extreme Distributions Based on Deep-Sea Fishery Data  

Lee, Jung Jin (Department of Statistics, Soongsil University)
Kim, Jae Kyoung (Department of Statistics, Soongsil University)
Publication Information
The Korean Journal of Applied Statistics / v.28, no.5, 2015 , pp. 847-858 More about this Journal
Abstract
Deep-sea fishery in the Antarctic Ocean has been actively progressed by the developed countries including Korea. In order to prevent the environmental destruction of the Antarctic Ocean, related countries have established the Commission for the Conservation of Antarctic Marine Living Resources (CCAMLR) and have monitored any illegal unreported or unregulated fishing. Fishing of tooth fish, an expensive fish, in the Antarctic Ocean has increased recently and high catches per unit effort (CPUE) of fishing boats, which is suspicious for an illegal activity, have been frequently reported. The data of CPUEs in a fishing area of the Antarctic Ocean often show an extreme Distribution or a mixture of two extreme distributions. This paper proposes an algorithm to detect an outlier of CPUEs by using the mixture of two extreme distributions. The parameters of the mixture distribution are estimated by the EM algorithm. Log likelihood value and posterior probabilities are used to detect an outlier. Experiments show that the proposed algorithm to detect outlier of the data can be adopted instead of simple criteria such as a CPUE is greater than 1.
Keywords
outlier detection; mixture of extreme distributions;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Aitkin, M. and Wilson, G. T. (1980). Mixture models, outliers, and the EM algorithm. Technometrics, 22, 325-331.   DOI   ScienceOn
2 Kang, C. W., Kang, H. C., Park, S. H., Seung, H. W., Yong, H. S., Lee, D. H., Lee, S. K., Lee, Y. S., Jin, S. H., Choi, J. H. and Han, S. T. (2007). Data Mining Concepts and Methods, Cyprus.
3 Kim, S., Cho, N. W. and Kang, S. H. (2010). Density based outlier detection for massive data analysis. Korean Journal of Management Science, 15, 71-88.
4 Lee, J. J. (2011). Data Mining Using R, SAS and MS-SQL, Freedom Academy.
5 Seo, H. S. and Yoon, M. (2011). Outlier detection using support vector machines. Communications for Statistical Applications and Methods, 18, 171-177   DOI
6 Yong, H. S., Na, Y. M., Seung, H. W., Lee, M. S., Lee, S. J. and Choi, L. (2007). Data Mining, Infinity Books.