참고문헌
- Batista, G. E. A. P. A., Prati, R. C. and Monard, M. C. (2004). A study of the behavior of several methods for balancing machine learning training data, Special Interest Groups Knowledge Discovery in Data Explorations Newsletter, 6, 20-29.
- Breiman, L. (1984). Algorithm CART, California Wadsworth International Group, Belmont, CA., 6, 20-29.
- Breiman, L. (1996). Bagging predictors, Machine Learning., 24, 123-140.
- Breiman, L. (2001). Random forests, Machine Learning, 45, 5-32. https://doi.org/10.1023/A:1010933404324
- Chawla, N. V., Bowyer, W. K., Hall, L. O. and Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique, Journal Of Artificial Intelligence Research , 16, 321-357.
- Chawla, N. V., Cieslak, D., Hall, L. and Joshi, A. (2008). Automatically countering imbalance and its empirical relationship to cost, Data Mining and Knowledge Discovery, 17, 225-252. https://doi.org/10.1007/s10618-008-0087-0
- Chawla, N. V., Japkowicz, N. and Kolcz, A. (2004). Special issue learning imbalanced datasets, Special Interest Groups Knowledge Discovery in Data Explorations Newsletter, 6, 1-6.
- Chawla, N. V., Lazarevic, A., Hall, L. O. and Bowyer, W. K. (2003). Smoteboost: Improving prediction of the minority class in boosting, Seventh European Conference on Principles and Practice of Knowledge Discovery in Databases, 107-119.
- Culp, M., Johnson, K. and Michailidis, G. (2006). Ada : An r package for stochastic boosting, Journal of Statistical Software, 16, 321-357.
- Freitas, A., Costa- Pereira, A. and Brazdil, P. (2007). Cost-sensitive decision trees applied to medical data, Data Warehousing and Knowledge Discovery, 4654, 302-312.
- Freund, Y. and Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences, 55, 119-139. https://doi.org/10.1006/jcss.1997.1504
- Galar, M., Fernandez, A., Barrenechea, E., Bustince, H. and Herrera, F. (2012). A review on ensembles for the class imbalance problem: Bagging-, boosting-, and hybrid-based approaches, Institute of Electrical and Electronics Engineers, 42, 463-484.
- Hido, S., Kashima, H. and Takahashi, Y. (2009). Roughly balanced bagging for imbalanced data, Statistical Analysis and Data Mining, 2, 412-426. https://doi.org/10.1002/sam.10061
- Huang, J. and Ling, C. (2005). Using AUC and accuracy in evaluating learning algorithms, Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, 17, 299-310.
- Hur, J. and Kim, J. (2007). Decision tree induction with imbalanced data set: A case of health insurance bill audit in a general hospital, Information systems review, 9, 45-65.
- Kang, P. and Cho, S. (2006). EUS SVMS: Ensemble of under-sampled SVMS for data imbalance problems, Lecture Notes in Computer Science, 4232, 837-846.
- Khreich, W., Granger, E., Miri, A. and Sabourin, R. (2010). Iterative boolean combination of classifiers in the roc space: An application to anomaly detection with HMMs, Pattern Recognition, 43, 2732-2752. https://doi.org/10.1016/j.patcog.2010.03.006
- Kim, J. and Jeong, J. (2004). Classification of class-imbalanced data: Effect of over-sampling and undersampling of training data, The Korean Journal of Applied Statistics, 17, 445-457. https://doi.org/10.5351/KJAS.2004.17.3.445
- Kim, J. and Park, H. (2012). Imbalanced data analysis using sampling methods, Inha University.
- Kubat, M., Holte, R. C. and Matwin, S. (1998). Machine learning for the detection of oil spills in satellite radar images, Machine Learning, 30, 195-215. https://doi.org/10.1023/A:1007452223027
- Kubat, M. and Matwin, S. (1997). Addressing the curse of imbalanced data sets: One-sided sampling, Proceedings of the Fourteenth International Conference on Machine Learning, 179-186.
- Ling, C. X. and Li, C. (1998). Data mining for direct marketing: Problems and solutions., Knowledge Discovery in Data-98.
- Liu, X., Wu, J. and Zhou, Z. (2009). Machine learning for the detection of oil spills in satellite radar images, Institute of Electrical and Electronics Engineers, 39, 539-550.
- Mazurowski, M. A., Habas, P. A., Zurada, J. M., Lo, J. Y., Baker, J. A. and Tourassi, G. D. (2008). Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance, Neural Networks, 21, 427-436. https://doi.org/10.1016/j.neunet.2007.12.031
- Nathalie, J. and Shaju, S. (2002). The class imbalance problem: A systematic study, Intelligent Data Analysis, 6, 429-449.
- Newman, D., Hettich, S., Blake, C. and Merz, C. (1998). UCI repository of machine learning databases, Department of Information and Computer Science.
- Oh, J. and Zhang, B. (2001). Kernel perceptron boosting for effective learning of imbalanced data, Proceedings of the Korean Information Science Society Conference, 304-306.
- Sayyad, S. J. and Menzies, T. J. (2005). The promise repository of software engineering databases, http://promise.site.uottawa.ca/SERepository.
- Seiffert, C., Khoshgoftaar, T., Van Hulse, J. and Napolitano, A. (2010). Rusboost: A hybrid approach to alleviating class imbalance, Institute of Electrical and Electronics Engineers, 40, 185-197.
- Wu, G. and Chang, E. (2005). KBA: Kernel boundary alignment considering imbalanced data distribution, Institute of Electrical and Electronics Engineers, 17, 786-795.
- Yang, Q. and Wu, X. (2006). 10 challenging problems in data mining research, International Journal of Information Technology and Decision Making, 5, 597-604. https://doi.org/10.1142/S0219622006002258
- Zadrozny, B. and Elkan, C. (2001). Learning and making decisions when costs and probabilities are both unknown, Knowledge Discovery in Data '01 Proceedings of the seventh Association for Computing Machinery Special Interest Groups Knowledge Discovery in Data international conference on Knowledge discovery and data mining, 204-213.
- Zhu, Z. and Song, Z. (2010). Fault diagnosis based on imbalance modified kernel fisher discriminant analysis, Chemical Engineering Research and Design, 88, 936-951. https://doi.org/10.1016/j.cherd.2010.01.005
피인용 문헌
- Weighted L1-Norm Support Vector Machine for the Classification of Highly Imbalanced Data vol.28, pp.1, 2015, https://doi.org/10.5351/KJAS.2015.28.1.009