References
-
김판준. 2007, 로치오 알고리즘을 이용한 자동분류에서 용어 가중치 기법.
$\ulcorner$ 문헌정보학논집$\lrcorner$ , 명지대학교, 문헌정보학회, 제9호: 157-185 -
김판준. 200sa. 기계학습을 통한 디스크립터 자동부여에 관한 연구.
$\ulcorner$ 정보관리학회지$\lrcorner$ , 23(1) : 279-299 https://doi.org/10.3743/KOSIM.2006.23.1.279 -
김판준. 2006b. 로치오 알고리즘을 이용한 학술지 논문의 디스크립터 자동부여에 관한연구
$\ulcorner$ 정보관리학회지$\lrcorner$ , 23(3): 69-89 https://doi.org/10.3743/KOSIM.2006.23.3.069 -
이재윤. 2005. 자질 선정 기준과 가중치 할당 방식간의 관계를 고려한 문서 자동분류의개선에 대한 연구
$\ulcorner$ 한국문헌정보학회지$\lrcorner$ , 39(2) 123-146 - 이재윤, 최보영, 정영미. 2000. 문헌 자동분류에서 용어가중치 기법에 대한 연구. 제7회 한국정보관리학회 학술대회 논문집, 2000년 8월 17일 이화여자대학교, pp.41-44
-
정영미. 1993.
$\ulcorner$ 정보검색론$\lrcorner$ . 서울: 구미무역 (주) 출판부 -
Brank, J., M. Grobelnik, N. Milic-Frayling & D. Mladenic. 2002. "Interaction of feature selection methods and linear classification models." In: Proceedings of the ICML-02 Workshop on Text Learning, Sydney. [cited 2007. 5. 3].
- Castillo M. D. and Serrano J. I. 2004. "A multistrategy approach for digital text categorization from imbalanced documents." ACM SIGKDD Explorations Newsletter: Special Issue on Leaning from Imbalanced Datasets, 6(1): 70-79 https://doi.org/10.1145/1007730.1007740
- Debole, Franca and F. Sebastiani. 2003. "Supervised term weighting for automated text categorization." In: Proceedings of SAC-03, 18th ACM Symposium on Applied Computing, New York: ACM. 784-788 https://doi.org/10.1145/952532.952688
- Deng, Zhi-Hong et al. 2004. A Comparative study on feature weight in text categorization. In Proceedings of The Sixth Asia Pacific Web Conference(APWEB 2004), Hangzhou, China, April 14-17, LNCS 3007, 588-597
- Forman G. 2003. "An extensive empirical study of feature selection metrics for text classification." The Journal of Machine Learning Research. Special Issue on Variable and Feature Selection. 3: 1289-1305 https://doi.org/10.1162/153244303322753670
- Geng, L. and Howard J. Hamilton. 2006. "Choosing the right lens: finding what is interesting in data mining." eds. by Guillet, Fabrice, Howard J. Hamilton. Quality Measures in Data Mining. Springer, pp. 3-24
- How, Bong Chih and Narayanan K. 2004. An empirical study of feature selection for text categorization based on term weightage. In Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence. pp.592-602 https://doi.org/10.1109/WI.2004.10060
- Joachims. Thorsten. 1996. "A probabilistic analysis of the rocchio algorithm with TFIDF for text categorization." Proceedings of ICML-97. 14th International Conference on Machine Learning, Nashville. TN: 143-151
- Joachims, Thorsten. 1998. "Text categorization with support vector machines: learning with many relevant features." In: Proceedings of the 10th European Conference on Machine Learning: 137-142
-
Lan, Man. Chew-Lim Tan, and Hwee-Boon LOW. 2006. "Proposing a new term weighting scheme for text categorization." In 21st National Conference on Artificial Intelligence, AAAI-2006, 16-20 July. 2006. Boston. Massachusetts. USA. [cited 2007. 3. 7.].
- Liu, Ying. Han Tong Loh. Kamal Yousef Tourni. and Shu Beng Tor. 2007. "Han dling of imbalanced data in text classification: category- based term weights." In: Kao, Anne and Stephen R. Poteet eds. Natural Language Processing and Text Mining. Springer.. pp.171-192
- Papineni, K. 2001. "Why inverse document frequency?" Proceedings of the North American Association for Computational Linguistics, NAACI New York. pp. 25- 32
- Prabowo. Ruby and Mike Thelwall. 2006. "A comparison of feature selection methods for an evloving RSS feed corpus." Information Processing and Management. 42: 1491-1512 https://doi.org/10.1016/j.ipm.2006.03.018
- Robertson S. 2004. Understanding inverse document frequency: on theoretical arguments for IDF. Journal of Documentation. 60(5): 503-520 https://doi.org/10.1108/00220410410560582
- Robertson. S. E. and K. Sparck Jones. 1976. "Relevance weighting of search terms." JASIS. 27(3): 129-146 https://doi.org/10.1002/asi.4630270302
- Rogati, M and Y. Yang. 2002. High-Performing Feature Selection for Text Classification." In: Proceedings of the eleventh international conference on Information and knowledge management. CIKM '02. [cited 2007. 8. 23.]. < citeseer .ist.psu.edu/rogati02highperforming.html>
- Salton. G.. H. Wu, and C. T. Yu. 1981. "The Measurement of term importance in automatic indexing." JASIS, 32(3): 175-186 https://doi.org/10.1002/asi.4630320304
- Salton. G. and M. J. McGill. 1983. Introduction to Modern Information Retrieval. N. Y.: McGraw-Hill
- Sebastiani. Fabrizio. 2002. "Machine learning in automated text categorization." ACM Computing Surveys, 34(1): 1-47 https://doi.org/10.1145/505282.505283
-
Soucy P. and Guy W. Mineau. 2005. "Beyond TFIDF weighting for text categorization in the vector space model." IJCAI-05 proceedings, 1130-1135. [cited 2007. 2. 20].
- Yang, Y and Liu X. 1999. "A re-examination of text categorization methods." In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval https://doi.org/10.1145/312624.312647
- Yang, Y. 1999. "Evaluation of statistical approaches to text categorization." Information Retrieval. 1: 69-90 https://doi.org/10.1023/A:1009982220290
- Yang. Y. and Pedersen J. O. 1997. "A comparative Study on Feature Selection in Text Categorization." In: Proceedings of ICML-97, 14th International Conference on Machine Learning: 412420 https://doi.org/10.1023/A:1009982220290
- Yu. C. T. and G. Salton 1976. "Precision weighting-an effective automatic indexing method." Journal of Association for Computing Machinery, 23(1): 7688
- Yu. C. T.. K. Lam. and G. Salton. 1982. "Term weighting in information retrieval using the term precision model." Journal of Association for Computing Machinery, 29(1): 152-170 https://doi.org/10.1145/322290.322300
- Zheng Z., X. Wu and R. Srihari. 2004. "Feature selection for text categorization on imbalanced data." ACM SIGKDD Explorations Newsletter: Special Issue on Leaning from Imbalanced Datasets, 6(1): 80-89 https://doi.org/10.1145/1007730.1007741