Browse > Article
http://dx.doi.org/10.14699/kbiblia.2021.32.1.133

A Study on Applicability of Machine Learning for Book Classification of Public Libraries: Focusing on Social Science and Arts  

Kwak, Chul Wan (강남대학교 산업데이터사이언스학부 데이터사이언스전공)
Publication Information
Journal of the Korean BIBLIA Society for library and Information Science / v.32, no.1, 2021 , pp. 133-150 More about this Journal
Abstract
The purpose of this study is to identify the applicability of machine learning targeting titles in the classification of books in public libraries. Data analysis was performed using Python's scikit-learn library through the Jupiter notebook of the Anaconda platform. KoNLPy analyzer and Okt class were used for Hangul morpheme analysis. The units of analysis were 2,000 title fields and KDC classification class numbers (300 and 600) extracted from the KORMARC records of public libraries. As a result of analyzing the data using six machine learning models, it showed a possibility of applying machine learning to book classification. Among the models used, the neural network model has the highest accuracy of title classification. The study suggested the need for improving the accuracy of title classification, the need for research on book titles, tokenization of titles, and stop words.
Keywords
Machine Learning; Title Classification; Library Classification; Python; Scikit-learn Library;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Golub, K., Hagelback, J., & Ardo, A. (2018). Automatic classification using DDC on the Swedish Union Catalogue. Proceedings of the 18th European Networked Knowledge Organization Systems (NKOS) Workshop co-located with the 22nd International Conference on Theory and Practice of Digital Libraries 2018 (TPDL 2018). Available: http://ceur-ws.org/Vol-2200/paper1.pdf
2 Muller, A. C. & Guido, S. (2017). Introduction to Machine Learning with Python. 박해선 옮김.(2019). 파이썬 라이브러리를 활용한 머신러닝 (번역개정판). 서울: 한빛미디어.
3 Cho, H. (2017). A experimental study on the development of a book recommendation system using automatic classification, based on the personality type. Journal of Korean Library and Information Science Society, 48(2), 215-236.   DOI
4 Han, M, Rye, J., & Seo, S. (2014). Classification and analysis of emotion in Korean texts using machine learning. The Korean Institute of Information Scientists and Engineers, 1722-1724.
5 Kim, S. & Eom, J. (2008). A study on the documents's automatic classification using machine learning. Journal of Information Management, 39(4), 47-66.   DOI
6 Korean Library Association (2013). Korean Decimal Classification. 6th edition. Seoul: Korean Library Association.
7 Lee, C., Choi, D., Kim, S., & Kang, J. (2013). Classification and analysis of emotion in Korean microblog texts. Journal of KISS: Databases, 40(3), 159-167.
8 Aggarwal, C. C. (2018). Machine Learning. Cham: Springer.