Browse > Article
http://dx.doi.org/10.5351/KJAS.2016.29.5.935

Analysis of patterns in meteorological research and development using a text-mining algorithm  

Park, Hongju (Department of Applied Statistics, Yonsei University)
Kim, Habin (Department of Statistics, Dongguk University)
Park, Taeyoung (Department of Applied Statistics, Yonsei University)
Lee, Yung-Seop (Department of Statistics, Dongguk University)
Publication Information
The Korean Journal of Applied Statistics / v.29, no.5, 2016 , pp. 935-947 More about this Journal
Abstract
This paper considers the analysis of patterns in meteorological research and development using a text-mining algorithm as the method of analyzing unstructured data. To analyze text data, we define a list of terms related to meteorological research and development, construct times series of a term-document matrix through data preprocessing, and identify terms that have upward or downward patterns over time. The proposed methodology is applied to multi-year plans funded by Korea Meteorological Administration research and development programs from 2011 to 2015.
Keywords
text-mining; term-document matrix; unstructured data; meteorological data;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Attali, Y. and Burstein, J. (2006). Automated Essay Scoring With $e-rater^{(R)}$ V.2, The Journal of Technology, Learning, and Assessment, 4, Available from: http://www.jtla.org.
2 Bae, K. Y., Park, J. H., Kim, J. S., and Lee, Y. S. (2013). Analysis of the abstracts of research article in food related to climate change using a text-mining algorithm, Journal of the Korean Data and Information Science Society, 24, 1429-1437.   DOI
3 Feinerer, I. (2013). Introduction to the tm package text mining in R, http://CRAN.R-project.org/doc/Rnews/
4 Feinerer, I., Hornik, K., and Meyer, D. (2008). Text mining infrastructure in R, Journal of Statistical Software, 25, 1-54.
5 Goo, J. and Kim, K. (2014). Text mining for Korean: characteristics and application to 2011 Korean Economic Census Data, Korean Journal of Applied Statistics, 27, 1207-1217.   DOI
6 Jeon, H. (2013). KoNLP: Korean NLP package, R package version 0.76, 8.
7 Jeon, H. (2015). Package KoNLP, Available from: https://cran.r-project.org/web/packages/KoNLP/KoNLP.pdf.
8 Jin, S. A., Heo, G. E., Jeong, Y. K., and Song, M. (2013). Topic-network based topic shift detection on twitter, Korea Society for Information Management, 30, 285-302.   DOI
9 Kang, M. M., Kim, S. R., and Park, S. M. (2012). Analysis and utilization of big data, Korea Information Science Society review, 30, 25-32.
10 Srivastava, A. N. and Sahami, M. (2009). Text Mining: Classication, Clustering, and Applications, CRC Press.
11 Zhang, B. T. (2007). Next-generation machine learning technologies, Communications of the Korea Information Science Society, 3, 96-107.