Browse > Article
http://dx.doi.org/10.6109/jkiice.2021.25.9.1199

Text Mining and Visualization of Unstructured Data Using Big Data Analytical Tool R  

Nam, Soo-Tai (Institute of General Education, Pusan National University)
Shin, Seong-Yoon (School of Computer Information & Communication Engineering, Kunsan National University)
Jin, Chan-Yong (Division of Information & Electronic Commerce, Wonkwang University)
Abstract
In the era of big data, not only structured data well organized in databases, but also the Internet, social network services, it is very important to effectively analyze unstructured big data such as web documents, e-mails, and social data generated in real time in mobile environment. Big data analysis is the process of creating new value by discovering meaningful new correlations, patterns, and trends in big data stored in data storage. We intend to summarize and visualize the analysis results through frequency analysis of unstructured article data using R language, a big data analysis tool. The data used in this study was analyzed for total 104 papers in the Mon-May 2021 among the journals of the Korea Institute of Information and Communication Engineering. In the final analysis results, the most frequently mentioned keyword was "Data", which ranked first 1,538 times. Therefore, based on the results of the analysis, the limitations of the study and theoretical implications are suggested.
Keywords
Network analysis; Unstructured data; Big data; Association analysis; Text mining;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J. Kim, H. Moon, and W. Lee, "A Study on Trend Analysis in Convergence Research Applying Word Cloud in Korea," Journal of Digital Convergence, vol. 19, no. 2, pp. 33-38, Feb. 2021.   DOI
2 Y. Oh and E. Park, "Data visualization of air quality data using R software," Journal of the Korea Data and Information Science Society, vol. 26, no. 2, pp. 399-408, Feb. 2015.   DOI
3 W. Lee, "A Study on Word Cloud Techniques for Analysis of Unstructured Text Data," The Journal of the Convergence on Culture Technology, vol. 6, no. 4, pp. 715-720, Nov. 2020.   DOI
4 E. Lee, K. Chu, and D. Lee, "A Study on Recent Trend Analysis in Consumer Research Applying Word Cloud," Journal of Product Research, vol. 37, no. 1, pp. 1-7, Feb. 2019.
5 J. Huh, "Designing of Image Processing Curriculum Considering Network Security," Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology, vol. 7, no. 6, pp. 861-869, Jun. 2017.   DOI
6 S. Kim and S. Choi, "Analyzing the level of resilience by gender in computational thinking classes," Journal of the Korea Institute of Information and Communication Engineering, vol. 25, no. 2, pp. 252-258, Feb. 2021.   DOI
7 H. Kim, S. Kim, and H. Kim, "Crisis Prediction of Regional Industry Ecosystem based on Text Sentiment Analysis Using News Data - Focused on the Automobile Industry in Gwangju-," International Journal of contents, vol. 20, no. 8, pp. 1-9, Aug. 2020.
8 J. Ban, J. Ha, and D. Kim, "Frequency and Social Network Analysis of the Bible Data using Big Data Analytics Tools R," Journal of the Korea Institute of Information and Communication Engineering, vol. 24, no. 2, pp. 166-171, Feb. 2010.   DOI
9 Y. Kang, M. Kim, C. Hong S. Kim, and S. Kwon, "Visualizing Educational Material using a Big Data Analytical Tool R Language," Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology, vol. 8, no. 3, pp. 915-924, Mar. 2018.