Browse > Article
http://dx.doi.org/10.22693/NIAIP.2020.27.2.066

Big Data Analysis of Busan Civil Affairs Using the LDA Topic Modeling Technique  

Park, Ju-Seop (Smart Governance Research Center, Dong-A University)
Lee, Sae-Mi (Smart Governance Research Center, Dong-A University)
Publication Information
Informatization Policy / v.27, no.2, 2020 , pp. 66-83 More about this Journal
Abstract
Local issues that occur in cities typically garner great attention from the public. While local governments strive to resolve these issues, it is often difficult to effectively eliminate them all, which leads to complaints. In tackling these issues, it is imperative for local governments to use big data to identify the nature of complaints, and proactively provide solutions. This study applies the LDA topic modeling technique to research and analyze trends and patterns in complaints filed online. To this end, 9,625 cases of online complaints submitted to the city of Busan from 2015 to 2017 were analyzed, and 20 topics were identified. From these topics, key topics were singled out, and through analysis of quarterly weighting trends, four "hot" topics(Bus stops, Taxi drivers, Praises, and Administrative handling) and four "cold" topics(CCTV installation, Bus routes, Park facilities including parking, and Festivities issues) were highlighted. The study conducted big data analysis for the identification of trends and patterns in civil affairs and makes an academic impact by encouraging follow-up research. Moreover, the text mining technique used for complaint analysis can be used for other projects requiring big data processing.
Keywords
big data analysis; online filings; local administration; LDA topic modeling; text mining;
Citations & Related Records
Times Cited By KSCI : 8  (Citation Analysis)
연도 인용수 순위
1 Hagen, L., Harrison, T. M., Uzuner, O., May, W., Fake, T. & Katragadda, S. E. (2016). "Petition Popularity: Do Linguistic and Semantic Factors Matter?" Government Information Quarterly, 33(4), 783-795.   DOI
2 Hofmann, T. (2001). "Unsupervised Learning by Probabilistic Latent Semantic Analysis." Machine Learning, 42(1-2), 177-196.   DOI
3 Hu, Y., Boyd-Graber, J., Satinoff, B. & Smith, A. (2014). "Interactive Topic Modeling." Machine Learning, 95(3), 423-469.   DOI
4 Jacobi, C., Atteveldt, W. V. & Welbers, K. (2015). "Quantitative Analysis of Large Amounts of Journalistic Texts Using Topic Modelling." Digital Journalism, 4(1), 89-106.   DOI
5 Jang, B. M. (2015). "Analysis of Public Big Data for Promoting Benefits of Community Residents." Master's Thesis. Kyungpook National University.
6 Kang, K. J. (2019). "Uijeongbu City, Big Data Analysis Project Completion Report Meeting Held." The Financial News. January 21.
7 Kim, G. & Yun, H. (2016). "Topic Modeling Approach to Understand Changes in Customer Perceptions on Hotel Services in Seoul." Journal of Korea Service Management Society, 17(3), 217-231.   DOI
8 Kim, H. W. (2017). "Seoul City, Unfavorable Rate Refund, Civil Service Regulation, 40% Reduction in Corporate Taxi Complaints." Dongyang News Agency, August 13.
9 Korea Data Agency. (2017). 2017 Data Industry White Paper. Seoul: Korea Data Agency.
10 Kim, C. S., Choi, S. J. & Kwahk, K. Y. (2017a). "Investigation of Research Trends in Information Systems Domain Using Topic Modeling and Time Series Regression Analysis." Journal of Digital Contents Society, 18(6), 1143-1150.   DOI
11 Kim, C. S., Kwahk, K. Y. & Yoon, H. J. (2017b). "An Analysis of Research Trends in Tourism Studies: Applying Topic Modeling and Time Series Regression Analysis." Journal of Tourism and Leisure Research, 29(12), 25-39.
12 Kim, J. H., & Chen, W. (2018). "Research Topic Analysis in Engineering Management Using a Latent Dirichlet Allocation Model." Journal of Industrial Integration and Management, 3(4), 1850016.   DOI
13 Kim, K. W. (2018a). "Daegu City Bus Passenger's Biggest Complaint is 'Unkind Bus Driver'." Maeil Shinmun, October 30.
14 Korea Institute of Sports Science (2016). Improvement plan of public sports facility management. Seoul: Korea Institute of Sports Science.
15 Kim, S. K. & Jang, S. Y. (2016). "A Study on the Research Trends in Domestic Industrial and Management Engineering Using Topic Modeling." Journal of the Korea Management Engineers Society, 21(3), 71-95.
16 Kim, Y. H. (2018b). "Incheon Bupyeong-gu Civil Big Data Analysis. 2nd Half Best 7." Maeil Ilbo, February 11.
17 National Information Society Agency. (2015). Strategy for Building Administrative Service Integration Delivery Platform. Seoul: National Information Society Agency.
18 Kwak, J. O. (2016). "Unkind Taxi Driver." The Transportation News Korea, May 31.
19 Lee, J. M., Lee, J. A. & Jeong, J. H. (2017). "The Jeonse Price Forecasting Used by News Big Data - Focusing on Topic Modeling Analysis." Korea Real Estate Academy Review, 69, 43-57.
20 Lee, S. S. (2016). "A Study on the Application of Topic Modeling for the Book Report Text." Journal of Korean Library and Information Science Society, 47(4), 1-18.   DOI
21 Liu, L., Tang, L., Dong, W., Yao, S. & Zhou, W. (2016). "An Overview of Topic Modeling and Its Current Applications in Bioinformatics." Springerplus, 5(1), 1608.   DOI
22 Mannila, H. (2000). "Theoretical Frameworks for Data Mining." ACM SIGKDD Explorations Newsletter, 1(2), 30-32.   DOI
23 Mergel, I., Rethemeyer, R. K. & Isett, K. (2016). "Big data in public affairs." Public Administration Review, 76 (6), 928-937.   DOI
24 Mika, W., Seppo, L. & Mervi, R. (2018). "A Topic Modelling Analysis of Living Labs Research." Technology Innovation Management Review, 8(7), 40-51.   DOI
25 Ministry of the Interior and Safety. (2018). Good Use of Big Data in Civil, Tourism and National Safety. Sejong: Ministry of the Interior and Safety.
26 Na, Y. W., Park, H. J. & Jung, J. W. (2015). "Pattern analysis of environment complaint using the spatial big data." Journal of the Korean Society of Civil Engineers, 63(7), 29-35.
27 Park, D. S., Moon, Y. S., Park, Y. H., Yoon, C. H., Jeong, Y. S. & Jang, H. S. (2014). Big data computing technology. Seoul: Hanbit Academy, Inc.
28 Park, H. J., Kim, H. N. & Hong, Y. J. (2017a). "A Topic Modeling Analysis on the Major Social Issues of the Students' Human Rights Ordinance in Korea." Asian Journal of Education, 18(4), 683-711.   DOI
29 Park, J. S., Hong, S. G. & Kim, J. W. (2017b). "A study on science technology trend and prediction using topic modeling." Journal of the Korea Industrial Information Systems Research, 22(4), 19-28.   DOI
30 Park, S. H., Moon, H. S. & Kim, J. K. (2017c). "Online reviews analysis for prediction of product ratings based on topic modeling." Journal of Information Technology Services, 16(3), 113-125.   DOI
31 Park, W. D. (2016). "Improvement Plan for the Civil Affairs Administration Service based on the Level of Resident Satisfaction." Master's Thesis. Myongji University.
32 Ramirez, E. H., Brena, R., Magatti, D., Stella, F. (2012). "Topic model validation." Neurocomputing, 76(1), 125-133.   DOI
33 Seol, D. H., Ko, J. H., & Yoo, S. H. (2018). "Korean Sociological Association and sociological research: Changes in the areas of sociology in Korea 1964-2017." Korean Journal of Sociology, 52(1), 153-213.   DOI
34 Shi, Z., Lee, G. M., Whinston, A. B. (2016). "Toward a Better Measure of Business Proximity: Topic Modeling for Industry Intelligence." MIS Quarterly, 40(4), 1035-1056.   DOI
35 Shin, H. C. (2009). "Administrative Service Improvement Program of Inhabitants Evaluation." Master's Thesis. Kyungpook National University.
36 Son, N. R. & Kim, S. Y. (2017). "Complaints Statistics and Department of Automated Classifications System through Public Complaints Big Data Analysis." The Journal of Korean Institute of Next Generation Computing, 13(1), 22-35.
37 Song, M. & Kim, S. Y. (2013). "Detecting the Knowledge Structure of Bioinformatics by Mining Full-text Collections." Scientometrics, 96(1), 183-201.   DOI
38 Stylios, G., Christodoulakis, D., Besharat, J., Vonitsanou, M. A., Kotrotsos, I., Koumpouri, A. & Stamou, S. (2010). "Public Opinion Mining for Governmental Decisions." Electronic Journal of e-Government, 8(2), 202-213.
39 Suh, J. H., Park, C. H. & Jeon, S. H. (2010). "Applying Text and Data Mining Techniques to Forecasting the Trend of Petitions Filed to E-People." Expert Systems with Applications, 37(10), 7255-7268.   DOI
40 van der Meer, T. G. (2016). "Automated Content Analysis and Crisis Communication Research." Public Relations Review, 425, 952-961.   DOI
41 Won, T. H. & Yoo, H. H. (2016). "Pattern Analysis for Civil Complaints of Local Governments Using a Text Mining." Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography, 34(3), 319-327.   DOI
42 Yang, H. C. (2018). "Big Data Analysis on Gimpo City bus civil complaint, The Most Frequent Complaint is Nonstop Bus." Kyeong Gi Ilbo, October 11.
43 Yang, H. L., Chang, T. W. & Choi, Y. (2018). "Exploring the Research Trend of Smart Factory with Topic Modeling." Sustainability, 10(8), 2779.   DOI
44 Yoon, J. E. & Suh, C. J. (2018). "Research Trend Analysis on Smart Healthcare by Using Topic Modeling and Ego Network Analysis." Journal of Digital Contents Society, 19(5), 981-993.   DOI
45 Yoon, M. Y. (2013). "Analysis of Major Data Promotion Strategies and Implications." The Journal of Science and Technology Policy, 23(3), 31-43.
46 Yoon, S. Y, & Yoon, D. K. (2017). "A Trends Analysis on Disaster and Safety Management Using Topic Modeling." Journal of Korean Society for Geospatial Information System, 25(3), 75-85.   DOI
47 Yu, Y. L. (2017). "Analysis of Media Coverage on 2015 Revised Curriculum Policy using Big Data Analysis." Doctoral Thesis, Department of Education, Seoul National University.
48 Blei, D. M. (2012). "Probabilistic Topic Models." Communications of the ACM, 55(4), 77-84.   DOI
49 Abuhay, T. M., Nigatie, Y. G. & Kovalchuk, S. V. (2018). "Towards Predicting Trend of Scientific Research Topics Using Topic Modeling." Procedia Computer Science, 136, 304-310.   DOI
50 Alghamdi, R. & Alfalqi, K. A. (2015). "Survey of Topic Modeling in Text Mining." International Journal of Advanced Computer Science and Application, s6(1), 147-153.
51 Blei, D. M., Ng, A. Y. & Jordan, M. (2003). "Latent Dirichlet Allocation." Journal of machine Learning research, 3(Jan), 993-1022.
52 Chang, J., Gerrish, S., Wang, C., Boyd-Graber, J. L. & Blei, D. M. (2009). "Reading Tea Leaves: How Humans Interpret Topic Models." Advances in neural information processing systems, 22, 288-296.
53 Cheng, X., Yan, X., Lan, Y. & Guo, J. (2014). "BTM: Topic Modeling Over Short Texts." IEEE Transactions on Knowledge and Data Engineering, 26(12), 2928-2941.   DOI
54 Cho, T. I. (2016). "Spatiotemporal Characteristics Analysis of Complaints on Officially Assessed Land Price by Big Data Mining." Doctoral Thesis, Department of Civil and Environmental Engineering, Incheon University.
55 Deerwester, S., Dumais, S., Landauer, T., Furnas, G. & Harshman, R. (1990). "Indexing by Latent Semantic Analysis." Journal of the American Society for Information Science, 41(6), 391-407.   DOI
56 DiMaggio, P., Nag, M. & Blei, D. (2013). "Exploiting Affinities Between Topic Modeling and the Sociological Perspective on Culture: Application to Newspaper Coverage of U.S. Government Arts Funding." Poetics, 41(6), 570-606.   DOI
57 Evangelopoulos, N. & Visinescu, L. (2012). "Text-mining the voice of the people." Communications of the ACM, 55(2), 62-69.   DOI
58 Hagen, L. (2018). "Content Analysis of E-petitions with Topic Modeling: How to Train and Evaluate LDA Models?" Information Processing & Management, 54(6), 1292-1307.   DOI