DOI QR코드

DOI QR Code

A Text Mining Study on Endangered Wildlife Complaints - Discovery of Key Issues through LDA Topic Modeling and Network Analysis -

멸종위기 야생생물 민원 텍스트 마이닝 연구 - LDA 토픽 모델링과 네트워크 분석을 통한 주요 이슈 발굴 -

  • Kim, Na-Yeong (National Institute of Ecology, Research Center for Endangered Species) ;
  • Nam, Hee-Jung (National Institute of Ecology, Research Center for Endangered Species) ;
  • Park, Yong-Su (National Institute of Ecology, Research Center for Endangered Species)
  • 김나영 (국립생태원 멸종위기종복원센터 ) ;
  • 남희정 (국립생태원 멸종위기종복원센터 ) ;
  • 박용수 (국립생태원 멸종위기종복원센터)
  • Received : 2023.10.26
  • Accepted : 2023.11.15
  • Published : 2023.12.30

Abstract

This study aimed to analyze the needs and interests of the public on endangered wildlife using complaint big data. We collected 1,203 complaints and their corresponding text data on endangered wildlife, pre-processed them, and constructed a document-term matrix for 1,739 text data. We performed LDA (Latent Dirichlet Allocation) topic modeling and network analysis. The results revealed that the complaints on endangered wildlife peaked in June-August, and the interest shifted from insects to various endangered wildlife in the living area, such as mammals, birds, and amphibians. In addition, the complaints on endangered wildlife could be categorized into 8 topics and 5 clusters, such as discovery report, habitat protection and response request, information inquiry, investigation and action request, and consultation request. The co-occurrence network analysis for each topic showed that the keywords reflecting the call center reporting procedure, such as photo, send, and take, had high centrality in common, and other keywords such as dung beetle, know, absence and think played an important role in the network. Through this analysis, we identified the main keywords and their relationships within each topic and derived the main issues for each topic. This study confirmed the increasing and diversifying public interest and complaints on endangered wildlife and highlighted the need for professional response. We also suggested developing and extending participatory conservation plans that align with the public's preferences and demands. This study demonstrated the feasibility of using complaint big data on endangered wildlife and its implications for policy decision-making and public promotion on endangered wildlife.

Keywords

References

  1. Blei, D.M. 2012. Probabilistic Topic Models. Communications of the ACM, 55, 77-84. http://dx.doi.org/10.1145/2133806.2133826
  2. Choi CM, Lee JH, Jessica Machado, and Kim GW. 2022. Big-Data-Based Text Mining and Social Network Analysis of Landscape Response to Future Environmental Change Land 11, no. 12: 2183. https://doi.org/10.3390/land11122183
  3. Choi HO. 2016. Study on Selecting Priority Criteria Utilizing Civil Complaint Data in the Field of Environment and Sanitation. Journal of Environmental Policy and Administration, 24(2), 45-57. https://doi.org/10.15301/jepa.2016.24.2.45
  4. Chae HH, Kim YC, Son SW. 2022. Korean and Worldwide Research Trends on Rare Plant and Endemic Plant in Korea. Korean J. Environ. Ecol. 36(3): 257-276, June 2022 https://doi.org/10.13047/KJEE.2022.36.3.257
  5. Chung JM, Kim WJ, Koo CD. 2016. Social Media Big Data Analysis for ICT Policy Agenda in Education. Korea Education and Research Service(KERIS), pp.4
  6. CIVIL PETITIONS TREATMENT ACT article No.2 [Act No. 18742, Jan. 11, 2022]
  7. Do MS, Choi GR, Hwang JW, Lee JY, Hur LH, Choi YS, Son SJ, Kwon IK, Yoo SY, Nam HK. 2020. Research topics and trends of endangered species using text mining in Korea. Journal of Asia-Pacific Biodiversity, 13(4), 2020, 518-523. ISSN 2287-884X, https://doi.org/10.1016/j.japb.2020.09.008
  8. Duberry, J. 2019. Big data and environmental civil society organizations. In Global Environmental Governance in the Information Age:
  9. Jiao Y, Li C, and Lin Y. 2021. Can Urban Environmental Problems Be Accurately Identified? A Complaint Text Mining Method. Applied Sciences, 11(9):4087. https://doi.org/10.3390/app11094087
  10. Jin DY, Kang SW, Han KJ, Kim JH, Kim DY and Kang SA. 2019. A Study on Improving Reflection of Demands for Settling Environmental Issues in Daily Lives: Focusing on the Analysis of Civil Complaint Big Data. Korea Environment Institute, 2019-09
  11. Lee HJ & Sung KJ. 2018. Analysis of Domestic and Foreign Local Biodiversity Strategies and Action Plan (LBSAP) using Semantic Network Analysis. Journal of Environmental Impact Assessment, 27(1), 92-104. https://doi.org/10.14249/EIA.2018.27.1.92
  12. Lee SY. 2021. An Analysis of Civil Complaints about Traffic Policing Using the LDA Model. The Journal of The Korea Institute of Intelligent Transport Systems, 20(4), 57-70. https://doi.org/10.12815/kits.2021.20.4.57
  13. Lim HI, Park GW, and Kim AR. 2023. Challenges in the Forestry Sector for Achieving the Kunming-Montreal Global Biodiversity Framework, a New Leap Forward for Global Biodiversity Conservation. International forest agenda vol.123. National Institute of Forest Science
  14. Lim HJ. 2014. Analysis of Chungnam Province Policy Keywords Using Big Data. Chungnam Development Institute
  15. Ministry of Environment [website]. (2022.12.20.). URL: http://www.me.go.kr/home/web/board/read.do?boardMasterId=1&boardId=1569450&menuId=10525
  16. Ministry of Environment. 2023. National Biodiversity Strategy(2024~2028)
  17. National Institute of Ecology. 2020. National Survey on Public Awareness of Endangered Wildlife-1st
  18. National Institute of Ecology. 2022. National Survey on Public Awareness of Endangered Wildlife-2nd
  19. Park JS & Lee SM. 2020. Big Data Analysis of Busan Civil Affairs Using the LDA Topic Modeling Technique. Informatization policy, 27(2), 66-83. https://doi.org/10.22693/NIAIP.2020.27.2.066
  20. Park KC. 2020. Big data analysis of complaints using text mining techniques: Gangnam-gu. Seoul: Seoul Digital Foundation
  21. Ryu SE, HONG SG, Lee TH, KIM NR. 2018. A Pattern Analysis of Bus Civil Complaint in Busan City Using the Text Network Analysis. Korean Computers and Accounting Review, 16(2), 19-43.
  22. Secretariat of the Convention on Biological Diversity. 2020. Global biodiversity Outlook 5.Montreal
  23. Sung JH & Lee KJ. 2022. An Analysis of Citizen's Complaints in Urban Parks Using Text Mining. Journal of recreation and landscape, 16(4), 99-104
  24. Wallach, Hanna & Murray, Iain & Salakhutdinov, Ruslan & Mimno, David. 2009. Evaluation methods for topic models. Proceedings of the 26th International Conference On Machine Learning, ICML 2009. 382. 139. https://doi.org/10.1145/1553374.1553515.
  25. Yang HJ, Ahn JM, and Lee TH. 2021. A Study of Korean's Experiences of Unfairness Based on Analysis of Text Big Data Posted on the Blue House National Petition. Survey Research, 22(1), 25-59. https://doi.org/10.20997/SR.22.1.2
  26. Yoon J., Kwon K., Yoo, J., and Yoo, N. 2021. Current Status and Future Prospects of Endangered Species Restoration Projects for Freshwater Fishes, Amphibians, and Reptiles in South Korea. Proce dings of the National Institute of Ecology of the Republic of Korea, 2(4), 247-258, https://doi.org/10.22920/PNIE.2021.2.4.24