DOI QR코드

DOI QR Code

Investigation of Research Trends in the D(Data)·N(Network)·A(A.I) Field Using the Dynamic Topic Model

다이나믹 토픽 모델을 활용한 D(Data)·N(Network)·A(A.I) 중심의 연구동향 분석

  • Wo, Chang Woo (Department of Computer Science, Chungbuk National University, SW&Cloud Planning Team, Institute of Information & Communications Technology Planning & Evaluation) ;
  • Lee, Jong Yun (Department of Computer Science, Chungbuk National University)
  • 우창우 (충북대학교 컴퓨터과학과 및 정보통신기획평가원 SW클라우드기획팀) ;
  • 이종연 (충북대학교 소프트웨어학과)
  • Received : 2020.07.24
  • Accepted : 2020.09.20
  • Published : 2020.09.28

Abstract

The Topic Modeling research, the methodology for deduction keyword within literature, has become active with the explosion of data from digital society transition. The research objective is to investigate research trends in D.N.A.(Data, Network, Artificial Intelligence) field using DTM(Dynamic Topic Model). DTM model was applied to the 1,519 of research projects with SW·A.I technology classifications among ICT(Information and Communication Technology) field projects between 6 years(2015~2020). As a result, technology keyword for D.N.A. field; Big data, Cloud, Artificial Intelligence, extended keyword; Unstructured, Edge Computing, Learning, Recognition was appeared every year, and accordingly that the above technology is being researched inclusively from other projects can be inferred. Finally, it is expected that the result from this paper become useful for future policy·R&D planning and corporation's technology·marketing strategy.

최근 디지털 사회의 도래로 다양한 데이터가 폭발적으로 증가하고, 그중 문헌 내 주제어를 도출하는 토픽 모델링에 관한 연구가 활발히 진행되고 있다. 본 논문의 연구목표는 토픽 모델링 방법 중 하나인 DTM(Dynamic Topic Model) 모델을 적용해 D.N.A.(Data, Network, A.I) 분야에 대한 연구동향을 탐색하는데 있다. 실험 데이터는 최근 6년간(2015~2020) ICT(Information and Communication Technology) 분야 중 기술대분류가 SW·AI에 해당하는 연구과제 1,519개 사업에 대해 DTM 모델을 적용하였다. 실험결과로, D.N.A. 분야의 기술 키워드 Big data, Cloud, Artificial Intelligence와 확장된 의미의 기술 키워드 Unstructured, Edge Computing, Learning, Recognition 등이 매년 연구에 표출되었으며, 해당 키워드 들이 특정 연구과제에 종속되지 않고 다른 연구과제에서도 포괄적으로 연구되고 있음을 확인하였다. 끝으로 본 논문의 연구결과는 향후 D.N.A. 분야에 대한 정책기획·과제기획 등 연구개발 기획 과정과 기업의 기술 확보전략·마케팅 전략 등 다양한 곳에 활용될 수 있을 것으로 기대한다.

Keywords

References

  1. H. J. Lee, P. M. Ryu, S. J. Lim, M. K. Jang & H. K. Ki. (2014). Technology Trends of AI for Big Data Knowledge Processing. Electronics and Telecommunications Trends, 29(4), 30-38. DOI : 10.22648/ETRI.2014.J.290404
  2. T. K. Kang, Y. H. Kang, Y. C. Ryoo & T. S. Cheung. (2019). Research Trend in Ultra-Low Latency Networking for Fourth Industrial Revolution. Electronics and Telecommunications Trends, 34(6), 108-122. DOI : 10.22648/ETRI.2019.J.340610
  3. J. Y. Lee & B. S. Cho. (2016). Suggestions for Nurturing Ecosystem to Spur Artificial Intelligence Industry. Electronics and Telecommunications Trends, 31(2), 51-62. DOI : 10.22648/ETRI.2016.J.310206
  4. David M. Blei, Andrew Y. Ng. & Michael I. Jordan. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993-1022. DOI : 10.1162/jmlr.2003.3.4.-5.993
  5. David M. Blei & John D. Lafferty. (2006). Dynamic Topic Models. Proceedings of the 23rd international conference on Machine learning, 113-120. DOI : 10.1145/1143844.1143859
  6. Michal Rosen Zvi, Thomas Griffiths, Mark Steyvers & Padhraic Smyth. (2004). The Author-Topic Model for Authors and Documents. Proceedings of the 20th conference on Uncertainty in artificial intelligence, 487-494.
  7. Wei Li & Andrew McCallum. (2006). Pachinko Allocation : DAG-Structured Mixture Models of Topic Correlations. Proceedings of the 23rd international conference on Machine learning, 577-584. DOI : 10.1145/1143844.1143917
  8. Xuerui Wang & Andrew McCallum. (2006). Topics over time: a non-Markov continuous-time model of topical trends. Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, 424-433. DOI : 10.1145/1150402.1150450
  9. Hamed Jelodar, Yongli Wang, Chi Yuan, Xia Feng, Xiahui Jiang, Yanchao Li & Liang Zhao. (2018). Latent Dirichlet Allocation (LDA) and Topic modeling: models, applications, a survey. Multimedia Tools and Applications, 78, 15169-15211. DOI : 10.1007/s11042-018-6894-4
  10. T. J. Kim. (2020). COVID-19 News Analysis Using News Big Data : Focusing on Topic Modeling Analysis. The Journal of the Korea Contents Association, 20(5), 457-466. DOI : 10.5392/JKCA.2020.20.05.457.
  11. H. C. Lee, J. H. Jang & K. T. Kim. (2020). A Study on the Conflict Structure of the Standing Committee through Topic A[4nalysis of the National Assembly Minutes : Health and Welfare Committee in the First Half of the 20th National Assembly. Korean Party Studies Review, 19(2), 131-167. DOI : 10.30992/KPSR.2020.06.19.2.131
  12. G. C. Kim & H. J. Noh. (2019). Research Trends of Regional Geography Education Using Topic Modeling. Social Studies Education, 58(4), 49-67. DOI : 10.37561/sse.2019.12.58.4.49
  13. W. J. Lee & H. Y. Lee. (2019). A Technology Landscape of Artificial Intelligence: Technological Structure and Firms' Competitive Advantages. Journal of Korea technology innovation society, 22(3), 340-361. DOI : 10.35978/jktis.2019.06.22.3.340
  14. K. W. Cho & Y. W. Woo. (2019). Topic Modeling on Research Trends of Industry 4.0 Using Text Mining. Journal of the Korea Institute of Information and Communication Engineering, 23(7), 764-770. DOI : 10.6109/jkiice.2019.23.7.764
  15. S. I. Hwang & M. K. Kim. (2019). An Analysis of Artificial Intelligence(A.I.) : related Studies' Trends in Korea Focused on Topic Modeling and Semantic Network Analysis. Journal of Digital Contents Society, 20(9), 1847-1855. DOI : 10.9728/dcs.2019.20.9.1847
  16. Scott Deerwester, Susan T Dumais, George W Furnas, Thomas K Landauer & Richard Harshman. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391-407. DOI : 10.1002/(SICI)1097-4571(199009)41:6<391
  17. Thomas Hofmann. (2001). Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42(1-2), 177-196. DOI : 10.1023/A:1007617005950
  18. Arnab Bhadury, Jianfei Chen, Jun Zhu & Shixia Liu. (2016). Scaling up Dynamic Topic Models. WWW '16 : Proceedings of the 25th International Conference on World Wide Web, 381-390. DOI : 10.1145/2872427.2883046
  19. Lin Liu, Lin Tang, Wen Dong, Shaowen Yao & Wei Zhou. (2016). An Overview of Topic Modeling and its current applications in bioinformatics. Springerplus, 5(1), 1608-1630. DOI : 10.1186/s40064-016-3252-8
  20. Derek Greene & James P. Cross. (2017). Exploring the Political Agenda of the European Parliament Using a Dynamic Topic Modeling Approach. Political Analysis, 25(1), 77-94. DOI : 10.1017/pan.2016.7
  21. Allen H. Huang, Reuven Lehavy, Amy Y. Zang & Rong Zheng. (2017). Analyst Information Discovery and Interpretation Roles: A Topic Modeling Approach. MANAGEMENT SCIENCE, 64(6), 2473-2972. DOI : 10.1287/mnsc.2017.2751
  22. Andra-Selina Pietsch & Stefan Lessmannb. (2018). Topic modeling for analyzing open-ended survey responses, JOURNAL OF BUSINESS ANALYTICS, 1(2), 93-116. DOI : 10.1080/2573234X.2019.1590131
  23. Paulo Bicalho, Marcelo Pita, Gabriel Pedrosa, Anisio Lacerda & Gisele L. Papp. (2017). A general framework to expand short text for topic modeling. Information Sciences, 393, 66-81. DOI : 10.1016/j.ins.2017.02.007
  24. Pooja Kherwa & Poonam Bansal. (2019). Topic Modeling: A Comprehensive Review. EAI Endorsed Transactions on Scalable Information Systems, 7(24), 1-16. DOI : 10.4108/eai.13-7-2018.159623