DOI QR코드

DOI QR Code

Research of Topic Analysis for Extracting the Relationship between Science Data

과학기술용어 간 관계 도출을 위한 토픽 분석 연구

  • Kim, Mucheol (Department of Multimedia, Sungkyul University)
  • Received : 2016.02.04
  • Accepted : 2016.02.22
  • Published : 2016.02.28

Abstract

With the development of web, amount of information are generated in social web. Then many researchers are focused on the extracting and analyzing social issues from various social data. The proposed approach performed gathering the science data and analyzing with LDA algorithm. It generated the clusters which represent the social topics related to 'health'. As a result, we could deduce the relationship between science data and social issues.

웹의 발달과 함께 많은 정보들이 쏟아지기 시작했다. 그에 따라서 사회 이슈들을 소셜 데이터로부터 추출하고, 이에 대한 해결 방법을 모색하는 연구에 대한 관심이 많아지고 있다. 이에 본 연구에서는 과학기술문헌들을 수집하고, 분석해서 이슈 토픽 별로 군집화 하는 연구를 수행한다. 이를 위해서 보건분야의 주요 용어들을 중심으로 수집하고, 효과적인 분석을 위한 데이터 처리 및 토픽들을 중심으로 군집화 연구를 수행한다. 그 결과, 연구 이슈들을 도출하고 사회 현상에 대한 해결 방안을 마련할 수 있는 토대를 구축하고자 한다.

Keywords

References

  1. Wang, R., Liu, W., and McDonald, C., "Corpus-independent Generic Keyphrase Extraction Using Word Embedding Vectors," Software Engineering Research Conference, 2014.
  2. Jung, D., Kim, J., Kim, K., Hur, J., Ohn, B., and Kang, M., "A Proposal of a Keyword Extraction System for Detecting Social Issues," Journal of Intelligence and Information Systems, Vol. 19, No. 3, pp. 1-23, 2013. https://doi.org/10.13088/jiis.2013.19.3.001
  3. Hyun, Y., Han, H., Choi, H., Park, J., Lee, K., Kwak, K., and Kim, N., "Methodology Using Text Analysis for Packaging R&D Information Services on Pending National Issues," Journal of Information Technology Applications and Management, pp. 231-257, 2013.
  4. Kang, N., Cho, M., and Kwon, O., "A Relation Analysis between NDSL User Queries and Technical Terms," Journal of Information Management, Vol. 39, No. 3, pp. 163-177, 2008. https://doi.org/10.1633/JIM.2008.39.3.163
  5. Park, J. and Song, M., "A Study on the Research Trends in Library & Information Science in Korea using Topic Modeling," Journal of the Korean Society for Information Management, Vol. 30, No. 1, pp. 7-32, 2013. https://doi.org/10.3743/KOSIM.2013.30.1.007
  6. Kim, K. and Park, C., "Analysis of English abstracts in Journal of the Korean Data & Information Science Society using topicmodels and social network analysis," Journal of the Korean Data and Information Science Society, Vol. 26, No. 1, pp. 151-159, 2015. https://doi.org/10.7465/jkdi.2015.26.1.151
  7. NDSL, http://www.ndsl.kr.
  8. Blei, D., A. Ng, M. Jordan, and J. Lafferty, "Latent Dirichlet Allocations," Journal of Machine Learning Research, Vol. 3, No. 4-5, pp. 993-1022, 2003.
  9. Blei, D. M., "Probabilistic topic models," Communications of the ACM, Vol. 55, No. 4, pp. 77-84, 2012. https://doi.org/10.1145/2133806.2133826
  10. Misra, H., Anuj K. G., and Jose, J. M., "Topic Modeling for Content Based Image Retrieval," Multimedia Processing, Communication and Computing Applications. Springer India, pp. 63-76, 2013.
  11. Doulaty, M., Saz, O., and Hain, T., "Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition," in Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015.
  12. Kim, S. and Kim., H., "Keyword Extraction from News Corpus using Modified TF-IDF," The Journal of Society for e-Business Studies, Vol. 14, No. 4, pp. 59-73, 2009.
  13. Kim, M., Seo, J., Noh, S., and Han, S., "Identity management‐based social trust model for mediating information sharing and privacy enhancement," Security and Communication Networks, Vol. 5, No. 8, pp. 887-897, 2012. https://doi.org/10.1002/sec.379
  14. Oh, S., "A Model for Ranking Semantic Associations in a Social Network," The Journal of Society for e-Business Studies, Vol. 18, No. 3, pp. 93-105, 2013. https://doi.org/10.7838/jsebs.2013.18.3.093
  15. Kim, J., Kim, N., Cho, Y., "User-Perspective Issue Clustering Using Multi-Layered Two-Mode Network Analysis," Journal of Intelligence and Information Systems, Vol. 20, No. 2, pp. 93-107, 2014. https://doi.org/10.13088/JIIS.2014.20.2.093
  16. Gupta, S. and Manning, C. D., "Analyzing the Dynamics of Research by Extracting Key Aspects of Scientific Papers," In IJCNLP (pp. 1-9), 2011.
  17. Teh, Y. W., Newman, D., and Welling, M., "A collapsed variational Bayesian inference algorithm for latent Dirichlet allocation," In Advances in Neural Information Processing Systems, pp. 1353-1360, 2006.

Cited by

  1. 국내 학술논문 주제 분류 알고리즘 비교 및 분석 vol.18, pp.8, 2018, https://doi.org/10.5392/jkca.2018.18.08.178