DOI QR코드

DOI QR Code

Reorganizing Social Issues from R&D Perspective Using Social Network Analysis

  • Shun Wong, William Xiu (Graduate School of Business Information Technology, Kookmin University) ;
  • Kim, Namgyu (School of Management Information Systems, Kookmin University)
  • 투고 : 2015.08.25
  • 심사 : 2015.09.16
  • 발행 : 2015.09.30

초록

The rapid development of internet technologies and social media over the last few years has generated a huge amount of unstructured text data, which contains a great deal of valuable information and issues. Therefore, text mining-extracting meaningful information from unstructured text data-has gained attention from many researchers in various fields. Topic analysis is a text mining application that is used to determine the main issues in a large volume of text documents. However, it is difficult to identify related issues or meaningful insights as the number of issues derived through topic analysis is too large. Furthermore, traditional issue-clustering methods can only be performed based on the co-occurrence frequency of issue keywords in many documents. Therefore, an association between issues that have a low co-occurrence frequency cannot be recognized using traditional issue-clustering methods, even if those issues are strongly related in other perspectives. Therefore, in this research, a methodology to reorganize social issues from a research and development (R&D) perspective using social network analysis is proposed. Using an R&D perspective lexicon, issues that consistently share the same R&D keywords can be further identified through social network analysis. In this study, the R&D keywords that are associated with a particular issue imply the key technology elements that are needed to solve a particular issue. Issue clustering can then be performed based on the analysis results. Furthermore, the relationship between issues that share the same R&D keywords can be reorganized more systematically, by grouping them into clusters according to the R&D perspective lexicon. We expect that our methodology will contribute to establishing efficient R&D investment policies at the national level by enhancing the reusability of R&D knowledge, based on issue clustering using the R&D perspective lexicon. In addition, business companies could also utilize the results by aligning the R&D with their business strategy plans, to help companies develop innovative products and new technologies that sustain innovative business models.

키워드

참고문헌

  1. Agrawal, R. and Batra, M., "A detailed study on text mining techniques", International Journal of Soft Computing and Engineering, Vol. 2, No. 6, 2013, pp. 2231- 2307.
  2. Aizawa, A., "An information-theoretic perspective of tf-idf measures", Information Processing and Management, Vol. 39, No. 1, 2003, pp. 45-65. https://doi.org/10.1016/S0306-4573(02)00021-3
  3. Albright, R., Taming text with the SVD, SAS Institute Inc., 2004.
  4. Bae, J., Shon, J., and Song, M., "Analysis of twitter for 2012 South Korea presidential election by text mining techniques", Journal of Information Technology Applications and Management, Vol. 19, No. 3, 2013, pp. 141-156.
  5. Barbakh, W. A., Wu, Y., and Fyfe, C., Nonstandard parameter adaption for exploratory data analysis, Springer Berlin Heidelberg, 2009, pp. 1-6.
  6. Borgatti, S. P. and Everett, M. G., "Network analysis of 2-mode data", Social Networks, Vol. 19, 1997, pp. 243-269. https://doi.org/10.1016/S0378-8733(96)00301-2
  7. Chen, S. Y. and Liu, X., "The contribution of data mining to information science", Journal of Information Science, Vol. 30, No. 6, 2004, pp. 550-558. https://doi.org/10.1177/0165551504047928
  8. Cho, I. and Kim, N., "Recommending core and connecting keywords of research area using social network and data mining techniques", Journal of Intelligence and Information Systems, Vol. 17, No. 1, 2011, pp. 127-138.
  9. Choi, C., "Research on informal organizational network: Social network analysis", Korea Society and Public Administration, Vol. 17, No. 1, 2006, pp. 1-23.
  10. Duan, L., Xu, L., Liu, Y., and Lee, J., "Cluster-based outlier detection", Annals of Operations Research, Vol. 168, No. 1, 2009, pp. 151-168. https://doi.org/10.1007/s10479-008-0371-9
  11. Everett, M. G. and Borgatti, S. P., "The dual- projection approach for two-mode networks", Social Networks, Vol. 35, No. 2, 2013, pp. 204-210. https://doi.org/10.1016/j.socnet.2012.05.004
  12. Fan, W., Wallace, W., Rich, S., and Zhang, Z., "Tapping the power of text mining", Communications of the ACM, Vol. 49, No. 9, 2006, pp. 76-82. https://doi.org/10.1145/1151030.1151032
  13. Feldman, R. and Sanger, J., The text mining handbook: Advanced approaches in analyzing unstructured data, Cambridge University Press, 2007.
  14. Han, J., Kamber, M., and Pei, J., Data mining: Concepts and techniques, (3rd ed.), Morgan Kaufmann Publishers, 2011.
  15. Hearst, M. A., "Untangling text data mining" in Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, ACL '99, ACL, 1999, pp. 3-10.
  16. Holton, C., "Identifying disgruntled employee systems fraud risk through text mining: A simple solution for a multi-billion dollar problem", Decision Support Systems, Vol. 46, No. 4, 2009, pp. 853-864. https://doi.org/10.1016/j.dss.2008.11.013
  17. Hong, S., Social network world and big data applications, Seoul, Powerbook, 2013, pp. 235-238.
  18. Hyun, Y., Han, H., Choi, H., Park, J., Lee, K., Kwahk, K-Y., and Kim, N., "Methodology using text analysis for packaging R&D information services on pending national issues", Journal of Information Technology Applications and Management, Vol. 20, 2013, pp. 231-257.
  19. Jain, A. K., Duin, R. P. W., and Mao, J., "Statistical pattern recognition: A review", Pattern Analysis and Machine Intelligence, IEEE Transactions on, Vol. 22, No. 1, 2000, pp. 4-37. https://doi.org/10.1109/34.824819
  20. Jain, A. K., Murty, M. N., and Flynn, P. J., "Data clustering: A review", ACM Computing Surveys (CSUR), Vol. 31, No. 3, 1999, pp. 264-323. https://doi.org/10.1145/331499.331504
  21. Kang, M. and Hau, Y. S., "Multi-level analysis of the antecedents of knowledge transfer: Integration of social capital theory and social network theory", Asia Pacific Journal of Information Systems, Vol. 22, No. 3, 2012, pp. 75-97.
  22. Kim, I., "The Value of Big Data and Strategy", in 2012 Big Data Search Analysis Technology Insight, 2012.
  23. Kim, Y. H., Social network analysis, Seoul, 2007.
  24. Kwahk, K-Y., Social network analysis, Cheongram, Seoul, 2014.
  25. Latapy, M., Magnien, C., and Del Vecchio, N., "Basic notions for the analysis of large two-mode networks", Social Networks, Vol. 30, No. 1, 2008, pp. 31-48. https://doi.org/10.1016/j.socnet.2007.04.006
  26. Li, J., Wang, K., and Xu, L., "Chameleon based on clustering feature tree and its application in customer segmentation", Annals of Operation Research, Vol. 168, No. 1, 2009, pp. 225-245. https://doi.org/10.1007/s10479-008-0368-4
  27. Liebowitz, J., Business analytics: An introduction, CRC Press, 2013.
  28. Lin, F. R., Hsieh, L. S., and Chuang, F. T., "Discovering genres of online discussion threads via text mining", Computer and Education, Vol. 52, No. 2, 2009, pp. 481-495. https://doi.org/10.1016/j.compedu.2008.10.005
  29. Liu, B., Sentiment analysis and opinion mining, Morgan and Claypool Publishers, 2012.
  30. Mooney, R. J. and Bunescu, R., "Mining knowledge from text using information extraction", ACM SIGKDD Exploration, Vol. 7, No. 1, 2005, pp. 3-10.
  31. Myung, J., Lee, D., and Lee, S., "A Korean product review analysis system using a semi-automatically constructed semantic dictionary", Journal of KIISE: Software and Applications, Vol. 35, No. 6, 2008, pp. 347-405.
  32. Nagaraj, R., Thiagarasu, V., and Vijayakumar, P., "A novel semantic level text classification by combining NLP and Thesaurus concepts", IOSR Journal of Computer Engineering, Vol. 16, No. 4, 2014, pp. 14-26. https://doi.org/10.9790/0661-16461426
  33. Opsahl, T., "Triadic closure in two-mode networks: Redefining the global and local clustering coefficients", Social Networks, Vol. 35, No. 2, 2013, pp. 159-167. https://doi.org/10.1016/j.socnet.2011.07.001
  34. Provost, F. and Fawcett, T., Data science for business, O'Reilly Media, 2013.
  35. Punj, G. and Stewart, D. W., "Cluster analysis in marketing research: Review and suggestions for application", Journal of Marketing Research, Vol. 20, No. 2, 1983, pp. 134-148. https://doi.org/10.2307/3151680
  36. Romero, C., Ventura, S., and Garcia, E., "Data mining in course management systems: Moodle case study and tutorial", Computer and Education, Vol. 51, No. 1, 2008, pp. 368- 384. https://doi.org/10.1016/j.compedu.2007.05.016
  37. Salton, G., Wong, A., and Yang, C. S., "A vector space model for automatic indexing", Communications of the ACM, Vol. 18, No. 11, 1975, pp. 613-620. https://doi.org/10.1145/361219.361220
  38. Sebastiani, F., "Classification of text", Automatic, the Encyclopedia of Language and Linguistics, (2nd ed.), Vol. 14, Elsevier Science Pub, 2006.
  39. Sivanandini, L. D. and Raj, M. M., "A survey on data clustering algorithm based on fuzzy techniques", International Journal of Science and Research, Vol. 2, No. 4, 2013, pp. 246- 251.
  40. Stanvrianou, A., Andritsos, P., and Nicoloyannis, N., "Overview and semantic issues of text mining", ACM SIGMOD Record, Vol. 36, No. 3, 2007, pp. 23-24. https://doi.org/10.1145/1324185.1324190
  41. Wang, T., Krim, H., and Viniotis, Y., "A generalized Markov graph model: Application to social network analysis", IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, 2013, pp. 318-332. https://doi.org/10.1109/JSTSP.2013.2246767
  42. Witten, I. H., "Text mining", Practical Handbook of Internet Computing, CRC Press, 2004.
  43. Yoon, S., "A study of churn prediction model for department store customers using data mining technique", Asia Marketing Journal, Vol. 6, No. 4, 2005, pp. 45-72.
  44. Zeng, L., Li, L., and Duan, L., "Business intelligence in enterprise computing environment", Information Technology and Management, Vol. 13, No. 4, 2012, pp. 297-310. https://doi.org/10.1007/s10799-012-0123-z