• Title/Summary/Keyword: Degree centrality analysis

Search Result 332, Processing Time 0.032 seconds

Movie Popularity Classification Based on Support Vector Machine Combined with Social Network Analysis

  • Dorjmaa, Tserendulam;Shin, Taeksoo
    • Journal of Information Technology Services
    • /
    • v.16 no.3
    • /
    • pp.167-183
    • /
    • 2017
  • The rapid growth of information technology and mobile service platforms, i.e., internet, google, and facebook, etc. has led the abundance of data. Due to this environment, the world is now facing a revolution in the process that data is searched, collected, stored, and shared. Abundance of data gives us several opportunities to knowledge discovery and data mining techniques. In recent years, data mining methods as a solution to discovery and extraction of available knowledge in database has been more popular in e-commerce service fields such as, in particular, movie recommendation. However, most of the classification approaches for predicting the movie popularity have used only several types of information of the movie such as actor, director, rating score, language and countries etc. In this study, we propose a classification-based support vector machine (SVM) model for predicting the movie popularity based on movie's genre data and social network data. Social network analysis (SNA) is used for improving the classification accuracy. This study builds the movies' network (one mode network) based on initial data which is a two mode network as user-to-movie network. For the proposed method we computed degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality as centrality measures in movie's network. Those four centrality values and movies' genre data were used to classify the movie popularity in this study. The logistic regression, neural network, $na{\ddot{i}}ve$ Bayes classifier, and decision tree as benchmarking models for movie popularity classification were also used for comparison with the performance of our proposed model. To assess the classifier's performance accuracy this study used MovieLens data as an open database. Our empirical results indicate that our proposed model with movie's genre and centrality data has by approximately 0% higher accuracy than other classification models with only movie's genre data. The implications of our results show that our proposed model can be used for improving movie popularity classification accuracy.

A Preliminary Study on the Semantic Network Analysis of Book Report Text (독후감 텍스트의 언어 네트워크 분석에 관한 기초연구)

  • Lee, Soo-Sang
    • Journal of Korean Library and Information Science Society
    • /
    • v.47 no.3
    • /
    • pp.95-114
    • /
    • 2016
  • The purpose of this preliminary study is to collect specific examples of book reports and understand semantic characteristics of them through semantic network. The analysis was conducted with 23 book reports which classified by three groups. The keywords were selected from the of book reports. Five types of keyword network were composed based on co-occurrence relations with keywords. The result of this study is following these. First, each keyword network of book reports of groups and individuals is shown to have different structural characteristics. Second, each network has different high centrality keywords according to the result analysis of 3 types of centrality(degree centrality, closeness centrality, betweenness centrality). These characteristic means that keyword network analysis is useful in recognizing the characteristics of not only groups' and but also individual's book reports.

Research Trends in Global Cruise Industry Using Keyword Network Analysis (키워드 네트워크 분석을 활용한 세계 크루즈산업 연구동향)

  • Jhang, Se-Eun;Lee, Su-Ho
    • Journal of Navigation and Port Research
    • /
    • v.38 no.6
    • /
    • pp.607-614
    • /
    • 2014
  • This article aims to explore and discuss research trends in global cruise industry using keyword network analysis. We visualize keyword networks in each of four groups of 1982-1999, 2000-2004, 2005-2009, 2010-2014 based on the top 20 keyword nodes' degree centrality and betweenness centrality which are selected among four centrality measurements, comparing them with frequency order. The article shows that keyword frequency collected from 240 articles published in international journals is subject to Zipf's law and nodes degree distribution also exhibits power law. We try to find out research trends in global cruise industry to change some important keywords diachronically, visualizing several networks focusing on the top two keywords, cruise and tourism, belonging to all the four year groups, with high degree and betweenness centrality values. Interestingly enough, a new node, China, connecting the top most keywords, appears in the most recent period of 2010-2014 when China has emerged as one of the rapid development countries in global cruise industry. Therefore keyword network analysis used in this article will be useful to understand research trends in global cruise industry because of increase and decrease of numbers of network types in different year groups and the visual connection between important nodes in giant components.

Semantic Network Analysis of Online News and Social Media Text Related to Comprehensive Nursing Care Service (간호간병통합서비스 관련 온라인 기사 및 소셜미디어 빅데이터의 의미연결망 분석)

  • Kim, Minji;Choi, Mona;Youm, Yoosik
    • Journal of Korean Academy of Nursing
    • /
    • v.47 no.6
    • /
    • pp.806-816
    • /
    • 2017
  • Purpose: As comprehensive nursing care service has gradually expanded, it has become necessary to explore the various opinions about it. The purpose of this study is to explore the large amount of text data regarding comprehensive nursing care service extracted from online news and social media by applying a semantic network analysis. Methods: The web pages of the Korean Nurses Association (KNA) News, major daily newspapers, and Twitter were crawled by searching the keyword 'comprehensive nursing care service' using Python. A morphological analysis was performed using KoNLPy. Nodes on a 'comprehensive nursing care service' cluster were selected, and frequency, edge weight, and degree centrality were calculated and visualized with Gephi for the semantic network. Results: A total of 536 news pages and 464 tweets were analyzed. In the KNA News and major daily newspapers, 'nursing workforce' and 'nursing service' were highly rated in frequency, edge weight, and degree centrality. On Twitter, the most frequent nodes were 'National Health Insurance Service' and 'comprehensive nursing care service hospital.' The nodes with the highest edge weight were 'national health insurance,' 'wards without caregiver presence,' and 'caregiving costs.' 'National Health Insurance Service' was highest in degree centrality. Conclusion: This study provides an example of how to use atypical big data for a nursing issue through semantic network analysis to explore diverse perspectives surrounding the nursing community through various media sources. Applying semantic network analysis to online big data to gather information regarding various nursing issues would help to explore opinions for formulating and implementing nursing policies.

Relationship between Genre Centrality and Performance in the Motion Picture Industry (네트워크 중심성과 성과에 관한 연구: 영화산업을 중심으로)

  • Lee, Wonhee;Jung, Dong-Il
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.6
    • /
    • pp.153-168
    • /
    • 2017
  • Existing researches on movie genre have been focusing on the relationship between a specific genre and performance of a movie. However, most of films cross into multiple genres and new approach is needed for analyzing a genre network. In this study social network analysis was used to analyze the genre centrality and its relationship with movie performance by developing a genre network, i.e. network among multiple genres constructed via genre co-occurrence pattern in a specific movie. Three index of genre centrality, eigenvector centrality, degree centrality, and bonacich power centrality, were tested for the valued genre network. Results showed that the relationship between genre centrality and movie performance appeared to be inverted U-shaped. This empirical finding is in line with the theory of ambidexterity which emphasizes the balance of exploration and exploitation. In addition, this study can provide practical implications for movie producers, distributors, and theaters that need to develop genre strategies.

A Study on the Development Trend of Artificial Intelligence Using Text Mining Technique: Focused on Open Source Software Projects on Github (텍스트 마이닝 기법을 활용한 인공지능 기술개발 동향 분석 연구: 깃허브 상의 오픈 소스 소프트웨어 프로젝트를 대상으로)

  • Chong, JiSeon;Kim, Dongsung;Lee, Hong Joo;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.1-19
    • /
    • 2019
  • Artificial intelligence (AI) is one of the main driving forces leading the Fourth Industrial Revolution. The technologies associated with AI have already shown superior abilities that are equal to or better than people in many fields including image and speech recognition. Particularly, many efforts have been actively given to identify the current technology trends and analyze development directions of it, because AI technologies can be utilized in a wide range of fields including medical, financial, manufacturing, service, and education fields. Major platforms that can develop complex AI algorithms for learning, reasoning, and recognition have been open to the public as open source projects. As a result, technologies and services that utilize them have increased rapidly. It has been confirmed as one of the major reasons for the fast development of AI technologies. Additionally, the spread of the technology is greatly in debt to open source software, developed by major global companies, supporting natural language recognition, speech recognition, and image recognition. Therefore, this study aimed to identify the practical trend of AI technology development by analyzing OSS projects associated with AI, which have been developed by the online collaboration of many parties. This study searched and collected a list of major projects related to AI, which were generated from 2000 to July 2018 on Github. This study confirmed the development trends of major technologies in detail by applying text mining technique targeting topic information, which indicates the characteristics of the collected projects and technical fields. The results of the analysis showed that the number of software development projects by year was less than 100 projects per year until 2013. However, it increased to 229 projects in 2014 and 597 projects in 2015. Particularly, the number of open source projects related to AI increased rapidly in 2016 (2,559 OSS projects). It was confirmed that the number of projects initiated in 2017 was 14,213, which is almost four-folds of the number of total projects generated from 2009 to 2016 (3,555 projects). The number of projects initiated from Jan to Jul 2018 was 8,737. The development trend of AI-related technologies was evaluated by dividing the study period into three phases. The appearance frequency of topics indicate the technology trends of AI-related OSS projects. The results showed that the natural language processing technology has continued to be at the top in all years. It implied that OSS had been developed continuously. Until 2015, Python, C ++, and Java, programming languages, were listed as the top ten frequently appeared topics. However, after 2016, programming languages other than Python disappeared from the top ten topics. Instead of them, platforms supporting the development of AI algorithms, such as TensorFlow and Keras, are showing high appearance frequency. Additionally, reinforcement learning algorithms and convolutional neural networks, which have been used in various fields, were frequently appeared topics. The results of topic network analysis showed that the most important topics of degree centrality were similar to those of appearance frequency. The main difference was that visualization and medical imaging topics were found at the top of the list, although they were not in the top of the list from 2009 to 2012. The results indicated that OSS was developed in the medical field in order to utilize the AI technology. Moreover, although the computer vision was in the top 10 of the appearance frequency list from 2013 to 2015, they were not in the top 10 of the degree centrality. The topics at the top of the degree centrality list were similar to those at the top of the appearance frequency list. It was found that the ranks of the composite neural network and reinforcement learning were changed slightly. The trend of technology development was examined using the appearance frequency of topics and degree centrality. The results showed that machine learning revealed the highest frequency and the highest degree centrality in all years. Moreover, it is noteworthy that, although the deep learning topic showed a low frequency and a low degree centrality between 2009 and 2012, their ranks abruptly increased between 2013 and 2015. It was confirmed that in recent years both technologies had high appearance frequency and degree centrality. TensorFlow first appeared during the phase of 2013-2015, and the appearance frequency and degree centrality of it soared between 2016 and 2018 to be at the top of the lists after deep learning, python. Computer vision and reinforcement learning did not show an abrupt increase or decrease, and they had relatively low appearance frequency and degree centrality compared with the above-mentioned topics. Based on these analysis results, it is possible to identify the fields in which AI technologies are actively developed. The results of this study can be used as a baseline dataset for more empirical analysis on future technology trends that can be converged.

Simulation Nursing Education Research Topics Trends Using Text Network Analysis (텍스트네트워크분석을 적용하여 탐색한 국내 시뮬레이션간호교육 연구주제 동향)

  • Park, Chan Sook
    • Journal of East-West Nursing Research
    • /
    • v.26 no.2
    • /
    • pp.118-129
    • /
    • 2020
  • Purpose: The purpose of this study was to analyze the topic trend of domestic simulation nursing education research using text network analysis(TNA). Methods: This study was conducted in four steps. TNA was performed using the NetMiner (version 4.4.1) program. Firstly, 245 articles from 4 databases (RISS, KCI, KISS, DBpia) published from 2008 to 2018, were collected. Secondly, keyword-forms were unified and representative words were selected. Thirdly, co-occurrence matrices of keywords with a frequency of 2 or higher were generated. Finally, social network-related measures-indices of degree centrality and betweenness centrality-were obtained. The topic trend over time was visualized as a sociogram and presented. Results: 178 author keywords were extracted. Keywords with high degree centrality were "Nursing student", "Clinical competency", "Knowledge", "Critical thinking", "Communication", and "Problem-solving ability." Keywords with high betweenness centrality were "CPR", "Knowledge", "Attitude", "Self-efficacy", "Performance ability", and "Nurse." Over time, the topic trends on simulation nursing education have diversified. For example, topics such as "Neonatal nursing", "Obstetric nursing", "Pediatric nursing", "Blood transfusion", "Community visit nursing", and "Core basic nursing skill" appeared. The core-topics that emerged only recently (2017-2018) were "High-fidelity", "Heart arrest", "Clinical judgment", "Reflection", "Core basic nursing skill." Conclusion: Although simulation nursing education research has been increasing, it is necessary to continue studies on integrated simulation learning designs based on various nursing settings. Additionally, in simulation nursing education, research is required not only on learner-centered educational outcomes, but also factors that influence educational outcomes from the perspective of the instructors.

New Evaluation Method of Patents by National R&D Program with Patent Citation Network Analysis (특허 인용 네트워크 분석을 활용한 국가연구개발사업 특허의 평가 방안)

  • Lim, Hongrae
    • Journal of Technology Innovation
    • /
    • v.27 no.4
    • /
    • pp.1-19
    • /
    • 2019
  • This study presents a new method to evaluate patents by public R&D program using patent citation network analysis. I used forward citation, degree centrality, betweenness centrality and page rank as the dependent variables which represents the quality of patents. I used primary independent variable as a dummy of public R&D program and controlled patents characteristics, applicant characteristics, technological characteristics and year effect. The empirical result shows that the patents of public R&D program is superior to other patents in regard to the number of forward citation, the degree centrality, the betweenness centrality and the page rank. This empirical result implies that patents of public R&D program directly and effectively connects technologies. Also patents from public R&D program connects important technologies.

Keyword Network Analysis on Global Research Trend in Design (1999~2018) (글로벌 디자인 연구동향에 대한 키워드 네트워크 분석 연구 (1999~2018))

  • Choi, Chool-Heon;Jang, Phill-Sik
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.2
    • /
    • pp.7-16
    • /
    • 2019
  • The purpose of this study is to identify the characteristics of researches that have been conducted for the last 20 years through analyzing global research trends and evolutions of design articles from 1999 to 2018 with keyword network analysis. For this purpose, we selected 3,569 articles in 22 journals related to design research retrieved from the Scopus database and constructed keyword network model through the author keyword and index keyword. The frequency of the author and index keyword, the centrality of betweenness and degree were analyzed with the keyword network. The results show that design has been applied to various fields for recent 20 years, and the research trends of design could be quantitatively characterized by keyword network analysis. The result of this study could be used to suggest future research topics in the field of design based on quantitative and empirical data.

An Analysis of the Research Topics of the Academic Papers Published in the Journal of Korean Society of Archives and Records Management: From 2001 to 2017 (『한국기록관리학회지』 논문의 연구 주제 분석 - 2001년부터 2017년까지 -)

  • Kim, Heesop;Kang, Bora
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.18 no.4
    • /
    • pp.183-204
    • /
    • 2018
  • The main purpose of this study was to investigate the research topics of the Journal of Korean Society of Archives and Records Management, which is one of the main academic journals of archival research in Korea. To achieve this objective, a total of 875 author-assigned Korean keywords were collected from the 390 papers published from the first issue (i.e., 2001) to the current issue (i.e., 2017) in the target journal. The collected keywords were analyzed using NetMiner V.4 to discover their frequency, degree centrality, and betweenness centrality. Results showed that "Archival Information Services," "Electronic Records," "Historical Archives," "Archivists," and "National Archives of Korea" showed the most frequently conducted research topics; whereas "Archival Information Services," "Electronic Records," "Evaluation," "Locality Archives," and "Retrieval System" were the most influencing research topics. On the other hand, "Archival Information Services," "Archivists," "Electronic Records," "Archive," and "Metadata" showed the most widely intervening research topics in this research.