• Title/Summary/Keyword: Degree centrality

Search Result 359, Processing Time 0.023 seconds

Movie Popularity Classification Based on Support Vector Machine Combined with Social Network Analysis

  • Dorjmaa, Tserendulam;Shin, Taeksoo
    • Journal of Information Technology Services
    • /
    • v.16 no.3
    • /
    • pp.167-183
    • /
    • 2017
  • The rapid growth of information technology and mobile service platforms, i.e., internet, google, and facebook, etc. has led the abundance of data. Due to this environment, the world is now facing a revolution in the process that data is searched, collected, stored, and shared. Abundance of data gives us several opportunities to knowledge discovery and data mining techniques. In recent years, data mining methods as a solution to discovery and extraction of available knowledge in database has been more popular in e-commerce service fields such as, in particular, movie recommendation. However, most of the classification approaches for predicting the movie popularity have used only several types of information of the movie such as actor, director, rating score, language and countries etc. In this study, we propose a classification-based support vector machine (SVM) model for predicting the movie popularity based on movie's genre data and social network data. Social network analysis (SNA) is used for improving the classification accuracy. This study builds the movies' network (one mode network) based on initial data which is a two mode network as user-to-movie network. For the proposed method we computed degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality as centrality measures in movie's network. Those four centrality values and movies' genre data were used to classify the movie popularity in this study. The logistic regression, neural network, $na{\ddot{i}}ve$ Bayes classifier, and decision tree as benchmarking models for movie popularity classification were also used for comparison with the performance of our proposed model. To assess the classifier's performance accuracy this study used MovieLens data as an open database. Our empirical results indicate that our proposed model with movie's genre and centrality data has by approximately 0% higher accuracy than other classification models with only movie's genre data. The implications of our results show that our proposed model can be used for improving movie popularity classification accuracy.

A study on women's welfare organization's network -Focusing on network centrality and organizational effectiveness- (여성복지조직의 네트워크에 관한 연구 -네트워크 중심성(centrality)과 조직효과성을 중심으로-)

  • Jang, Yeon Jin
    • Korean Journal of Social Welfare Studies
    • /
    • v.41 no.4
    • /
    • pp.313-343
    • /
    • 2010
  • The aim of this study is to examine the factors influencing network centrality on women's welfare organizations, and to investigate how the level of network centrality influence the effectiveness of the organization. To achieve this goal, this study conducted a survey on women's welfare organizations in Seoul from March to June, 2009. Network analysis method was used to get each organization's network centrality value. Also, through the Structural Equation Modelling, organizational characteristics predicting network centrality and effect of network centrality on organizational effectiveness. The main results are as follows. First, the significant affecting factors were different between three types of centralities with regards to the type of organization, recognition of resource dependency, attitude of top manager, and established year. Second, the common factors affecting three network centralities were the number of informal ties, accepting feminism as the main organizational philosophy, and the number of qualified staffs. Third, only closeness centrality positively predicted the level of organizational effectiveness among three types of centralities. The faster the organization reaches to other organizations in a network, the organizational effectiveness becomes higher, which means high closeness centrality is more important factor than high degree centrality or high betweenness centrality to increase organizational effectiveness. This result shows social welfare organization should consider changing inter-organizational network strategy from quantity-focused to quality-focused.

Improved Social Network Analysis Method in SNS (SNS에서의 개선된 소셜 네트워크 분석 방법)

  • Sohn, Jong-Soo;Cho, Soo-Whan;Kwon, Kyung-Lag;Chung, In-Jeong
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.4
    • /
    • pp.117-127
    • /
    • 2012
  • Due to the recent expansion of the Web 2.0 -based services, along with the widespread of smartphones, online social network services are being popularized among users. Online social network services are the online community services which enable users to communicate each other, share information and expand human relationships. In the social network services, each relation between users is represented by a graph consisting of nodes and links. As the users of online social network services are increasing rapidly, the SNS are actively utilized in enterprise marketing, analysis of social phenomenon and so on. Social Network Analysis (SNA) is the systematic way to analyze social relationships among the members of the social network using the network theory. In general social network theory consists of nodes and arcs, and it is often depicted in a social network diagram. In a social network diagram, nodes represent individual actors within the network and arcs represent relationships between the nodes. With SNA, we can measure relationships among the people such as degree of intimacy, intensity of connection and classification of the groups. Ever since Social Networking Services (SNS) have drawn increasing attention from millions of users, numerous researches have made to analyze their user relationships and messages. There are typical representative SNA methods: degree centrality, betweenness centrality and closeness centrality. In the degree of centrality analysis, the shortest path between nodes is not considered. However, it is used as a crucial factor in betweenness centrality, closeness centrality and other SNA methods. In previous researches in SNA, the computation time was not too expensive since the size of social network was small. Unfortunately, most SNA methods require significant time to process relevant data, and it makes difficult to apply the ever increasing SNS data in social network studies. For instance, if the number of nodes in online social network is n, the maximum number of link in social network is n(n-1)/2. It means that it is too expensive to analyze the social network, for example, if the number of nodes is 10,000 the number of links is 49,995,000. Therefore, we propose a heuristic-based method for finding the shortest path among users in the SNS user graph. Through the shortest path finding method, we will show how efficient our proposed approach may be by conducting betweenness centrality analysis and closeness centrality analysis, both of which are widely used in social network studies. Moreover, we devised an enhanced method with addition of best-first-search method and preprocessing step for the reduction of computation time and rapid search of the shortest paths in a huge size of online social network. Best-first-search method finds the shortest path heuristically, which generalizes human experiences. As large number of links is shared by only a few nodes in online social networks, most nods have relatively few connections. As a result, a node with multiple connections functions as a hub node. When searching for a particular node, looking for users with numerous links instead of searching all users indiscriminately has a better chance of finding the desired node more quickly. In this paper, we employ the degree of user node vn as heuristic evaluation function in a graph G = (N, E), where N is a set of vertices, and E is a set of links between two different nodes. As the heuristic evaluation function is used, the worst case could happen when the target node is situated in the bottom of skewed tree. In order to remove such a target node, the preprocessing step is conducted. Next, we find the shortest path between two nodes in social network efficiently and then analyze the social network. For the verification of the proposed method, we crawled 160,000 people from online and then constructed social network. Then we compared with previous methods, which are best-first-search and breath-first-search, in time for searching and analyzing. The suggested method takes 240 seconds to search nodes where breath-first-search based method takes 1,781 seconds (7.4 times faster). Moreover, for social network analysis, the suggested method is 6.8 times and 1.8 times faster than betweenness centrality analysis and closeness centrality analysis, respectively. The proposed method in this paper shows the possibility to analyze a large size of social network with the better performance in time. As a result, our method would improve the efficiency of social network analysis, making it particularly useful in studying social trends or phenomena.

Smart Store in Smart City: The Development of Smart Trade Area Analysis System Based on Consumer Sentiments (Smart Store in Smart City: 소비자 감성기반 상권분석 시스템 개발)

  • Yoo, In-Jin;Seo, Bong-Goon;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.25-52
    • /
    • 2018
  • This study performs social network analysis based on consumer sentiment related to a location in Seoul using data reflecting consumers' web search activities and emotional evaluations associated with commerce. The study focuses on large commercial districts in Seoul. In addition, to consider their various aspects, social network indexes were combined with the trading area's public data to verify factors affecting the area's sales. According to R square's change, We can see that the model has a little high R square value even though it includes only the district's public data represented by static data. However, the present study confirmed that the R square of the model combined with the network index derived from the social network analysis was even improved much more. A regression analysis of the trading area's public data showed that the five factors of 'number of market district,' 'residential area per person,' 'satisfaction of residential environment,' 'rate of change of trade,' and 'survival rate over 3 years' among twenty two variables. The study confirmed a significant influence on the sales of the trading area. According to the results, 'residential area per person' has the highest standardized beta value. Therefore, 'residential area per person' has the strongest influence on commercial sales. In addition, 'residential area per person,' 'number of market district,' and 'survival rate over 3 years' were found to have positive effects on the sales of all trading area. Thus, as the number of market districts in the trading area increases, residential area per person increases, and as the survival rate over 3 years of each store in the trading area increases, sales increase. On the other hand, 'satisfaction of residential environment' and 'rate of change of trade' were found to have a negative effect on sales. In the case of 'satisfaction of residential environment,' sales increase when the satisfaction level is low. Therefore, as consumer dissatisfaction with the residential environment increases, sales increase. The 'rate of change of trade' shows that sales increase with the decreasing acceleration of transaction frequency. According to the social network analysis, of the 25 regional trading areas in Seoul, Yangcheon-gu has the highest degree of connection. In other words, it has common sentiments with many other trading areas. On the other hand, Nowon-gu and Jungrang-gu have the lowest degree of connection. In other words, they have relatively distinct sentiments from other trading areas. The social network indexes used in the combination model are 'density of ego network,' 'degree centrality,' 'closeness centrality,' 'betweenness centrality,' and 'eigenvector centrality.' The combined model analysis confirmed that the degree centrality and eigenvector centrality of the social network index have a significant influence on sales and the highest influence in the model. 'Degree centrality' has a negative effect on the sales of the districts. This implies that sales decrease when holding various sentiments of other trading area, which conflicts with general social myths. However, this result can be interpreted to mean that if a trading area has low 'degree centrality,' it delivers unique and special sentiments to consumers. The findings of this study can also be interpreted to mean that sales can be increased if the trading area increases consumer recognition by forming a unique sentiment and city atmosphere that distinguish it from other trading areas. On the other hand, 'eigenvector centrality' has the greatest effect on sales in the combined model. In addition, the results confirmed a positive effect on sales. This finding shows that sales increase when a trading area is connected to others with stronger centrality than when it has common sentiments with others. This study can be used as an empirical basis for establishing and implementing a city and trading area strategy plan considering consumers' desired sentiments. In addition, we expect to provide entrepreneurs and potential entrepreneurs entering the trading area with sentiments possessed by those in the trading area and directions into the trading area considering the district-sentiment structure.

A Comparative Analysis Study of IFLA School Library Guidelines Using Semantic Network Analysis (언어 네트워크 분석을 통한 IFLA의 학교도서관 가이드라인 비교·분석에 관한 연구)

  • Lee, Byeong-Kee
    • Journal of Korean Library and Information Science Society
    • /
    • v.51 no.2
    • /
    • pp.1-21
    • /
    • 2020
  • The purpose of this study is to explore semantic characteristics of IFLA school library guidelines through network analysis. There are two versions, 2002 edition and 2015 revision of the guidelines. This study analyzed the 2002 edition and 2015 revision of the IFLA school library guidelines view point of semantic network, and compared characteristics of two versions. The keywords were to extracted from two texts, semantic network were composed based on co-occurrence relations with keywords. The centrality(degree centrality, closeness centrality, betweenness centrality) was analyzed from the network. In addition, this study conducted topic modeling analysis using LDA function of NetMiner4.0. The result of this study is following these. First, When comparing the centrality, the 'Program, Teaching, Reading, Inquiry, Literacy, Media' keyword was higher in the 2015 revision than in the 2002 edition. Second, 'Inquiry' in degree centrality and 'Achievement' in closeness centrality which were not included in the 2002 edition top-ranked keyword list, have new appeared in 2015 revision. third, As a result of the analysis of topic modeling, compared to the 2002 version, the importance of topics on programs and services, teaching and learning activities of librarian teacher, and media and information literacy is increasing in the 2015 revision.

A Comparison of First Time and Repeat Visitors' Tourism Destination -Focusing on Seoul City (최초방문자와 재방문자의 관광목적지 선택차이 연구 -서울지역을 중심으로)

  • Kim, Min-Sun;Um, Hyemi
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.11
    • /
    • pp.648-654
    • /
    • 2016
  • This paper investigates differences of tourism destination choices for sightseeing in Seoul between first-time visitors and repeat visitors. We constructed social network using secondary data from '2015 International Visitor Survey' and analyzed its density and centrality. Study results find that: (1) first-time and repeat visitors' tourism destinations are concentrated in areas located north of the Han river. The proximity of destinations suggests the positive effects resulting from the movement network. (2) As the result of degree centrality, closeness centrality, betweenness centrality, the highest ranking tourism destinations for both visitor groups are identical, but indexes of centralities in repeat visitors' destinations increase, including Shinchon/ Hongik University, Gangnam station, and Garosu-gil. Therefore, the roles of these destinations are becoming established as tourism hubs and are popular among younger visitors as well as attract repeat visitors. Results of this study will be a useful reference in developing and managing new tourism products.

The Influence of Small World and Centrality on the Paper Achievement of Government-Funded Research Institutes (과학기술계 정부출연연구기관의 논문 성과에 좁은 세상 구조와 중심성이 미치는 영향)

  • Lee, Hyekyung;Kim, Somin;Kim, Jeongheum
    • Journal of Technology Innovation
    • /
    • v.29 no.1
    • /
    • pp.39-73
    • /
    • 2021
  • The cooperative network structure influences the academic performance of the research institute. In particular, South Korea's Government-Funded Research Institutes(GRI) need to establish an efficient cooperative system as a leading national R&D implementer. This study applied the Small World structure, which has been discussed as an efficient network structure, and the centrality of representing the characteristics of nodes to the cooperative network of GRI in Korea. Based on the SCIE published data from 2010 to 2019, we analyze how the Small World characteristics and centrality of GRI contribute to academic performance using a network analysis and Feasible GLS regression. The GRI cooperative network has shown that the Small World network structure facilitates the academic performance. In addition, centrality indicating the degree of direct connection showed positive significance, but centrality indicating the degree of intermediary was not significant or negative. The results of this study explain that the higher the number of institutions that exchange and cooperate, the higher the academic performance, and the higher the performance of the institutions that serve as the center of cooperation. In addition, it was established that the stronger the cooperative network of GRIs have the characteristics of Small World, the more effective it is to create research results. This study applies centrality and Small World previously discussed as an efficient network structure to the GRI cooperation network and provide implications for establishing policies and strategies related to R&D cooperation among GRIs.

Social Network Analysis of Shared Bicycle Usage Pattern Based on Urban Characteristics: A Case Study of Seoul Data (도시특성에 기반한 공유 자전거 이용 패턴의 소셜 네트워크 분석 연구: 서울시 데이터 사례 분석)

  • Byung Hyun Lee;Il Young Choi;Jae Kyeong Kim
    • Information Systems Review
    • /
    • v.22 no.1
    • /
    • pp.147-165
    • /
    • 2020
  • The sharing economy service is now spreading in various fields such as accommodation, cars and bicycles. In particular, bicycle-sharing service have become very popular around the world, and since September 2015, Seoul has been providing a bicycle-sharing service called 'Ttareungi'. However, the number of bicycles is unbalanced among rental stations continuously according to the user's bicycle use. In order to solve these problems, we employed social network analysis using Ttareungi data in Seoul, Korea. We analyzed degree centrality, closeness centrality, betweenness centrality and k-core. As a result, the degree centrality was found to be closely linked with bus or subway transfer center. Closeness centrality was found to be in an unbalanced departure and arrival frequency or poor public transport proximity. Betweenness centrality means where the frequency of departure and arrival occurs frequently. Finally, the k-core analysis showed that Mapo-gu was the most important group by time zone. Therefore, the results of this study may contribute to the planning of relocation and additional installation of bike rental station in Seoul.

A Social network analysis on the research subjects in Journal of Korean Safety Management and Science (대한안전경영과학회지의 연구 주제에 대한 사회 연결망 분석)

  • Yang, Byoung-Hak
    • Journal of the Korea Safety Management & Science
    • /
    • v.15 no.2
    • /
    • pp.161-166
    • /
    • 2013
  • The purpose of this research is to analyse the research subject in journal of Korean safety management and sciences. Total 1850 key words in 560 papers were analysed by the Pajek system which is one of well known social network analysis tool. Key words trend from 2008 to 2012 was examined. Then the relationship among each key words was visualized. There were five key words group which strongly connected among key words. The degree centrality, between centrality, proximity prestige on each key words were calculated to verify influence degree to other key words.

A Study on Inter-Organizational Service Network for the Primary School Children in Need (결식아동 지원조직간 서비스 연계망(network)에 관한 연구)

  • Lee, Hye-Won
    • Korean Journal of Social Welfare
    • /
    • v.49
    • /
    • pp.190-224
    • /
    • 2002
  • The purpose of this study was to find out the characteristics of the direct workers and the organizations delivering services to the children in low-income families, and to identify major factors that affect the number and degree centrality of service network. The research sample was 141 organizations. and the data were collected by a survey questionnaire and analyzed by UCINET V and multiple regression. The results show that the classification of organizations, the work-autonomy, the license of social welfare have a significant effect on the number of network organization, and the license of social welfare, the subjective body of organizations, the number of children in charge have a significant effect on the degree centrality of service network, among the independent variables. Based on the research results, implications for the future practice are discussed.

  • PDF