• Title/Summary/Keyword: Movie analysis

Search Result 491, Processing Time 0.033 seconds

Movie Popularity Classification Based on Support Vector Machine Combined with Social Network Analysis

  • Dorjmaa, Tserendulam;Shin, Taeksoo
    • Journal of Information Technology Services
    • /
    • v.16 no.3
    • /
    • pp.167-183
    • /
    • 2017
  • The rapid growth of information technology and mobile service platforms, i.e., internet, google, and facebook, etc. has led the abundance of data. Due to this environment, the world is now facing a revolution in the process that data is searched, collected, stored, and shared. Abundance of data gives us several opportunities to knowledge discovery and data mining techniques. In recent years, data mining methods as a solution to discovery and extraction of available knowledge in database has been more popular in e-commerce service fields such as, in particular, movie recommendation. However, most of the classification approaches for predicting the movie popularity have used only several types of information of the movie such as actor, director, rating score, language and countries etc. In this study, we propose a classification-based support vector machine (SVM) model for predicting the movie popularity based on movie's genre data and social network data. Social network analysis (SNA) is used for improving the classification accuracy. This study builds the movies' network (one mode network) based on initial data which is a two mode network as user-to-movie network. For the proposed method we computed degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality as centrality measures in movie's network. Those four centrality values and movies' genre data were used to classify the movie popularity in this study. The logistic regression, neural network, $na{\ddot{i}}ve$ Bayes classifier, and decision tree as benchmarking models for movie popularity classification were also used for comparison with the performance of our proposed model. To assess the classifier's performance accuracy this study used MovieLens data as an open database. Our empirical results indicate that our proposed model with movie's genre and centrality data has by approximately 0% higher accuracy than other classification models with only movie's genre data. The implications of our results show that our proposed model can be used for improving movie popularity classification accuracy.

An Analysis of the Factors Affecting the Movie's Popularity (영화 흥행에 영향을 미치는 요인 분석)

  • Lee, Jeongwon;Jeon, Byungil;Kim, Semin;Lee, Gyujeon;Lee, Choong Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.496-499
    • /
    • 2019
  • The study aims to collect detailed movie information from box office of the Korea Film Council and data on Naver's movie ratings to analyze important factors affecting the movie's popularity based on movie audiences and ratings.

  • PDF

Pairwise fusion approach to cluster analysis with applications to movie data (영화 데이터를 위한 쌍별 규합 접근방식의 군집화 기법)

  • Kim, Hui Jin;Park, Seyoung
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.2
    • /
    • pp.265-283
    • /
    • 2022
  • MovieLens data consists of recorded movie evaluations that was often used to measure the evaluation score in the recommendation system research field. In this paper, we provide additional information obtained by clustering user-specific genre preference information through movie evaluation data and movie genre data. Because the number of movie ratings per user is very low compared to the total number of movies, the missing rate in this data is very high. For this reason, there are limitations in applying the existing clustering methods. In this paper, we propose a convex clustering-based method using the pairwise fused penalty motivated by the analysis of MovieLens data. In particular, the proposed clustering method execute missing imputation, and at the same time uses movie evaluation and genre weights for each movie to cluster genre preference information possessed by each individual. We compute the proposed optimization using alternating direction method of multipliers algorithm. It is shown that the proposed clustering method is less sensitive to noise and outliers than the existing method through simulation and MovieLens data application.

An Analysis of Movie Consumption Behavior from Transaction Cost Perspectives (거래비용관점에서 본 영화 소비행위 분석)

  • Park, Hye Youn;Kim, Jai Beom;Lee, Chang Jin
    • Review of Culture and Economy
    • /
    • v.20 no.3
    • /
    • pp.3-33
    • /
    • 2017
  • The present study analyzed movie consumption behavior from the perspective of transaction cost, taking into account the possible incurrence of additional costs in the process of consumers obtaining movie information to choose movies. Regression and multinomial logistic regression analyses were performed in the analysis by taking movie information and the individuals' social demographic characteristics as independent variables and the number and frequency of movies watched as dependent variables, using information from the "2015 movie consumer survey." The results showed that consumers considering elements such as "directors" and "online reviews" were found to be more active in movie consumption. The analysis of movie-watching frequency showed that the information considered when choosing a movie was different for high- and low-frequency movie viewers. Putting these factors together suggests that movie consumption can vary according to an individual's cultural capital, preferences, and their degree of movie information awareness. While existing studies have mostly analyzed the determinants of box office performance, the significance of the present study is its empirical analysis of individual movie information in terms of transaction cost. Based on the results above, it can be inferred that the cyclical structure of trading expenses influences movie consumption and, once preferences are formed through a certain level of consumption, the trading cost expenses decrease, which results in increasing consumption. Therefore, film makers need to establish and execute marketing strategies that appropriately use movie information so that consumers can reduce the trading costs necessary for movie watching.

Sentiment Analysis on Movie Reviews Using Word Embedding and CNN (워드 임베딩과 CNN을 사용하여 영화 리뷰에 대한 감성 분석)

  • Ju, Myeonggil;Youn, Seongwook
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.15 no.1
    • /
    • pp.87-97
    • /
    • 2019
  • Reaction of people is importantly considered about specific case as a social network service grows. In the previous research on analysis of social network service, they predicted tendency of interesting topic by giving scores to sentences written by user. Based on previous study we proceeded research of sentiment analysis for social network service's sentences, which predict the result as positive or negative for movie reviews. In this study, we used movie review to get high accuracy. We classify the movie review into positive or negative based on the score for learning. Also, we performed embedding and morpheme analysis on movie review. We could predict learning result as positive or negative with a number 0 and 1 by applying the model based on learning result to social network service. Experimental result show accuracy of about 80% in predicting sentence as positive or negative.

Semantic analysis via application of deep learning using Naver movie review data (네이버 영화 리뷰 데이터를 이용한 의미 분석(semantic analysis))

  • Kim, Sojin;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.19-33
    • /
    • 2022
  • With the explosive growth of social media, its abundant text-based data generated by web users has become an important source for data analysis. For example, we often witness online movie reviews from the 'Naver Movie' affecting the general public to decide whether they should watch the movie or not. This study has conducted analysis on the Naver Movie's text-based review data to predict the actual ratings. After examining the distribution of movie ratings, we performed semantics analysis using Korean Natural Language Processing. This research sought to find the best review rating prediction model by comparing machine learning and deep learning models. We also compared various regression and classification models in 2-class and multi-class cases. Lastly we explained the causes of review misclassification related to movie review data characteristics.

A Visualization of Movie Review based on a Semantic Network Analysis (의미연결망 분석을 활용한 영화 리뷰 시각화)

  • Kim, Seul-gi;Kim, Jang Hyun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.197-200
    • /
    • 2018
  • The aim of current research is to suggest a interface for movie reviews at a glance through semantic network analysis. The implication of this study is to systematically investigate the structure of eWoM. Specifically, by visualizing semantic networks of movie reviews this study attempts to provide a prototype of a possible review system that can check the response of movie viewer at a glance.

  • PDF

A Study on Movie Consumption and Concentration Trends in Theaters and Online (극장과 온라인의 영화 소비와 소비집중도 추세에 관한 연구)

  • Kim, Jun Sung
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.8
    • /
    • pp.170-179
    • /
    • 2022
  • In the theater-based movie industry, it is known that the diversity of movie consumption is hindered due to concentrated consumption. This study extends the existing discussions on the concentration of movie consumption in theaters to the concentration of online movie consumption. In addition, the study analyzes the impact of Covid-19 pandemic on movie consumption and the concentration thereof. For analysis, panel data for the period from 2012 through 2021 were collected by utilizing the box office data of the Korean Film Council. As a result of the analysis, it was found that the concentration of consumption by movie, country, and genre was higher in theaters than online. Further, the concentration of movie consumption has increased both in theaters and online until the outbreak of Covid-19 pandemic. During the Covid-19 pandemic period, the size of consumption has decreased both in theaters and online, while the concentration of consumption by movie online has increased. The result of this study implies a need for policy-level efforts to convert the trend of consumption concentration for long-term development of the movie industry with secured diversity of movie consumption, and for this, the study suggests that the use of online media would be useful.

Movie Retrieval System by Analyzing Sentimental Keyword from User's Movie Reviews (사용자 영화평의 감정어휘 분석을 통한 영화검색시스템)

  • Oh, Sung-Ho;Kang, Shin-Jae
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.3
    • /
    • pp.1422-1427
    • /
    • 2013
  • This paper proposed a movie retrieval system based on sentimental keywords extracted from user's movie reviews. At first, sentimental keyword dictionary is manually constructed by applying morphological analysis to user's movie reviews, and then keyword weights in the dictionary are calculated for each movie with TF-IDF. By using these results, the proposed system classify sentimental categories of movies and rank classified movies. Without reading any movie reviews, users can retrieve movies through queries composed by sentimental keywords.

Research on the Movie Reviews Regarded as Unsuccessful in Box Office Outcomes in Korea: Based on Big Data Posted on Naver Movie Portal

  • Jeon, Ho-Seong
    • Asia-Pacific Journal of Business
    • /
    • v.12 no.3
    • /
    • pp.51-69
    • /
    • 2021
  • Purpose - Based on literature studies of movie reviews and movie ratings, this study raised two research questions on the contents of online word of mouth and the number of movie screens as mediator variables. Research question 1 wanted to figure out which topics of word groups had a positive or negative impact on movie ratings. Research question 2 tried to identify the role of the number of movie screens between movie ratings and box office outcomes. Design/methodology/approach - Through R program, this study collected about 82,000 movie reviews and movie ratings posted on Naver's movie website to examine the role of online word of mouths and movie screen counts in 10 movies that were considered commercially unsuccessful with fewer than 2 million viewers despite securing about 1,000 movie screens. To confirm research question 1, topic modeling, a text mining technique, was conducted on movie reviews. In addition, this study linked the movie ratings posted on Naver with information of KOBIS by date, to identify the research question 2. Findings - Through topic modeling, 5 topics were identified. Topics found in this study were largely organized into two groups, the content of the movie (topic 1, 2, 3) and the evaluation of the movie (topics 4, 5). When analyzing the relationship between movie reviews and movie ratings with 5 mediators identified in topic modeling to probe research question 1, the topic word groups related to topic 2, 3 and 5 appeared having a negative effect on the netizen's movie ratings. In addition, by connecting two secondary data by date, analysis for research question 2 was implemented. The outcomes showed that the causal relationship between movie ratings and audience numbers was mediated by the number of movie screens. Research implications or Originality - The results suggested that the information presented in text format was harder to quantify than the information provided in scores, but if content information could be digitalized through text mining techniques, it could become variable and be analyzed to identify causality with other variables. The outcomes in research question 2 showed that movie ratings had a direct impact on the number of viewers, but also had indirect effects through changes in the number of movie screens. An interesting point is that the direct effect of movie ratings on the number of viewers is found in most American films released in Korea.