Search | Korea Science

A Model of Predictive Movie 10 Million Spectators through Big Data Analysis (빅데이터 분석을 통한 천만 관객 영화 예측 모델)

Yu, Jong-Pil;Lee, Eung-hwan
- The Journal of Bigdata
- /
- v.3 no.1
- /
- pp.63-71
- /
- 2018
In the last five years (2013~2017), we analyzed what factors influenced Korean films that have surpassed 10 million viewers in the Korean movie industry, where the total number of moviegoers is over 200 million. In general, many people consider the number of screens and ratings as important factors that affect the audience's success. In this study, four additional factors, including the number of screens and ratings, were established to establish a hypothesis and correlate it with the presence of 10 million spectators through big data analysis. The results were significant, with 91 percent accuracy in predicting 10 million viewers and 99.4 percent accuracy in estimating cumulative attendance.
PDF KSCI

An Analysis of Corelation between Movie Attendance and Related Tweets for Predicting Box Office (영화 흥행 예측을 위한 영화 관객 수와 관련 트윗간의 상관관계 분석)

Yim, Junyeob;Hwang, Byung-Yeon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.11a
- /
- pp.1245-1247
- /
- 2013
최근 들어 영화에 대한 수요가 증가하면서 국내 영화시장규모는 지속적으로 성장하고 있다. 이와 관련하여 여러 가지 위험요소를 제거하고 시장에서의 성공을 위해 영화의 흥행을 예측하기 위한 다양한 연구들이 진행되고 있다. 그러나 그러한 예측을 위한 관련 요소들 간의 상관관계를 정확한 수치로 표현하는 일은 매우 어려우며 관련연구 또한 아직 미흡하다. 본 논문에서는 트위터에서 발생되는 트윗을 설문 표본으로 삼고 영화 관련 트윗과 영화의 흥행을 의미하는 관객 수와의 상관관계를 분석하여 상관계수를 도출하였다. 실험 결과 실험에 사용된 영화 10편의 관객 수에 대한 데이터 모두 관련 트윗의 발생비율과 양의 상관관계를 가짐을 알 수 있었으며 이를 통해 트위터를 이용한 영화의 흥행 여부 예측에 대한 가능성을 제시했다.
https://doi.org/10.3745/PKIPS.y2013m11a.1245 인용 PDF

A Study on the Performance Evaluation of Machine Learning for Predicting the Number of Movie Audiences (영화 관객 수 예측을 위한 기계학습 기법의 성능 평가 연구)

Jeong, Chan-Mi;Min, Daiki
- The Journal of Society for e-Business Studies
- /
- v.25 no.2
- /
- pp.49-63
- /
- 2020
The accurate prediction of box office in the early stage is crucial for film industry to make better managerial decision. With aims to improve the prediction performance, the purpose of this paper is to evaluate the use of machine learning methods. We tested both classification and regression based methods including k-NN, SVM and Random Forest. We first evaluate input variables, which show that reputation-related information generated during the first two-week period after release is significant. Prediction test results show that regression based methods provides lower prediction error, and Random Forest particularly outperforms other machine learning methods. Regression based method has better prediction power when films have small box office earnings. On the other hand, classification based method works better for predicting large box office earnings.
https://doi.org/10.7838/jsebs.2020.25.2.049 인용 PDF KSCI

Assessing Box Office Performance Using Movie Scripts Text Mining (영화 스크립트 텍스트 마이닝을 통한 흥행성과 예측)

Ha, Hyunsoo;Hwang, Byeong-Yeon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2016.10a
- /
- pp.556-558
- /
- 2016
영화 흥행 실패의 리스크를 줄이기 위해 객관적인 흥행 예측 지표가 요구된다. 본 논문에서는 영화 스크립트의 텍스트를 분석하여 흥행성과를 예측하는 기법을 제안한다. 객관적인 흥행 예측 지표는 누적 관객 수와 누적 매출액으로 설정하였다. 실험은 2010년 1월 1일부터 2016년 8월까지 개봉한 영화중에서 누적 관객 수와 누적 매출액을 기준으로 상위 50위까지의 영화 스크립트를 분석하여 진행했다. 실험을 통해 영화 제작에 앞서 스크립트 분석만을 활용한 영화 흥행성과 예측이 가능함을 보였다.
https://doi.org/10.3745/PKIPS.y2016m10a.556 인용 PDF

Prediction of movie audience numbers using hybrid model combining GLS and Bass models (GLS와 Bass 모형을 결합한 하이브리드 모형을 이용한 영화 관객 수 예측)

Kim, Bokyung;Lim, Changwon
- The Korean Journal of Applied Statistics
- /
- v.31 no.4
- /
- pp.447-461
- /
- 2018
Domestic film industry sales are increasing every year. Theaters are the primary sales channels for movies and the number of audiences using the theater affects additional selling rights. Therefore, the number of audiences using the theater is an important factor directly linked to movie industry sales. In this paper we consider a hybrid model that combines a multiple linear regression model and the Bass model to predict the audience numbers for a specific day. By combining the two models, the predictive value of the regression analysis was corrected to that of the Bass model. In the analysis, three films with different release dates were used. All subset regression method is used to generate all possible combinations and 5-fold cross validation to estimate the model 5 times. In this case, the predicted value is obtained from the model with the smallest root mean square error and then combined with the predicted value of the Bass model to obtain the final predicted value. With the existence of past data, it was confirmed that the weight of the Bass model increases and the compensation is added to the predicted value.
https://doi.org/10.5351/KJAS.2018.31.4.447 인용 PDF KSCI

Box Office Hit Prediction Using Data mining and Text mining (데이터마이닝과 텍스트마이닝을 활용한 영화 흥행 예측)

Jo, Hyo-jung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2021.05a
- /
- pp.316-318
- /
- 2021
영화 수익에 있어 영화의 흥행 여부는 중요한 영향을 끼친다. 영화 흥행 요인은 영화 산업의 규모가 커지면서 많은 제작사들 및 투자자들이 고려해야 하는 사항이 되었다. 따라서 영화의 흥행을 예측하기 위한 많은 모델이 연구되었다. 본 연구의 목적은 선행연구에서 흥행에 유의미한 영향을 끼친다고 밝혀진 스크린 수, 감독명, 제작사명 등의 내재적인 속성과 더불어 온라인 구전 변수를 사용하여 영화 흥행 예측 모델을 만드는 것이다. 이때 기사 수, 블로그 수와 같이 온라인 구전의 크기를 나타내는 변수들을 사용하는 대신 개봉 후 첫 주간의 관람객 리뷰를 텍스트마이닝을 이용하여 전체 리뷰 중 긍정 리뷰의 비율에 따라 점수를 매긴 후 독립변수로 사용한다. 그 후, 데이터 마이닝 기법을 활용하여 만든 모델에 앞서 언급한 독립변수를 입력 값으로 사용하여 영화의 흥행을 예측한다. 최종적으로 의사결정트리와 로지스틱회귀를 수행한 결과 영화 흥행에 영향을 주는 독립변수를 찾고 모델의 성능을 평가하였다. 로지스틱회귀의 결과 관객 수, 평점이 영화의 흥행에 특히 유의한 영향을 끼치는 변수로 선정되었고 리뷰 역시 유의한 변수로 선정되었다. 이때 만들어진 모델은 약 90%의 높은 수준의 정확도를 보여주었다. 의사결정트리의 결과 관객 수가 가장 중요한 변수로 선정되었다.
https://doi.org/10.3745/PKIPS.y2021m05a.316 인용 PDF

Prediction of Number of Movie Audience Using Feature Minimization and Data Selection (특징 최소화와 데이터 선별을 활용한 영화 관객수 예측)

Yang, Youngbo;Yu, Heonchang
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.05a
- /
- pp.443-446
- /
- 2019
빅데이터 분석을 위해 많이 사용하고 있는 기계학습 알고리즘들 중 딥러닝 알고리즘이 많이 활용되고 있으며 분류와 예측에 높은 정확도를 나타내고 있다. 딥러닝 알고리즘의 적용에 따른 많은 장단점들이 있지만, 단점은 분석에 사용되는 특징들이 너무 많다는 것과 분석 모델을 만드는데 사용되는 알고리즘도 여러 가지를 적용하다 보니 분석 시간이 오래 걸린다는 것이다. 이런 단점들은 업무를 파악하면 특징을 최소화할 수 있고 필요로 하는 정보만 선별해서 대표적인 딥러닝 알고리즘 하나에 분석을 하게 되면 분석 시간을 단축시킬 수 있다. 이 실험은 [1], [2]에서 연구한 영화 관객수 예측 모델을 4개의 특징으로 최소화하고 선별된 데이터를 인공신경망 알고리즘 하나로 예측 모델을 생성하였을 때 유의미한 정보를 도출해 낼 수 있는지를 알아보기 위한 것이다. 실험결과는 최종 관객수를 1명 단위까지 정확하게 예측하지는 못했지만 비슷한 수준의 관객수 정보를 예측하였다. 학문적인 접근으로 보았을 때 예측 정확도가 높지 않으면 사용이 불가능한 모델이라고 판단할 수 있지만, 기업 입장으로 접근해 보았을 때 예측 정보가 [1]. [2] 연구 결과에 비해 부족한 수준은 아니다. 총 소요된 시간은 기획 3일, 데이터 수집 및 모델 개발 5일, 분석 시간 10분으로 개발 시간 단축, 업무 효율성 향상, 비용 절감을 기대할 수 있다.
https://doi.org/10.3745/PKIPS.y2019m05a.443 인용 PDF

Movie Box-office Analysis using Social Big Data (소셜 빅데이터를 이용한 영화 흥행 요인 분석)

Lee, O-Joun;Park, Seung-Bo;Chung, Daul;You, Eun-Soon
- The Journal of the Korea Contents Association
- /
- v.14 no.10
- /
- pp.527-538
- /
- 2014
The demand prediction is a critical issue for the film industry. As the social media, such as Twitter and Facebook, gains momentum of late, considerable efforts are being dedicated to prediction and analysis of hit movies based on unstructured text data. For prediction of trends found in commercially successful films, the correlations between the amount of data and hit movies may be analyzed by estimating the data variation by period while opinion mining that assigns sentiment polarity score to data may be employed. However, it is not possible to understand why the audience chooses a certain movie or which attribute of a movie is preferred by using such a quantitative approach. This has limited the efforts to identify factors driving a movie's commercial success. In this regard, this study aims to investigate a movie's attributes that reflect the interests of the audience. This would be done by extracting topic keywords that represent the contents of Twits through frequency measurement based on the collected Twitter data while analyzing responses displayed by the audience. The objective is to propose factors driving a movie's commercial success.
https://doi.org/10.5392/JKCA.2014.14.10.527 인용 PDF KSCI

Movie attendance and sales forecast model through big data analysis (빅데이터 분석을 통한 영화 관객수, 매출액 예측 모델)

Lee, Eung-hwan;Yu, Jong-Pil
- The Journal of Bigdata
- /
- v.4 no.2
- /
- pp.185-194
- /
- 2019
In the 100-year history of Korean films, Korean films have grown to more than 100 million viewers every year since 2012, and their total sales are estimated at 1 trillion. It is assumed that the influence on the popularity of Korean movies is related to 2012, when 60% of smartphone penetration rate and 30 million subscribers exceeded. As a result, before and after 2012, changes in movie boxing factor variables were needed, and the prediction model trained as a new independent variable was applied to actual data.
PDF

Analysis of Spectator Mobilizing Power for 2000's Korea Movies Based on Construction of Network (네트워크 기반 2000년대 한국영화의 관객 동원력 분석)

Kim, Hak-Yong
- The Journal of the Korea Contents Association
- /
- v.11 no.1
- /
- pp.429-437
- /
- 2011
Movie network as a social network shows power-law distribution that is one of distinct properties in scale-free network. We constructed movie network from 799 Korea movies that screened from 2000 to 2009 and analyzed structural properties of the network. The 799 movies was classified three groups as a spectator mobilizing power. One million spectators mobilizing power movie was denoted the first class. The best 10 movie directors who produced at least three movies for ten years and had 70% the first class movie of them were selected. We also preferred the best 20 movie actors who played at least five movies for ten years and had 70% the first class movie of them. We re-constructed core movie network that composed the best 10 directors, the best 20 movie stars, and 157 movies that were produced by the directors or were played by the movie stars. We predict a possible combination of the director and movie actor as a category of the movie that has highly spectator mobilizing power. Here, we provide insight and method for producing high spectators mobilizing power movies
https://doi.org/10.5392/JKCA.2011.11.1.429 인용 PDF KSCI

Search Result 28, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)