Analysis of drama viewership related words through unstructured data collection

Kang, Sun-Kyoung;Lee, Hyun-Chang;Shin, Seong-Yoon;

doi:10.6109/jkiice.2017.21.8.1567

Journal of the Korea Institute of Information and Communication Engineering (한국정보통신학회논문지)

Volume 21 Issue 8
/
Pages.1567-1574
/
2017
/
2234-4772(pISSN)
/
2288-4165(eISSN)

The Korea Institute of Information and Commucation Engineering (한국정보통신학회)

DOI QR Code

Analysis of drama viewership related words through unstructured data collection

비정형데이터 수집을 통한 드라마 시청률 연관어 분석

Kang, Sun-Kyoung (Department of Computer Software Engineering, Wonkwang University) ;
Lee, Hyun-Chang (Department of Digital Contents Engineering, Wonkwang University) ;
Shin, Seong-Yoon (School of Computer Information & Communication Engineering, Kunsan National University)

Received : 2017.06.08
Accepted : 2017.07.05
Published : 2017.08.31

https://doi.org/10.6109/jkiice.2017.21.8.1567 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we analyzed the stereotyped and non - stereotyped data in order to analyze the drama 's ratings. The formalized data collection collected 19 items from the four areas of drama information, person information, broadcasting information, and audience rating information of each broadcasting company. Atypical data were collected from bulletin boards, pre - broadcast blogs and post - broadcast blogs operated by each broadcasting company using a crawling technique. As a result of comparing the differences according to the four areas for each broadcaster from the collected regular data, the results were similar to each other. And we derived seven related words by analyzing the correlation of occurrence frequencies from unstructured data collected from bulletin boards and blogs of each broadcasting company. The derived associations were obtained through reliability analysis.

본 논문에서는 드라마의 시청률에 영향을 미치는 연관어 분석을 위해 정형화된 데이터와 비정형화된 데이터를 분석하는 내용이다. 정형화된 데이터 수집은 각 방송사의 드라마정보, 인물정보, 방송정보, 시청률정보라는 4가지 영역에서 총 19가지항목을 수집하였다. 비정형데이터는 각 방송사에서 드라마별로 운영되고 있는 게시판과 방영전 블로그와 방영후 블로그로부터 크롤링기법을 이용하여 수집하였다. 수집된 정형데이터로부터 각 방송사별 4가지 영역별에 따른 차이를 비교한 결과 방송사별 서로 유사한 결과 값을 보이고 있었다. 그리고 각 방송사의 드라마별 게시판과 블로그에서 수집된 비정형데이터로부터 출현빈도의 상관관계 분석을 통해 관련 연관어를 7개 도출하였다. 도출된 연관어는 신뢰성 분석을 통해 이루어졌다.

Keywords

References

S. K Kang, H. C Lee, and S. Y Shin, "Analysis of related words of drama viewership through SNS unstructured data crawling," in Proceedings of the 41th Annual Conference of Korea Institute of information and Communication, pp. 105, June 2017.
S. H Yun, K. H Lee, H. S Lim, D.R Kim, and J. H Kim, "The Method of Digital Copyright Authentication for Contents of Collective Intelligence," Journal of the Korea Convergence Society, vol. 6, no. 6, pp. 185-193, June 2015. https://doi.org/10.15207/JKCS.2015.6.6.185
Y. W. No, D. Y. Kim, J. E, Han, J. T. Lim, K. S. Boek, and J. S. Yoo, "Hot Topic Prediction Scheme Considering User Influences in Social Networks", Journal of Korea Contents Association Research, vol.15, no. 8, pp24-36, Dec. 2015.
J. B. Lee, C. K. Lee, and K. J. Cha, "An Analysis of IT Trends Using Tweet Data", Journal of Intelligence and Information Systems, vol. 21, no. 1, pp143-159, Sep. 2015. https://doi.org/10.13088/jiis.2015.21.1.143
Y. J Lee, J. H Seo, and J. T Choi, "Fashion Trend Marketing Prediction Analysis Based on Opinion Mining Applying SNS Text Contents," Journal of KIIT. vol. 12, no. 12, pp. 163-170, June 2014.
H. J Kim and J. Y Chang, "Discovering News Keyword Associations Using Association Rule Mining," The Journal of the Institute of Internet Broadcasting and Communication, vol. 11, no. 6, pp. 63-71, Dec. 2011.
B. W. Kim, "Trend Analysis and National Policy for Artificial Intelligence." Informatization Policy, vol. 23, no. 1, pp. 74-93, Mar. 2016. https://doi.org/10.22693/NIAIP.2016.23.1.074
J. Y. Seo and C. Koh, "Big Data Analysis by Sensitivity Analysis," Journal of the Society of Convergence Knowledge, vol. 2, no. 1, pp. 15-21, June 2014.
J. S. Sohn, S. W. Cho, K. L. Kwon, and I. J. Chung, "Improved Social Network Analysis Method in SNS," Journal of Intelligence and Information Systems, vol. 18, no. 4, pp. 117-127, Mar. 2012. https://doi.org/10.13088/JIIS.2012.18.4.117
S. N. Kang, Y. S. Kim, S. H. Choi, "Study on the social issue sentiment classification using text mining." Journal of the Korean data & information science society, vol. 26, no. 5, pp. 1167-1173, June 2015. https://doi.org/10.7465/jkdi.2015.26.5.1167
J. Y. Jang, "Automatic Retrieval of SNS Opinion Document Using Machine Learning Technique.," The Journal of the Institute of Internet, Broadcasting and Communication, vol. 13, no. 5, pp. 27-35, Sep. 2013. https://doi.org/10.7236/JIIBC.2013.13.5.27

Journal of the Korea Institute of Information and Communication Engineering (한국정보통신학회논문지)

Analysis of drama viewership related words through unstructured data collection

비정형데이터 수집을 통한 드라마 시청률 연관어 분석

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)