Sentiment Analysis Engine for Cambodian Music Industry Re-building

캄보디아 음악 산업 재건을 위한 감정 분석 엔진 연구

  • Received : 2017.11.16
  • Accepted : 2017.12.07
  • Published : 2017.12.31


During Khmer Rouge Regime, Cambodian pop music was completely forgotten since 90% of artists were killed. After recovering from war since 1979, the music started to grow again in 1990. However, Cambodian popular music dynamic and flows are observably directed by the multifaceted socioeconomic, political and creative forces. The major problems are the plagiarism and piracy which have been prevailing for years in the industry. Recently, the consciousness of the need to preserve Khmer original songs from both fans and artist have been increased and become a new trend for Cambodia young population. Still, the music quality is in the limit state. To increase the mind-set, the feedbacks and inspiration are needed. The study suggested a music ranking website using sentiment analysis which data were collected from Production Companies Facebook Pages' posts and comments. The study proposed an algorithm which translates from Khmer to English, doing sentiment analysis and generate the ranking. The result showed 80% accuracy of translation and sentiment analysis on the proposed system. The songs that rank high in the system are the songs which are original and fit the occasion in Cambodia. With the proposed ranking algorithm, it would help to increase the competitive advantage of the musical productions as well as to encourage the producers to compose the new songs which fit the particular activities and event.

캄보디아의 대중음악은 크메르 루즈 정권 기간 동안 예술가의 90 %가 사망 한 이래로 완전히 잊혀졌다. 1979 년부터 전쟁에서 회복 한 후 1990 년 음악은 다시 성장하기 시작했다. 그러나 캄보디아 대중 음악의 역 동성과 흐름은 다면적 사회 경제적, 정치적, 창조적 세력에 의해 관찰 되고 있지만, 표절과 불법 복제로 수년간 대중음악산업에서 널리 퍼져 많은 문제가 되어왔다. 최근에는 크메르(캄보디아언어) 전통 음악을 팬과 아티스트 모두에게 보존해야 할 필요성에 대한 의식이 높아져 캄보디아 젊은 인구의 새로운 트렌드가 되었으나, 음악 품질은 여전히 한계상태에 봉착해 있고, 전통 대중 음악의 전문성을 높이기 위해서는 대중의 드백과 영감이 필요하다. 이 연구는 캄보디아에서 가장 많은 대중음악 관련 사이트인 페이스 북 페이지의 게시물과 코멘트에서 수집 된 문장들을 감정분석을 사용하여 음악 순위 차트(웹 사이트)를 구현하였다. 크메르어에서 영어로 번역하고 감정 분석을 수행하고 순위를 생성하는 알고리즘 개발하였다. 그 결과로 제안 된 시스템에서 번역 및 감정분석의 정확도가 80 %임을 보여주었다. 순위에서 높이 평가된 노래는 크메르(캄보디아언어)로 된 전통대중음악으로 이 논문의 취지와 부합이 되었다. 캄보디아 전통대중음악을 다시 부활하기 위해서 제안 된 시스템과 순위 알고리즘을 사용하여 음악제작의 경쟁 우위를 높이고 제작자가 특정 활동 및 이벤트에 맞는 새 노래를 작곡하는 데 도움이 될것으로 사료된다.



  1. "Entertainment industry facing 'collapse', Business, Phnom Penh Post" Entertainment industry facing "collapse", Business, Phnom Penh Post, 26-May-2009.
  2. "Not the same old song, Post Weekend, Phnom Penh Post" Not the same old song, Post Weekend, Phnom Penh Post, 03-Oct-2015.
  3. S. Mamula, "Starting from Nowhere?: Popular Music in Cambodia after the Khmer Rouge" Starting from Nowhere?: Popular Music in Cambodia after the Khmer Rouge, Asian Music, vol. 39, pp. 26-41, 2008.
  4. "What Kind of Impact Does Our Music Really Make on Society?" What Kind of Impact Does Our Music Really Make on Society?, 09-Apr-2017.
  5. D. E. Giles, "Increasing returns to information in the US popular music industry" Increasing returns to information in the US popular music industry.
  6. D. E. Giles, "Survival of the hippest: life at the top of the hot 100" Survival of the hippest: life at the top of the hot 100, Appl. Econ., vol. 39, pp. 1877-1887, Aug. 2007.
  7. Y. Yang, C. Chen, and Y. Chen, "The Measurement of the Search Charts of Music" The Measurement of the Search Charts of Music, in International Conference on Networks Security, Wireless Communications and Trusted Computing, 2009, pp. 346-349.
  8. A. S. Eric and T. Clive, "The Dynamics of Chart Success in the U.K. Pre-Recorded Popular Music Industry" The Dynamics of Chart Success in the U.K. Pre-Recorded Popular Music Industry, J. Cult. Econ., vol. 24, no. 2, pp. 113-134, May 2000.
  10. "Billboard biz l Billboard" Billboard biz l Billboard, 16-Jul-2017. [Online]. Available: [Accessed: 16-Jul-2017].
  11. S. Han, "Korea Launches First Official Music Charts Gaon" Korea Launches First Official Music Charts Gaon, 23-Feb-2010.
  12. Q. Keith, "8 Ways to Improve Customer Relationships With Social Media: Social Media Examiner" 8 Ways to Improve Customer Relationships With Social Media: Social Media Examiner, 01-Jul-2014.
  13. "How can social media impact my business make your marketing personal" How can social media impact my business make your marketing personal, Mobile commerce stats for small business. 25-Mar-2017.
  14. T. Hennig-Thurau, E. C. Malthouse, C. Friege, S. Gensler, L. Lobschat, A. Rangaswamy, and B. Skiera, "The Impact of New Media on Customer Relationships" The Impact of New Media on Customer Relationships, J. Serv. Res., vol. 13, Aug. 2010.
  15. L. Kwok and B. Yu, "Spreading Social Media Messages on Facebook: An Analysis of Restaurant Business-to-Consumer Communications" Spreading Social Media Messages on Facebook: An Analysis of Restaurant Business-to-Consumer Communications, Cornell Hosp. Q., vol. 54, Feb. 2013.
  16. T. Grizane and I. Jurgelane, "Social Media Impact on Business Evaluation" Social Media Impact on Business Evaluation, Procedia Comput. Sci., vol. 104, 2017.
  17. A. Abbasi, H. Chen, and A. Salem, "Sentiment Analysis in Multiple Languages: Feature Selection for Opinion Classification in Web Forums" Sentiment Analysis in Multiple Languages: Feature Selection for Opinion Classification in Web Forums, ACM Trans. Inf. Syst., vol. 26, Jun. 2008.
  18. S. Ahn, S. Oh, and J. Byun, "A Big Data Study on Viewers' Response and Success Factors in the D2C Era Focused on tvN's Web-real Variety 'SinSeoYuGi' and Naver TV Cast Programming" A Big Data Study on Viewers' Response and Success Factors in the D2C Era Focused on tvN's Web-real Variety "SinSeoYuGi" and Naver TV Cast Programming, Int. J. Adv. Cult. Technol., vol. 4, pp. 7-18.
  19. R. C. Britto, "Branding and communication on Twitter" Branding and communication on Twitter.
  20. B. J. Jansen, M. Zhang, K. Sobel, and A. Chowdury, "Twitter Power: Tweets As Electronic Word of Mouth" Twitter Power: Tweets As Electronic Word of Mouth, J. Am. Soc. Inf. Sci. Technol., vol. 60, Nov. 2009.
  21. A. Pak and P. Paroubek, "Twitter as a Corpus for Sentiment Analysis and Opinion Mining." Twitter as a Corpus for Sentiment Analysis and Opinion Mining., in LREC, 2010.
  22. M. Thelwall, K. Buckley, and G. Paltoglou, "Sentiment in Twitter events" Sentiment in Twitter events, J. Am. Soc. Inf. Sci. Technol., vol. 62, Feb. 2011.
  23. G. M. Thomaz, A. A. Biz, E. M. Bettoni, L. Mendes-Filho, and D. Buhalis, "Content mining framework in social media: A FIFA world cup 2014 case analysis" Content mining framework in social media: A FIFA world cup 2014 case analysis, Inf. & Manag.
  24. A. Go and R. B. L Huang, "Twitter sentiment analysis"
  25. A. Go and L. H. R Bhayani, "Twitter sentiment classification using distant supervision"
  26. A. Sarlan and S. B. C Nadam, "Twitter sentiment analysis" Twitter sentiment analysis.
  27. H. Wang and A. K. D Can F.Bar…, "A system for real-time twitter sentiment analysis of 2012 us presidential election cycle" A system for real-time twitter sentiment analysis of 2012 us presidential election cycle.
  28. A. Ortigosa, J. M. Mart?n, and R. M. Carro, "Sentiment analysis in Facebook and its application to e-learning" Sentiment analysis in Facebook and its application to e-learning, Comput. Hum. Behav., vol. 31, Feb. 2014.
  29. C. Troussas and K. E. M Virvou, "Sentiment analysis of Facebook statuses using Naive Bayes classifier for language learning" Sentiment analysis of Facebook statuses using Naive Bayes classifier for language learning, in IISA 2013, 2013.
  30. O. Kucuktunc and I. W. BB Cambazoglu, "A large-scale sentiment analysis for Yahoo! answers" A large-scale sentiment analysis for Yahoo! answers.
  31. O. Kucuktunc, B. B. Cambazoglu, I. Weber, and H. Ferhatosmanoglu, "A large-scale sentiment analysis for Yahoo! answers" A large-scale sentiment analysis for Yahoo! answers, in Proceedings of the fifth ACM international conference on Web search and data mining - WSDM '12, 2012, p. 633.
  32. G. Qiu, B. Liu, J. Bu, and C. Chen, "Expanding Domain Sentiment Lexicon Through Double Propagation" Expanding Domain Sentiment Lexicon Through Double Propagation, in Proceedings of the 21st International Jont Conference on Artifical Intelligence, 2009, pp. 1199-1204.
  33. B. Pang, L. Lee, and S. Vaithyanathan, "Thumbs up?: sentiment classification using machine learning techniques" Thumbs up?: sentiment classification using machine learning techniques, in Proceedings of the ACL-02 conference on Empirical methods in natural language processing - EMNLP '02, 2002, pp. 79-86.
  34. B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis" Opinion Mining and Sentiment Analysis, Found. Trends Inf. Retr., vol. 2, 2008.
  35. M. Hu and B. Liu, "Mining and summarizing customer reviews" Mining and summarizing customer reviews, in Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '04, 2004, p. 168.
  36. B. Fu, J. Lin, L. Li, C. Faloutsos, J. Hong, and N. Sadeh, "Why people hate your app: making sense of user feedback in a mobile app store" Why people hate your app: making sense of user feedback in a mobile app store, in Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '13, 2013, p. 1276.
  37. J. Bollen, H. Mao, and X. Zeng, "Twitter mood predicts the stock market" Twitter mood predicts the stock market, J. Comput. Sci., vol. 2, Mar. 2011.
  38. P. Hennig, P. Berger, C. Lehmann, A. Mascher, and C. Meinel, "Accelerate the detection of trends by using sentiment analysis within the blogosphere" Accelerate the detection of trends by using sentiment analysis within the blogosphere, in 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014), 2014, pp. 503-508.
  39. M. Thelwall, K. Buckley, G. Paltoglou, D. Cai, and A. Kappas, "Sentiment in Short Strength Detection Informal Text: Making the case for logarithmic binning" Sentiment in Short Strength Detection Informal Text: Making the case for logarithmic binning, J. Am. Soc. Inf. Sci. Technol., vol. 61, no. 12, pp. 2417-2425, Dec. 2010.
  40. T. Pedersen, "A Decision Tree of Bigrams is an Accurate Predictor of Word Sense" A Decision Tree of Bigrams is an Accurate Predictor of Word Sense, in Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics on Language Technologies, 2001, pp. 1-8.
  41. K. Dave, S. Lawrence, and D. M. Pennock, "Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews" Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews, in Proceedings of the 12th International Conference on World Wide Web, 2003, pp. 519-528.
  42. P. D. Turney and M. L. Littman, "Measuring Praise and Criticism: Inference of Semantic Orientation from Association" Measuring Praise and Criticism: Inference of Semantic Orientation from Association, ACM Trans. Inf. Syst., vol. 21, no. 4, Oct. 2003.
  43. D. J. Xu, S. S. Liao, and Q. Li, "Combining empirical experimentation and modeling techniques: A design research approach for personalized mobile advertising applications" Combining empirical experimentation and modeling techniques: A design research approach for personalized mobile advertising applications, Decis. Support Syst., vol. 44, Feb. 2008.
  44. A. Wong and B. Factura, "Amazon Digital Music: Sentiment Analysis and Text Mining" Amazon Digital Music: Sentiment Analysis and Text Mining.
  45. D. Pop, "Machine Learning and Cloud Computing: Survey of Distributed and SaaS Solutions" Machine Learning and Cloud Computing: Survey of Distributed and SaaS Solutions, Mar. 2016.
  46. "Google Cloud Machine Learning Engine: The smart person's guide - TechRepublic" Google Cloud Machine Learning Engine: The smart person's guide - TechRepublic, 10-Aug-2017.
  47. J. Kaminski, Y. Jiang, F. Piller, and C. Hopp, "Do User Entrepreneurs Speak Different?: Applying Natural Language Processing to Crowdfunding Videos" Do User Entrepreneurs Speak Different?: Applying Natural Language Processing to Crowdfunding Videos, in Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems - CHI EA '17, 2017, pp. 2683-2689.
  48. "Retrieving publications from Library" Retrieving publications from Library.
  49. "Graph API Overview" Graph API Overview, 27-Jul-2017.
  52. S. Joseph, "Cambodia's 2017 Social Media & Digital Statistics" Cambodia's 2017 Social Media & Digital Statistics, 09-Apr-2017.
  53. R. Nicholas, "Music Marketing Tips and Ideas - 50 Ways to Promote Your Music" Music Marketing Tips and Ideas - 50 Ways to Promote Your Music, 27-Jul-2016.
  54. "Sinn Sisamouth and the golden age of Khmer pop Savong School Cambodia" Sinn Sisamouth and the golden age of Khmer pop Savong School Cambodia, 02-Sep-2017.
  55. V. Pol, "The Popularity of the song" The Popularity of the song, 20-Aug-2017.