Browse > Article
http://dx.doi.org/10.9708/jksci.2010.15.11.021

A Study on the Characteristics of Opinion Retrieval Using Term Statistical Analysis in Opinion Documents  

Han, Kyoung-Soo (성결대학교 컴퓨터공학부)
Abstract
Opinion retrieval which searches the opinions expressed in documents by users cannot outperform significantly yet traditional topical retrieval which searches the facts. Therefore, the focus of this paper is to identify the statistical characteristics which can be applied to opinion retrieval by comparing and analyzing the term statistics of opinion and non-opinion documents in the blog domain. The TREC Blogs06 collection and 150 TREC topics are used in the experiments. The difference between term probability distributions in opinion documents is measured by JS divergence, and the difference according to the topic types and topic domains is also investigated. Moreover, the term probabilities of opinion terms are analyzed comparatively. The main findings of this study include the following: it is necessary to consider the topic-specific characteristics for the opinion detection; it is effective to extract positive and negative opinion terms according to the topics; the topic types are complementary to the topic domains; and special attention has to be given to the usage of the positive opinion terms.
Keywords
Opinion Retrieval; Opinion Detection; Opinion Terms;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Robert Krovetz, "Viewing Morphology as an Inference Process," Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR-1993), pp. 191-202, Pittsburgh, USA, Jun. 1993.
2 Thomas M. Cover and Joy A. Thomas, "Elements of Information Theory," Wiley-Interscience, New York, 1991.
3 Lillian Jane Lee, "Similarity-Based Approaches to Natural Language Processing," Phd Thesis, The Division of Engineering and Applied Sciences, Harvard University, May 1997.
4 Craig Macdonald and Ladh Ounis, "The TREC Blog06 Collection: Creating and Analysing a Blog Test Collection," DCS Technical Report TR-2006-224, University of Glasgow, 2006.
5 The Blogs08 Test Collection, http://ir.dcs.gla.ac.uk/ test_collections/blogs08info.html.
6 TREC 2008 Blog Track, http://trec.nist.gov/data/blog08.html.
7 Lifeng Jia, Clement Yu, and Wei Zhang, "UIC at TREC 2008 Blog Track," Proceedings of the 17th Text Retrieval Conference (TREC-2008), Gaithersburg, Maryland, USA, Nov. 2008.
8 이승욱, 송영인, 임해창, "혼합 방식에 기반한 의견 문서 검색 시스템," 정보관리학회지, 제 25권, 제 4호, 115-129 쪽, 2008년 12월.   과학기술학회마을   DOI
9 남상협, 나승훈, 이예하, 이용훈, 김준기, 이종혁, "의견 어 구추출을위한생성모델과분류모델을결합한부분지도 학습 방법," 한국정보과학회 2008 종합학술대회 논문집, 제 35권, 제 1호(C), 268-273쪽, 2008년 6월.
10 주해종, 홍봉화, 정복철, "의견정보 모니터링을 위한 웹 마 이닝 시스템에 관한 연구," 한국컴퓨터정보학회논문지, 제 15권, 제 1호, 149-157쪽, 2010년 1월.   과학기술학회마을   DOI
11 GuangXu Zhou, Hemant Joshi, and Coskun Bayrak, "Topic Categorization for Relevancy and Opinion Detection," Proceedings of the 16th Text Retrieval Conference (TREC-2007), Gaithersburg, Maryland, USA, Nov. 2007.
12 윤홍준, 김한준, "오피니언 마이닝 기술을 이용한 효율적 상품평 검색 기법," 정보과학회논문지: 컴퓨팅의 실제 및 레터, 제 16권, 제 2호, 222-226쪽, 2010년 2월.   과학기술학회마을
13 Min Zhang and Xingyao Ye, "A Generation Model to Unify Topic Relevance and Lexicon-based Sentiment for Opinion Retrieval," Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR-2008), pp .411-418, Singapore, Jul. 2008.
14 Kiduk Yang, Ning Yu, Alejandro Valerio, Hui Zhang, and Weimao Ke, "Fusion Approach to Finding Opinions in Blogosphere," Proceedings of the 1st International Conference on Weblogs and Social Media(ICWSM-2007), Boulder, Colorado, USA, Mar. 2007.
15 Iadh Ounis, Craig Macdonald, and Ian Soboroff, "Overview of the TREC-2008 Blog Track," Proceedings of the 17th Text Retrieval Conference (TREC-2008), Gaithersburg, Maryland, USA, Nov. 2008.
16 Craig Macdonald, Iadh Ounis, and Ian Soboroff, "Overview of the TREC-2009 Blog Track," Proceedings of the 18th Text Retrieval Conference (TREC-2009), Gaithersburg, Maryland, USA, Nov. 2009.
17 신현일, 유은일, 류근호, "주제어가중치기법에의한효율적인 블로그 검색 시스템," 한국컴퓨터정보학회논문지, 제 15권, 제 4호, 1-9쪽, 2010년 4월.   과학기술학회마을   DOI
18 Olga Vechtomova, "Using Subjective Adjectives in Opinion Retrieval from Blogs," Proceedings of the 16th Text Retrieval Conference (TREC-2007), Gaithersburg, Maryland, USA, Nov. 2007.
19 Soo-Min Kim and Eduard Hovy, "Automatic Detection of Opinion Bearing Words and Sentences," Proceedings of the 2nd International Joint Conference on Natural Language Processing (IJCNLP-2005), pp. 61-66, Jeju Island, Korea, Oct. 2005.
20 Ethan Zhang and Yi Zhang, "UCSC on TREC 2006 Blog Opinion Mining," Proceedings of the 15th Text Retrieval Conference (TREC-2006), Gaithersburg, Maryland, USA, Nov. 2006.
21 Ben He, Craig Macdonald, Jiyin He, and Ladh Ounis, "An Effective Statistical Approach to Blog Post Opinion Retrieval," Proceeding of the 17th ACM Conference on Information and Knowledge Management (CIKM-2008), pp. 1063-1072, California, USA, Oct. 2008.